Data Platform Lead
Sober Sidekick
Role Overview
We’re seeking a full-stack data professional to join our fast-growing team to build, scale, and optimize our data infrastructure and insights. This role will immediately own critical ETL frameworks for file ingestion and support analytics across clinical, claims, and member engagement data. You’ll also play a key role in our next evolution: building intelligent systems that help us personalize care pathways, match members to the right support, and unlock insight from our own rich data sources. This role blends architecture and hands-on implementation, ideal for someone excited by greenfield builds, machine learning, and applying modern data tools to real-world healthcare problems.
What You’ll Do
- Architect and implement a scalable ETL ingestion framework for file ingestion and transformations
- Design and build a unified data lake, linking disparate datasets used in supporting internal and external facing dashboards for clinical programs
- Create modular, reusable architecture that scales with new programs and data sources.
- Collaborate with product and engineering teams to support future initiatives such as classifying and analyzing sentiment across user posts, developing matching algorithms to pair members in sobriety with ideal recovery navigators, building adaptive engagement journey algorithms for optimizing outcomes
- Lead data governance efforts, establishing best practices for data quality, security, and access
- Create and maintain clear technical documentation and data dictionaries.
Who We Are
Empathy Health Technology ensures no one faces life’s challenges alone. Our mission is to scale the most epic wave of comeback stories the world has ever seen by combining compassionate support with innovative technology to reduce isolation and improve well-being. Through our flagship platform, Sober Sidekick, we provide access to 24/7 peer support, 12-step recovery meetings, and personalized resources, creating a supportive, stigma-free space where people can connect and heal in real time. We partner with health plans, providers, and Employee Assistance Programs to deliver scalable, outcomes-driven solutions. Our vision is a world where support is accessible to all, unlocking resilience, fostering connection, and empowering people to thrive.
Who We’re Looking For
- 4+ years of experience in data engineering and/or data science-related roles.
- Strong SQL, Python, and programming skills.
- Experience building and maintaining ETL frameworks, data pipelines, and data warehousing.
- Familiarity and experience with Airflow, Kafka, dbt, Spark, Docker, Kubernetes, Terraform, or managed service alternatives
- Experience using Google Cloud (GCS, Cloud Functions, Dataflow, BigQuery)
- Hands-on experience with modern data modeling tools (Snowflake, Databricks)
- Experience with healthcare data (Ex: HIPAA compliance, PHI security best practices, claims data, EMR data, eligibility files)
- Experience with machine learning frameworks (Scikit, TensorFlow, PyTorch, etc..)
- Experience working with or fine-tuning large language models (LLMs)
- Familiarity with BI tools (Domo, Amplitude, Looker, etc.)
Why This Role
- Shape the long-term data strategy of a growing organization on the front lines of healthcare transformation.
- High visibility, real ownership, and a seat at the table as we build.
- Opportunity to make a meaningful impact on people’s lives through thoughtful technology.
- Work alongside a small, mission-driven team where your input matters.