Senior Data Engineer for Healthcare Data Platform

rapid eagle inc • United State
Remote
Apply
AI Summary

We are seeking a Senior Data Engineer to build a modern healthcare data platform. The ideal candidate will have strong Databricks and streaming experience. Key responsibilities include porting a high-volume event pipeline into Databricks, integrating with Kafka, and implementing complex patient matching and event handling logic.

Key Highlights
Port high-volume event pipeline into Databricks
Integrate with Kafka
Implement complex patient matching and event handling logic
Key Responsibilities
Port the core ingestion pipeline into a Databricks-native architecture conforming to standard ingestion layer patterns
Implement per-job success/failure tracking and metrics capture in alignment with platform engineering standards
Integrate with bulk patient matching libraries to accurately process patient update signals and lifecycle events
Build event handlers for patient merges, practice merges, and facility-level changes
Develop Databricks-to-Kafka (D2K) jobs for ingestion model outputs and downstream event streams
Technical Skills Required
Databricks Delta Lake Jobs cluster management Notebooks Apache Kafka PySpark Spark Structured Streaming Python Delta Live Tables Databricks Asset Bundles CI/CD for Databricks Confluent Kafka AWS MSK dbt on Databricks
Benefits & Perks
401(k) matching
Dental insurance
Health insurance
Nice to Have
Delta Live Tables (DLT)
Databricks Asset Bundles / CI/CD for Databricks
Confluent Kafka or AWS MSK
dbt on Databricks
Healthcare data or patient identity matching experience

Job Description


Benefits:

  • 401(k) matching
  • Dental insurance
  • Health insurance


AWS Healthcare Data Engineer

100% Remote

Skills:-

We are looking for a Senior Data Engineer with strong Databricks and streaming experience to build the core

ingestion layer of a modern healthcare data platform. You will port a high-volume event pipeline into

Databricks, integrate with Kafka, and implement complex patient matching and event handling logic.

Key Responsibilities

  • Port the core ingestion pipeline into a Databricks-native architecture conforming to standard ingestion


layer patterns

  • Implement per-job success/failure tracking and metrics capture in alignment with platform engineering


standards

  • Integrate with bulk patient matching libraries to accurately process patient update signals and lifecycle


events

  • Build event handlers for patient merges, practice merges, and facility-level changes
  • Develop Databricks-to-Kafka (D2K) jobs for ingestion model outputs and downstream event streams
  • Ensure the solution is low-maintenance, well-documented, and observable in production


Required Skills & Experience

  • Databricks — Delta Lake, Jobs, cluster management, Notebooks
  • Apache Kafka — producer/consumer patterns, event-driven architecture
  • PySpark / Spark Structured Streaming
  • Python — advanced data engineering
  • High-volume stateful event stream processing
  • Experience porting or refactoring large-scale data pipelines


NICE TO HAVE

  • Delta Live Tables (DLT)
  • Databricks Asset Bundles / CI/CD for Databricks
  • Confluent Kafka or AWS MSK
  • dbt on Databricks
  • Healthcare data or patient identity matching experience


This is a remote position.

Similar Jobs

Explore other opportunities that match your interests

AI Field Engineer (Enterprise)

Devops
•
31m ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

medilinkers llc

United State
Visa Sponsorship Relocation Remote
Job Type Part-time
Experience Level Not Applicable

agentic ai: wir bauen apps und...

United State

AI Field Engineer (Enterprise)

Devops
•
5h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

medilinkers llc

United State

Subscribe our newsletter

New Things Will Always Update Regularly