Senior Data Engineer / Databricks Developer (DQX & Clinical Data)

Remote
Apply
AI Summary

Design and implement a scalable Data Quality Monitoring framework using DQX on Databricks for clinical data domains. Build data pipelines, create dashboards, and establish data governance across Study Personnel & Milestones. Must have strong Databricks, Spark, SQL, Python, and Delta Lake experience with clinical data background.

Key Highlights
Databricks-native Data Quality Monitoring with DQX
Clinical data experience mandatory (pharma/hospital)
End-to-end ownership of data pipelines and dashboards
Key Responsibilities
Design and implement a Data Quality Monitoring framework using DQX on Databricks
Develop and configure data quality rules
Build scalable data pipelines (ETL/ELT) and data models
Create dashboards and reporting layers for business users
Ensure linkage between data quality rules, business usage, and critical data elements
Document technical architecture, rule logic, and governance model
Support scaling across additional clinical data domains
Collaborate with both technical teams and business stakeholders
Technical Skills Required
Databricks Spark SQL Python
Benefits & Perks
100% Remote international project
High-impact role with visibility and ownership
Nice to Have
Experience in regulated or compliance-heavy environments
Knowledge of clinical operations / trial data / life sciences
Experience with dashboarding and data quality reporting
Familiarity with Azure DevOps / CI-CD
Experience with APIs, integrations, and downstream systems
Knowledge of Snowflake
Experience with Veeva or similar clinical systems

Job Description


Data Engineer / Databricks Developer (DQX & Clinical Data)

Remote | International Project



About the Rol

eWe are looking for a Senior Data Engineer / Databricks Developer to join an international, fully remote project focused on building a DQX-based Data Quality Monitoring solution within a clinical (pharma/hospital) environment

.This role will play a key part in designing and implementing a scalable framework that allows business users and data owners to monitor, understand, and proactively manage data quality across critical clinical data domains


.
Project Overvi

ewThe objective is to develop a Databricks-native Data Quality Monitoring capability leveraging DQX t

  • o:Enable business users to monitor data quality via dashboards and tren
  • dsEstablish a clear link between data input, usage, and business-critical rul
  • esAllow data owners to define and enforce data quality standar
  • dsProvide transparency into data quality performance over ti
  • meIdentify recurring issues and critical data poin

tsThe initial scope focuses on Clinical Study Management (Study Personnel & Milestones), with a roadmap to scale across additional clinical domain


s.
Key Responsibilit

  • iesDesign and implement a Data Quality Monitoring framework using DQX on Databri
  • cksDevelop and configure data quality ru
  • lesBuild scalable data pipelines (ETL/ELT) and data mod
  • elsCreate dashboards and reporting layers for business us
  • ersEnsure linkage between data quality rules, business usage, and critical data eleme
  • ntsDocument technical architecture, rule logic, and governance mo
  • delSupport scaling across additional clinical data doma
  • insCollaborate with both technical teams and business stakehold


ers
Must-Have Sk

  • illsStrong hands-on experience with Databr
  • icksExpertise in Spark, SQL, Python, and Delta
  • LakeProven experience designing and implementing data pipelines (ETL/
  • ELT)Experience with data quality frameworks (DQX is mandat
  • ory)Solid understanding of data governance and data owner
  • shipExperience translating business requirements into technical solut
  • ionsExperience working in complex enterprise environm
  • ents✅ Fluent English (mandat
  • ory)✅ Mandatory experience with clinical data (pharma, clinical trials, or hospital environme


nts)
Nice-to-Have S

  • killsExperience in regulated or compliance-heavy environ
  • mentsKnowledge of clinical operations / trial data / life sci
  • encesExperience with dashboarding and data quality repo
  • rtingFamiliarity with Azure DevOps /
  • CI-CDExperience with APIs, integrations, and downstream sy
  • stemsKnowledge of Snow
  • flakeExperience with Veeva or similar clinical sy


stems
Tech Stack / Ke

ywordsDatabricks · DQX · Spark · SQL · Python · Delta Lake · ETL/ELT · Data Pipelines · Azure · Azure DevOps · Snowflake · Data Governance · Data Engineering · Cloud · Bi


g Data
Profile We’re Look

  • ing ForSenior, self-driven professional with an end-to-end ownership
  • mindsetStrong analytical and problem-solving
  • skillsComfortable working in dynamic, evolving envir
  • onmentsAbility to communicate effectively with both technical and business au
  • diencesProactive, structured, and solution-oriented


mindset
What

  • We Offer🌍 100% Remote international
  • project📈 Opportunity to shape a scalable, enterprise-grade data quality
  • solution🤝 Collaboration with global stakeholders in a clinical/life sciences env
  • ironment🚀 High-impact role with visibility and o


wnership

Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

KCS iT

Portugal

Junior Data Engineer

Data Science
5d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

mau

Portugal
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

bridge351

Portugal

Subscribe our newsletter

New Things Will Always Update Regularly