Vaiticka Solution

Sr. Data Engineer (Healthcare)

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Sr. Data Engineer (Healthcare) on a contract basis, remote work. Requires 10+ years in Data Engineering, expertise in healthcare data integration (EDI 837/835, HL7, FHIR), and proficiency in OCI. Strong Python and DevOps skills needed.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
456
-
🗓️ - Date
March 13, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
United States
-
🧠 - Skills detailed
#Documentation #Terraform #Data Ingestion #XML (eXtensible Markup Language) #FHIR (Fast Healthcare Interoperability Resources) #Data Management #Scala #SQL (Structured Query Language) #Data Governance #"ETL (Extract #Transform #Load)" #Data Architecture #UAT (User Acceptance Testing) #Data Engineering #Oracle Cloud #GIT #Oracle #DevOps #Spark (Apache Spark) #Deployment #Data Lake #Data Quality #Metadata #Python #Delta Lake #Automated Testing #Spark SQL #Observability #Leadership #Cloud #Monitoring #Automation #PySpark #Data Lineage #DataOps #JSON (JavaScript Object Notation) #Data Pipeline #Data Integration
Role description
Position: Sr. Data Engineer (Healthcare) Location: Remote Employment: Contract Job Description: Seeking a Senior Data Engineer with deep healthcare domain expertise to lead the design and delivery of a large‑scale OCI‑based Healthcare Analytics Data Lake, integrating clinical and claims data while serving as the primary onshore technical lead for end‑to‑end development, client coordination, and production deployment. Roles and Responsibilities: - • We are seeking a highly experienced Senior Data Engineer to support the design and development of a large-scale Healthcare Analytics Data Lake that integrates clinical (HL7, FHIR, CCDA) and claims (EDI 837/835) data. • The engineer will work onsite as the primary technical liaison, supporting end-to-end development, collaborating with offshore teams, and coordinating with client stakeholders for QA and deployment activities. • This role requires strong hands-on engineering skills, deep healthcare data expertise, and proficiency with modern cloud-native data architectures, specifically on Oracle Cloud Infrastructure (OCI). Key Responsibilities: - • Data Pipeline Development Design and implement large-scale data ingestion, parsing, and transformation pipelines using Python, Spark, PySpark, and Spark SQL. Build and optimize metadata-driven pipelines for flexible ingestion and transformation. • Process multi-format healthcare data including EDI 837/835, HL7 v2, CCDA, and FHIR bundles. • Cloud-Native Engineering (OCI Preferred) Develop and operate data pipelines using OCI services: OCI Data Integration OCI Data Flow (Spark) OCI Delta Lake OCI Autonomous Database OIC Integration Engine for parsing clinical/claims data Ensure performance tuning, scalability, cost optimization, and production stability. • Data Lake & Medallion Architecture Build Delta Lake/Parquet‑based data lakes following Medallion Architecture (Bronze → Silver → Gold). Implement CDC, schema evolution, data quality checks, and validation frameworks. • Data Modelling & Healthcare Domain Expertise Develop canonical clinical and claims data models aligned to healthcare CDMs. Map and normalize data to industry terminologies such as: LOINC SNOMED CT ICD-9/10 CPT RxNorm • DevOps, DataOps & Orchestration Implement CI/CD pipelines using Git, Terraform, and automated deployment workflows. Develop orchestrations/workflows with built-in data lineage, auditability, monitoring, and governance. Establish DataOps best practices for automated testing, observability, and metadata management. • Onsite Leadership & Client Coordination Act as the primary onshore engineering lead between offshore teams and client stakeholders. • Facilitate handovers to QA for SIT/UAT, coordinate deployment cycles, and support production readiness. Conduct architecture walkthroughs, design reviews, and requirement mapping sessions. Mandatory skills: - • 10+ years in Data Engineering with strong hands-on development experience Expert-level skills in: Python PySpark / Spark SQL JSON, XML processing. • Experience with: EDI 837/835, HL7, CCDA, FHIR Delta Lake, Parquet, schema evolution • Strong understanding of: Data modelling Healthcare CDMs Data governance, lineage, audit frameworks Metadata-driven architectures Data pipeline orchestration Cloud & DevOps • Experience with cloud-native platforms, preferably OCI Hands-on with Git, CI/CD, Terraform, DataOps automation • Domain Knowledge Familiarity with healthcare terminology standards: LOINC, SNOMED, ICD, CPT, RxNorm • Soft Skills Strong communication, client‑facing presence, and ability to work independently onsite Ability to coordinate offshore development teams’ Excellent documentation and technical leadership capability