eStaffing Inc.

Senior Data Engineer (AI)

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer (AI) on a contract through the end of 2026, based in Pittsburgh, PA. Requires 7+ years of experience, expertise in Azure services, Databricks, and MLOps, with strong skills in Python and SQL.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
June 11, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
Hybrid
-
📄 - Contract
W2 Contractor
-
🔒 - Security
Unknown
-
📍 - Location detailed
Pittsburgh, PA
-
🧠 - Skills detailed
#Synapse #SaaS (Software as a Service) #Data Engineering #Azure Data Factory #Deployment #Azure ADLS (Azure Data Lake Storage) #ADLS (Azure Data Lake Storage) #ML (Machine Learning) #Security #Storage #Spark (Apache Spark) #"ETL (Extract #Transform #Load)" #Data Ingestion #Data Pipeline #Data Lake #AI (Artificial Intelligence) #Data Modeling #Python #Azure #Data Processing #Data Architecture #Logging #Data Quality #SQL (Structured Query Language) #Kafka (Apache Kafka) #Scala #Data Science #Azure Event Hubs #Batch #Automation #Data Transformations #Data Governance #Databases #Databricks #PySpark #Monitoring #ADF (Azure Data Factory) #Cloud
Role description
Job Title: Senior Data Engineer (AI)-HYBRID Job Type: Contract + extension Location: Pittsburgh, PA Job Duration: Contract through end of 2026 (extension eligible) !! NO C2C only W2 !! Note: Contract through end of 2026 (extension eligible) Please read the job description carefully Before APPLY! Job Description: We are seeking a Senior Data Engineer with strong expertise in modern data engineering and working knowledge of MLOps practices to support the Knowledge Navigator platform. This individual will play a critical role in building and maintaining scalable data pipelines that feed downstream analytics and AI/ML use cases. The ideal candidate is a hands-on engineer who thrives in Azure-based environments, is experienced with large-scale data processing using Databricks, and can operate effectively within established enterprise architectures. This role requires a strong focus on data reliability, pipeline performance, and adherence to enterprise data governance and security standards, while collaborating across engineering, analytics, and AI teams. Responsibilities • Design, build, and maintain scalable data ingestion and processing pipelines into Azure Data Lake Storage (ADLS) and Databricks • Implement batch and/or streaming data pipelines to support analytics and AI/ML workloads • Develop and optimize data transformations using Spark (PySpark) and SQL within Databricks • Integrate data from enterprise source systems (e.g., APIs, relational databases, SaaS platforms) into centralized data platforms • Ensure high data quality, integrity, and availability for downstream consumers • Monitor, troubleshoot, and optimize pipeline performance, reliability, and cost efficiency • Support pipeline automation, orchestration, and scheduling using tools such as Azure Data Factory or Synapse Pipelines • Collaborate with data scientists and ML engineers to support data and model pipeline workflows • Implement logging, monitoring, and alerting to proactively identify issues in production pipelines • Adhere to enterprise data governance, security, and access control policies • Work within established architectural frameworks while contributing improvements and best practices • Partner with cross-functional teams to ensure stable and scalable data operations Requirements- • 7+ years of professional experience (post-graduate) in data engineering or related field • Strong hands-on experience with Azure data services, including: • Azure Data Lake Storage (ADLS) • Azure Data Factory and/or Synapse Pipelines • Deep experience with Databricks, including Spark / PySpark development and performance optimization • Proven experience building and maintaining enterprise-scale data pipelines • Experience integrating data from diverse enterprise systems (databases, APIs, cloud platforms) • Solid understanding of data architecture principles, data modeling, and ETL/ELT patterns • Experience with pipeline orchestration, automation, and monitoring • Working knowledge of MLOps concepts, including supporting data workflows for model training and deployment • Strong troubleshooting and performance tuning skills in distributed environments • Familiarity with data governance, security, and access control frameworks • Proficiency in Python and SQL • Strong collaboration and communication skills Experience supporting AI/ML or GenAI platforms and workflows Familiarity with CI/CD pipelines for data engineering or ML workflows Experience with real-time/streaming data technologies (e.g., Kafka, Azure Event Hubs)