

firstPRO, Inc
Senior Data Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer on a contract-to-hire basis, located in Philadelphia, PA (Hybrid). Requires 5+ years in Data Engineering, expertise in Databricks, PySpark, and Azure Cloud Services, with a focus on real-time streaming data solutions.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
545
-
🗓️ - Date
June 27, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
Hybrid
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Philadelphia, PA
-
🧠 - Skills detailed
#Azure cloud #Batch #SQL (Structured Query Language) #Azure Data Factory #PySpark #Big Data #Deployment #Microservices #Apache Spark #Data Pipeline #Synapse #DevOps #Spark (Apache Spark) #Kubernetes #Data Processing #IoT (Internet of Things) #Scala #Databricks #Cloud #Data Engineering #Datasets #"ETL (Extract #Transform #Load)" #Storage #Data Quality #Spark SQL #Data Ingestion #Azure #Docker #Data Integration #ADF (Azure Data Factory) #Data Science #Delta Lake #ML (Machine Learning) #Data Modeling #AI (Artificial Intelligence) #Security #Computer Science
Role description
Our client is seeking an experienced Data Engineer to join a growing data engineering team responsible for building modern, scalable data platforms that support real-time analytics, operational applications, and AI/ML initiatives. This is an exciting opportunity to work in a medical device IoT environment where your work will directly enable data-driven decisions using live device telemetry and streaming data.
While the team supports traditional batch data processing, the primary focus of this role is designing and developing real-time streaming data solutions, robust data services, and high-performance data pipelines that power APIs, microservices, and machine learning applications.
Location: Philadelphia, PA (Hybrid – 2-3 Days Onsite)
Type: Contract-to-Hire
Responsibilities
• Design, develop, and maintain scalable real-time streaming data pipelines.
• Build data ingestion and processing frameworks for IoT device telemetry and operational data.
• Develop ETL/ELT solutions supporting both cloud and on-premises environments.
• Create and optimize data services that support APIs, microservices, and downstream applications.
• Build data pipelines that enable analytics, reporting, and AI/ML model development.
• Utilize Databricks and Apache Spark technologies to process large-scale datasets efficiently.
• Optimize Spark jobs using PySpark, Spark SQL, and Delta Lake.
• Implement CI/CD pipelines and DevOps best practices for automated deployment and testing.
• Collaborate with software engineers, data scientists, architects, and business stakeholders to deliver scalable data solutions.
• Monitor, troubleshoot, and improve performance, reliability, and scalability of data platforms.
• Ensure data quality, governance, and security across enterprise data environments.
Required Qualifications
• Bachelor's degree in Computer Science, Information Systems, Engineering, or a related field.
• 5+ years of experience in Data Engineering or related data platform development.
• Strong experience building enterprise data pipelines in both cloud and on-premises environments.
• Hands-on expertise with Databricks, including: PySpark, Spark SQL, Delta Lake
• Strong knowledge of Apache Spark and distributed data processing.
• Experience developing ETL/ELT data integration solutions.
• Experience building data services supporting APIs and microservices.
• Experience working with Azure Cloud Services.
• Understanding of DevOps methodologies and CI/CD deployment pipelines.
• Experience working with Big Data technologies and large-scale distributed systems.
• Strong SQL development and data modeling skills.
Preferred Qualifications
• Experience with streaming technologies such as Spark Structured Streaming or similar platforms.
• Experience working with IoT, connected devices, or telemetry data.
• Familiarity with Azure Data Factory, Event Hubs, Azure Storage, or Azure Synapse.
• Experience supporting machine learning or AI data pipelines.
• Knowledge of containerization technologies such as Docker and Kubernetes.
• Experience working in regulated industries.
Our client is seeking an experienced Data Engineer to join a growing data engineering team responsible for building modern, scalable data platforms that support real-time analytics, operational applications, and AI/ML initiatives. This is an exciting opportunity to work in a medical device IoT environment where your work will directly enable data-driven decisions using live device telemetry and streaming data.
While the team supports traditional batch data processing, the primary focus of this role is designing and developing real-time streaming data solutions, robust data services, and high-performance data pipelines that power APIs, microservices, and machine learning applications.
Location: Philadelphia, PA (Hybrid – 2-3 Days Onsite)
Type: Contract-to-Hire
Responsibilities
• Design, develop, and maintain scalable real-time streaming data pipelines.
• Build data ingestion and processing frameworks for IoT device telemetry and operational data.
• Develop ETL/ELT solutions supporting both cloud and on-premises environments.
• Create and optimize data services that support APIs, microservices, and downstream applications.
• Build data pipelines that enable analytics, reporting, and AI/ML model development.
• Utilize Databricks and Apache Spark technologies to process large-scale datasets efficiently.
• Optimize Spark jobs using PySpark, Spark SQL, and Delta Lake.
• Implement CI/CD pipelines and DevOps best practices for automated deployment and testing.
• Collaborate with software engineers, data scientists, architects, and business stakeholders to deliver scalable data solutions.
• Monitor, troubleshoot, and improve performance, reliability, and scalability of data platforms.
• Ensure data quality, governance, and security across enterprise data environments.
Required Qualifications
• Bachelor's degree in Computer Science, Information Systems, Engineering, or a related field.
• 5+ years of experience in Data Engineering or related data platform development.
• Strong experience building enterprise data pipelines in both cloud and on-premises environments.
• Hands-on expertise with Databricks, including: PySpark, Spark SQL, Delta Lake
• Strong knowledge of Apache Spark and distributed data processing.
• Experience developing ETL/ELT data integration solutions.
• Experience building data services supporting APIs and microservices.
• Experience working with Azure Cloud Services.
• Understanding of DevOps methodologies and CI/CD deployment pipelines.
• Experience working with Big Data technologies and large-scale distributed systems.
• Strong SQL development and data modeling skills.
Preferred Qualifications
• Experience with streaming technologies such as Spark Structured Streaming or similar platforms.
• Experience working with IoT, connected devices, or telemetry data.
• Familiarity with Azure Data Factory, Event Hubs, Azure Storage, or Azure Synapse.
• Experience supporting machine learning or AI data pipelines.
• Knowledge of containerization technologies such as Docker and Kubernetes.
• Experience working in regulated industries.





