New York Technology Partners

Data Engineer (PySpark)

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Data Engineer (PySpark) on a contract basis, requiring expertise in PySpark, SQL, and AWS. Key skills include building data pipelines, optimizing workflows, and collaborating with global teams. Experience with Airflow and data warehousing is essential.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date

April 14, 2026

🕒 - Duration

Unknown

🏝️ - Location

Unknown

📄 - Contract

Unknown

🔒 - Security

Unknown

📍 - Location detailed

Irvine, CA

🧠 - Skills detailed

#Data Quality #GIT #AWS (Amazon Web Services) #Datasets #Kafka (Apache Kafka) #Version Control #Data Engineering #PySpark #Cloud #Spark (Apache Spark) #Data Modeling #Data Warehouse #SQL (Structured Query Language) #Data Lake #Distributed Computing #Data Processing #Python #Data Pipeline #Redshift #Databricks #Data Governance #Scala #Airflow

Role description

We are looking for a talented Data Engineer to design and deliver scalable data solutions that power analytics and business insights. This role requires deep technical expertise in modern data engineering practices, along with the ability to collaborate effectively with both internal teams and external stakeholders. The ideal candidate is experienced in building robust data pipelines, working with large-scale distributed systems, and optimizing performance across cloud-based data platforms. Key Responsibilities • Design, build, and maintain scalable data pipelines and data processing frameworks • Process and manage large datasets using distributed computing technologies • Partner with cross-functional teams and stakeholders to understand data needs and deliver effective solutions • Optimize and tune data workflows across platforms such as Databricks and Kafka • Implement best practices in data modeling, data warehousing, and data governance • Develop and manage workflow orchestration using tools like Airflow • Contribute to global delivery efforts, including coordination with offshore teams Required Qualifications • Strong hands-on experience with: • PySpark • Hive • SQL • Python • Experience working with cloud platforms (AWS preferred) • Proficiency with workflow orchestration tools (e.g., Airflow) • Experience using version control systems such as Git-based platforms • Familiarity with MPP data warehouses (e.g., Redshift or similar technologies) • Solid understanding of data warehousing principles and data modeling techniques • Exposure to Databricks and modern data lake architectures • Strong communication skills with the ability to work in client-facing environments • Experience collaborating with distributed and offshore teams What We’re Looking For • A problem-solver who can build efficient, reliable, and scalable data systems • Strong attention to performance optimization and data quality • Ability to translate business requirements into technical data solutions • Collaborative mindset with experience working across global teams • Ownership mentality with a focus on delivering high-quality outcomes

Apply now Apply with DFH

Chief Engineer

This role is for a Chief Engineer with a contract length of "unknown," offering an hourly pay rate tied to milestones. Key skills include Python, TypeScript, SQL, and cloud infrastructure (Azure preferred). Experience in healthcare data and EHR integrations is required.

🌎 - Country

New York Technology Partners

Data Engineer (PySpark)

Chief Engineer

Senior Applied AI Engineer - Data & Analytics Architecture

Senior AI/ML Engineer Frisco, TX & Atlanta, GA

Python Developer

Book a

chat

with us

Company