TalentOla

Data Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Data Engineer with 10+ years of experience in AWS Glue and Snowflake. It is a contract position based in Midtown Manhattan, NYC, offering a hybrid work model. Key skills include Python, PySpark, and SQL. AWS certification preferred.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date

June 25, 2026

🕒 - Duration

Unknown

🏝️ - Location

Hybrid

📄 - Contract

Unknown

🔒 - Security

Unknown

📍 - Location detailed

New York City Metropolitan Area

🧠 - Skills detailed

#Deployment #Airflow #Scala #Data Management #Docker #dbt (data build tool) #Lambda (AWS Lambda) #Python #Cloud #Metadata #Data Engineering #GIT #Data Pipeline #Terraform #Snowflake #AWS Glue #Infrastructure as Code (IaC) #Data Quality #Kafka (Apache Kafka) #Programming #Data Warehouse #S3 (Amazon Simple Storage Service) #Spark (Apache Spark) #"ETL (Extract #Transform #Load)" #Data Ingestion #SQL (Structured Query Language) #DevOps #Monitoring #AWS (Amazon Web Services) #Apache Spark #Data Processing #PySpark #IAM (Identity and Access Management) #Computer Science #Data Lineage #Data Integration #BI (Business Intelligence) #Automation #Datasets #Data Catalog

Role description

Role : Data Engineer – AWS Glue & Snowflake Location – Midtown Manhattan, NYC- preferred at least 2 days onsite in a week Contract role Looking for 10+ years of experience Position Summary We are seeking a highly skilled Senior Data Engineer with deep expertise in AWS Glue and cloud-based data engineering to design, build, and optimize scalable data integration solutions. This role will be responsible for developing enterprise-grade ETL/ELT pipelines, managing large-scale data movement, and supporting modern analytics platforms built on AWS and Snowflake. The ideal candidate combines strong software engineering practices with hands-on experience in AWS data services, PySpark development, and cloud-native architecture. Key Responsibilities Data Pipeline Development • Design, develop, and maintain scalable ETL/ELT pipelines using AWS Glue. • Build and optimize Glue Jobs, Crawlers, Data Catalogs, and Glue Workflows. • Develop PySpark and Python-based transformation logic for large-scale data processing. • Create reusable data ingestion frameworks and automation capabilities. • Implement robust error handling, monitoring, and recovery mechanisms. AWS Data Platform Engineering • Design and maintain data solutions leveraging AWS services including: • AWS Glue • Amazon S3 • IAM • Lake Formation • CloudWatch • Lambda • Step Functions • EventBridge • Ensure secure, scalable, and cost-effective data platform operations. • Optimize AWS resource utilization and processing performance. Snowflake Integration • Build and maintain data pipelines that ingest, transform, and load data into Snowflake. • Optimize Snowflake loading patterns, staging strategies, and transformation workflows. • Collaborate with data modelers and analytics teams to support reporting and business intelligence initiatives. Data Quality & Governance • Implement data quality controls, validation routines, and monitoring processes. • Support data lineage, metadata management, and governance requirements. • Troubleshoot data anomalies and performance bottlenecks across the platform. Engineering & DevOps • Develop solutions using Infrastructure as Code and CI/CD best practices. • Utilize Git-based source control and automated deployment pipelines. • Participate in architecture reviews and technical design discussions. • Mentor junior engineers and promote engineering standards. Required Qualifications • Bachelor's degree in Computer Science, Information Systems, Engineering, or related field. • 5+ years of experience in Data Engineering or ETL development. • 3+ years of hands-on AWS Glue development experience. • Strong Python and PySpark programming skills. • Advanced SQL development skills. • Experience building cloud-native data pipelines on AWS. • Experience integrating with Snowflake or similar cloud data warehouses. • Knowledge of data warehousing concepts and dimensional modeling. • Experience working with large datasets and distributed processing frameworks. Preferred Qualifications • AWS Certified Data Engineer – Associate or Specialty certification. • Experience with Apache Spark optimization and tuning. • Experience with dbt. • Experience with Airflow, Step Functions, or other orchestration tools. • Experience with Kafka, Kinesis, or streaming data platforms. • Experience with healthcare, insurance, financial services, or other regulated industries. • Familiarity with Kimball dimensional modeling methodologies. Technical Skills Required • AWS Glue • PySpark • Python • SQL • Amazon S3 • Snowflake • Data Warehousing • ETL/ELT Development • Git Preferred • dbt • Airflow • Step Functions • Lambda • Lake Formation • Terraform • CloudFormation • Kafka/Kinesis • Docker Success Factors • Strong software engineering mindset. • Expertise in building reliable and scalable data pipelines. • Ability to troubleshoot complex data integration issues. • Strong communication and collaboration skills. • Focus on automation, performance optimization, and operational excellence. Ideal Candidate Profile An engineer who can independently design a data ingestion framework, write PySpark transformations in AWS Glue, optimize Snowflake loads, automate deployments through CI/CD, and serve as the technical expert for enterprise data integration initiatives.

Apply now Apply with DFH

← See all roles