

Galaxy i technologies Inc
AWS Tech Lead / Senior Data Engineer (AWS, Python, PySpark)
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AWS Tech Lead / Senior Data Engineer in Malvern, PA, on a W2 contract for over 6 months, offering a competitive pay rate. Requires 15+ years of experience with AWS, Python, PySpark, and event-driven architectures.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
June 30, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
On-site
-
📄 - Contract
W2 Contractor
-
🔒 - Security
Unknown
-
📍 - Location detailed
Malvern, PA
-
🧠 - Skills detailed
#Libraries #Terraform #Spark (Apache Spark) #Data Modeling #Monitoring #Datasets #Kafka (Apache Kafka) #ML (Machine Learning) #DynamoDB #Athena #Data Engineering #AWS (Amazon Web Services) #Deployment #GIT #Security #Data Pipeline #Data Science #Data Processing #Observability #Data Quality #Python #Cloud #PySpark #Datadog #AWS S3 (Amazon Simple Storage Service) #Redshift #Airflow #SQL (Structured Query Language) #Batch #Data Security #Data Transformations #IAM (Identity and Access Management) #S3 (Amazon Simple Storage Service) #Java #SNS (Simple Notification Service) #Data Lake #Compliance #DevOps #Lambda (AWS Lambda) #SQS (Simple Queue Service) #"ETL (Extract #Transform #Load)" #Scala
Role description
Hi, Everyone
•
•
•
•
•
• W2 CONTRACT ONLY
•
•
• W2 CONTRACT ONLY
•
•
• W2 CONTRACT ONLY
•
•
•
•
•
• 100% Closure & Long-term project, Immediate Interview Surely
Job Title: AWS Tech Lead / Senior Data Engineer (AWS, Python, PySpark)
Location: Malvern, PA
Contract : w2 Contract / c2c Contract
Local to Malvern, PA only!
JD :
Job Description: Tech Lead - AWS (15+ Years Experience)
Skills: Java| AWS | Python | PySpark | Architecture
\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
Summary
We are seeking an experienced Tech lead (15+ years) with a strong background in Java, AWS, Python, PySpark, and event-driven architectures. You will design and build scalable batch and streaming data pipelines, optimize cloud data platforms, and deliver high-quality, reliable datasets that support analytics, reporting, and machine learning workloads.
\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
Key Responsibilities
• Architect, build, and maintain driven data pipelines event- using AWS services such as Kinesis, MSK/Kafka, Lambda, Step Functions, SQS/SNS, and Glue/EMR.
• Develop ETL/ELT workflows using Python and PySpark, ensuring performance, scalability, and cost efficiency.
• Implement and optimize Spark-based data transformations, partitioning strategies, and data processing frameworks.
• Design and manage data lake and warehouse structures using S3, Glue Catalog, Athena, and/or Redshift.
• Build streaming solutions with checkpointing, stateful transformations, idempotency, and schema evolution.
• Ensure high standards of data quality, observability, monitoring, and alerting (CloudWatch, Datadog, etc.).
• Implement data security best practices including IAM, encryption (KMS), networking, and governance.
• Create reusable frameworks, internal libraries, and CI/CD pipelines for automated deployments.
• Collaborate with data scientists, analysts, and business teams to deliver well-modeled, reliable datasets.
• Lead design reviews, mentor junior engineers, and contribute to engineering best practices.
\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
Required Qualifications
• 15+ years of professional experience.
• Strong expertise in Python and PySpark for large-scale data processing.
• Advanced hands-on experience with AWS (S3, Glue, EMR, Lambda, Step Functions, Kinesis/MSK, DynamoDB, Athena, Redshift).
• Deep experience building event-driven and streaming data pipelines.
• Strong SQL experience for analytical and ETL workloads.
• Hands-on experience with workflow orchestration tools such as Airflow or Step Functions.
• Experience with CI/CD, Git, and Infrastructure-as-Code (Terraform or CloudFormation).
• Strong understanding of distributed systems, Spark performance tuning, data modeling, and cloud cost optimization.
• Knowledge of data security, encryption, networking, and compliance best practices in cloud environments.
Soft Skills
• Strong design and architectural understanding
• Excellent communication and stakeholder interaction skills
• Ability to work in a globally distributed team
Role Descriptions: Sr Developer
Essential Skills: Sr Developer
Desirable Skills:
Keyword:
Skills: Digital : Python~Digital : Amazon Web Service(AWS) Cloud Computing~Digital : DevOps
Experience Required: 8-10
NOTE: Please share your updated resume to c2c@galaxyitech.com
Hi, Everyone
•
•
•
•
•
• W2 CONTRACT ONLY
•
•
• W2 CONTRACT ONLY
•
•
• W2 CONTRACT ONLY
•
•
•
•
•
• 100% Closure & Long-term project, Immediate Interview Surely
Job Title: AWS Tech Lead / Senior Data Engineer (AWS, Python, PySpark)
Location: Malvern, PA
Contract : w2 Contract / c2c Contract
Local to Malvern, PA only!
JD :
Job Description: Tech Lead - AWS (15+ Years Experience)
Skills: Java| AWS | Python | PySpark | Architecture
\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
Summary
We are seeking an experienced Tech lead (15+ years) with a strong background in Java, AWS, Python, PySpark, and event-driven architectures. You will design and build scalable batch and streaming data pipelines, optimize cloud data platforms, and deliver high-quality, reliable datasets that support analytics, reporting, and machine learning workloads.
\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
Key Responsibilities
• Architect, build, and maintain driven data pipelines event- using AWS services such as Kinesis, MSK/Kafka, Lambda, Step Functions, SQS/SNS, and Glue/EMR.
• Develop ETL/ELT workflows using Python and PySpark, ensuring performance, scalability, and cost efficiency.
• Implement and optimize Spark-based data transformations, partitioning strategies, and data processing frameworks.
• Design and manage data lake and warehouse structures using S3, Glue Catalog, Athena, and/or Redshift.
• Build streaming solutions with checkpointing, stateful transformations, idempotency, and schema evolution.
• Ensure high standards of data quality, observability, monitoring, and alerting (CloudWatch, Datadog, etc.).
• Implement data security best practices including IAM, encryption (KMS), networking, and governance.
• Create reusable frameworks, internal libraries, and CI/CD pipelines for automated deployments.
• Collaborate with data scientists, analysts, and business teams to deliver well-modeled, reliable datasets.
• Lead design reviews, mentor junior engineers, and contribute to engineering best practices.
\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
Required Qualifications
• 15+ years of professional experience.
• Strong expertise in Python and PySpark for large-scale data processing.
• Advanced hands-on experience with AWS (S3, Glue, EMR, Lambda, Step Functions, Kinesis/MSK, DynamoDB, Athena, Redshift).
• Deep experience building event-driven and streaming data pipelines.
• Strong SQL experience for analytical and ETL workloads.
• Hands-on experience with workflow orchestration tools such as Airflow or Step Functions.
• Experience with CI/CD, Git, and Infrastructure-as-Code (Terraform or CloudFormation).
• Strong understanding of distributed systems, Spark performance tuning, data modeling, and cloud cost optimization.
• Knowledge of data security, encryption, networking, and compliance best practices in cloud environments.
Soft Skills
• Strong design and architectural understanding
• Excellent communication and stakeholder interaction skills
• Ability to work in a globally distributed team
Role Descriptions: Sr Developer
Essential Skills: Sr Developer
Desirable Skills:
Keyword:
Skills: Digital : Python~Digital : Amazon Web Service(AWS) Cloud Computing~Digital : DevOps
Experience Required: 8-10
NOTE: Please share your updated resume to c2c@galaxyitech.com





