Data Lake AWS Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Data Lakehouse Engineer (AWS + Python + DevOps) on a long-term contract in Dallas, TX, requiring independent visa holders. Key skills include AWS Lake Formation, Python, PySpark, and DevOps tools.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

🗓️ - Date discovered

July 12, 2025

🕒 - Project duration

Unknown

🏝️ - Location type

On-site

📄 - Contract type

Unknown

🔒 - Security clearance

Unknown

📍 - Location detailed

Dallas, TX

🧠 - Skills detailed

#Data Storage #Spark (Apache Spark) #DynamoDB #Python #Version Control #Data Governance #AWS Glue #Terraform #Batch #Scala #AWS IAM (AWS Identity and Access Management) #Programming #Infrastructure as Code (IaC) #Data Lake #PySpark #Data Engineering #AWS Lambda #Data Lakehouse #DevOps #S3 (Amazon Simple Storage Service) #Cloud #IAM (Identity and Access Management) #GitLab #Security #AWS (Amazon Web Services) #"ETL (Extract #Transform #Load)" #Lambda (AWS Lambda) #Storage #Data Processing

Role description

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

Item 1
Item 2
Item 3

Unordered list

Item A
Item B
Item C

Text link

Bold text

Emphasis

^Superscript

_Subscript

Job Title: Data Lakehouse Engineer (AWS + Python + DevOps) Location: Dallas, TX - 1 week onsite every quarter Type: Long Term Contract Note: Looking for independent visa holders Job Summary: We are seeking a highly skilled Data Lakehouse Engineer to join our team and help solve critical business and technology challenges. You will play a key role in building and maintaining a scalable Data Lakehouse solution to ensure the right data reaches the right users at the right time — empowering both business and technical teams with trusted, governed data. Key Responsibilities • Design and implement a modern Data Lakehouse architecture using AWS native services. • Develop and maintain ETL pipelines using AWS Glue, Lambda, Step Functions, and Python. • Configure and manage AWS Lake Formation for secure, governed access to data in S3. • Integrate and optimize data workflows using PySpark and serverless technologies. • Build and deploy infrastructure as code using CloudFormation, Terraform, Stacker, and the Serverless Framework. • Collaborate with DevOps teams using GitLab for version control and CI/CD pipelines. • Work with IAM for access control and DynamoDB for data storage requirements. Technical Skills Required Cloud & Data Engineering (AWS): • AWS Lake Formation • S3 • AWS Glue (Crawler, Catalog, Registry, Glue Jobs) • AWS Step Functions • AWS Lambda • AWS IAM • DynamoDB Programming & Data Processing: • Python (AWS-specific development) • PySpark DevOps & Infrastructure as Code: • GitLab (CI/CD) • Serverless Framework • Stacker • CloudFormation • Terraform Preferred Qualifications • Proven experience in building data lakehouse or lake-based data platforms on AWS. • Strong knowledge of data governance and access management using AWS Lake Formation. • Hands-on experience with real-time and batch data processing. • Familiarity with best practices in cloud security, DevOps, and CI/CD pipelines. • Excellent problem-solving, communication, and collaboration skills.

Apply now Apply with DFH Sign up

← See all roles