

AWS AI Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for an AWS AI Data Engineer on a remote contract basis, requiring expertise in AWS ETL services, Python or Scala, and machine learning frameworks. Key skills include data classification and AWS infrastructure management, with a focus on AI/ML workflows.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
May 31, 2025
π - Project duration
Unknown
-
ποΈ - Location type
Remote
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
United States
-
π§ - Skills detailed
#Cloud #EC2 #Indexing #S3 (Amazon Simple Storage Service) #Classification #Compliance #Databases #RDS (Amazon Relational Database Service) #Scala #Programming #Data Processing #ML (Machine Learning) #Python #Data Privacy #PyTorch #Datasets #SageMaker #AI (Artificial Intelligence) #Data Engineering #"ETL (Extract #Transform #Load)" #Lambda (AWS Lambda) #AWS (Amazon Web Services) #TensorFlow #AWS Glue #Data Pipeline #Data Architecture #VPC (Virtual Private Cloud) #IAM (Identity and Access Management) #AWS IAM (AWS Identity and Access Management)
Role description
Job Title: AWS AI Data Engineer
Location: Remote in US
Job Type: Contract
Role Scope
We are looking for an experienced AWS AI Data Engineer to join our dynamic team, responsible for developing, managing, and optimizing data architectures that support AI and Machine Learning (ML) workflows. The ideal candidate will have extensive experience in integrating large-scale datasets, building scalable and automated data pipelines, and working with advanced ML frameworks and tools. The candidate should also have experience with AWS ETL services (such as AWS Glue, Lambda, and Data Pipeline) to handle data processing and integration tasks effectively.
Overview: Itβs one of the workstreams of Project Acuity. PASD Data Platform includes centralized web application for internal PASD users across the Recruitment Business to support marketing and operational use cases. Building a database at the patient level will provide significant benefit to PASDβs future reporting capabilities and engagement of external stakeholders.
Must Have Skills
β’
β’ Proficiency in programming languages such as Python, Scala, or similar.
β’ Solid understanding of machine learning frameworks such as TensorFlow and PyTorch.
β’ Strong experience in data classification, including the identification of PII data entities.
β’ Knowledge and experience with retrieval-augmented generation (RAG) and agent-based workflows.
β’ Deep understanding of how-to re-rank and improve LLM outputs using Index and Vector stores.
β’ Ability to leverage AWS services (e.g., SageMaker, Comprehend, Entity Resolution) to solve complex data and AI-related challenges.
β’ Ability to manage and deploy machine learning models and frameworks at scale using AWS infrastructure.
β’ Strong analytical and problem-solving skills, with the ability to innovate and develop new approaches to data engineering and AI/ML.
β’ experience with AWS ETL services (such as AWS Glue, Lambda, and Data Pipeline) to handle data processing and integration tasks effectively.
β’ Experience in core AWS Services including AWS IAM, VPC, EC2, S3, RDS, Lambda, CloudWatch, CloudTrail.
Nice To Have Skills
β’ Experience with data privacy and compliance requirements, especially related to PII data.
β’ Familiarity with advanced data indexing techniques, vector databases, and other technologies that improve the quality of AI/ML outputs.
Job Title: AWS AI Data Engineer
Location: Remote in US
Job Type: Contract
Role Scope
We are looking for an experienced AWS AI Data Engineer to join our dynamic team, responsible for developing, managing, and optimizing data architectures that support AI and Machine Learning (ML) workflows. The ideal candidate will have extensive experience in integrating large-scale datasets, building scalable and automated data pipelines, and working with advanced ML frameworks and tools. The candidate should also have experience with AWS ETL services (such as AWS Glue, Lambda, and Data Pipeline) to handle data processing and integration tasks effectively.
Overview: Itβs one of the workstreams of Project Acuity. PASD Data Platform includes centralized web application for internal PASD users across the Recruitment Business to support marketing and operational use cases. Building a database at the patient level will provide significant benefit to PASDβs future reporting capabilities and engagement of external stakeholders.
Must Have Skills
β’
β’ Proficiency in programming languages such as Python, Scala, or similar.
β’ Solid understanding of machine learning frameworks such as TensorFlow and PyTorch.
β’ Strong experience in data classification, including the identification of PII data entities.
β’ Knowledge and experience with retrieval-augmented generation (RAG) and agent-based workflows.
β’ Deep understanding of how-to re-rank and improve LLM outputs using Index and Vector stores.
β’ Ability to leverage AWS services (e.g., SageMaker, Comprehend, Entity Resolution) to solve complex data and AI-related challenges.
β’ Ability to manage and deploy machine learning models and frameworks at scale using AWS infrastructure.
β’ Strong analytical and problem-solving skills, with the ability to innovate and develop new approaches to data engineering and AI/ML.
β’ experience with AWS ETL services (such as AWS Glue, Lambda, and Data Pipeline) to handle data processing and integration tasks effectively.
β’ Experience in core AWS Services including AWS IAM, VPC, EC2, S3, RDS, Lambda, CloudWatch, CloudTrail.
Nice To Have Skills
β’ Experience with data privacy and compliance requirements, especially related to PII data.
β’ Familiarity with advanced data indexing techniques, vector databases, and other technologies that improve the quality of AI/ML outputs.