

Lorven Technologies Inc.
Lead Data Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Lead Data Engineer with 13+ years of experience, focusing on Python, PySpark, and AWS. Contract duration is unspecified, and it is remote. Key skills include data pipeline optimization and AI/ML integration.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
February 19, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Atlanta, GA
-
🧠 - Skills detailed
#Data Science #Scala #ML (Machine Learning) #Data Processing #Automation #Data Quality #Data Engineering #Data Modeling #Metadata #S3 (Amazon Simple Storage Service) #Python #Data Pipeline #Cloud #PySpark #Batch #AI (Artificial Intelligence) #"ETL (Extract #Transform #Load)" #Spark (Apache Spark) #AWS (Amazon Web Services) #Datasets #SQL (Structured Query Language)
Role description
Job Title: Data Engineer – Generative AI
Location- Charlotte, NC or Atlanta, GA (Remote)
Duration: Contract
Exp: 13+ years
Job Description:
• Design, develop, and optimize data pipelines using Python and PySpark for batch and incremental processing.
• Build and manage AWS based data solutions leveraging services such as S3, Glue, and cloud native processing frameworks.
• Prepare, transform, and curate datasets to support AI/ML and GenAI model development.
• Integrate data pipelines with AI/ML workflows, ensuring data quality, consistency, and traceability.
• Implement data validation, profiling, and performance tuning to improve reliability and scalability.
• Collaborate with data scientists, ML engineers, and platform teams to deliver end to end GenAI solutions.
Required Qualifications
• Strong hands on experience with Python for data engineering and automation.
• Proven expertise in PySpark / Spark for large scale data processing.
• Experience working in AWS cloud environments for data engineering workloads.
• Solid understanding of data engineering fundamentals, including ETL, data modeling, and performance optimization.
• Experience supporting or working alongside AI/ML or GenAI initiatives.
Nice to Have
• Exposure to GenAI pipelines, model data preparation, or LLM driven workflows.
• Experience with CI/CD, data quality frameworks, or cloud cost optimization.
• Familiarity with SQL based analytics and metadata driven data processing
Job Title: Data Engineer – Generative AI
Location- Charlotte, NC or Atlanta, GA (Remote)
Duration: Contract
Exp: 13+ years
Job Description:
• Design, develop, and optimize data pipelines using Python and PySpark for batch and incremental processing.
• Build and manage AWS based data solutions leveraging services such as S3, Glue, and cloud native processing frameworks.
• Prepare, transform, and curate datasets to support AI/ML and GenAI model development.
• Integrate data pipelines with AI/ML workflows, ensuring data quality, consistency, and traceability.
• Implement data validation, profiling, and performance tuning to improve reliability and scalability.
• Collaborate with data scientists, ML engineers, and platform teams to deliver end to end GenAI solutions.
Required Qualifications
• Strong hands on experience with Python for data engineering and automation.
• Proven expertise in PySpark / Spark for large scale data processing.
• Experience working in AWS cloud environments for data engineering workloads.
• Solid understanding of data engineering fundamentals, including ETL, data modeling, and performance optimization.
• Experience supporting or working alongside AI/ML or GenAI initiatives.
Nice to Have
• Exposure to GenAI pipelines, model data preparation, or LLM driven workflows.
• Experience with CI/CD, data quality frameworks, or cloud cost optimization.
• Familiarity with SQL based analytics and metadata driven data processing






