

AI Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI Data Engineer with a 6+ year data engineering background, focusing on generative AI technologies. Contract length is W2 only, pay rate is unspecified, and it's a hybrid position in Reston, VA.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
July 12, 2025
π - Project duration
Unknown
-
ποΈ - Location type
Hybrid
-
π - Contract type
W2 Contractor
-
π - Security clearance
Unknown
-
π - Location detailed
Reston, VA
-
π§ - Skills detailed
#Databases #Data Quality #ML (Machine Learning) #Python #Computer Science #Scala #Monitoring #Teradata #Snowflake #Data Engineering #Data Pipeline #AI (Artificial Intelligence) #Cloud #Graph Databases #AWS (Amazon Web Services) #Amazon Neptune #Neo4J #"ETL (Extract #Transform #Load)" #Data Processing
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
Hi Everyone,
CloudIngest is hiring for AI Data Engineer
Interested Candidates can share your profile to jyothi@cloudingest.com
Role: AI Data Engineer
Location: Hybrid 3 days/week in Reston, VA
Visa: USC, GC, H4EAD, and L1 only
Contract: W2 Only
Job Description:
We are seeking a talented and motivated AI Data Engineer to join our innovation-focused team. Youβll contribute to cutting-edge generative AI systems and scalable AI pipelines using state-of-the-art cloud and open-source tools. Ideal candidates will have hands-on experience with prompt engineering, unstructured data processing, and agentic workflows, and will be familiar with integrating vector and graph databases into AI solutions.
Primary Responsibilities
β’ Design and develop scalable AI/ML data pipelines, particularly for Retrieval-Augmented Generation (RAG) architectures, using Python, Snowflake, and AWS services including Bedrock
β’ Build and manage data pipelines for ingesting and transforming structured, semi-structured, and unstructured data into Snowflake, vector DBs (e.g., Pinecone, Weaviate, FAISS), and graph DBs (e.g., Neo4j, Amazon Neptune)
β’ Implement data quality, validation, and monitoring mechanisms
β’ Develop and deploy end-to-end generative AI pipelines
β’ Engineer solutions for processing unstructured data and building autonomous agent workflows
β’ Collaborate with product and engineering teams to integrate AI models into existing systems
β’ Stay current with GenAI trends and apply best practices in production environments
β’ Mentor junior team members on GenAI technologies and data engineering techniques
β’ Apply GenAI to insurance-specific use cases when relevant
Required Qualifications
β’ Bachelor's degree in Computer Science, AI, or related field
β’ 6+ years of data engineering experience
β’ Direct or strong exposure to generative AI technologies
β’ Proven delivery of enterprise-grade GenAI pipelines
β’ Familiarity with prompt engineering and RAG pipeline implementation
β’ Experience with vector and graph databases for AI applications
β’ Ability to develop solutions handling unstructured data
β’ Experience with agentic AI workflows
Preferred Tools & Technologies
β’ Python
β’ Snowflake
β’ Teradata
β’ AWS (including Bedrock)
β’ Vector databases: Pinecone, Weaviate, FAISS
β’ Graph databases: Neo4j, Amazon Neptune