

Ravh-IT
Senior Data Engineer – GenAI
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer – GenAI in Irvine, offering a contract position with a pay rate of "unknown." Candidates should have 10–15+ years of experience, strong skills in Python, SQL, RAG, LangChain, and vector databases.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
June 26, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
On-site
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Irvine, CA
-
🧠 - Skills detailed
#Athena #Data Lake #Data Quality #"ETL (Extract #Transform #Load)" #Spark (Apache Spark) #SQL (Structured Query Language) #Data Pipeline #Knowledge Graph #AWS Glue #Docker #Dataflow #REST (Representational State Transfer) #Snowflake #ADF (Azure Data Factory) #Kafka (Apache Kafka) #Cloud #Kubernetes #Data Engineering #GCP (Google Cloud Platform) #Azure #Microsoft Azure #Azure DevOps #Synapse #FastAPI #MLflow #NLP (Natural Language Processing) #Deployment #S3 (Amazon Simple Storage Service) #GIT #Hugging Face #Python #Langchain #dbt (data build tool) #AWS (Amazon Web Services) #Databases #Databricks #Scala #ADLS (Azure Data Lake Storage) #BigQuery #REST API #ML (Machine Learning) #Security #Apache Spark #GitHub #AI (Artificial Intelligence) #DevOps #Airflow #Lambda (AWS Lambda) #Data Science #Apache Airflow #Model Deployment #PySpark #Delta Lake
Role description
Job Title: Senior Data Engineer – GenAI / RAG / LangChain / LangGraph
Location: Irvine
Experience: 10–15+ Years
Employment Type: Contract
Job Description
We are seeking a highly experienced Senior Data Engineer with expertise in modern data engineering and Generative AI technologies. The ideal candidate should have hands-on experience designing scalable data platforms while building AI-powered applications using RAG (Retrieval-Augmented Generation), LangChain, LangGraph, LLMs, and Vector Databases.
The candidate should possess strong cloud data engineering expertise along with practical experience integrating Large Language Models into enterprise applications.
Mandatory Skills
Data Engineering
• 8+ years of experience in Data Engineering
• Strong expertise in Python and SQL
• Apache Spark / PySpark
• Databricks
• ETL/ELT Pipeline Development
• Delta Lake
• Data Warehousing & Data Lake Architecture
• Apache Airflow or equivalent orchestration tools
• CI/CD for Data Pipelines
• Git / Azure DevOps / GitHub
Cloud Platforms (Any One)
• Microsoft Azure (ADF, Synapse, ADLS)
• AWS (Glue, EMR, Lambda, S3, Athena)
• Google Cloud Platform (BigQuery, Dataflow)
Generative AI / LLM
• Hands-on experience building RAG (Retrieval-Augmented Generation) solutions
• LangChain
• LangGraph
• OpenAI / Azure OpenAI / Anthropic Claude / Gemini APIs
• Prompt Engineering
• AI Agents / Multi-Agent Workflows
• LLM Orchestration
• Function Calling / Tool Calling
• LLM Evaluation and Optimization
Vector Databases
Experience with one or more:
• Pinecone
• ChromaDB
• FAISS
• Weaviate
• Milvus
• Azure AI Search
AI/ML
• Machine Learning fundamentals
• Embedding Models
• Semantic Search
• Document Processing
• NLP
• Model Deployment (preferred)
Additional Skills
• REST APIs / FastAPI
• Docker
• Kubernetes (Preferred)
• MLflow
• Kafka (Preferred)
Responsibilities
• Design and develop scalable enterprise data pipelines.
• Build Retrieval-Augmented Generation (RAG) applications.
• Develop AI Agents using LangChain and LangGraph.
• Integrate enterprise data sources with LLMs.
• Build semantic search solutions using vector databases.
• Optimize prompt engineering and LLM performance.
• Work with structured and unstructured data sources.
• Collaborate with Data Scientists, ML Engineers, and Business stakeholders.
• Ensure data quality, governance, scalability, and security.
Preferred Experience
• Financial Services / Asset Management
• Banking
• Healthcare
• Insurance
• Retail
• Manufacturing
Nice to Have
• Microsoft Fabric
• Snowflake
• DBT
• MLOps
• Hugging Face
• LlamaIndex
• CrewAI / AutoGen
• MCP (Model Context Protocol)
• Knowledge Graphs
• GraphRAG
Recruiter Screening Checklist
Candidates must have:
• ✔️ 8+ years of Data Engineering experience
• ✔️ Strong Python & SQL
• ✔️ Databricks / Spark
• ✔️ Azure or AWS
• ✔️ RAG implementation experience
• ✔️ LangChain
• ✔️ LangGraph
• ✔️ OpenAI / Azure OpenAI
• ✔️ Vector Database experience
• ✔️ AI Agent development
• ✔️ Production deployment of LLM applications
• ✔️ Strong communication skills
Search Keywords for Recruiters
Senior Data Engineer, GenAI Engineer, AI Engineer, LLM Engineer, RAG Engineer, LangChain, LangGraph, OpenAI, Azure OpenAI, Databricks, PySpark, Python, SQL, AI Agents, Vector Database, Pinecone, ChromaDB, FAISS, Weaviate, Azure AI Search, LlamaIndex, MLflow, Semantic Search, Prompt Engineering.
Job Title: Senior Data Engineer – GenAI / RAG / LangChain / LangGraph
Location: Irvine
Experience: 10–15+ Years
Employment Type: Contract
Job Description
We are seeking a highly experienced Senior Data Engineer with expertise in modern data engineering and Generative AI technologies. The ideal candidate should have hands-on experience designing scalable data platforms while building AI-powered applications using RAG (Retrieval-Augmented Generation), LangChain, LangGraph, LLMs, and Vector Databases.
The candidate should possess strong cloud data engineering expertise along with practical experience integrating Large Language Models into enterprise applications.
Mandatory Skills
Data Engineering
• 8+ years of experience in Data Engineering
• Strong expertise in Python and SQL
• Apache Spark / PySpark
• Databricks
• ETL/ELT Pipeline Development
• Delta Lake
• Data Warehousing & Data Lake Architecture
• Apache Airflow or equivalent orchestration tools
• CI/CD for Data Pipelines
• Git / Azure DevOps / GitHub
Cloud Platforms (Any One)
• Microsoft Azure (ADF, Synapse, ADLS)
• AWS (Glue, EMR, Lambda, S3, Athena)
• Google Cloud Platform (BigQuery, Dataflow)
Generative AI / LLM
• Hands-on experience building RAG (Retrieval-Augmented Generation) solutions
• LangChain
• LangGraph
• OpenAI / Azure OpenAI / Anthropic Claude / Gemini APIs
• Prompt Engineering
• AI Agents / Multi-Agent Workflows
• LLM Orchestration
• Function Calling / Tool Calling
• LLM Evaluation and Optimization
Vector Databases
Experience with one or more:
• Pinecone
• ChromaDB
• FAISS
• Weaviate
• Milvus
• Azure AI Search
AI/ML
• Machine Learning fundamentals
• Embedding Models
• Semantic Search
• Document Processing
• NLP
• Model Deployment (preferred)
Additional Skills
• REST APIs / FastAPI
• Docker
• Kubernetes (Preferred)
• MLflow
• Kafka (Preferred)
Responsibilities
• Design and develop scalable enterprise data pipelines.
• Build Retrieval-Augmented Generation (RAG) applications.
• Develop AI Agents using LangChain and LangGraph.
• Integrate enterprise data sources with LLMs.
• Build semantic search solutions using vector databases.
• Optimize prompt engineering and LLM performance.
• Work with structured and unstructured data sources.
• Collaborate with Data Scientists, ML Engineers, and Business stakeholders.
• Ensure data quality, governance, scalability, and security.
Preferred Experience
• Financial Services / Asset Management
• Banking
• Healthcare
• Insurance
• Retail
• Manufacturing
Nice to Have
• Microsoft Fabric
• Snowflake
• DBT
• MLOps
• Hugging Face
• LlamaIndex
• CrewAI / AutoGen
• MCP (Model Context Protocol)
• Knowledge Graphs
• GraphRAG
Recruiter Screening Checklist
Candidates must have:
• ✔️ 8+ years of Data Engineering experience
• ✔️ Strong Python & SQL
• ✔️ Databricks / Spark
• ✔️ Azure or AWS
• ✔️ RAG implementation experience
• ✔️ LangChain
• ✔️ LangGraph
• ✔️ OpenAI / Azure OpenAI
• ✔️ Vector Database experience
• ✔️ AI Agent development
• ✔️ Production deployment of LLM applications
• ✔️ Strong communication skills
Search Keywords for Recruiters
Senior Data Engineer, GenAI Engineer, AI Engineer, LLM Engineer, RAG Engineer, LangChain, LangGraph, OpenAI, Azure OpenAI, Databricks, PySpark, Python, SQL, AI Agents, Vector Database, Pinecone, ChromaDB, FAISS, Weaviate, Azure AI Search, LlamaIndex, MLflow, Semantic Search, Prompt Engineering.






