

MethodHub
Generative AI Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Generative AI Engineer on a 12-month contract, hybrid from various US locations. Key skills include Python, prompt engineering, LangChain proficiency, and experience with cloud LLM providers. Strong data science and iterative prompt development experience required.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
October 14, 2025
π - Duration
More than 6 months
-
ποΈ - Location
Hybrid
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
Mountain View, CA
-
π§ - Skills detailed
#Langchain #Monitoring #Azure #Cloud #AWS (Amazon Web Services) #Python #AI (Artificial Intelligence) #Security #Data Science #Observability #Airflow
Role description
GenAI Engineer ( Strong Data Science )
Location: Hybrid from San Francisco, CA / Santa Clara, CA / Mountain View CA / Southfield, MI / St. Louis, MO / Princeton, NJ / Austin, TX / Seattle, WA
Duration: 12 months contract with possible extension
βGiven the nature of some of the most urgent work, it is iterative prompt development. It is critical the folks know how to improve a prompt to develop the business outcomes we are looking for. If they canβt easily articulate a business problem, it is hard for them to improve a promptβ
Customer Need
The project is focused on building production-grade GenAI solutions with emphasis on:
β’ RAG pipelines leveraging LLMs
β’ Prompt engineering (system/tool prompts, function calling, versioning with evals)
β’ Evaluation & observability (ground truth setup, confusion metrics, LLM-as-judge with human review, cost & latency monitoring)
β’ Retrieval strategies & prompt patterns (context management, hallucination mitigation)
β’ LangChain/LlamaIndex (or equivalent) proficiency
β’ Cloud LLM providers (Azure OpenAI, AWS Bedrock, Vertex AI)
β’ Workflow orchestration (Airflow, Dagster)
β’ Security & privacy (PII handling, RBAC)
β’ Python expertise with strong practical experience
Success criteria:
β’ A production-ready prompt + RAG pipeline system with measurable quality lift (accuracy/confusion metrics) and reduced hallucinations, backed by a repeatable evaluation harness.
GenAI Engineer ( Strong Data Science )
Location: Hybrid from San Francisco, CA / Santa Clara, CA / Mountain View CA / Southfield, MI / St. Louis, MO / Princeton, NJ / Austin, TX / Seattle, WA
Duration: 12 months contract with possible extension
βGiven the nature of some of the most urgent work, it is iterative prompt development. It is critical the folks know how to improve a prompt to develop the business outcomes we are looking for. If they canβt easily articulate a business problem, it is hard for them to improve a promptβ
Customer Need
The project is focused on building production-grade GenAI solutions with emphasis on:
β’ RAG pipelines leveraging LLMs
β’ Prompt engineering (system/tool prompts, function calling, versioning with evals)
β’ Evaluation & observability (ground truth setup, confusion metrics, LLM-as-judge with human review, cost & latency monitoring)
β’ Retrieval strategies & prompt patterns (context management, hallucination mitigation)
β’ LangChain/LlamaIndex (or equivalent) proficiency
β’ Cloud LLM providers (Azure OpenAI, AWS Bedrock, Vertex AI)
β’ Workflow orchestration (Airflow, Dagster)
β’ Security & privacy (PII handling, RBAC)
β’ Python expertise with strong practical experience
Success criteria:
β’ A production-ready prompt + RAG pipeline system with measurable quality lift (accuracy/confusion metrics) and reduced hallucinations, backed by a repeatable evaluation harness.