MethodHub

Generative AI Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Generative AI Engineer on a 12-month contract, hybrid from various US locations. Key skills include Python, prompt engineering, LangChain proficiency, and experience with cloud LLM providers. Strong data science and iterative prompt development experience required.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
October 14, 2025
πŸ•’ - Duration
More than 6 months
-
🏝️ - Location
Hybrid
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Mountain View, CA
-
🧠 - Skills detailed
#Langchain #Monitoring #Azure #Cloud #AWS (Amazon Web Services) #Python #AI (Artificial Intelligence) #Security #Data Science #Observability #Airflow
Role description
GenAI Engineer ( Strong Data Science ) Location: Hybrid from San Francisco, CA / Santa Clara, CA / Mountain View CA / Southfield, MI / St. Louis, MO / Princeton, NJ / Austin, TX / Seattle, WA Duration: 12 months contract with possible extension β€œGiven the nature of some of the most urgent work, it is iterative prompt development. It is critical the folks know how to improve a prompt to develop the business outcomes we are looking for. If they can’t easily articulate a business problem, it is hard for them to improve a prompt” Customer Need The project is focused on building production-grade GenAI solutions with emphasis on: β€’ RAG pipelines leveraging LLMs β€’ Prompt engineering (system/tool prompts, function calling, versioning with evals) β€’ Evaluation & observability (ground truth setup, confusion metrics, LLM-as-judge with human review, cost & latency monitoring) β€’ Retrieval strategies & prompt patterns (context management, hallucination mitigation) β€’ LangChain/LlamaIndex (or equivalent) proficiency β€’ Cloud LLM providers (Azure OpenAI, AWS Bedrock, Vertex AI) β€’ Workflow orchestration (Airflow, Dagster) β€’ Security & privacy (PII handling, RBAC) β€’ Python expertise with strong practical experience Success criteria: β€’ A production-ready prompt + RAG pipeline system with measurable quality lift (accuracy/confusion metrics) and reduced hallucinations, backed by a repeatable evaluation harness.