Gen AI Architect (Eval Framework)

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Gen AI Architect (Eval Framework) in Fremont, CA, with a contract length of "unknown" and a pay rate of "unknown." Requires 15 years of experience, expertise in Langfuse, Azure AI services, LLMOps, and proficiency in Python.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

🗓️ - Date discovered

September 24, 2025

🕒 - Project duration

Unknown

🏝️ - Location type

On-site

📄 - Contract type

Unknown

🔒 - Security clearance

Unknown

📍 - Location detailed

Fremont, CA

🧠 - Skills detailed

#Debugging #Observability #TypeScript #Scala #Azure #AI (Artificial Intelligence) #Python #Data Science #Cloud #Documentation

Role description

Hi, I hope you are doing well. We have an urgent below position .If you are interested, please share your updated resume with the rate expectation. Role: Gen AI Architect (Eval Framework) Location: Fremont, CA, USA Experience: 15 years Job description: Mandatory skills: Langfuse (including v3 features) Evaluation SDK. Azure AI services LLMOps, prompt engineering, and GenAI lifecycle management. Python Skills: Hands-on experience with Langfuse (including v3 features) and integrations. · Experience with other GenAI observability tools (e.g., TruLens, W&B, Helicone). · Knowledge of Retrieval-Augmented Generation (RAG), fine-tuning, and multi-agent orchestration. · Strong understanding of Azure AI services, especially the Evaluation SDK. · Deep expertise in LLMOps, prompt engineering, and GenAI lifecycle management. · Proficiency in Python, TypeScript, or similar languages used in GenAI frameworks. · Experience with cloud-native architectures (Azure preferred). · Familiarity with Tracing tools, observability platforms, and evaluation metrics. · Excellent communication and documentation skills. Key Responsibilities: · Set-up and deploy Langfuse v3 in production environment. · Architect and implement the upgrade of Langfuse v2 to v3 within the LamBots framework, ensuring backward compatibility and performance optimization · Design modular components for prompt management, tracing, metrics, evaluation, and playground features using Langfuse v3. · Leverage Langfuse’s full feature set: ○ Prompt Management – versioning, templating, and optimization ○ Tracing – end-to-end visibility into GenAI workflows ○ Metrics – performance, latency, and usage analytics ○ Evaluation – automated and manual scoring of model outputs ○ Playground – interactive testing and debugging of prompts · Integrate Azure AI Evaluation SDK into LamBots to enable scalable enterprise-grade evaluation pipelines/workflows, including: · Build reusable components and templates for evaluation across diverse GenAI use cases. · Collaborate with cross-functional teams to integrate evaluation capabilities into production pipelines/ systems. · Ensure scalability and reliability of evaluation tools in both offline and online environments. · Define and enforce evaluation standards and best practices for GenAI agents, RAG pipelines, and multi-agent orchestration. · Collaborate with product, engineering, and data science teams to align evaluation metrics with business KPIs. · Drive observability, debugging, and traceability features for GenAI workflows. Stay current with emerging GenAI evaluation tools, frameworks, and methodologies. -- Thanks & Regards, Anil Kumar Raas Infotek Corporation. 262 Chapman Road, Suite 105A, Newark, DE -19702 Direct No: 302-286-9932 Ext: 133 Email: anil.kumar@raasinfotek.com

Apply now Apply with DFH Sign up

← See all roles

Go to role

Gen AI Architect (Eval Framework)

Premium Members Land Roles Faster—Upgrade today.

Cloud AI/ML Engineer

Technical Architect

Python AI Developer

Python AI Developer

Premium Members Land Roles Faster—Upgrade today.

Book a

chat

with us

Company