Wall Street Consulting Services LLC

GEN AI

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for an AI Lead Engineer with 11+ years in AI/ML, including 3+ years in Generative AI. Contract length is "unknown," with a pay rate of "$/hour." Requires expertise in Python, LLMs, RAG workflows, and cloud services (Azure, AWS, GCP).

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date

May 20, 2026

🕒 - Duration

Unknown

🏝️ - Location

Unknown

📄 - Contract

Unknown

🔒 - Security

Unknown

📍 - Location detailed

Plano, TX

🧠 - Skills detailed

#Langchain #Libraries #SageMaker #Microservices #Graph Databases #OpenSearch #GitHub #Scala #Flask #Cloud #Computer Science #Python #Model Evaluation #Leadership #ML (Machine Learning) #Programming #AWS (Amazon Web Services) #Security #Knowledge Graph #Logging #Databases #Monitoring #Observability #Data Science #AI (Artificial Intelligence) #FastAPI #Azure #Automation #"ETL (Extract #Transform #Load)" #GCP (Google Cloud Platform)

Role description

Experience Required 11+ years in AI/ML development, with 3+ years specialized in Generative AI and LLM applications. Role Overview The AI Lead Engineer will design, build, and operate production-grade Generative AI solutions for complex enterprise scenarios. The role focuses on scalable LLM-powered applications, robust RAG pipelines, and multi-agent systems with MCP deployed across major cloud AI platforms. Key Responsibilities Technical Leadership & Development • Design and implement enterprise-grade GenAI solutions using LLMs (GPT, Claude, Llama and similar families). • Build and optimize production-ready RAG pipelines including chunking, embeddings, retrieval tuning, query rewriting, and prompt optimization. • Develop single- and multi-agent systems using LangChain, LangGraph, LlamaIndex and similar orchestration frameworks. • Design agentic systems with robust tool calling, memory management, and reasoning patterns. • Author MCP (Model Context Protocol) servers, tools, and resources, and integrate them with Cursor, Claude, Codex, Copilot, and internal enterprise systems. • Build plugins and extensions for Claude, Codex, Cursor and GitHub Copilot ecosystems. • Building AI Agents and Sub-Agents, Agent Skills for tools like Claude Code, Codex, and GitHub Copilot. • Build scalable Python + FastAPI/Flask or MCP microservices for AI-powered applications, including integration with enterprise APIs. • Implement model evaluation frameworks using RAGAS, DeepEval, or custom metrics aligned to business KPIs. • Implement agent-based memory management using Mem0, LangMem or similar libraries. • Fine-tune and evaluate LLMs for specific domains and business use cases. • Deploy and manage AI solutions on Azure (Azure OpenAI, Azure AI Studio, Copilot Studio), AWS (Bedrock, SageMaker, Comprehend, Lex), and GCP (Vertex AI, Generative AI Studio). • Implement observability, logging, and telemetry for AI systems to ensure traceability and performance monitoring. • Ensure scalability, reliability, security, and cost-efficiency of production AI applications. • Deep understanding of RAG architectures, hybrid retrieval, and context engineering patterns. • Translate business requirements into robust technical designs, architectures, and implementation roadmaps. • Drive innovation by evaluating new LLMs, orchestration frameworks, and cloud AI capabilities (including Copilot Studio for copilots and workflow automation). Required Skills & Experience Core Technical • Programming: Expert-level Python with production-quality code, testing, and performance tuning. • GenAI Frameworks: Strong hands-on experience with LangChain, LangGraph, LlamaIndex, agentic orchestration libraries. • LLM Integration: Practical experience integrating OpenAI, Anthropic Claude, Azure OpenAI, AWS Bedrock, and Vertex AI models via APIs/SDKs. • RAG & Search: Deep experience designing and operating RAG workflows (document ingestion, embeddings, retrieval optimization, query rewriting). • Vector Databases: Production experience with at least two of OpenSearch, Pinecone, Qdrant, Weaviate, pgvector, FAISS. Cloud & AI Services • Azure: Azure OpenAI, Azure AI Studio, Copilot Studio, Azure Cognitive Search. • AWS: Bedrock, SageMaker endpoints, AWS Nova, AWS Transform etc. • GCP: Vertex AI (models, endpoints), Agentspace, Agent Builder. Preferred Qualifications • Master's degree in Computer Science, AI/ML, Data Science, or related field. • Experience with multi-agent systems, Agent-to-Agent (A2A) communication, and MCP-based ecosystems. • Familiarity with LLMOps / observability platforms such as LangSmith, Opik, Azure AI Foundry. • Experience integrating graph databases and knowledge graphs to enhance retrieval and reasoning.

Apply now Apply with DFH

← See all roles