

Wall Street Consulting Services LLC
GEN AI
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI Lead Engineer with 11+ years in AI/ML, including 3+ years in Generative AI. Contract length is "unknown," with a pay rate of "$/hour." Requires expertise in Python, LLMs, RAG workflows, and cloud services (Azure, AWS, GCP).
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
May 20, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Unknown
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Plano, TX
-
🧠 - Skills detailed
#Langchain #Libraries #SageMaker #Microservices #Graph Databases #OpenSearch #GitHub #Scala #Flask #Cloud #Computer Science #Python #Model Evaluation #Leadership #ML (Machine Learning) #Programming #AWS (Amazon Web Services) #Security #Knowledge Graph #Logging #Databases #Monitoring #Observability #Data Science #AI (Artificial Intelligence) #FastAPI #Azure #Automation #"ETL (Extract #Transform #Load)" #GCP (Google Cloud Platform)
Role description
Experience Required
11+ years in AI/ML development, with 3+ years specialized in Generative AI and LLM applications.
Role Overview
The AI Lead Engineer will design, build, and operate production-grade Generative AI solutions for complex enterprise scenarios. The role focuses on scalable LLM-powered applications, robust RAG pipelines, and multi-agent systems with MCP deployed across major cloud AI platforms.
Key Responsibilities
Technical Leadership & Development
• Design and implement enterprise-grade GenAI solutions using LLMs (GPT, Claude, Llama and similar families).
• Build and optimize production-ready RAG pipelines including chunking, embeddings, retrieval tuning, query rewriting, and prompt optimization.
• Develop single- and multi-agent systems using LangChain, LangGraph, LlamaIndex and similar orchestration frameworks.
• Design agentic systems with robust tool calling, memory management, and reasoning patterns.
• Author MCP (Model Context Protocol) servers, tools, and resources, and integrate them with Cursor, Claude, Codex, Copilot, and internal enterprise systems.
• Build plugins and extensions for Claude, Codex, Cursor and GitHub Copilot ecosystems.
• Building AI Agents and Sub-Agents, Agent Skills for tools like Claude Code, Codex, and GitHub Copilot.
• Build scalable Python + FastAPI/Flask or MCP microservices for AI-powered applications, including integration with enterprise APIs.
• Implement model evaluation frameworks using RAGAS, DeepEval, or custom metrics aligned to business KPIs.
• Implement agent-based memory management using Mem0, LangMem or similar libraries.
• Fine-tune and evaluate LLMs for specific domains and business use cases.
• Deploy and manage AI solutions on Azure (Azure OpenAI, Azure AI Studio, Copilot Studio), AWS (Bedrock, SageMaker, Comprehend, Lex), and GCP (Vertex AI, Generative AI Studio).
• Implement observability, logging, and telemetry for AI systems to ensure traceability and performance monitoring.
• Ensure scalability, reliability, security, and cost-efficiency of production AI applications.
• Deep understanding of RAG architectures, hybrid retrieval, and context engineering patterns.
• Translate business requirements into robust technical designs, architectures, and implementation roadmaps.
• Drive innovation by evaluating new LLMs, orchestration frameworks, and cloud AI capabilities (including Copilot Studio for copilots and workflow automation).
Required Skills & Experience
Core Technical
• Programming: Expert-level Python with production-quality code, testing, and performance tuning.
• GenAI Frameworks: Strong hands-on experience with LangChain, LangGraph, LlamaIndex, agentic orchestration libraries.
• LLM Integration: Practical experience integrating OpenAI, Anthropic Claude, Azure OpenAI, AWS Bedrock, and Vertex AI models via APIs/SDKs.
• RAG & Search: Deep experience designing and operating RAG workflows (document ingestion, embeddings, retrieval optimization, query rewriting).
• Vector Databases: Production experience with at least two of OpenSearch, Pinecone, Qdrant, Weaviate, pgvector, FAISS.
Cloud & AI Services
• Azure: Azure OpenAI, Azure AI Studio, Copilot Studio, Azure Cognitive Search.
• AWS: Bedrock, SageMaker endpoints, AWS Nova, AWS Transform etc.
• GCP: Vertex AI (models, endpoints), Agentspace, Agent Builder.
Preferred Qualifications
• Master's degree in Computer Science, AI/ML, Data Science, or related field.
• Experience with multi-agent systems, Agent-to-Agent (A2A) communication, and MCP-based ecosystems.
• Familiarity with LLMOps / observability platforms such as LangSmith, Opik, Azure AI Foundry.
• Experience integrating graph databases and knowledge graphs to enhance retrieval and reasoning.
Experience Required
11+ years in AI/ML development, with 3+ years specialized in Generative AI and LLM applications.
Role Overview
The AI Lead Engineer will design, build, and operate production-grade Generative AI solutions for complex enterprise scenarios. The role focuses on scalable LLM-powered applications, robust RAG pipelines, and multi-agent systems with MCP deployed across major cloud AI platforms.
Key Responsibilities
Technical Leadership & Development
• Design and implement enterprise-grade GenAI solutions using LLMs (GPT, Claude, Llama and similar families).
• Build and optimize production-ready RAG pipelines including chunking, embeddings, retrieval tuning, query rewriting, and prompt optimization.
• Develop single- and multi-agent systems using LangChain, LangGraph, LlamaIndex and similar orchestration frameworks.
• Design agentic systems with robust tool calling, memory management, and reasoning patterns.
• Author MCP (Model Context Protocol) servers, tools, and resources, and integrate them with Cursor, Claude, Codex, Copilot, and internal enterprise systems.
• Build plugins and extensions for Claude, Codex, Cursor and GitHub Copilot ecosystems.
• Building AI Agents and Sub-Agents, Agent Skills for tools like Claude Code, Codex, and GitHub Copilot.
• Build scalable Python + FastAPI/Flask or MCP microservices for AI-powered applications, including integration with enterprise APIs.
• Implement model evaluation frameworks using RAGAS, DeepEval, or custom metrics aligned to business KPIs.
• Implement agent-based memory management using Mem0, LangMem or similar libraries.
• Fine-tune and evaluate LLMs for specific domains and business use cases.
• Deploy and manage AI solutions on Azure (Azure OpenAI, Azure AI Studio, Copilot Studio), AWS (Bedrock, SageMaker, Comprehend, Lex), and GCP (Vertex AI, Generative AI Studio).
• Implement observability, logging, and telemetry for AI systems to ensure traceability and performance monitoring.
• Ensure scalability, reliability, security, and cost-efficiency of production AI applications.
• Deep understanding of RAG architectures, hybrid retrieval, and context engineering patterns.
• Translate business requirements into robust technical designs, architectures, and implementation roadmaps.
• Drive innovation by evaluating new LLMs, orchestration frameworks, and cloud AI capabilities (including Copilot Studio for copilots and workflow automation).
Required Skills & Experience
Core Technical
• Programming: Expert-level Python with production-quality code, testing, and performance tuning.
• GenAI Frameworks: Strong hands-on experience with LangChain, LangGraph, LlamaIndex, agentic orchestration libraries.
• LLM Integration: Practical experience integrating OpenAI, Anthropic Claude, Azure OpenAI, AWS Bedrock, and Vertex AI models via APIs/SDKs.
• RAG & Search: Deep experience designing and operating RAG workflows (document ingestion, embeddings, retrieval optimization, query rewriting).
• Vector Databases: Production experience with at least two of OpenSearch, Pinecone, Qdrant, Weaviate, pgvector, FAISS.
Cloud & AI Services
• Azure: Azure OpenAI, Azure AI Studio, Copilot Studio, Azure Cognitive Search.
• AWS: Bedrock, SageMaker endpoints, AWS Nova, AWS Transform etc.
• GCP: Vertex AI (models, endpoints), Agentspace, Agent Builder.
Preferred Qualifications
• Master's degree in Computer Science, AI/ML, Data Science, or related field.
• Experience with multi-agent systems, Agent-to-Agent (A2A) communication, and MCP-based ecosystems.
• Familiarity with LLMOps / observability platforms such as LangSmith, Opik, Azure AI Foundry.
• Experience integrating graph databases and knowledge graphs to enhance retrieval and reasoning.






