

PranaTree
LLM Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an LLM Engineer specializing in Generative AI, offering a remote contract for over 6 months at a competitive pay rate. Requires 12+ months of GenAI experience, strong Python skills, and proficiency in AWS or Azure.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
May 14, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Chicago, IL
-
🧠 - Skills detailed
#AWS (Amazon Web Services) #Scala #Monitoring #API (Application Programming Interface) #Programming #Databases #ML (Machine Learning) #Data Science #AI (Artificial Intelligence) #Model Deployment #Strategy #Cloud #Debugging #Azure #FastAPI #Deployment #Python
Role description
Job Overview
Pranatree LLC is looking for a skilled LLM Engineer specializing in Generative AI to join our team. This role is suitable for candidates with relevant GenAI development experience and a strong foundation in backend engineering. The position is remote and involves working on diverse projects, including both client-facing and internal initiatives focused on developing GenAI use cases such as AI assistants or content generation tools.
Key Responsibilities
• RAG Pipeline Development: Design and develop Retrieval-Augmented Generation (RAG) pipelines for efficient and context-aware information retrieval.
• Prompt Tuning and Strategy: Implement prompt engineering and fine-tuning techniques to enhance the quality and relevance of AI outputs.
• API Development: Create, maintain, and optimize backend APIs using FastAPI or similar frameworks to support AI-driven applications.
• Model Deployment: Deploy and manage large language models (LLMs) such as those from OpenAI or equivalent providers, ensuring production readiness.
• Cloud Integration: Utilize cloud services (AWS or Azure) for model deployment, scaling, and performance optimization.
• Validation and Testing: Validate and test GenAI outputs to ensure high-quality performance and reliability.
• Vector Databases and Caching: Work with vector databases and caching layers to improve data retrieval and reduce latency.
• Collaboration: Partner with data scientists, engineers, and stakeholders to deliver robust solutions for both internal and client-facing projects.
• Performance Monitoring: Profile, debug, and optimize large-scale machine learning systems for scalability and system performance.
Qualifications and Requirements
• Experience:
• 12+ months of relevant GenAI development experience.
• Overall 2+ years of experience in machine learning engineering.
• Technical Skills:
• Programming: Strong proficiency in Python.
• Frameworks: Experience with FastAPI for backend API development.
• LLM Expertise: Hands-on experience with large language models, including OpenAI or equivalent.
• Cloud Platforms: Proficiency in cloud platforms such as AWS or Azure.
• Production Deployment: Experience in deploying LLM-based products into production environments.
• Project Experience:
• Involvement in projects that include GenAI use cases like developing AI assistants or content generation tools.
• Familiarity with prompt tuning, RAG pipeline creation, and validation of generative outputs.
• Working with vector databases and caching for optimizing data retrieval.
• Other Skills:
• Strong understanding of machine learning system performance, including profiling, debugging, and scalability.
• Good communication skills for collaborative work and client interactions when needed.
Performance Expectations
• Design and deploy efficient RAG pipelines and GenAI solutions that align with project requirements.
• Optimize backend systems and APIs for seamless integration with GenAI models.
• Ensure scalable, reliable, and high-performance solutions.
• Collaborate effectively with cross-functional teams and stakeholders.
• Participate in client interviews for external projects if required.
Job Overview
Pranatree LLC is looking for a skilled LLM Engineer specializing in Generative AI to join our team. This role is suitable for candidates with relevant GenAI development experience and a strong foundation in backend engineering. The position is remote and involves working on diverse projects, including both client-facing and internal initiatives focused on developing GenAI use cases such as AI assistants or content generation tools.
Key Responsibilities
• RAG Pipeline Development: Design and develop Retrieval-Augmented Generation (RAG) pipelines for efficient and context-aware information retrieval.
• Prompt Tuning and Strategy: Implement prompt engineering and fine-tuning techniques to enhance the quality and relevance of AI outputs.
• API Development: Create, maintain, and optimize backend APIs using FastAPI or similar frameworks to support AI-driven applications.
• Model Deployment: Deploy and manage large language models (LLMs) such as those from OpenAI or equivalent providers, ensuring production readiness.
• Cloud Integration: Utilize cloud services (AWS or Azure) for model deployment, scaling, and performance optimization.
• Validation and Testing: Validate and test GenAI outputs to ensure high-quality performance and reliability.
• Vector Databases and Caching: Work with vector databases and caching layers to improve data retrieval and reduce latency.
• Collaboration: Partner with data scientists, engineers, and stakeholders to deliver robust solutions for both internal and client-facing projects.
• Performance Monitoring: Profile, debug, and optimize large-scale machine learning systems for scalability and system performance.
Qualifications and Requirements
• Experience:
• 12+ months of relevant GenAI development experience.
• Overall 2+ years of experience in machine learning engineering.
• Technical Skills:
• Programming: Strong proficiency in Python.
• Frameworks: Experience with FastAPI for backend API development.
• LLM Expertise: Hands-on experience with large language models, including OpenAI or equivalent.
• Cloud Platforms: Proficiency in cloud platforms such as AWS or Azure.
• Production Deployment: Experience in deploying LLM-based products into production environments.
• Project Experience:
• Involvement in projects that include GenAI use cases like developing AI assistants or content generation tools.
• Familiarity with prompt tuning, RAG pipeline creation, and validation of generative outputs.
• Working with vector databases and caching for optimizing data retrieval.
• Other Skills:
• Strong understanding of machine learning system performance, including profiling, debugging, and scalability.
• Good communication skills for collaborative work and client interactions when needed.
Performance Expectations
• Design and deploy efficient RAG pipelines and GenAI solutions that align with project requirements.
• Optimize backend systems and APIs for seamless integration with GenAI models.
• Ensure scalable, reliable, and high-performance solutions.
• Collaborate effectively with cross-functional teams and stakeholders.
• Participate in client interviews for external projects if required.





