

Infojini Inc
Senior GenAI Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior GenAI Engineer in Philadelphia, PA (Hybrid) for 6+ months at a competitive pay rate. Key skills include open-source LLM deployment, Python proficiency, vector databases, RAG implementation, and data security understanding.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
April 9, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
Hybrid
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Philadelphia, PA
-
🧠 - Skills detailed
#C++ #Docker #Consulting #"ETL (Extract #Transform #Load)" #Databases #Langchain #Metadata #Python #Security #Documentation #Logging #Kubernetes #Hugging Face #Data Privacy #Transformers #Deployment
Role description
Below is the job description for the position:
Job Title: Senior GenAI Engineer
Location: Philadelphia, PA (Hybrid – 3 Days Onsite, 2 Days Remote)
Duration: 6+ Months (Possible Extension)
Interview Process: 1st Round – Virtual | 2nd Round – In-person
Job Description
We are seeking an experienced Senior GenAI Engineer to support on-premise LLM and vector database implementation. The ideal candidate will have strong hands-on experience with open-source LLMs, retrieval-augmented generation (RAG), and enterprise-grade deployments.
Core Experience
• Hands-on experience deploying open-source LLMs such as Meta Llama 3, Mistral, or Mixtral in on-premise or private environments
• Strong proficiency in Python for LLM inference, prompt engineering, and system integration
• Experience with CPU-based inference, model quantization, and performance tuning
Vector Databases & RAG
• Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
• Proven experience implementing Retrieval-Augmented Generation (RAG) pipelines
• Experience generating and managing embeddings, including metadata filtering
Security & Governance
• Strong understanding of data privacy, air-gapped deployments, and enterprise security requirements
• Experience implementing access controls and audit logging
Nice to Have
• Experience with LangChain or LlamaIndex
• Exposure to Rust, Go, or C++ for high-performance services
• Familiarity with Docker and Kubernetes for on-premise deployments
• Knowledge of inference frameworks such as vLLM, llama.cpp, or Hugging Face Transformers
• Prior experience working in regulated or enterprise environments
Key Deliverables
• Reference architecture and deployment guidance
• Working prototype (LLM + Vector Database + RAG pipeline)
• Documentation and knowledge transfer to internal teams
Thanks & Regards
Infojini Consulting
Website: https://www.infojiniconsulting.com
Address: 10015 Old Columbia Road, Suite B 215, Columbia, MD 21046
Below is the job description for the position:
Job Title: Senior GenAI Engineer
Location: Philadelphia, PA (Hybrid – 3 Days Onsite, 2 Days Remote)
Duration: 6+ Months (Possible Extension)
Interview Process: 1st Round – Virtual | 2nd Round – In-person
Job Description
We are seeking an experienced Senior GenAI Engineer to support on-premise LLM and vector database implementation. The ideal candidate will have strong hands-on experience with open-source LLMs, retrieval-augmented generation (RAG), and enterprise-grade deployments.
Core Experience
• Hands-on experience deploying open-source LLMs such as Meta Llama 3, Mistral, or Mixtral in on-premise or private environments
• Strong proficiency in Python for LLM inference, prompt engineering, and system integration
• Experience with CPU-based inference, model quantization, and performance tuning
Vector Databases & RAG
• Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
• Proven experience implementing Retrieval-Augmented Generation (RAG) pipelines
• Experience generating and managing embeddings, including metadata filtering
Security & Governance
• Strong understanding of data privacy, air-gapped deployments, and enterprise security requirements
• Experience implementing access controls and audit logging
Nice to Have
• Experience with LangChain or LlamaIndex
• Exposure to Rust, Go, or C++ for high-performance services
• Familiarity with Docker and Kubernetes for on-premise deployments
• Knowledge of inference frameworks such as vLLM, llama.cpp, or Hugging Face Transformers
• Prior experience working in regulated or enterprise environments
Key Deliverables
• Reference architecture and deployment guidance
• Working prototype (LLM + Vector Database + RAG pipeline)
• Documentation and knowledge transfer to internal teams
Thanks & Regards
Infojini Consulting
Website: https://www.infojiniconsulting.com
Address: 10015 Old Columbia Road, Suite B 215, Columbia, MD 21046






