Infojini Inc

Senior GenAI Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior GenAI Engineer in Philadelphia, PA (Hybrid) for 6+ months at a competitive pay rate. Key skills include open-source LLM deployment, Python proficiency, vector databases, RAG implementation, and data security understanding.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
April 9, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
Hybrid
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Philadelphia, PA
-
🧠 - Skills detailed
#C++ #Docker #Consulting #"ETL (Extract #Transform #Load)" #Databases #Langchain #Metadata #Python #Security #Documentation #Logging #Kubernetes #Hugging Face #Data Privacy #Transformers #Deployment
Role description
Below is the job description for the position: Job Title: Senior GenAI Engineer Location: Philadelphia, PA (Hybrid – 3 Days Onsite, 2 Days Remote) Duration: 6+ Months (Possible Extension) Interview Process: 1st Round – Virtual | 2nd Round – In-person Job Description We are seeking an experienced Senior GenAI Engineer to support on-premise LLM and vector database implementation. The ideal candidate will have strong hands-on experience with open-source LLMs, retrieval-augmented generation (RAG), and enterprise-grade deployments. Core Experience • Hands-on experience deploying open-source LLMs such as Meta Llama 3, Mistral, or Mixtral in on-premise or private environments • Strong proficiency in Python for LLM inference, prompt engineering, and system integration • Experience with CPU-based inference, model quantization, and performance tuning Vector Databases & RAG • Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector • Proven experience implementing Retrieval-Augmented Generation (RAG) pipelines • Experience generating and managing embeddings, including metadata filtering Security & Governance • Strong understanding of data privacy, air-gapped deployments, and enterprise security requirements • Experience implementing access controls and audit logging Nice to Have • Experience with LangChain or LlamaIndex • Exposure to Rust, Go, or C++ for high-performance services • Familiarity with Docker and Kubernetes for on-premise deployments • Knowledge of inference frameworks such as vLLM, llama.cpp, or Hugging Face Transformers • Prior experience working in regulated or enterprise environments Key Deliverables • Reference architecture and deployment guidance • Working prototype (LLM + Vector Database + RAG pipeline) • Documentation and knowledge transfer to internal teams Thanks & Regards Infojini Consulting Website: https://www.infojiniconsulting.com Address: 10015 Old Columbia Road, Suite B 215, Columbia, MD 21046