Senior GenAI/LLM Engineer:

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Senior GenAI/LLM Engineer on a contract basis, remote but requires Bay Area residency. Key skills include deep learning, NLP, and LLM optimization. Experience with PyTorch, TensorFlow, and advanced fine-tuning techniques is essential. Competitive pay rate offered.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

🗓️ - Date discovered

May 22, 2025

🕒 - Project duration

Unknown

🏝️ - Location type

Remote

📄 - Contract type

W2 Contractor

🔒 - Security clearance

Unknown

📍 - Location detailed

San Francisco Bay Area

🧠 - Skills detailed

#Monitoring #Hugging Face #PyTorch #Documentation #Mathematics #TensorFlow #Transformers #Data Science #Deep Learning #Libraries #Databases #"ETL (Extract #Transform #Load)" #Compliance #Data Pipeline #Deployment #NLP (Natural Language Processing)

Role description

I’m recruiting for a client working at the bleeding edge of LLM infrastructure and GenAI deployment. They are based in the Bay Area California, however the role can be remote, however must live in The Bay Area for adhoc face to face meetings. It’s a Contract role and can be W2, C2C or 1099 and they are willing to pay a competitive market rate: GenAI/LLM Engineer: Implementing GenAI requires specialized expertise in large language models. Traditional data scientists often haven't had the opportunity to dive deep into the practical intricacies of LLMs-particularly advanced fine-tuning techniques, model compression strategies, memory optimization approaches, and specialized training workflows. This role requires a hands-on deep learning practitioner comfortable with modern frameworks and libraries specific to LLM development. Key Responsibilities: Implement and optimize advanced fine-tuning approaches (LoRA, PEFT, QLoRA) to adapt foundation models to Client's domain Develop systematic prompt engineering methodologies specific to utility operations, regulatory compliance, and technical documentation Create reusable prompt templates and libraries to standardize interactions across multiple LLM applications and use cases Implement prompt testing frameworks to quantitatively evaluate and iteratively improve prompt effectiveness Establish prompt versioning systems and governance to maintain consistency and quality across applications Apply model customization techniques like knowledge distillation, quantization, and pruning to reduce memory footprint and inference costs Tackle memory constraints using techniques such as sharded data parallelism, GPU offloading, or CPU+GPU hybrid approaches Build robust retrieval-augmented generation (RAG) pipelines with vector databases, embedding pipelines, and optimized chunking strategies Design advanced prompting strategies including chain-of-thought reasoning, conversation orchestration, and agent-based approaches Collaborate with the MLOps engineer to ensure models are efficiently deployed, monitored, and retrained as needed Expected Skillset: Deep Learning & NLP: Proficiency with PyTorch/TensorFlow, Hugging Face Transformers, DSPy, and advanced LLM training techniques GPU/Hardware Knowledge: Experience with multi-GPU training, memory optimization, and parallelization strategies LLMOps: Familiarity with workflows for maintaining LLM-based applications in production and monitoring model performance Technical Adaptability: Ability to interpret research papers and implement emerging techniques (without necessarily requiring PhD-level mathematics) Domain Adaptation: Skills in creating data pipelines for fine-tuning models with utility-specific content

Apply now Apply with DFH Sign up

← See all roles

Go to role

Senior GenAI/LLM Engineer:

Premium Members Land Roles Faster—Upgrade today.

DBA, Data Visualization Specialist

Encounter Data Analyst

Contractor - Oracle APEX Engineer / Developer (Remote)

Business Analyst

Premium Members Land Roles Faster—Upgrade today.

Book a

chat

with us

Company