TALENDICA

LLM Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an LLM Engineer, offering a remote contract for 2-3 months at a competitive pay rate. Key skills include Python, deep learning (PyTorch), and extensive experience with NLP/LLM systems. A Master's/PhD in a relevant field is required.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
December 3, 2025
πŸ•’ - Duration
3 to 6 months
-
🏝️ - Location
Remote
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
United States
-
🧠 - Skills detailed
#NLP (Natural Language Processing) #BERT #Computer Science #Scala #AI (Artificial Intelligence) #Data Management #"ETL (Extract #Transform #Load)" #Python #Batch #Classification #PyTorch #Deep Learning #Cloud #Deployment
Role description
LLM Engineer Remote 2-3 Months Key Responsibilities 1. LLM Development & Optimization Β· Fine-tune and deploy large language models (GPT, BERT, T5, etc.) and NLP-related tasks with vision language models (Florence-2, PaliGemma) for domain-specific tasks such as text classification, summarization, and entity extraction. Β· Advanced NLP techniques, ensuring accurate text recognition of plan annotations (e.g., pipe materials, dimensions). 1. Annotation Workflow Integration Β· Design custom techniques effective annotations to thereby achieve desired project goal. Β· Automate or streamline annotation tasks wherever possible (e.g., partial auto-labeling) to reduce manual effort and error rates. 1. Experimentation & Evaluation Β· Establish robust evaluation metrics (Perplexity, Precision, recall, F1-score, Levheinstein distance, character array, Bleu score ) for NLP components, including text extraction quality. Β· Set up an iterative experimentation framework to track model versioning, data changes, and performance gains over time. 1. Data Management & Accountability Β· Coordinate with the Project Manager and AI Engineer to ensure new data (annotated or collected) is properly versioned and accessible. Β· Implement best practices for dataset growth, including tracking who annotated or curated the data and how these changes affect model performance. 1. Deployment & Scalability Β· Integrate with existing infrastructure and pipelines, ensuring minimal disruption to ongoing model development. Β· Optimize model sizes and queries to reduce latency between input and output (quantization, batch optimization, etc.) Required Qualifications Β· Education & Experience Β· Master’s/Phd in Computer Science/Engineering, Computational Linguistics. Β· Intensive hands on experience on custom LLM. Technical Expertise Β· Proficiency in Python and deep learning frameworks (PyTorch preferred). Β· Proven track record of deploying NLP or LLM systems in production (Cloud or on-prem). Β· Solid understanding of tokenization, embedding techniques, and advanced fine-tuning strategies. Β· Experience in leveraging open, closed, or custom annotation tools (for example- labelStudio) to coordinate between multiple annotation formats and annotation teams