LLM - Sr. SW Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Software Engineer focused on evaluating and validating Large Language Models. It is a short-term, 1-month contract with flexible hours, requiring 5+ years of software engineering experience and expertise in Fullstack Engineering. Remote work available.
🌎 - Country
United Kingdom
πŸ’± - Currency
Β£ GBP
-
πŸ’° - Day rate
-
πŸ—“οΈ - Date discovered
September 8, 2025
πŸ•’ - Project duration
1 to 3 months
-
🏝️ - Location type
Remote
-
πŸ“„ - Contract type
Fixed Term
-
πŸ”’ - Security clearance
Unknown
-
πŸ“ - Location detailed
England, United Kingdom
-
🧠 - Skills detailed
#React #TypeScript #C++ #Datasets #Hugging Face #Automation #Debugging #Python #Angular #JavaScript #Code Reviews #jQuery #Databricks #Java #AI (Artificial Intelligence)
Role description
Job Title: LLM – Senior Software Engineer (Evaluation & Repository Validation) About the Role Turing is seeking 300 Senior Software Engineers for a short-term project to evaluate and validate Large Language Models (LLMs) on real-world software engineering tasks. You’ll work at the intersection of AI, software engineering, and open-source ecosystems, building high-quality evaluation datasets and testing model performance on complex codebases. Engagement Details β€’ Type: Short-term Contract (1 month, possible extension) β€’ Engagement: 10–40 hrs/week (flexible, partial PST overlap required) β€’ Start Date: Immediate β€’ Rate Range: As per market β€’ Location: US, UK, Canada, France, Germany, Switzerland, Denmark, Finland, Netherlands, Sweden, Iceland, Italy, Austria, Ireland, Norway Key Responsibilities β€’ Review and compare model-generated code outputs, providing structured evaluations. β€’ Assess code diffs for correctness, efficiency, and style. β€’ Identify edge cases and ambiguities in model behavior. β€’ Build and validate agents for coding copilots, home automation, and creative design assistants. β€’ Collaborate with the team to improve LLM performance on real-world coding tasks. Required Skills & Experience β€’ 5+ years of software engineering experience. β€’ 2+ years at top-tier product/research companies (e.g., Google, Meta, Microsoft, Stripe, Amazon, Apple, Netflix, Shopify, Nvidia, Databricks, Hugging Face, etc.). β€’ Strong expertise in Fullstack Engineering: β€’ Backend: Java, Rust, Go, Node, Python, C++ β€’ Frontend: TypeScript, JavaScript, React, Vue, Angular, jQuery β€’ Deep understanding of software architecture, debugging, and code reviews. β€’ Proven ability to evaluate correctness, maintainability, and efficiency of code. β€’ Excellent written/oral communication for structured evaluation rationales. Vetting Process β€’ ICF assessment (mandatory) β€’ Technical interview Why Join Us? At Turing, you’ll contribute to the next generation of AI systems, empowering LLMs to reason about and interact with real-world software repositories. This is a unique opportunity to shape frontier AI while working alongside world-class engineers and researchers.