

LLM - Sr. SW Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Software Engineer focused on evaluating and validating Large Language Models. It is a short-term, 1-month contract with flexible hours, requiring 5+ years of software engineering experience and expertise in Fullstack Engineering. Remote work available.
π - Country
United Kingdom
π± - Currency
Β£ GBP
-
π° - Day rate
-
ποΈ - Date discovered
September 8, 2025
π - Project duration
1 to 3 months
-
ποΈ - Location type
Remote
-
π - Contract type
Fixed Term
-
π - Security clearance
Unknown
-
π - Location detailed
England, United Kingdom
-
π§ - Skills detailed
#React #TypeScript #C++ #Datasets #Hugging Face #Automation #Debugging #Python #Angular #JavaScript #Code Reviews #jQuery #Databricks #Java #AI (Artificial Intelligence)
Role description
Job Title:
LLM β Senior Software Engineer (Evaluation & Repository Validation)
About the Role
Turing is seeking 300 Senior Software Engineers for a short-term project to evaluate and validate Large Language Models (LLMs) on real-world software engineering tasks. Youβll work at the intersection of AI, software engineering, and open-source ecosystems, building high-quality evaluation datasets and testing model performance on complex codebases.
Engagement Details
β’ Type: Short-term Contract (1 month, possible extension)
β’ Engagement: 10β40 hrs/week (flexible, partial PST overlap required)
β’ Start Date: Immediate
β’ Rate Range: As per market
β’ Location: US, UK, Canada, France, Germany, Switzerland, Denmark, Finland, Netherlands, Sweden, Iceland, Italy, Austria, Ireland, Norway
Key Responsibilities
β’ Review and compare model-generated code outputs, providing structured evaluations.
β’ Assess code diffs for correctness, efficiency, and style.
β’ Identify edge cases and ambiguities in model behavior.
β’ Build and validate agents for coding copilots, home automation, and creative design assistants.
β’ Collaborate with the team to improve LLM performance on real-world coding tasks.
Required Skills & Experience
β’ 5+ years of software engineering experience.
β’ 2+ years at top-tier product/research companies (e.g., Google, Meta, Microsoft, Stripe, Amazon, Apple, Netflix, Shopify, Nvidia, Databricks, Hugging Face, etc.).
β’ Strong expertise in Fullstack Engineering:
β’ Backend: Java, Rust, Go, Node, Python, C++
β’ Frontend: TypeScript, JavaScript, React, Vue, Angular, jQuery
β’ Deep understanding of software architecture, debugging, and code reviews.
β’ Proven ability to evaluate correctness, maintainability, and efficiency of code.
β’ Excellent written/oral communication for structured evaluation rationales.
Vetting Process
β’ ICF assessment (mandatory)
β’ Technical interview
Why Join Us?
At Turing, youβll contribute to the next generation of AI systems, empowering LLMs to reason about and interact with real-world software repositories. This is a unique opportunity to shape frontier AI while working alongside world-class engineers and researchers.
Job Title:
LLM β Senior Software Engineer (Evaluation & Repository Validation)
About the Role
Turing is seeking 300 Senior Software Engineers for a short-term project to evaluate and validate Large Language Models (LLMs) on real-world software engineering tasks. Youβll work at the intersection of AI, software engineering, and open-source ecosystems, building high-quality evaluation datasets and testing model performance on complex codebases.
Engagement Details
β’ Type: Short-term Contract (1 month, possible extension)
β’ Engagement: 10β40 hrs/week (flexible, partial PST overlap required)
β’ Start Date: Immediate
β’ Rate Range: As per market
β’ Location: US, UK, Canada, France, Germany, Switzerland, Denmark, Finland, Netherlands, Sweden, Iceland, Italy, Austria, Ireland, Norway
Key Responsibilities
β’ Review and compare model-generated code outputs, providing structured evaluations.
β’ Assess code diffs for correctness, efficiency, and style.
β’ Identify edge cases and ambiguities in model behavior.
β’ Build and validate agents for coding copilots, home automation, and creative design assistants.
β’ Collaborate with the team to improve LLM performance on real-world coding tasks.
Required Skills & Experience
β’ 5+ years of software engineering experience.
β’ 2+ years at top-tier product/research companies (e.g., Google, Meta, Microsoft, Stripe, Amazon, Apple, Netflix, Shopify, Nvidia, Databricks, Hugging Face, etc.).
β’ Strong expertise in Fullstack Engineering:
β’ Backend: Java, Rust, Go, Node, Python, C++
β’ Frontend: TypeScript, JavaScript, React, Vue, Angular, jQuery
β’ Deep understanding of software architecture, debugging, and code reviews.
β’ Proven ability to evaluate correctness, maintainability, and efficiency of code.
β’ Excellent written/oral communication for structured evaluation rationales.
Vetting Process
β’ ICF assessment (mandatory)
β’ Technical interview
Why Join Us?
At Turing, youβll contribute to the next generation of AI systems, empowering LLMs to reason about and interact with real-world software repositories. This is a unique opportunity to shape frontier AI while working alongside world-class engineers and researchers.