

Staff Machine Learning Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is a Staff Machine Learning Engineer position for a 12-month contract in London, offering £1000 – £1200 per day. Requires 7+ years of experience in large-scale ML systems, strong Python/PyTorch skills, and expertise in GPU programming and distributed training frameworks.
🌎 - Country
United Kingdom
💱 - Currency
£ GBP
-
💰 - Day rate
1000
-
🗓️ - Date discovered
July 18, 2025
🕒 - Project duration
More than 6 months
-
🏝️ - Location type
Hybrid
-
📄 - Contract type
Outside IR35
-
🔒 - Security clearance
Unknown
-
📍 - Location detailed
London Area, United Kingdom
-
🧠 - Skills detailed
#Programming #Python #PyTorch #Generative Models #Transformers #AI (Artificial Intelligence) #ML (Machine Learning) #"ETL (Extract #Transform #Load)" #Kubernetes #Cloud #Scala
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
We’re teaming up with one of the leading names in AI, known for pushing the boundaries of what’s possible with large-scale generative models and next-gen cloud infrastructure. This is a rare opportunity to step into a Staff Machine Learning Engineer role and play a key part in shaping the platforms powering millions of users across the globe.
You'll be joining a team of exceptional researchers and engineers, all passionate about advancing the field and delivering world-class AI experiences.
Location: London, Oxford Street (hybrid; onsite in London once a week)
Rate: £1000 – £1200 per day, outside IR35
Start date: ASAP, 12-month contract
What you'll be doing
• Leading the development of scalable, reliable systems for training and fine-tuning transformer-based models
• Optimising inference pipelines for real-time applications — aiming for low latency and high throughput
• Exploring and applying advanced fine-tuning methods like LoRA, prefix-tuning, and adapters
• Tuning performance across GPUs and systems using tools like DeepSpeed, Triton, TensorRT, and even custom kernels
• Working closely with research, platform, and product teams to deliver new features and enhance the developer experience
• Identifying and resolving bottlenecks through profiling, benchmarking, and performance tuning
• Helping to define best practices for building, testing, and maintaining production ML services and APIs
• Mentoring other engineers and helping to foster a culture of technical excellence and innovation
What our client is looking for
• 7+ years of experience building and deploying large-scale ML systems in production
• Strong Python and PyTorch skills, with a deep understanding of transformers, LLMs, and multimodal models
• Hands-on experience with distributed training frameworks like DeepSpeed or FSDP
• Solid background in GPU programming (CUDA, ROCm) and inference optimisation
• Practical experience with parameter-efficient fine-tuning techniques in real-world applications
• Familiarity with container orchestration tools (Kubernetes, Kubeflow) and cloud-native environments
• Knowledge of serving frameworks like Triton, vLLM, or similar
• Clean, maintainable coding style and a strong testing discipline
• Great communication skills and a collaborative mindset
Your recruitment consultant
Cameron Dalziel is a recruitment specialist in assembling teams in data, AI, design, and technology across Europe. He engages with top talent and is committed to providing a high quality service that delivers results. Connect with Cam to discuss this role.