

Reinforcement Learning Engineer (6-month Contract)
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Reinforcement Learning Engineer on a 6-month contract, offering $120,000 - $180,000 annually. Key skills include Python, reinforcement learning, and ML frameworks. A Master's degree and experience with real-world robotics arms are required.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
818.1818181818
-
ποΈ - Date discovered
September 7, 2025
π - Project duration
More than 6 months
-
ποΈ - Location type
On-site
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
San Francisco, CA
-
π§ - Skills detailed
#TensorFlow #PyTorch #Data Engineering #Deployment #Computer Science #Python #Reinforcement Learning #Programming #ML (Machine Learning)
Role description
About us
β’ Our mission is to build the worldβs first labor agency that deploys dexterous robots as its primary workforce.
β’ Youβll be joining a team of extremely hardcore and self-motivated engineers, scientists, and operators who focus on winning 24/7.
β’ You will develop and own entire systems from design to deployment, playing a foundational role in deploying 5000+ robots by 2031.
What you'll do
β’ Design and develop robot autonomy software stack and algorithms to enable capabilities including grasping and more dexterous behaviors in unstructured environments
β’ Setup a simulation environment to train/test various reinforcement learning policies
β’ Research and implement state-of-the-art reinforcement learning policies
β’ Optimize robot policies for distributed training at scale and real-time edge deployment
β’ Ship production quality, safety-critical software
Responsibilities
β’ Design overall architecture from data engineering, model architecture to shipping production ready automony software stack
β’ Train reinforcement learning policy to do manipulation tasks.
β’ Close the sim2real gap between policies trained in simulation and real.
β’ Work with autonomy, system teams to ship RL policies to the production fleet.
β’ 3+ years of Python programming experience.
β’ Strong abilities to identify root cause of the performance bottlenecks in RL training.
β’ Experience with tuning reward models, hyperparameters and exploration strategies to solve complex tasks in deep RL.
Required Qualifications
β’ Master's degree in Computer Science, Machine Learning, Robotics, or equivalent technical discipline
β’ Have trained real-world robotics arms with RL
β’ Deep expertise in machine learning fundamentals, reinforcement learning, and associated frameworks (PyTorch, TensorFlow, Ray, etc.)
β’ Proven track record developing and deploying ML systems from research through production implementation
β’ Hands-on experience with model lifecycle management including training, deployment, and maintenance in production settings
Nice to have
β’ Published RL research in top ML conferences (NeurIPS, CoRL, RSS, ICML, etc.)
β’ Robotics and control theory knowledge
Benefits
β’ Medical, dental & vision plans
β’ Daily meals stipend
Expected Compensation
β’ $120,000 - $180,000 per year + benefits
β’ This is a 6-month contract role with the opportunity for conversion to a full-time position upon successful completion of the contract and business needs.
Hiring process
β’ phone screen + 3-4 technical interviews + onsite
About us
β’ Our mission is to build the worldβs first labor agency that deploys dexterous robots as its primary workforce.
β’ Youβll be joining a team of extremely hardcore and self-motivated engineers, scientists, and operators who focus on winning 24/7.
β’ You will develop and own entire systems from design to deployment, playing a foundational role in deploying 5000+ robots by 2031.
What you'll do
β’ Design and develop robot autonomy software stack and algorithms to enable capabilities including grasping and more dexterous behaviors in unstructured environments
β’ Setup a simulation environment to train/test various reinforcement learning policies
β’ Research and implement state-of-the-art reinforcement learning policies
β’ Optimize robot policies for distributed training at scale and real-time edge deployment
β’ Ship production quality, safety-critical software
Responsibilities
β’ Design overall architecture from data engineering, model architecture to shipping production ready automony software stack
β’ Train reinforcement learning policy to do manipulation tasks.
β’ Close the sim2real gap between policies trained in simulation and real.
β’ Work with autonomy, system teams to ship RL policies to the production fleet.
β’ 3+ years of Python programming experience.
β’ Strong abilities to identify root cause of the performance bottlenecks in RL training.
β’ Experience with tuning reward models, hyperparameters and exploration strategies to solve complex tasks in deep RL.
Required Qualifications
β’ Master's degree in Computer Science, Machine Learning, Robotics, or equivalent technical discipline
β’ Have trained real-world robotics arms with RL
β’ Deep expertise in machine learning fundamentals, reinforcement learning, and associated frameworks (PyTorch, TensorFlow, Ray, etc.)
β’ Proven track record developing and deploying ML systems from research through production implementation
β’ Hands-on experience with model lifecycle management including training, deployment, and maintenance in production settings
Nice to have
β’ Published RL research in top ML conferences (NeurIPS, CoRL, RSS, ICML, etc.)
β’ Robotics and control theory knowledge
Benefits
β’ Medical, dental & vision plans
β’ Daily meals stipend
Expected Compensation
β’ $120,000 - $180,000 per year + benefits
β’ This is a 6-month contract role with the opportunity for conversion to a full-time position upon successful completion of the contract and business needs.
Hiring process
β’ phone screen + 3-4 technical interviews + onsite