

Jobs via Dice
Data Scientist + ML Engineer (Gen AI)
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist + ML Engineer (Generative AI) on a 12-month contract in Cupertino, CA (Hybrid). Requires 2+ years of ML experience, proficiency in Python and PyTorch, and expertise in diffusion models and LLMs. Pay rate: $65-75/hour.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
October 28, 2025
🕒 - Duration
More than 6 months
-
🏝️ - Location
Hybrid
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Cupertino, CA
-
🧠 - Skills detailed
#Data Pipeline #Debugging #Data Analysis #Data Engineering #Automation #Scala #Deep Learning #AI (Artificial Intelligence) #Distributed Computing #Python #PyTorch #ML (Machine Learning) #Computer Science #Libraries #Documentation #Data Science #Datasets
Role description
Dice is the leading career destination for tech experts at every stage of their careers. Our client, OSI Engineering, Inc., is seeking the following. Apply via Dice today!
A globally leading technology company is looking for a highly skilled Data Scientist + ML Engineer (Generative AI) to join the team. In this role, you will be responsible for developing, fine-tuning, and applying advanced generative AI models including diffusion models, large language models (LLMs), and other state-of-the-art architectures. You will collaborate closely with cross-functional partners in research, data engineering, and operations to deliver high-quality machine learning solutions and scalable datasets.
This position requires a balance of technical depth and creative problem-solving. You should be comfortable working with large, complex datasets and possess a strong grasp of modern ML frameworks, distributed computing environments, and end-to-end data pipelines.
RESPONSIBILITIES:
• Design and Implement LLM-Driven Synthetic Data Pipelines: Design and build workflows using LLMs and Gen AI techniques to create high-volume, high-quality synthetic data for model training and testing.
• Design, implement, and deploy machine learning models with a focus on generative AI (diffusion models, LLMs, and related architectures)
• Fine-tune, evaluate, and optimize large language models for specific downstream tasks and data needs
• Develop and maintain scalable data pipelines supporting training, evaluation, and inference workflows
• Conduct exploratory data analysis to surface insights and identify opportunities for model or data improvement
• Partner cross-functionally with researchers, engineers, and data program managers to define requirements and deliver high-impact ML solutions
• Build and enhance internal tools, libraries, and automation workflows to accelerate experimentation and iteration
QUALIFICATIONS:
• Bachelor s degree in Computer Science or related field from an accredited U.S. institution
• 2+ years of experience in Machine Learning or Software Engineering
• Expert-level proficiency in Python and familiarity with deep learning frameworks such as PyTorch
• Strong foundation in machine learning algorithms, data preprocessing, and evaluation techniques
• Demonstrated experience working with diffusion models, stable diffusion, or large language models (LLMs)
• Excellent analytical, problem-solving, and debugging skills
• Strong communication and documentation skills with the ability to explain complex concepts clearly
• Ability to work independently in a fast-paced, iterative development environment
Type: Contract
Duration:12 months +
Work Location: Cupertino, CA (Hybrid)
Pay range: $ 65-75 (DOE)
Dice is the leading career destination for tech experts at every stage of their careers. Our client, OSI Engineering, Inc., is seeking the following. Apply via Dice today!
A globally leading technology company is looking for a highly skilled Data Scientist + ML Engineer (Generative AI) to join the team. In this role, you will be responsible for developing, fine-tuning, and applying advanced generative AI models including diffusion models, large language models (LLMs), and other state-of-the-art architectures. You will collaborate closely with cross-functional partners in research, data engineering, and operations to deliver high-quality machine learning solutions and scalable datasets.
This position requires a balance of technical depth and creative problem-solving. You should be comfortable working with large, complex datasets and possess a strong grasp of modern ML frameworks, distributed computing environments, and end-to-end data pipelines.
RESPONSIBILITIES:
• Design and Implement LLM-Driven Synthetic Data Pipelines: Design and build workflows using LLMs and Gen AI techniques to create high-volume, high-quality synthetic data for model training and testing.
• Design, implement, and deploy machine learning models with a focus on generative AI (diffusion models, LLMs, and related architectures)
• Fine-tune, evaluate, and optimize large language models for specific downstream tasks and data needs
• Develop and maintain scalable data pipelines supporting training, evaluation, and inference workflows
• Conduct exploratory data analysis to surface insights and identify opportunities for model or data improvement
• Partner cross-functionally with researchers, engineers, and data program managers to define requirements and deliver high-impact ML solutions
• Build and enhance internal tools, libraries, and automation workflows to accelerate experimentation and iteration
QUALIFICATIONS:
• Bachelor s degree in Computer Science or related field from an accredited U.S. institution
• 2+ years of experience in Machine Learning or Software Engineering
• Expert-level proficiency in Python and familiarity with deep learning frameworks such as PyTorch
• Strong foundation in machine learning algorithms, data preprocessing, and evaluation techniques
• Demonstrated experience working with diffusion models, stable diffusion, or large language models (LLMs)
• Excellent analytical, problem-solving, and debugging skills
• Strong communication and documentation skills with the ability to explain complex concepts clearly
• Ability to work independently in a fast-paced, iterative development environment
Type: Contract
Duration:12 months +
Work Location: Cupertino, CA (Hybrid)
Pay range: $ 65-75 (DOE)





