

Alignerr
Data Scientist – AI Training & Evaluation
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist – AI Training & Evaluation on a flexible, fully remote hourly contract (10–40 hours/week) with a pay rate of "hourly rate." Requires a degree in a quantitative field, proficiency in Python, R, SQL, and experience in model evaluation and data analysis.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
320
-
🗓️ - Date
April 11, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Chicago, IL
-
🧠 - Skills detailed
#Visualization #Mathematics #AI (Artificial Intelligence) #Deep Learning #Data Science #NLP (Natural Language Processing) #TensorFlow #R #Data Wrangling #Computer Science #ML (Machine Learning) #SQL (Structured Query Language) #Data Analysis #Python #Statistics #PyTorch #A/B Testing #Model Evaluation
Role description
About The Role
AI is only as good as the experts who train it. We're looking for data scientists to help evaluate, refine, and improve next-generation AI systems — bringing your quantitative expertise directly to bear on how the world's most advanced models reason, analyze, and communicate.
This is a fully remote, flexible contract role. You set your hours and work at your own pace, contributing to projects that sit at the frontier of applied AI research.
• Organization: Alignerr
• Type: Hourly Contract
• Location: Remote
• Commitment: 10–40 hours/week
What You'll Do
• Evaluate AI model outputs for statistical soundness, reasoning quality, and analytical accuracy
• Design and apply data-driven evaluation criteria and scoring rubrics
• Analyze patterns in AI-generated responses to surface systematic errors or biases
• Create high-quality training data — including prompts, worked solutions, and expert annotations — across data science and ML domains
• Review AI-generated code, visualizations, and statistical analyses for correctness and best practices
• Provide structured, detailed feedback that directly improves model performance
• Work independently and asynchronously on your own schedule
Who You Are
• Degree in Data Science, Statistics, Computer Science, Mathematics, or a related quantitative field (MS or PhD preferred)
• Strong foundation in statistics, probability, and machine learning concepts
• Proficient in Python, R, SQL, or similar data analysis tools
• Experienced with data wrangling, exploratory data analysis, and model evaluation
• Sharp analytical thinker with excellent attention to detail
• Clear written communicator — able to explain complex technical concepts concisely
• Self-motivated and comfortable working independently in an async environment
Nice to Have
• Experience with deep learning frameworks such as PyTorch or TensorFlow
• Familiarity with NLP, large language models, or AI evaluation workflows
• Published research or hands-on industry experience in applied machine learning
• Background in A/B testing, causal inference, or experimental design
Why Join Us
• Work on cutting-edge AI projects alongside top research labs and AI teams globally
• Get rare, inside exposure to how state-of-the-art LLMs are trained and evaluated
• Fully remote and async — work when and where it suits you
• Complete autonomy over your schedule and workload (10–40 hrs/week)
• Join a growing community of expert contributors who are actively shaping the future of AI
• Potential for ongoing work and long-term contract extension
About The Role
AI is only as good as the experts who train it. We're looking for data scientists to help evaluate, refine, and improve next-generation AI systems — bringing your quantitative expertise directly to bear on how the world's most advanced models reason, analyze, and communicate.
This is a fully remote, flexible contract role. You set your hours and work at your own pace, contributing to projects that sit at the frontier of applied AI research.
• Organization: Alignerr
• Type: Hourly Contract
• Location: Remote
• Commitment: 10–40 hours/week
What You'll Do
• Evaluate AI model outputs for statistical soundness, reasoning quality, and analytical accuracy
• Design and apply data-driven evaluation criteria and scoring rubrics
• Analyze patterns in AI-generated responses to surface systematic errors or biases
• Create high-quality training data — including prompts, worked solutions, and expert annotations — across data science and ML domains
• Review AI-generated code, visualizations, and statistical analyses for correctness and best practices
• Provide structured, detailed feedback that directly improves model performance
• Work independently and asynchronously on your own schedule
Who You Are
• Degree in Data Science, Statistics, Computer Science, Mathematics, or a related quantitative field (MS or PhD preferred)
• Strong foundation in statistics, probability, and machine learning concepts
• Proficient in Python, R, SQL, or similar data analysis tools
• Experienced with data wrangling, exploratory data analysis, and model evaluation
• Sharp analytical thinker with excellent attention to detail
• Clear written communicator — able to explain complex technical concepts concisely
• Self-motivated and comfortable working independently in an async environment
Nice to Have
• Experience with deep learning frameworks such as PyTorch or TensorFlow
• Familiarity with NLP, large language models, or AI evaluation workflows
• Published research or hands-on industry experience in applied machine learning
• Background in A/B testing, causal inference, or experimental design
Why Join Us
• Work on cutting-edge AI projects alongside top research labs and AI teams globally
• Get rare, inside exposure to how state-of-the-art LLMs are trained and evaluated
• Fully remote and async — work when and where it suits you
• Complete autonomy over your schedule and workload (10–40 hrs/week)
• Join a growing community of expert contributors who are actively shaping the future of AI
• Potential for ongoing work and long-term contract extension




