YO IT CONSULTING

Data Scientist - LLM

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Data Scientist - LLM, a part-time, fully remote contract position focused on building data-driven infrastructure for AI systems. Key skills include Python, SQL, and experience with machine learning, statistical analysis, and large-scale data systems.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

100

🗓️ - Date

December 6, 2025

🕒 - Duration

Unknown

🏝️ - Location

Remote

📄 - Contract

Unknown

🔒 - Security

Unknown

📍 - Location detailed

United States

🧠 - Skills detailed

#A/B Testing #Python #Data Pipeline #SQL (Structured Query Language) #Data Science #Snowflake #TensorFlow #Libraries #Pandas #Visualization #AI (Artificial Intelligence) #Datasets #ML (Machine Learning) #Scala #PyTorch #NumPy #"ETL (Extract #Transform #Load)" #Monitoring #BigQuery #Statistics #Data Engineering #Clustering

Role description

hiring a Data Scientist to help build advanced analytics and data-driven infrastructure for its AI lab partner focused on developing intelligent agent-based systems. This role is ideal for analytical thinkers who excel at turning large-scale data into actionable insights and enjoy working at the intersection of machine learning, experimentation, and real-world applications. You’ll be designing data pipelines, statistical models, and performance metrics that drive the next generation of autonomous systems. You’re a great fit if you: Have a strong background in data science, machine learning, or applied statistics. Are proficient in Python, SQL, and familiar with libraries such as Pandas, NumPy, Scikit-learn, and PyTorch/TensorFlow. Understand probabilistic modeling, statistical inference, and experimentation frameworks (A/B testing, causal inference). Can collect, clean, and transform complex datasets into structured formats ready for modeling and analysis. Have experience designing and evaluating predictive models, using metrics like precision, recall, F1-score, and ROC-AUC. Are comfortable working with large-scale data systems (Snowflake, BigQuery, or similar). Are curious about AI agents, and how data can shape the reasoning, adaptability, and behavior of intelligent systems. Enjoy collaborating with cross-functional teams — from engineers to research scientists — to define meaningful KPIs and experiment setups. Primary Goal of This Role To design and implement robust data models, pipelines, and metrics that support experimentation, benchmarking, and continuous learning for agentic AI systems. The role focuses on building data-driven insights into how agents reason, perform, and improve over time across algorithmic and real-world tasks. What You’ll Do Develop data collection and preprocessing pipelines for structured and unstructured data from multiple agent simulations. Build and iterate on machine learning models for performance prediction, behavior clustering, and outcome optimization. Design and maintain dashboards and visualization tools for monitoring agent performance, benchmarks, and trends. Conduct statistical analyses to evaluate the efficacy of AI systems under various environments and constraints. Collaborate with engineers to design evaluation frameworks that measure reasoning quality, adaptability, and efficiency. Prototype data-driven tools and feedback loops to automatically improve model accuracy and agent behavior over time. Work closely with AI research teams to translate experimental results into scalable, production-grade insights. Why This Role Is Exciting Work at the forefront of AI agent intelligence and help define how data shapes their evolution. Blend machine learning, experimentation, and data engineering in one role. Collaborate with top-tier AI engineers on new agent benchmarks and feedback mechanisms. Contribute to a mission that merges algorithmic reasoning, real-world performance, and human-like decision-making. Pay & Work Structure You’ll be classified as an hourly contractor to. Paid weekly via Stripe Connect, based on hours logged. Part-time (20 hrs - 40 hrs/week) with fully remote, async flexibility — work from anywhere, on your own schedule. Weekly bonus of $500 - $1000 USD per 5 task created.

Apply now Apply with DFH Sign up

YO IT CONSULTING

Data Scientist - LLM

M365 Solutions Architect

Advanced Growth Insights & AI Analyst, Data and Analytics

Machine Learning Engineer (Remote | $80 – $120/hr)

PowerApps Developer

Book a

chat

with us

Company