Data Scientist

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Data Scientist on a 6-month contract-to-hire, offering expertise in large language models, generative AI, and data engineering. Required skills include Python, PySpark, SQL, AWS, and strong machine learning fundamentals.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

🗓️ - Date discovered

July 23, 2025

🕒 - Project duration

More than 6 months

🏝️ - Location type

Unknown

📄 - Contract type

Fixed Term

🔒 - Security clearance

Unknown

📍 - Location detailed

United States

🧠 - Skills detailed

#Hugging Face #Regression #Automation #dbt (data build tool) #PySpark #Pandas #Spark SQL #Scala #Data Lake #S3 (Amazon Simple Storage Service) #Clustering #REST API #Model Evaluation #Data Engineering #Airflow #Classification #Data Science #pydantic #Cloud #Python #AWS (Amazon Web Services) #Data Pipeline #Docker #AWS Glue #Lambda (AWS Lambda) #Spark (Apache Spark) #REST (Representational State Transfer) #Time Series #GIT #Langchain #"ETL (Extract #Transform #Load)" #Observability #Jupyter #AI (Artificial Intelligence) #SQL (Structured Query Language) #Programming #Data Lakehouse #ML (Machine Learning)

Role description

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

Item 1
Item 2
Item 3

Unordered list

Item A
Item B
Item C

Text link

Bold text

Emphasis

^Superscript

_Subscript

Job Title: Data Scientist Our client is looking for a modern data scientist who blends advanced analytical skills with hands-on experience in large language models, generative AI, and data engineering. This individual works closely with business leaders to frame use cases, prototypes rapidly, and builds scalable solutions that bridge insight and automation. Equally comfortable with notebooks and production pipelines, they are a full-spectrum contributor from idea to deployed intelligence. Length: 6-month Contract-to-hire Associate Vendors: We are accepting applications from candidates who are currently authorized to work in the US for any employer without sponsorship. Role & Responsibilities • Deep knowledge of LLMs (OpenAI, Claude, Cohere, LLaMA, etc.) and prompt engineering • Experience fine-tuning and embedding LLMs for domain-specific applications • Classical ML: regression, classification, time series, clustering, model evaluation • AWS stack preferred • Proficient in PySpark, SQL, dbt, and cloud-native pipelines (e.g., AWS Glue, Lambda, S3, Step Functions) • Strong ETL/ELT design skills; experience with data lakehouse and modern data stack • Familiar with orchestrators like Airflow or Dagster • Python (pandas, scikit-learn, LangChain, Hugging Face, Pydantic) • Jupyter, VS Code, Git, Docker • Experience deploying models via REST APIs or event-driven architectures • Skilled at scoping analytics and GenAI use cases from business questions • Builds POCs to validate value quickly, then scales to production • Great communicator who can explain trade-offs, model limitations, and data caveats to non-technical audiences • Cross-functional partner to product, engineering, and ops teams • Code-first mindset with a strong sense of reproducibility, testing, and observability • Advocates for responsible AI and MLOps best practices Required Qualifications: • 5+ years in Data Science • Expertise in Large Language Models (OpenAI, Claude, Cohere, LLaMA, etc.) • Well-versed in Prompt Engineering • Strong Machine Learning Fundamentals (regression, classification, time series, clustering, model evaluation) • Deep understanding of Generative AI and use cases in the world of Data Science • Strong in programming with tools like Python (PySpark), dbt, Jupyter, REST APIs • Strong ELT/ETL design background • Experience with Cloud Native data pipelines in AWS Desired Qualifications: • Strong AWS stack knowledge for AI/ML

Apply now Apply with DFH Sign up

← See all roles