

Data Scientist
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist on a 6-month contract-to-hire, offering expertise in large language models, generative AI, and data engineering. Required skills include Python, PySpark, SQL, AWS, and strong machine learning fundamentals.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
July 23, 2025
π - Project duration
More than 6 months
-
ποΈ - Location type
Unknown
-
π - Contract type
Fixed Term
-
π - Security clearance
Unknown
-
π - Location detailed
United States
-
π§ - Skills detailed
#Hugging Face #Regression #Automation #dbt (data build tool) #PySpark #Pandas #Spark SQL #Scala #Data Lake #S3 (Amazon Simple Storage Service) #Clustering #REST API #Model Evaluation #Data Engineering #Airflow #Classification #Data Science #pydantic #Cloud #Python #AWS (Amazon Web Services) #Data Pipeline #Docker #AWS Glue #Lambda (AWS Lambda) #Spark (Apache Spark) #REST (Representational State Transfer) #Time Series #GIT #Langchain #"ETL (Extract #Transform #Load)" #Observability #Jupyter #AI (Artificial Intelligence) #SQL (Structured Query Language) #Programming #Data Lakehouse #ML (Machine Learning)
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
Job Title: Data Scientist
Our client is looking for a modern data scientist who blends advanced analytical skills with hands-on experience in large language models, generative AI, and data engineering. This individual works closely with business leaders to frame use cases, prototypes rapidly, and builds scalable solutions that bridge insight and automation. Equally comfortable with notebooks and production pipelines, they are a full-spectrum contributor from idea to deployed intelligence.
Length: 6-month Contract-to-hire
Associate Vendors: We are accepting applications from candidates who are currently authorized to work in the US for any employer without sponsorship.
Role & Responsibilities
β’ Deep knowledge of LLMs (OpenAI, Claude, Cohere, LLaMA, etc.) and prompt engineering
β’ Experience fine-tuning and embedding LLMs for domain-specific applications
β’ Classical ML: regression, classification, time series, clustering, model evaluation
β’ AWS stack preferred
β’ Proficient in PySpark, SQL, dbt, and cloud-native pipelines (e.g., AWS Glue, Lambda, S3, Step Functions)
β’ Strong ETL/ELT design skills; experience with data lakehouse and modern data stack
β’ Familiar with orchestrators like Airflow or Dagster
β’ Python (pandas, scikit-learn, LangChain, Hugging Face, Pydantic)
β’ Jupyter, VS Code, Git, Docker
β’ Experience deploying models via REST APIs or event-driven architectures
β’ Skilled at scoping analytics and GenAI use cases from business questions
β’ Builds POCs to validate value quickly, then scales to production
β’ Great communicator who can explain trade-offs, model limitations, and data caveats to non-technical audiences
β’ Cross-functional partner to product, engineering, and ops teams
β’ Code-first mindset with a strong sense of reproducibility, testing, and observability
β’ Advocates for responsible AI and MLOps best practices
Required Qualifications:
β’ 5+ years in Data Science
β’ Expertise in Large Language Models (OpenAI, Claude, Cohere, LLaMA, etc.)
β’ Well-versed in Prompt Engineering
β’ Strong Machine Learning Fundamentals (regression, classification, time series, clustering, model evaluation)
β’ Deep understanding of Generative AI and use cases in the world of Data Science
β’ Strong in programming with tools like Python (PySpark), dbt, Jupyter, REST APIs
β’ Strong ELT/ETL design background
β’ Experience with Cloud Native data pipelines in AWS
Desired Qualifications:
β’ Strong AWS stack knowledge for AI/ML