Senior Data Scientist

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Senior Data Scientist in Raleigh, NC, on a contract basis. Requires 12+ years of experience, strong skills in machine learning, NLP, and deep learning frameworks. Proficiency in Python and cloud platforms is essential.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

🗓️ - Date discovered

August 9, 2025

🕒 - Project duration

Unknown

🏝️ - Location type

On-site

📄 - Contract type

W2 Contractor

🔒 - Security clearance

Unknown

📍 - Location detailed

Raleigh, NC

🧠 - Skills detailed

#"ETL (Extract #Transform #Load)" #Data Modeling #Transformers #Libraries #ML Ops (Machine Learning Operations) #Databases #BERT #NoSQL #Python #Classification #Clustering #Code Reviews #Cloud #OpenSearch #SpaCy #PyTorch #AI (Artificial Intelligence) #GCP (Google Cloud Platform) #Keras #Elasticsearch #Data Science #API (Application Programming Interface) #Datasets #AWS (Amazon Web Services) #Scala #Distributed Computing #Spark (Apache Spark) #NLP (Natural Language Processing) #Langchain #ML (Machine Learning) #Azure #Hugging Face #Deep Learning #Deployment #TensorFlow

Role description

Role: Sr Data Scientist Location: Raleigh NC - On-Site Type: Contract C2C/W2 Overall Experience: 12+ Years Must Interview Process: 3 Round with Coding task RESPONSIBILITIES • Develop and implement LLM-based applications tailored for in-house legal needs, ensuring they align with client’s commitment to excellence and innovation • Evaluate and maintain our data assets and training/evaluation datasets, ensuring they meet the highest standards of integrity and quality. • Design and build pipelines for preprocessing, annotating, and managing legal document datasets, fostering a customer-centric mindset. • Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs, working as one team. • Conduct experiments and evaluate model performance to drive continuous improvements, raising the bar in all deliverables. • Evaluate AI/ML and GenAI outcomes, both human and automated, to ensure accuracy, reliability, and alignment with business objectives. • Interface with other technical personnel or team members to finalize requirements, demonstrating ownership of outcomes. • Work closely with other development team members to understand complex product requirements and translate them into software designs, ensuring ethical choices in all actions. • Successfully implement development processes, coding best practices, and code reviews for production environments, embodying client’s values in every task. REQUIREMENTS • Strong hands-on experience and foundations in machine learning, including dimensionality reduction, clustering, embeddings, and sequence classification algorithms. • Experience with deep learning frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers. • Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT. • Practical experience with large language models, prompt engineering, fine-tuning, and benchmarking using frameworks such as LangChain and LlamaIndex. • Strong Python background. • Knowledge of AWS, GCP, Azure, or other cloud platforms. • Understanding of data modeling principles and complex data models. • Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch/OpenSearch, ChromaDB). • Knowledge of Scala, Spark, Ray, or other distributed computing systems is highly preferred. • Knowledge of API development, containerization, and machine learning deployment is highly preferred. • Experience with ML Ops/AI Ops is highly preferred.

Apply now Apply with DFH Sign up

← See all roles