

Senior Data Scientist
β - Featured Role | Apply direct with Data Freelance Hub
This role is a Senior Data Scientist contract position in Raleigh, NC, requiring 12+ years of experience. Key skills include Natural Language Processing, Machine Learning, Python, and deep learning frameworks. Local candidates only; on-site work is mandatory.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
July 25, 2025
π - Project duration
Unknown
-
ποΈ - Location type
On-site
-
π - Contract type
W2 Contractor
-
π - Security clearance
Unknown
-
π - Location detailed
North Carolina, United States
-
π§ - Skills detailed
#Cloud #Transformers #Hugging Face #AWS (Amazon Web Services) #Databases #NoSQL #Deployment #Spark (Apache Spark) #Scala #ML Ops (Machine Learning Operations) #Elasticsearch #Deep Learning #GCP (Google Cloud Platform) #Keras #Code Reviews #API (Application Programming Interface) #SpaCy #Clustering #Langchain #"ETL (Extract #Transform #Load)" #BERT #Distributed Computing #NLP (Natural Language Processing) #OpenSearch #Data Science #ML (Machine Learning) #PyTorch #Libraries #AI (Artificial Intelligence) #Azure #Classification #TensorFlow #Data Modeling #Python #Datasets
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
Role: Senior Data Scientist
Location: Raleigh, NC On-Site
Need Local Profiles Only
Type: Contract C2C/W2
Exp: 12+ Years Must
Must Have:
Strong Foundations in : Natural Language Processing, Machine Learning and Python, Search and Re-Ranking, torch, transformers
Nice to Have: GenAI, LLM, RAG, OpenSearch, ElasticSearch, MLOps, AWS
RESPONSIBILITIES
β’ Develop and implement LLM-based applications tailored for in-house legal needs, ensuring they align with commitment to excellence and innovation
β’ Evaluate and maintain our data assets and training/evaluation datasets, ensuring they meet the highest standards of integrity and quality.
β’ Design and build pipelines for preprocessing, annotating, and managing legal document datasets, fostering a customer-centric mindset.
β’ Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs, working as one team.
β’ Conduct experiments and evaluate model performance to drive continuous improvements, raising the bar in all deliverables.
β’ Evaluate AI/ML and GenAI outcomes, both human and automated, to ensure accuracy, reliability, and alignment with business objectives.
β’ Interface with other technical personnel or team members to finalize requirements, demonstrating ownership of outcomes.
β’ Work closely with other development team members to understand complex product requirements and translate them into software designs, ensuring ethical choices in all actions.
β’ Successfully implement development processes, coding best practices, and code reviews for production environments, embodying values in every task.
REQUIREMENTS
β’ Strong hands-on experience and foundations in machine learning, including dimensionality reduction, clustering, embeddings, and sequence classification algorithms.
β’ Experience with deep learning frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers.
β’ Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT.
β’ Practical experience with large language models, prompt engineering, fine-tuning, and benchmarking using frameworks such as LangChain and LlamaIndex.
β’ Strong Python background.
β’ Knowledge of AWS, GCP, Azure, or other cloud platforms.
β’ Understanding of data modeling principles and complex data models.
β’ Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch/OpenSearch, ChromaDB).
β’ Knowledge of Scala, Spark, Ray, or other distributed computing systems is highly preferred.
β’ Knowledge of API development, containerization, and machine learning deployment is highly preferred.
β’ Experience with ML Ops/AI Ops is highly preferred.