Senior Data Scientist

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Scientist in Raleigh, NC, on a contract basis. Requires 12+ years of experience, strong skills in machine learning, NLP, and deep learning frameworks. Proficiency in Python and cloud platforms is essential.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
-
πŸ—“οΈ - Date discovered
August 9, 2025
πŸ•’ - Project duration
Unknown
-
🏝️ - Location type
On-site
-
πŸ“„ - Contract type
W2 Contractor
-
πŸ”’ - Security clearance
Unknown
-
πŸ“ - Location detailed
Raleigh, NC
-
🧠 - Skills detailed
#"ETL (Extract #Transform #Load)" #Data Modeling #Transformers #Libraries #ML Ops (Machine Learning Operations) #Databases #BERT #NoSQL #Python #Classification #Clustering #Code Reviews #Cloud #OpenSearch #SpaCy #PyTorch #AI (Artificial Intelligence) #GCP (Google Cloud Platform) #Keras #Elasticsearch #Data Science #API (Application Programming Interface) #Datasets #AWS (Amazon Web Services) #Scala #Distributed Computing #Spark (Apache Spark) #NLP (Natural Language Processing) #Langchain #ML (Machine Learning) #Azure #Hugging Face #Deep Learning #Deployment #TensorFlow
Role description
Role: Sr Data Scientist Location: Raleigh NC - On-Site Type: Contract C2C/W2 Overall Experience: 12+ Years Must Interview Process: 3 Round with Coding task RESPONSIBILITIES β€’ Develop and implement LLM-based applications tailored for in-house legal needs, ensuring they align with client’s commitment to excellence and innovation β€’ Evaluate and maintain our data assets and training/evaluation datasets, ensuring they meet the highest standards of integrity and quality. β€’ Design and build pipelines for preprocessing, annotating, and managing legal document datasets, fostering a customer-centric mindset. β€’ Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs, working as one team. β€’ Conduct experiments and evaluate model performance to drive continuous improvements, raising the bar in all deliverables. β€’ Evaluate AI/ML and GenAI outcomes, both human and automated, to ensure accuracy, reliability, and alignment with business objectives. β€’ Interface with other technical personnel or team members to finalize requirements, demonstrating ownership of outcomes. β€’ Work closely with other development team members to understand complex product requirements and translate them into software designs, ensuring ethical choices in all actions. β€’ Successfully implement development processes, coding best practices, and code reviews for production environments, embodying client’s values in every task. REQUIREMENTS β€’ Strong hands-on experience and foundations in machine learning, including dimensionality reduction, clustering, embeddings, and sequence classification algorithms. β€’ Experience with deep learning frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers. β€’ Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT. β€’ Practical experience with large language models, prompt engineering, fine-tuning, and benchmarking using frameworks such as LangChain and LlamaIndex. β€’ Strong Python background. β€’ Knowledge of AWS, GCP, Azure, or other cloud platforms. β€’ Understanding of data modeling principles and complex data models. β€’ Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch/OpenSearch, ChromaDB). β€’ Knowledge of Scala, Spark, Ray, or other distributed computing systems is highly preferred. β€’ Knowledge of API development, containerization, and machine learning deployment is highly preferred. β€’ Experience with ML Ops/AI Ops is highly preferred.