

Data Scientist
β - Featured Role | Apply direct with Data Freelance Hub
This role is a Data Scientist position based in Raleigh, NC, with a contract length of "unknown" and a pay rate of "unknown." Key skills include machine learning, NLP, Python, and experience with LLMs and cloud platforms.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
September 23, 2025
π - Project duration
Unknown
-
ποΈ - Location type
On-site
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
Raleigh, NC
-
π§ - Skills detailed
#Deep Learning #BERT #SpaCy #AI (Artificial Intelligence) #NLP (Natural Language Processing) #AWS (Amazon Web Services) #Databases #Distributed Computing #Deployment #Azure #Keras #API (Application Programming Interface) #TensorFlow #Langchain #Spark (Apache Spark) #"ETL (Extract #Transform #Load)" #Scala #ML (Machine Learning) #NoSQL #Code Reviews #Data Modeling #ML Ops (Machine Learning Operations) #Hugging Face #Classification #Python #Libraries #Elasticsearch #Transformers #Cloud #Datasets #Data Science #Clustering #PyTorch #OpenSearch #GCP (Google Cloud Platform)
Role description
Position: Data Scientist
Location: Raleigh, NC
RESPONSIBILITIES
β’ Develop and implement LLM-based applications tailored for in-house legal
β’ Fine-tune and deploy large language models to enhance their performance on legal text processing tasks
β’ Evaluate and help maintain our data assets and training/evaluation data sets
β’ Design and build pipelines for preprocessing, annotating, and managing legal document datasets
β’ Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs
β’ Conduct experiments and evaluate model performance to drive continuous improvements
β’ Interface with other technical personnel or team members to finalize requirements.
β’ Work closely with other development team members to understand moderately complex product requirements and translate them into software designs.
β’ Successfully implement development processes, coding best practices, and code reviews for production environments.
REQUIREMENTS
β’ Formal training in machine learning: dimensionality reduction, clustering, embeddings, and sequence classification algorithms
β’ Experience with deep learning frameworks such as PyTorch, Tensorflow and Hugging Face Transformers.
β’ Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT
β’ Practical experience with large language models, prompt engineering, fine-tuning and benchmarking, using frameworks such as LangChain and LlamaIndex
β’ Strong Python background
β’ Knowledge of AWS, GCP, Azure, or other cloud platform
β’ Understanding of data modeling principles and complex data models.
β’ Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch/OpenSearch, ChromaDB)
β’ Knowledge of Scala, Spark, Ray, or other distributed computing systems highly preferred
β’ Knowledge of API development, containerization, and machine learning deployment highly preferred
β’ Experience with ML Ops/AI Ops highly preferred
Regards
Patrick Fernandez
Talent Acquisition Group - Strategic Recruitment Manager
Position: Data Scientist
Location: Raleigh, NC
RESPONSIBILITIES
β’ Develop and implement LLM-based applications tailored for in-house legal
β’ Fine-tune and deploy large language models to enhance their performance on legal text processing tasks
β’ Evaluate and help maintain our data assets and training/evaluation data sets
β’ Design and build pipelines for preprocessing, annotating, and managing legal document datasets
β’ Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs
β’ Conduct experiments and evaluate model performance to drive continuous improvements
β’ Interface with other technical personnel or team members to finalize requirements.
β’ Work closely with other development team members to understand moderately complex product requirements and translate them into software designs.
β’ Successfully implement development processes, coding best practices, and code reviews for production environments.
REQUIREMENTS
β’ Formal training in machine learning: dimensionality reduction, clustering, embeddings, and sequence classification algorithms
β’ Experience with deep learning frameworks such as PyTorch, Tensorflow and Hugging Face Transformers.
β’ Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT
β’ Practical experience with large language models, prompt engineering, fine-tuning and benchmarking, using frameworks such as LangChain and LlamaIndex
β’ Strong Python background
β’ Knowledge of AWS, GCP, Azure, or other cloud platform
β’ Understanding of data modeling principles and complex data models.
β’ Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch/OpenSearch, ChromaDB)
β’ Knowledge of Scala, Spark, Ray, or other distributed computing systems highly preferred
β’ Knowledge of API development, containerization, and machine learning deployment highly preferred
β’ Experience with ML Ops/AI Ops highly preferred
Regards
Patrick Fernandez
Talent Acquisition Group - Strategic Recruitment Manager