Data Scientist

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is a Data Scientist position based in Raleigh, NC, with a contract length of "unknown" and a pay rate of "unknown." Key skills include machine learning, NLP, Python, and experience with LLMs and cloud platforms.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
-
πŸ—“οΈ - Date discovered
September 23, 2025
πŸ•’ - Project duration
Unknown
-
🏝️ - Location type
On-site
-
πŸ“„ - Contract type
Unknown
-
πŸ”’ - Security clearance
Unknown
-
πŸ“ - Location detailed
Raleigh, NC
-
🧠 - Skills detailed
#Deep Learning #BERT #SpaCy #AI (Artificial Intelligence) #NLP (Natural Language Processing) #AWS (Amazon Web Services) #Databases #Distributed Computing #Deployment #Azure #Keras #API (Application Programming Interface) #TensorFlow #Langchain #Spark (Apache Spark) #"ETL (Extract #Transform #Load)" #Scala #ML (Machine Learning) #NoSQL #Code Reviews #Data Modeling #ML Ops (Machine Learning Operations) #Hugging Face #Classification #Python #Libraries #Elasticsearch #Transformers #Cloud #Datasets #Data Science #Clustering #PyTorch #OpenSearch #GCP (Google Cloud Platform)
Role description
Position: Data Scientist Location: Raleigh, NC RESPONSIBILITIES β€’ Develop and implement LLM-based applications tailored for in-house legal β€’ Fine-tune and deploy large language models to enhance their performance on legal text processing tasks β€’ Evaluate and help maintain our data assets and training/evaluation data sets β€’ Design and build pipelines for preprocessing, annotating, and managing legal document datasets β€’ Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs β€’ Conduct experiments and evaluate model performance to drive continuous improvements β€’ Interface with other technical personnel or team members to finalize requirements. β€’ Work closely with other development team members to understand moderately complex product requirements and translate them into software designs. β€’ Successfully implement development processes, coding best practices, and code reviews for production environments. REQUIREMENTS β€’ Formal training in machine learning: dimensionality reduction, clustering, embeddings, and sequence classification algorithms β€’ Experience with deep learning frameworks such as PyTorch, Tensorflow and Hugging Face Transformers. β€’ Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT β€’ Practical experience with large language models, prompt engineering, fine-tuning and benchmarking, using frameworks such as LangChain and LlamaIndex β€’ Strong Python background β€’ Knowledge of AWS, GCP, Azure, or other cloud platform β€’ Understanding of data modeling principles and complex data models. β€’ Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch/OpenSearch, ChromaDB) β€’ Knowledge of Scala, Spark, Ray, or other distributed computing systems highly preferred β€’ Knowledge of API development, containerization, and machine learning deployment highly preferred β€’ Experience with ML Ops/AI Ops highly preferred Regards Patrick Fernandez Talent Acquisition Group - Strategic Recruitment Manager