

Data Scientist
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist with a 3-year contract in Lexington, MA, offering competitive pay. Requires a Master’s in a relevant field, proficiency in Python, and expertise in NLP, machine learning, and generative AI. US citizenship is mandatory.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
800
-
🗓️ - Date discovered
September 17, 2025
🕒 - Project duration
More than 6 months
-
🏝️ - Location type
On-site
-
📄 - Contract type
W2 Contractor
-
🔒 - Security clearance
Unknown
-
📍 - Location detailed
Lexington, MA
-
🧠 - Skills detailed
#Data Cleaning #GIT #Visualization #Datasets #ML (Machine Learning) #NLU (Natural Language Understanding) #Statistics #Model Deployment #AI (Artificial Intelligence) #Cloud #Scala #Computer Science #Python #Deep Learning #Data Science #Deployment #NLP (Natural Language Processing)
Role description
ONLY US CITIZEN "DEFENSE CLIENT"
Position Title: Data Scientist
Location: Lexington, MA USA
Duration: 03 years Contract on W2 (Possible Extension)
DESCRIPTION
Designs, develops, and implements methods, processes, and systems to consolidate and analyze diverse data sets including structured and unstructured. Develops software programs, algorithms, dashboards, information tools, and queries to clean, model, integrate and evaluate datasets. Keeps abreast of new analytic methodologies and technologies. Collaborates with functional business units to drive business solutions and direction.
DUTIES AND RESPONSIBILITIES:
• Develop and optimize state-of-the-art generative AI models for foreign language proficiency testing and education tasks. Use and extend state of the art NLP techniques including large language model fine-tuning, natural language understanding, and content generation.
• Design and implement scalable algorithms and pipelines to handle extensive text datasets in high-performance computing (HPC)environments.
• Collaborate with software engineers, data scientists, and domain experts to delivered-to-end AI solutions for a variety of text-centric applications.
• Conduct research on novel methodologies, staying current with emerging trends in machine learning, generative AI, and NLP.
• Present technical findings to both expert and non-expert audiences, providing clear explanations of complex concepts and proposed solutions.
• KNOWLEDGE, REQUIRED SKILLS, COMPETENCIES AND EXPERIENCE:
• Master’s degree (or equivalent professional experience) in Computer Science, Math, AI/ML, Data Science, Statistics, Applied Linguistics, or a related field.
• Proficiency in Python, with demonstrable experience developing and implementing algorithms for AI/ML workflows.
• Strong background in natural language processing, machine learning, and generative AI, with a portfolio of relevant coursework, publications, or project work.
• Hands-on experience with data science pipelines, including data cleaning, preparation, and visualization, for large-scale text corpora.
• Familiarity with high-performance computing (e.g., supercomputers, clusters, or cloud infrastructure) and the associated challenges of large-scale AI model development.
NICE TO HAVE
• Experience with deep learning frameworks for building and training complex human language technology systems.
• Familiarity with responsible AI principles, model interpretability, and ethical considerations in large language model deployment.
• Proven experience collaborating on shared code repositories (e.g., Git), contributing to communal software projects or technical communities.
• Enthusiasm for learning new technologies and exploring innovative applications of generative AI.
• (Bonus) Demonstrated interest in learning foreign languages
ONLY US CITIZEN "DEFENSE CLIENT"
Position Title: Data Scientist
Location: Lexington, MA USA
Duration: 03 years Contract on W2 (Possible Extension)
DESCRIPTION
Designs, develops, and implements methods, processes, and systems to consolidate and analyze diverse data sets including structured and unstructured. Develops software programs, algorithms, dashboards, information tools, and queries to clean, model, integrate and evaluate datasets. Keeps abreast of new analytic methodologies and technologies. Collaborates with functional business units to drive business solutions and direction.
DUTIES AND RESPONSIBILITIES:
• Develop and optimize state-of-the-art generative AI models for foreign language proficiency testing and education tasks. Use and extend state of the art NLP techniques including large language model fine-tuning, natural language understanding, and content generation.
• Design and implement scalable algorithms and pipelines to handle extensive text datasets in high-performance computing (HPC)environments.
• Collaborate with software engineers, data scientists, and domain experts to delivered-to-end AI solutions for a variety of text-centric applications.
• Conduct research on novel methodologies, staying current with emerging trends in machine learning, generative AI, and NLP.
• Present technical findings to both expert and non-expert audiences, providing clear explanations of complex concepts and proposed solutions.
• KNOWLEDGE, REQUIRED SKILLS, COMPETENCIES AND EXPERIENCE:
• Master’s degree (or equivalent professional experience) in Computer Science, Math, AI/ML, Data Science, Statistics, Applied Linguistics, or a related field.
• Proficiency in Python, with demonstrable experience developing and implementing algorithms for AI/ML workflows.
• Strong background in natural language processing, machine learning, and generative AI, with a portfolio of relevant coursework, publications, or project work.
• Hands-on experience with data science pipelines, including data cleaning, preparation, and visualization, for large-scale text corpora.
• Familiarity with high-performance computing (e.g., supercomputers, clusters, or cloud infrastructure) and the associated challenges of large-scale AI model development.
NICE TO HAVE
• Experience with deep learning frameworks for building and training complex human language technology systems.
• Familiarity with responsible AI principles, model interpretability, and ethical considerations in large language model deployment.
• Proven experience collaborating on shared code repositories (e.g., Git), contributing to communal software projects or technical communities.
• Enthusiasm for learning new technologies and exploring innovative applications of generative AI.
• (Bonus) Demonstrated interest in learning foreign languages