

FUSTIS LLC
Sr. Data Scientist (NLP / Generative AI / LLMs)
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Sr. Data Scientist in Woodlawn, MD, offering $80/h C2C for a long-term contract. Requires 15+ years of experience, expertise in NLP, Python, SQL, and Generative AI, and a Bachelor's degree. Public Trust clearance needed.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
640
-
ποΈ - Date
July 2, 2026
π - Duration
Unknown
-
ποΈ - Location
On-site
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
Woodlawn, MD
-
π§ - Skills detailed
#Version Control #Airflow #HTML (Hypertext Markup Language) #Libraries #Deployment #Pandas #Monitoring #NLTK (Natural Language Toolkit) #Python #Database Management #Scala #TensorFlow #Matplotlib #Statistics #Leadership #Anomaly Detection #GIT #Hadoop #Web Services #SpaCy #AI (Artificial Intelligence) #SQL (Structured Query Language) #Langchain #Programming #Computer Science #Data Analysis #ML (Machine Learning) #Spark (Apache Spark) #Impala #EC2 #Azure #Cloud #Mathematics #Oracle #Regular Expressions #NLP (Natural Language Processing) #DevOps #Apache Spark #NumPy #PyTorch #Model Deployment #Data Science #AWS (Amazon Web Services) #Data Engineering #SQL Server #MySQL
Role description
Job Description
Job Title : Sr. Data Scientist
Pay Rate : $80/h C2C
Visa : USC, GC
LOCATION : Woodlawn, MD (5 days per week onsite) (Locals)
Experience : 15+ years
EMPLOYMENT TYPE: Long term Contract; Any work authorization, as long as the candidate has worked and lived in the USA for 3 years. Will need to obtain Public Trust.
INFO REQUIRED TO SUBMIT:
β’ Standard Submittal Information
Key Required Skills
β’ Solid Experience with Natural Language Processing (NLP), Python, NLP frameworks, SQL, Pandas, NLTK and SPACy.
β’ Experience with Generative AI and Large Language Models (LLM)
β’ Excellent Communication skills
Position Description
β’ Hands on experience in Python, NLP frameworks, SQL, Pandas, NLTK, SPACy and LLMs
β’ Well versed in SQL and analyzing trends and transactional data.
β’ Understand real world challenges and develop automated data solutions
β’ Develop, test, and deploy new techniques for NLP understanding
β’ Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs)
β’ Train and optimize NLP/LLM models and create Python based pipelines
β’ Experience building cloud native solutions on AWS
β’ Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution.
β’ Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem.
β’ Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems.
β’ Provide accurate, timely, complex, and sophisticated data analysis.
Detailed Skills Requirements
Foundation for Success (Basic Qualifications)
β’ Bachelorβs degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on Python, NLP frameworks, SQL, Pandas, NLTK and SPACy, data science, and AI/ML/LLM engineering.
β’ Overall 10+ yearsβ experience in IT industry
Factors To Help You Shine (Required Skills)?
β’
β’ Selected candidate must be able to obtain and maintain a public trust clearance
β’
β’
β’
β’ Selected candidate must be willing to work on-site in Woodlawn, MD 5 days a week
β’
β’
β’
β’ Master's and 10+ years of experience, Bachelor's and 12+ years of experience or 18+ years in lieu of a degree
β’
β’
β’ Solid Experience with Natural Language Processing (NLP), Python, NLP frameworks, SQL, Pandas, NLTK and SPACy.
β’ Experience with Generative AI and Large Language Models (LLM)
β’ Evidence of true self-starter and operating independently.
β’ Fluency in Python Programming, version control and collaboration with GIT, standard Python packages (ex. Pandas, numpy, matplotlib) and ML frameworks
β’ Knowledge of TensorFlow, PyTorch, Pandas, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2.
β’ Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search.
β’ Expert knowledge in conducting data analysis and applying advanced statistical concepts and ML methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models.
β’ Experience with ML model deployment and operations like DevOps, MLOps, LLMOps.
β’ Experience with NLP and Generative AI libraries like regular expressions (e.g., spacy, langchain), text annotation tools and semantic frameworks.
β’ Ability to clean and process large amounts of real-world data.
β’ Experience retrieving and manipulating data from a variety of data sources included DB2, Oracle, SQL Server, Hadoop and flat files.
β’ Excellent Communication skills.
β’ Experience with database management systems (e.g., PostgresSQL, MySQL, SQLite, SQL, etc.)
β’ Excellent analytical skills to identify potential risks and propose effective solutions.
β’ Excellent problem-solving skills, ability to collaborate with cross-functional teams and proven communication in written and verbal formats to various audiences to include executive leadership.
How To Stand Out From The Crowd (Desired Skills)?
β’ Prior experience with federal or state governments IT projects.
β’ Industry experience preferred
β’ Experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive.
β’ Experience working in an analytical research environment.
β’ Experience in parallel processing such as GPU programming with CUDA
β’ Experience with Mathematica
β’ Experience using markup languages such as LaTeX, HTML, etc.
β’ Experience with Natural Language Processing for anomaly detection
Job Description
Job Title : Sr. Data Scientist
Pay Rate : $80/h C2C
Visa : USC, GC
LOCATION : Woodlawn, MD (5 days per week onsite) (Locals)
Experience : 15+ years
EMPLOYMENT TYPE: Long term Contract; Any work authorization, as long as the candidate has worked and lived in the USA for 3 years. Will need to obtain Public Trust.
INFO REQUIRED TO SUBMIT:
β’ Standard Submittal Information
Key Required Skills
β’ Solid Experience with Natural Language Processing (NLP), Python, NLP frameworks, SQL, Pandas, NLTK and SPACy.
β’ Experience with Generative AI and Large Language Models (LLM)
β’ Excellent Communication skills
Position Description
β’ Hands on experience in Python, NLP frameworks, SQL, Pandas, NLTK, SPACy and LLMs
β’ Well versed in SQL and analyzing trends and transactional data.
β’ Understand real world challenges and develop automated data solutions
β’ Develop, test, and deploy new techniques for NLP understanding
β’ Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs)
β’ Train and optimize NLP/LLM models and create Python based pipelines
β’ Experience building cloud native solutions on AWS
β’ Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution.
β’ Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem.
β’ Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems.
β’ Provide accurate, timely, complex, and sophisticated data analysis.
Detailed Skills Requirements
Foundation for Success (Basic Qualifications)
β’ Bachelorβs degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on Python, NLP frameworks, SQL, Pandas, NLTK and SPACy, data science, and AI/ML/LLM engineering.
β’ Overall 10+ yearsβ experience in IT industry
Factors To Help You Shine (Required Skills)?
β’
β’ Selected candidate must be able to obtain and maintain a public trust clearance
β’
β’
β’
β’ Selected candidate must be willing to work on-site in Woodlawn, MD 5 days a week
β’
β’
β’
β’ Master's and 10+ years of experience, Bachelor's and 12+ years of experience or 18+ years in lieu of a degree
β’
β’
β’ Solid Experience with Natural Language Processing (NLP), Python, NLP frameworks, SQL, Pandas, NLTK and SPACy.
β’ Experience with Generative AI and Large Language Models (LLM)
β’ Evidence of true self-starter and operating independently.
β’ Fluency in Python Programming, version control and collaboration with GIT, standard Python packages (ex. Pandas, numpy, matplotlib) and ML frameworks
β’ Knowledge of TensorFlow, PyTorch, Pandas, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2.
β’ Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search.
β’ Expert knowledge in conducting data analysis and applying advanced statistical concepts and ML methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models.
β’ Experience with ML model deployment and operations like DevOps, MLOps, LLMOps.
β’ Experience with NLP and Generative AI libraries like regular expressions (e.g., spacy, langchain), text annotation tools and semantic frameworks.
β’ Ability to clean and process large amounts of real-world data.
β’ Experience retrieving and manipulating data from a variety of data sources included DB2, Oracle, SQL Server, Hadoop and flat files.
β’ Excellent Communication skills.
β’ Experience with database management systems (e.g., PostgresSQL, MySQL, SQLite, SQL, etc.)
β’ Excellent analytical skills to identify potential risks and propose effective solutions.
β’ Excellent problem-solving skills, ability to collaborate with cross-functional teams and proven communication in written and verbal formats to various audiences to include executive leadership.
How To Stand Out From The Crowd (Desired Skills)?
β’ Prior experience with federal or state governments IT projects.
β’ Industry experience preferred
β’ Experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive.
β’ Experience working in an analytical research environment.
β’ Experience in parallel processing such as GPU programming with CUDA
β’ Experience with Mathematica
β’ Experience using markup languages such as LaTeX, HTML, etc.
β’ Experience with Natural Language Processing for anomaly detection






