FUSTIS LLC

Sr. Data Scientist (NLP / Generative AI / LLMs)

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Sr. Data Scientist in Woodlawn, MD, offering $80/h C2C for a long-term contract. Requires 15+ years of experience, expertise in NLP, Python, SQL, and Generative AI, and a Bachelor's degree. Public Trust clearance needed.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
640
-
πŸ—“οΈ - Date
July 2, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
On-site
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Woodlawn, MD
-
🧠 - Skills detailed
#Version Control #Airflow #HTML (Hypertext Markup Language) #Libraries #Deployment #Pandas #Monitoring #NLTK (Natural Language Toolkit) #Python #Database Management #Scala #TensorFlow #Matplotlib #Statistics #Leadership #Anomaly Detection #GIT #Hadoop #Web Services #SpaCy #AI (Artificial Intelligence) #SQL (Structured Query Language) #Langchain #Programming #Computer Science #Data Analysis #ML (Machine Learning) #Spark (Apache Spark) #Impala #EC2 #Azure #Cloud #Mathematics #Oracle #Regular Expressions #NLP (Natural Language Processing) #DevOps #Apache Spark #NumPy #PyTorch #Model Deployment #Data Science #AWS (Amazon Web Services) #Data Engineering #SQL Server #MySQL
Role description
Job Description Job Title : Sr. Data Scientist Pay Rate : $80/h C2C Visa : USC, GC LOCATION : Woodlawn, MD (5 days per week onsite) (Locals) Experience : 15+ years EMPLOYMENT TYPE: Long term Contract; Any work authorization, as long as the candidate has worked and lived in the USA for 3 years. Will need to obtain Public Trust. INFO REQUIRED TO SUBMIT: β€’ Standard Submittal Information Key Required Skills β€’ Solid Experience with Natural Language Processing (NLP), Python, NLP frameworks, SQL, Pandas, NLTK and SPACy. β€’ Experience with Generative AI and Large Language Models (LLM) β€’ Excellent Communication skills Position Description β€’ Hands on experience in Python, NLP frameworks, SQL, Pandas, NLTK, SPACy and LLMs β€’ Well versed in SQL and analyzing trends and transactional data. β€’ Understand real world challenges and develop automated data solutions β€’ Develop, test, and deploy new techniques for NLP understanding β€’ Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs) β€’ Train and optimize NLP/LLM models and create Python based pipelines β€’ Experience building cloud native solutions on AWS β€’ Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution. β€’ Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem. β€’ Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems. β€’ Provide accurate, timely, complex, and sophisticated data analysis. Detailed Skills Requirements Foundation for Success (Basic Qualifications) β€’ Bachelor’s degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on Python, NLP frameworks, SQL, Pandas, NLTK and SPACy, data science, and AI/ML/LLM engineering. β€’ Overall 10+ years’ experience in IT industry Factors To Help You Shine (Required Skills)? β€’ β€’ Selected candidate must be able to obtain and maintain a public trust clearance β€’ β€’ β€’ β€’ Selected candidate must be willing to work on-site in Woodlawn, MD 5 days a week β€’ β€’ β€’ β€’ Master's and 10+ years of experience, Bachelor's and 12+ years of experience or 18+ years in lieu of a degree β€’ β€’ β€’ Solid Experience with Natural Language Processing (NLP), Python, NLP frameworks, SQL, Pandas, NLTK and SPACy. β€’ Experience with Generative AI and Large Language Models (LLM) β€’ Evidence of true self-starter and operating independently. β€’ Fluency in Python Programming, version control and collaboration with GIT, standard Python packages (ex. Pandas, numpy, matplotlib) and ML frameworks β€’ Knowledge of TensorFlow, PyTorch, Pandas, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2. β€’ Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search. β€’ Expert knowledge in conducting data analysis and applying advanced statistical concepts and ML methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models. β€’ Experience with ML model deployment and operations like DevOps, MLOps, LLMOps. β€’ Experience with NLP and Generative AI libraries like regular expressions (e.g., spacy, langchain), text annotation tools and semantic frameworks. β€’ Ability to clean and process large amounts of real-world data. β€’ Experience retrieving and manipulating data from a variety of data sources included DB2, Oracle, SQL Server, Hadoop and flat files. β€’ Excellent Communication skills. β€’ Experience with database management systems (e.g., PostgresSQL, MySQL, SQLite, SQL, etc.) β€’ Excellent analytical skills to identify potential risks and propose effective solutions. β€’ Excellent problem-solving skills, ability to collaborate with cross-functional teams and proven communication in written and verbal formats to various audiences to include executive leadership. How To Stand Out From The Crowd (Desired Skills)? β€’ Prior experience with federal or state governments IT projects. β€’ Industry experience preferred β€’ Experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive. β€’ Experience working in an analytical research environment. β€’ Experience in parallel processing such as GPU programming with CUDA β€’ Experience with Mathematica β€’ Experience using markup languages such as LaTeX, HTML, etc. β€’ Experience with Natural Language Processing for anomaly detection