AI/ML/LLM Data Scientist (PUBLIC TRUST REQUIRED)

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI/ML/LLM Data Scientist with a contract length of over 6 months, offering up to $89.00 per hour. Key skills include NLP, Python, and clinical domain experience. Public Trust clearance is required.
🌎 - Country
United States
💱 - Currency
$ USD
💰 - Day rate
Unknown
Unknown
712
🗓️ - Date discovered
May 10, 2025
🕒 - Project duration
More than 6 months
🏝️ - Location type
On-site
📄 - Contract type
Unknown
🔒 - Security clearance
Yes
📍 - Location detailed
Windsor Mill, MD 21244
🧠 - Skills detailed
#TensorFlow #AI (Artificial Intelligence) #Deployment #Langchain #Oracle #SpaCy #Impala #Apache Spark #ML (Machine Learning) #Matplotlib #Hadoop #Data Analysis #SQL Server #Mathematics #Pandas #Monitoring #Database Management #Regular Expressions #Anomaly Detection #Data Science #Security #NumPy #AWS (Amazon Web Services) #Computer Science #Data Engineering #GIT #NLP (Natural Language Processing) #Statistics #Model Deployment #EC2 #Leadership #Programming #Version Control #PostgreSQL #SQL (Structured Query Language) #Azure #DevOps #Web Services #Airflow #HTML (Hypertext Markup Language) #Scala #Libraries #MySQL #Python #Spark (Apache Spark) #PyTorch #NLTK (Natural Language Toolkit)
Role description
Key Required Skills: - Strong knowledge of AI, Machine Learning (ML), Large Language Models (LLM), Python, Natural Language Processing (NLP), and experience in the clinical domain. Position Description: - Stay updated on new methods in NLP, ML, and Generative AI. - Understand real-world challenges and develop automated data solutions. - Develop, test, and deploy new techniques for NLP understanding. - Achieve scalable development and deployment of ML and Generative AI approaches (such as LLMs). - Train and optimize NLP/LLM models and create Python-based pipelines. - Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution. - Advise on methods and data needed or available to evaluate intelligence or data problems. - Collaborate with data collectors and analysts to identify and address gaps in complex monitoring problems. - Provide accurate, timely, and sophisticated data analysis. Basic Qualifications: - Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science, along with industry experience in NLP, data science, and AI/ML/LLM engineering. - A minimum of 8 years of experience as a Data Scientist. - Must be able to obtain and maintain a Public Trust (contract requirement). Required Skills: - Experience with Natural Language Processing (NLP), Generative AI, and Large Language Models (LLM). - Fluency in Python programming, version control, and collaboration using GIT, along with standard Python packages (e.g., Pandas, NumPy, Matplotlib) and ML frameworks. - Knowledge of TensorFlow, PyTorch, Pandas, scikit-learn, and NLTK, with optional experience in Azure ML and Amazon Web Services EC2. - Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks like Airflow, as well as experience with semantic search. - Expertise in conducting data analysis and applying advanced statistical concepts and ML methods to build, train, test, and evaluate various supervised and unsupervised analytic models. - Experience with ML model deployment and operations, including DevOps, MLOps, and LLMOps. - Familiarity with NLP and Generative AI libraries, such as regular expressions (e.g., SpaCy, Langchain), text annotation tools, and semantic frameworks. - Ability to clean and process large amounts of real-world data. - Experience retrieving and manipulating data from various sources, including DB2, Oracle, SQL Server, Hadoop, and flat files. - Proficiency with database management systems (e.g., PostgreSQL, MySQL, SQLite, SQL, etc.). - Excellent analytical skills to identify potential risks and propose effective solutions. - Strong problem-solving skills and the ability to collaborate with cross-functional teams. - Proven communication skills, both written and verbal, tailored to various audiences, including executive leadership. Desired Skills: - Prior experience working on applications in the clinical domain. - Experience with federal or state government IT projects. - Familiarity with distributed processing via the Hadoop ecosystem (e.g., Spark, Impala, Hive). - Experience in an analytical research environment. - Knowledge of parallel processing, such as GPU programming with CUDA. - Familiarity with Mathematica. - Experience using markup languages such as LaTeX and HTML. - Experience with Natural Language Processing for anomaly detection. Education: - Bachelor's degree with 12+ years of experience. - Must be able to obtain and maintain a Public Trust (contract requirement). Job Types: Full-time, Contract Pay: Up to $89.00 per hour Education: Bachelor's (Required) Experience: AI: 9 years (Required) Machine Learning (ML): 9 years (Required) Large Language Models (LLM): 9 years (Required) Python: 9 years (Required) Natural Language Processing (NLP): 9 years (Required) clinical domain: 9 years (Required) data science: 9 years (Required) AI/ML/LLM engineering: 9 years (Required) Data Scientist: 8 years (Required) Generative AI: 9 years (Required) TensorFlow, PyTorch, Pandas, scikit-learn, and NLTK: 9 years (Required) Security clearance: Confidential (Required) Ability to Commute: Windsor Mill, MD 21244 (Required) Work Location: In person