JS Consulting Solution

Data Scientist with PHD, NLP & LLM || Only USC and Onsite

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist with a PhD, requiring onsite work in Washington, DC for a 6+ month contract. Key skills include NLP, machine learning, Python, AWS, and experience with LLMs. USC only.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
October 15, 2025
πŸ•’ - Duration
More than 6 months
-
🏝️ - Location
On-site
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Washington, DC
-
🧠 - Skills detailed
#Regression #NLTK (Natural Language Toolkit) #Scala #Data Normalization #Elasticsearch #NLP (Natural Language Processing) #Keras #Data Science #Hadoop #NER (Named-Entity Recognition) #Programming #Reinforcement Learning #SQL (Structured Query Language) #Langchain #Data Cleansing #GitHub #Deployment #API (Application Programming Interface) #RNN (Recurrent Neural Networks) #Microservices #Oracle #Normalization #Linux #Sentiment Analysis #Spark (Apache Spark) #Visualization #Clustering #R #GIT #Image Processing #Kubernetes #Jenkins #"ETL (Extract #Transform #Load)" #TensorFlow #RDS (Amazon Relational Database Service) #Theano #Docker #MySQL #RDF (Resource Description Framework) #Classification #Automation #PyTorch #SageMaker #AWS (Amazon Web Services) #Jupyter #OpenSearch #NLG (Natural Language Generation) #ML (Machine Learning) #Scripting #SpaCy #Knowledge Graph #Mathematics #AI (Artificial Intelligence) #SciPy #Cloud #NumPy #Computer Science #PySpark #Redshift #Statistics #AWS SageMaker #Python #BERT #Tableau #GitLab #Deep Learning #PostgreSQL
Role description
Job Title- Data Scientist Project Location – Onsite in Washington, District of Columbia Duration- 6+ months contract Visa- USC Must have PHD Minimum Qualifications: β€’ Work or educational background in one or more of the following areas: machine learning, computational linguistics, deep learning, ratification intelligence, data science and/or data analytic, generative AI, symbolic AI, causal AI, operations research, computer science, Mathematics, business analytics, or knowledge management. β€’ Demonstrated experience programming with R/Python, Linux, and Spark in AWS cloud environment, or knowledge and algorithmic design experience in Python (3+ years) β€’ Proficient with Amazon AWS Sagemaker, Jupyter Notebook and Python Scikit, Deep Learning, Machine Learning tools such as TensorFlow β€’ Experience with image processing models such as Coco, CLIP, ResNet or comparable models β€’ Demonstrated experience with machine learning techniques including natural language processing, and Large language Models (GPTv4-o1, o3, OpenAI APIs, Llama, Claude, etc). β€’ Experience developing AI agents and development proficiency using agentic programming β€’ Proficient in Natural language processing (NLP) and Natural language generation (NLG) including prior projects in any of the following categories: top modeling of text, sentiment analysis of text, part of speech tagging, Name Entity Recognition (NER), Bag of Words, text extraction β€’ Experience building and working with any of these components: Vector DB, BERT, RoBERTa (or comparable tools), Spacy, LLM and GenAI tools. Experience with LoRA, LangChain, RAG, LLM Fine TuningΒ and PEFT, Knowledge Graphs. β€’ Strong skills in developing GraphRAG, Chain of Thought (CoT), Tree of Thought (ToT), Reinforcement learning and AI development architectures with Human-in-the-Loop (HITL β€’ Demonstrated experience with SQL and any relational database technologies, such as Oracle, PostgreSQL, MySQL, RDS, Redshift, Hadoop EMR, Hive, etc. β€’ Demonstrated experience processing structured and unstructured data sources, data cleansing, data normalization and prep for analysis β€’ Demonstrated experience with code repositories and build/deployment pipelines, specifically Jenkins and/or Git/GitHub/GitLab. β€’ Demonstrated experience using Tableau, or Kibana, Quicksights or other similar data visualizations tools. β€’ Very comfortable working with ambiguity (e.g. imperfect data, loosely defined concepts, ideas, or goals) Qualifications & Requirements β€’ Education: MS in Computer Science, Statistics, Math, Engineering, or related field, PhD required. β€’ 3+ years of relevant experience in building large scale machine learning or deep learning models and/or systems β€’ 1+ year of experience specifically with deep learning (e.g., CNN, RNN, LSTM) β€’ 1+ year of experience building NLP and NLG tools. β€’ Experience with wide range of LLMs (Llama, Claude, OpenAI, Cohere, etc.), LoRA, LangChain, RAG, LLM Fine Tuning and PEFT are preferred. β€’ Demonstrated skills with Jupyter Notebook, AWS Sagemaker, or Domino Datalab or comparable environments β€’ Passion for solving complex data problems and generating cross-functional solutions in a fast-paced environment β€’ Knowledge in Python and SQL, object oriented programming, service oriented architectures β€’ Strong scripting skills with Shell script and SQL β€’ Strong coding skills and experience with Python (including SciPy, NumPy, and/or PySpark) and/or Scala. β€’ Knowledge and implementation experience with NLP techniques (topic modeling, bag of words, text classification, TF/IDF, Sentiment analysis) and NLP technologies such as Python NLTK, or Spacy or comparable technologies β€’ Knowledge and implementation experience with statistical and machine learning models (regression, classification, clustering, graph models, etc.) Preferred Qualifications β€’ Hands on experience building models with deep learning frameworks like Tensorflow, Keras, Caffe, PyTorch, Theano, H2O, or similar β€’ Experience with LLM Agents, Agentic programming β€’ Experience with search architecture (for instance: Solr, ElasticSearch, AWS OpenSearch) β€’ Experience with building querying ontologies such as Zeno, OWL, RDF, SparQL or comparable are preferred β€’ Knowledge & experience with microservices, service mesh, API development and test automation are preferred β€’ Demonstrated experience using Docker, Kubernetes, and/or other similar container frameworks are preferred Additional Job Qualifications: β€’ Ability to translate business ideas into analytics models that have major business impact. β€’ Demonstrated experience working with multiple stakeholders. β€’ Demonstrated communication skills, e.g. explaining complex technical issues to more junior data scientists, in graphical, verbal, or written formats. β€’ Demonstrated experience developing tested, reusable and reproducible work