Data Scientist - Gen AI

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist - Gen AI, located in Reston, VA, on a 12+ month contract. Key skills include Python, machine learning, NLP, and AWS experience. A Master's degree is required, with a PhD preferred and 3+ years of relevant experience.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
-
πŸ—“οΈ - Date discovered
May 28, 2025
πŸ•’ - Project duration
More than 6 months
-
🏝️ - Location type
Hybrid
-
πŸ“„ - Contract type
Unknown
-
πŸ”’ - Security clearance
Unknown
-
πŸ“ - Location detailed
Reston, VA
-
🧠 - Skills detailed
#SpaCy #Hadoop #Image Processing #PySpark #RNN (Recurrent Neural Networks) #AI (Artificial Intelligence) #NER (Named-Entity Recognition) #Statistics #Clustering #Cloud #Data Science #PostgreSQL #Theano #PyTorch #BERT #MySQL #Oracle #Jupyter #Scripting #Mathematics #GitHub #Reinforcement Learning #Knowledge Graph #Langchain #Automation #NLTK (Natural Language Toolkit) #GitLab #Tableau #SciPy #TensorFlow #Deployment #Jenkins #Computer Science #NumPy #OpenSearch #Consul #Redshift #GIT #Classification #Deep Learning #RDS (Amazon Relational Database Service) #Consulting #API (Application Programming Interface) #"ETL (Extract #Transform #Load)" #Microservices #SageMaker #Sentiment Analysis #RDF (Resource Description Framework) #SQL (Structured Query Language) #AWS (Amazon Web Services) #Normalization #Lean #Scala #NLG (Natural Language Generation) #Agile #Data Cleansing #Linux #NLP (Natural Language Processing) #Data Normalization #Visualization #Keras #Elasticsearch #Regression #Python #Kubernetes #Programming #R #Spark (Apache Spark) #ML (Machine Learning) #Docker #AWS SageMaker
Role description
Dice is the leading career destination for tech experts at every stage of their careers. Our client, VLink Inc, is seeking the following. Apply via Dice today! Job Title: Data Scientist - Gen AI Location: Reston, VA- Hybrid Employment Type: Contract Duration: 12+ Months About VLink: Started in 2006 and headquartered in Connecticut, VLink is one of the fastest growing digital technology services and consulting companies. Since its inception, our innovative team members have been solving the most complex business, and IT challenges of our global clients. Job Description: Minimum Qualifications: Work or educational background in one or more of the following areas: machine learning, computational linguistics, deep learning, ratification intelligence, data science and/or data analytic, generative AI, symbolic AI, causal AI, operations research, computer science, Mathematics, business analytics, or knowledge management. Demonstrated experience programming with R/Python, Linux, and Spark in AWS cloud environment, or knowledge and algorithmic design experience in Python (3+ years) Proficient with Amazon AWS Sagemaker, Jupyter Notebook and Python Scikit, Deep Learning, Machine Learning tools such as TensorFlow Experience with image processing models such as Coco, CLIP, ResNet or comparable models Demonstrated experience with machine learning techniques including natural language processing, and large language Models (GPTv4-o1, o3, OpenAI APIs, Llama, Claude, etc). Experience developing AI agents and development proficiency using agentic programming Proficient in Natural language processing (NLP) and Natural language generation (NLG) including prior projects in any of the following categories: top modeling of text, sentiment analysis of text, part of speech tagging, Name Entity Recognition (NER), Bag of Words, text extraction Experience building and working with any of these components: Vector DB, BERT, RoBERTa (or comparable tools), Spacy, LLM and GenAI tools. Experience with LoRA, LangChain, RAG, LLM Fine Tuning and PEFT, Knowledge Graphs. Strong skills in developing GraphRAG, Chain of Thought (CoT), Tree of Thought (ToT), Reinforcement learning and AI development architectures with Human-in-the-Loop (HITL Demonstrated experience with SQL and any relational database technologies, such as Oracle, PostgreSQL, MySQL, RDS, Redshift, Hadoop EMR, Hive, etc. Demonstrated experience processing structured and unstructured data sources, data cleansing, data normalization and prep for analysis Demonstrated experience with code repositories and build/deployment pipelines, specifically Jenkins and/or Git/GitHub/GitLab. Demonstrated experience using Tableau, or Kibana, Quick sights or other similar data visualizations tools. Very comfortable working with ambiguity (e.g. imperfect data, loosely defined concepts, ideas, or goals) Qualifications & Requirements Education: MS in Computer Science, Statistics, Math, Engineering, or related field, PhD preferred 3+ years of relevant experience in building large scale machine learning or deep learning models and/or systems 1+ year of experience specifically with deep learning (e.g., CNN, RNN, LSTM) 1+ year of experience building NLP and NLG tools. Experience with wide range of LLMs (Llama, Claude, OpenAI, Cohere, etc.), LoRA, Lang Chain, RAG, LLM Fine Tuning and PEFT are preferred. Demonstrated skills with Jupyter Notebook, AWS Sagemaker, or Domino Data lab or comparable environments Passion for solving complex data problems and generating cross-functional solutions in a fast-paced environment Knowledge in Python and SQL, object-oriented programming, service-oriented architectures Strong scripting skills with Shell script and SQL Strong coding skills and experience with Python (including SciPy, NumPy, and/or Pyspark) and/or Scala. Knowledge and implementation experience with NLP techniques (topic Modeling, bag of words, text classification, TF/IDF, Sentiment analysis) and NLP technologies such as Python NLTK, or Spacy or comparable technologies Knowledge and implementation experience with statistical and machine learning models (regression, classification, clustering, graph models, etc.) Preferred Qualifications Hands on experience building models with deep learning frameworks like TensorFlow, Keras, Caffe, PyTorch, Theano, H2O, or similar Experience with LLM Agents, Agentic programming Experience with search architecture (for instance: Solr, Elasticsearch, AWS OpenSearch) Experience with building querying ontologies such as Zeno, OWL, RDF, SparQL or comparable are preferred Knowledge & experience with microservices, service mesh, API development and test automation are preferred Demonstrated experience using Docker, Kubernetes, and/or other similar container frameworks are preferred Strongly prefer a PhD in math, computer science stat or comparable field with experience in data science, AI development and deep learning, advanced analytics Additional Job Qualifications: Ability to translate business ideas into analytics models that have major business impact. Demonstrated experience working with multiple stakeholders. Demonstrated communication skills, e.g., explaining complex technical issues to more junior data scientists, in graphical, verbal, or written formats. Demonstrated experience developing tested, reusable, and reproducible work. Transparently documenting code and methodologies. Ability to work in Agile, Lean, and rapid development processes Develop and maintain financial software applications. Work closely with the finance team to understand their needs and translate them into functional software. Test software to ensure responsiveness and efficiency. Identify, prioritize, and execute tasks in the software development life cycle. Collaborate with internal teams and vendors to fix and improve products. 10+years Employment Practices: EEO, ADA, FMLA Compliant VLink is an equal opportunity employer. At VLink, we are committed to embracing diversity, multiculturalism, and inclusion. VLink does not discriminate on the basis of race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. All aspects of employment including the decision to hire, promote, or discharge, will be decided on the basis of qualifications, merit, performance, and business needs.