

Data Scientist/GenAI Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist/GenAI Engineer in Plano, TX, for 18 months at a competitive pay rate. Key skills include Python, Machine Learning, NLP, and experience with Databricks, MongoDB, and Generative AI, preferably with OpenAI models.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
June 6, 2025
π - Project duration
More than 6 months
-
ποΈ - Location type
On-site
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
Plano, TX
-
π§ - Skills detailed
#ML (Machine Learning) #SQL (Structured Query Language) #Data Processing #React #Scala #Databricks #PySpark #Deployment #API (Application Programming Interface) #Python #Automation #"ETL (Extract #Transform #Load)" #Data Engineering #MongoDB #Data Storage #NoSQL #NLP (Natural Language Processing) #Databases #Data Science #Spark (Apache Spark) #Storage #Model Deployment #AI (Artificial Intelligence) #Data Pipeline #Datasets
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
Title- Data Scientist/GenAI Engineer
Location- Plano TX (On-site)
Duration: 18 months
Job Summary:
We are seeking a highly skilled Data Scientist with a strong foundation in Python, Machine Learning, and Generative AI to join a dynamic and innovative team. The ideal candidate will bring hands-on expertise in Natural Language Processing (NLP), Large Language Models (LLMs) (especially OpenAI), and advanced data analytics platforms such as Databricks and MongoDB.
This role requires a balance of technical excellence, creativity in developing AI solutions, and practical implementation skills to build and optimize intelligent systems at scale.
Key Responsibilities:
β’ Design, develop, and deploy LLM-based applications, with a focus on Generative AI and agent-based architectures.
β’ Implement and fine-tune Large Language Models, preferably using OpenAI APIs, for various enterprise use cases.
β’ Apply NLP techniques to extract, analyze, and interpret complex data from unstructured sources.
β’ Use supervised and unsupervised machine learning methods to generate insights and predictions from large-scale datasets.
β’ Build robust data pipelines and workflows using Python, Databricks, Spark, and PySpark.
β’ Leverage MongoDB and other NoSQL databases for data storage, retrieval, and manipulation.
β’ Evaluate model performance, especially LLM evaluation metrics, to ensure efficiency and effectiveness.
β’ Collaborate with full-stack developers and data engineers for seamless integration of ML models with MongoDB, SQL, React, and API-driven platforms.
β’ Stay current with advancements in AI/ML, and bring innovative solutions to enterprise data and AI services.
Required Skills and Qualifications:
β’ Strong proficiency in Python for data science, automation, and ML workflows.
β’ Expertise in Databricks, PySpark, and Spark for scalable data processing.
β’ Solid experience in MongoDB and other modern database technologies.
β’ Practical experience in Generative AI and building intelligent AI agents.
β’ Hands-on knowledge of Natural Language Processing techniques and tools.
β’ Proven track record of LLM tuning and customization, with a preference for OpenAI-based models.
β’ Familiarity with LLM evaluation methodologies and tools (specific to one role).
β’ Deep understanding of machine learning algorithms β both supervised and unsupervised.
β’ Excellent communication and collaboration skills for working in cross-functional teams.
Preferred Qualifications:
β’ Experience in telecom, enterprise AI, or large-scale data environments.
β’ Exposure to full-stack development (React, MongoDB) and API development.
β’ Familiarity with MLOps practices and CI/CD for ML model deployment.