

Themesoft Inc.
Senior Data Scientist
⭐ - Featured Role | Apply direct with Data Freelance Hub
Nothing Found.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
May 19, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Unknown
-
📄 - Contract
W2 Contractor
-
🔒 - Security
Unknown
-
📍 - Location detailed
Los Angeles, CA
-
🧠 - Skills detailed
#Python #Scala #Cloud #AI (Artificial Intelligence) #ML (Machine Learning) #Data Science #Azure #Deployment #"ETL (Extract #Transform #Load)" #MongoDB #Normalization #Databases
Role description
Job Title: Senior Data Scientist
Customer Location: Los Angeles, CA
Hire type: Contract W2
Technical Skills:
· Advanced Python development for ML/AI workloads
· End‑to‑end ML lifecycle: model training, evaluation, fine‑tuning, and labeling/tagging workflows
· Generative AI systems design, including LLM-based application development
· Prompt engineering optimization for large language models
· Document AI pipelines: OCR/extraction, parsing, normalization, and text chunking for structured & unstructured data
· Embedding generation pipelines for semantic search and retrieval
· Vector similarity search implementation using vector databases
· ML model integration with Vector DBs and MongoDB
· Production‑grade ML engineering: scalable, maintainable, and deployment‑ready code
· Knowledge of CI/CD pipelines and cloud deployment (Azure preferred)
· Experience with Vector DBs and/or MongoDB
Python, Large Language Models (LLMs) (via LLM‑based applications), Vector Databases, MongoDB
Roles & Responsibilities
We are seeking a highly skilled Data Science Engineer to design and develop scalable ML and Generative AI solutions. The ideal candidate will have deep expertise in Python, hands-on experience in model training, document processing pipelines, and strong knowledge of vector databases and modern ML/GenAI frameworks.
Strong fit if the candidate:
· Has expert-level Python skills
· Has hands-on experience building ML/GenAI systems, not just theoretical knowledge
· Has worked on end-to-end ML pipelines (data → model → deployment)
· Has experience with document AI, embeddings, and vector search
· Thinks like an engineer (scalable, maintainable, production-ready code)
Key Responsibilities
· Develop and deploy machine learning and GenAI solutions using Python
· Design and optimize prompt engineering strategies for LLM-based applications
· Build document extraction, parsing, and chunking pipelines for structured and unstructured data
· Train, evaluate, and fine-tune ML models; manage tagging and labeling workflows
· Implement embedding generation and vector search solutions
· Integrate ML models with Vector DBs and MongoDB
· Ensure code quality, scalability, and production readiness
Job Title: Senior Data Scientist
Customer Location: Los Angeles, CA
Hire type: Contract W2
Technical Skills:
· Advanced Python development for ML/AI workloads
· End‑to‑end ML lifecycle: model training, evaluation, fine‑tuning, and labeling/tagging workflows
· Generative AI systems design, including LLM-based application development
· Prompt engineering optimization for large language models
· Document AI pipelines: OCR/extraction, parsing, normalization, and text chunking for structured & unstructured data
· Embedding generation pipelines for semantic search and retrieval
· Vector similarity search implementation using vector databases
· ML model integration with Vector DBs and MongoDB
· Production‑grade ML engineering: scalable, maintainable, and deployment‑ready code
· Knowledge of CI/CD pipelines and cloud deployment (Azure preferred)
· Experience with Vector DBs and/or MongoDB
Python, Large Language Models (LLMs) (via LLM‑based applications), Vector Databases, MongoDB
Roles & Responsibilities
We are seeking a highly skilled Data Science Engineer to design and develop scalable ML and Generative AI solutions. The ideal candidate will have deep expertise in Python, hands-on experience in model training, document processing pipelines, and strong knowledge of vector databases and modern ML/GenAI frameworks.
Strong fit if the candidate:
· Has expert-level Python skills
· Has hands-on experience building ML/GenAI systems, not just theoretical knowledge
· Has worked on end-to-end ML pipelines (data → model → deployment)
· Has experience with document AI, embeddings, and vector search
· Thinks like an engineer (scalable, maintainable, production-ready code)
Key Responsibilities
· Develop and deploy machine learning and GenAI solutions using Python
· Design and optimize prompt engineering strategies for LLM-based applications
· Build document extraction, parsing, and chunking pipelines for structured and unstructured data
· Train, evaluate, and fine-tune ML models; manage tagging and labeling workflows
· Implement embedding generation and vector search solutions
· Integrate ML models with Vector DBs and MongoDB
· Ensure code quality, scalability, and production readiness






