Themesoft Inc.

Senior Data Scientist

⭐ - Featured Role | Apply direct with Data Freelance Hub

Nothing Found.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date

May 19, 2026

🕒 - Duration

Unknown

🏝️ - Location

Unknown

📄 - Contract

W2 Contractor

🔒 - Security

Unknown

📍 - Location detailed

Los Angeles, CA

🧠 - Skills detailed

#Python #Scala #Cloud #AI (Artificial Intelligence) #ML (Machine Learning) #Data Science #Azure #Deployment #"ETL (Extract #Transform #Load)" #MongoDB #Normalization #Databases

Role description

Job Title: Senior Data Scientist Customer Location: Los Angeles, CA Hire type: Contract W2 Technical Skills: · Advanced Python development for ML/AI workloads · End‑to‑end ML lifecycle: model training, evaluation, fine‑tuning, and labeling/tagging workflows · Generative AI systems design, including LLM-based application development · Prompt engineering optimization for large language models · Document AI pipelines: OCR/extraction, parsing, normalization, and text chunking for structured & unstructured data · Embedding generation pipelines for semantic search and retrieval · Vector similarity search implementation using vector databases · ML model integration with Vector DBs and MongoDB · Production‑grade ML engineering: scalable, maintainable, and deployment‑ready code · Knowledge of CI/CD pipelines and cloud deployment (Azure preferred) · Experience with Vector DBs and/or MongoDB Python, Large Language Models (LLMs) (via LLM‑based applications), Vector Databases, MongoDB Roles & Responsibilities We are seeking a highly skilled Data Science Engineer to design and develop scalable ML and Generative AI solutions. The ideal candidate will have deep expertise in Python, hands-on experience in model training, document processing pipelines, and strong knowledge of vector databases and modern ML/GenAI frameworks. Strong fit if the candidate: · Has expert-level Python skills · Has hands-on experience building ML/GenAI systems, not just theoretical knowledge · Has worked on end-to-end ML pipelines (data → model → deployment) · Has experience with document AI, embeddings, and vector search · Thinks like an engineer (scalable, maintainable, production-ready code) Key Responsibilities · Develop and deploy machine learning and GenAI solutions using Python · Design and optimize prompt engineering strategies for LLM-based applications · Build document extraction, parsing, and chunking pipelines for structured and unstructured data · Train, evaluate, and fine-tune ML models; manage tagging and labeling workflows · Implement embedding generation and vector search solutions · Integrate ML models with Vector DBs and MongoDB · Ensure code quality, scalability, and production readiness

Apply now Apply with DFH

← See all roles