

Celebal Technologies
Lead Data Scientist & Machine Learning Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Lead Data Scientist & Machine Learning Engineer, offering a long-term contract with a pay rate of "unknown." It requires 6+ years of experience, expertise in Databricks, and proficiency in Python, SQL, and ML frameworks. Remote work is available.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
April 20, 2026
π - Duration
Unknown
-
ποΈ - Location
Remote
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
United States
-
π§ - Skills detailed
#Langchain #ML Ops (Machine Learning Operations) #DevOps #Scala #PySpark #TensorFlow #Databricks #Data Science #Statistics #MLflow #Data Engineering #ML (Machine Learning) #SQL (Structured Query Language) #Spark (Apache Spark) #Python #PyTorch #Delta Lake #Kubernetes #AI (Artificial Intelligence) #Leadership #Databases #Computer Science #Mathematics #Docker
Role description
Lead Data Scientist & Machine Learning Engineer
Location: Houston, TX or SFO, CA or Remote
Long Term
About the Role
Are you a visionary in the world of Generative AI and Databricks? We are looking for a Lead Data Scientist & ML Engineer to bridge the gap between cutting-edge research and scalable production systems.
In this role, you wonβt just be building models; youβll be architecting the future of our AI ecosystem. You will lead the charge in leveraging the Databricks Data Intelligence Platform to build, deploy, and monitor sophisticated ML and GenAI solutions that solve complex business problems.
What Youβll Do
β’ Architect GenAI Solutions: Design and implement LLM-based applications using RAG (Retrieval-Augmented Generation), fine-tuning, and prompt engineering.
β’ End-to-End ML Ops: Build and automate robust ML pipelines on Databricks, utilizing Unity Catalog, MLflow, and Model Serving.
β’ Lead & Mentor: Act as the technical North Star for a team of data scientists and engineers, fostering a culture of excellence and rapid experimentation.
β’ Productionize at Scale: Convert POCs into enterprise-grade products, ensuring high performance, low latency, and cost-efficient scaling.
β’ Cross-Functional Collaboration: Partner with Stakeholders, Data Engineers, and DevOps to align AI initiatives with core business objectives.
What Weβre Looking For
β’ Experience: 6+ years in Data Science/ML Engineering, with at least 2 years in a leadership or principal capacity.
β’ Databricks Mastery: Expert-level knowledge of the Databricks ecosystem (Delta Lake, Spark, Mosaic AI, Workflows).
β’ Generative AI Expertise: Hands-on experience with frameworks like LangChain, LlamaIndex, and Vector Databases (e.g., Pinecone, Weaviate, or Databricks Vector Search).
β’ Technical Stack:
β’ Languages: Python (Expert), SQL, PySpark.
β’ ML Frameworks: PyTorch, TensorFlow, or Scikit-learn.
β’ DevOps/MLOps: Experience with CI/CD, MLflow, and containerization (Docker/Kubernetes).
β’ Education: Masterβs or PhD in Computer Science, Mathematics, Statistics, or a related quantitative field (or equivalent experience).
Lead Data Scientist & Machine Learning Engineer
Location: Houston, TX or SFO, CA or Remote
Long Term
About the Role
Are you a visionary in the world of Generative AI and Databricks? We are looking for a Lead Data Scientist & ML Engineer to bridge the gap between cutting-edge research and scalable production systems.
In this role, you wonβt just be building models; youβll be architecting the future of our AI ecosystem. You will lead the charge in leveraging the Databricks Data Intelligence Platform to build, deploy, and monitor sophisticated ML and GenAI solutions that solve complex business problems.
What Youβll Do
β’ Architect GenAI Solutions: Design and implement LLM-based applications using RAG (Retrieval-Augmented Generation), fine-tuning, and prompt engineering.
β’ End-to-End ML Ops: Build and automate robust ML pipelines on Databricks, utilizing Unity Catalog, MLflow, and Model Serving.
β’ Lead & Mentor: Act as the technical North Star for a team of data scientists and engineers, fostering a culture of excellence and rapid experimentation.
β’ Productionize at Scale: Convert POCs into enterprise-grade products, ensuring high performance, low latency, and cost-efficient scaling.
β’ Cross-Functional Collaboration: Partner with Stakeholders, Data Engineers, and DevOps to align AI initiatives with core business objectives.
What Weβre Looking For
β’ Experience: 6+ years in Data Science/ML Engineering, with at least 2 years in a leadership or principal capacity.
β’ Databricks Mastery: Expert-level knowledge of the Databricks ecosystem (Delta Lake, Spark, Mosaic AI, Workflows).
β’ Generative AI Expertise: Hands-on experience with frameworks like LangChain, LlamaIndex, and Vector Databases (e.g., Pinecone, Weaviate, or Databricks Vector Search).
β’ Technical Stack:
β’ Languages: Python (Expert), SQL, PySpark.
β’ ML Frameworks: PyTorch, TensorFlow, or Scikit-learn.
β’ DevOps/MLOps: Experience with CI/CD, MLflow, and containerization (Docker/Kubernetes).
β’ Education: Masterβs or PhD in Computer Science, Mathematics, Statistics, or a related quantitative field (or equivalent experience).




