

Brooksource
Lead Data Engineer - AI Data Products
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Lead Data Engineer - AI Data Products, offering a contract-to-permanent position with a pay rate of "unknown." It requires 10 years of data engineering experience, expertise in Databricks, Python, and vector databases, and is 100% remote.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
October 23, 2025
π - Duration
Unknown
-
ποΈ - Location
Remote
-
π - Contract
W2 Contractor
-
π - Security
Unknown
-
π - Location detailed
United States
-
π§ - Skills detailed
#Leadership #Strategy #AI (Artificial Intelligence) #ML (Machine Learning) #Data Science #DevOps #Cloud #Code Reviews #Computer Science #Databases #Data Engineering #RDS (Amazon Relational Database Service) #Databricks #"ETL (Extract #Transform #Load)" #Data Pipeline #Python #Scala
Role description
Lead Data Engineer β AI Data Products | Databricks | Vector Databases
Contract-to-Permanent Hire (W-2)
100% Remote (CST Work Hours)
Our Fortune 50 healthcare client is seeking a Lead AI Data Engineer to drive the strategy and execution of scalable AI data products. This is a ground-floor opportunity for a leader who thrives on mentoring others, shaping engineering practices, and building robust, cloud-native data pipelines enabling enterprise AI/ML capabilities. The role balances hands-on implementation (~50%) with leadership and strategic oversight as the team and AI product portfolio grow. Youβll guide engineers, conduct code reviews, and collaborate cross-functionally to build AI-ready data products.
Responsibilities:
β’ Own the technical strategy for data engineering and AI product development.
β’ Design and implement scalable data pipelines in Databricks and ETL/ELT workflows, leveraging Medallion Architecture and cloud-native best practices.
β’ Develop data engineering solutions using Python, RDE/RDS, and related core technologies.
β’ Define, document, and evolve engineering standards, architecture, and development workflows.
β’ Mentor and guide a growing team of engineers, including senior and entry-level talent, providing technical oversight and supporting professional growth.
β’ Conduct code reviews to ensure maintainable, secure, and high-quality code delivery.
β’ Balance hands-on implementation (about 50%) with leadership and strategic responsibilities as the team and AI product portfolio scale.
β’ Partner with cross-functional teams to deliver scalable, production-ready AI data products.
Requirements:
β’ Bachelor of Science in Computer Science or Data Science required; MS degree preferred.
β’ 10 years of professional experience in data engineering, including integrating AI/ML solutions.
β’ 2 years of experience as a Lead Engineer, conducting code reviews and mentoring both senior and junior engineers.
β’ Hands-on experience with Databricks and cloud-based data engineering.
β’ Expertise in Python, ETL/ELT pipeline development, and Medallion Architecture.
β’ Experience with vector databases (FAISS, Pinecone, Weaviate, Milvus) and building AI-ready data assets (embeddings, vector stores, feature tables, semantic AI context assets).
β’ Experience driving architectural decisions and technical standards for scalable data & AI solutions.
β’ Ability to define and drive engineering practices in a dynamic, data & AI-focused environment, applying DevOps and code-first principles to data workflows.
β’ US Citizens & Green Card holders only
β’
Lead Data Engineer β AI Data Products | Databricks | Vector Databases
Contract-to-Permanent Hire (W-2)
100% Remote (CST Work Hours)
Our Fortune 50 healthcare client is seeking a Lead AI Data Engineer to drive the strategy and execution of scalable AI data products. This is a ground-floor opportunity for a leader who thrives on mentoring others, shaping engineering practices, and building robust, cloud-native data pipelines enabling enterprise AI/ML capabilities. The role balances hands-on implementation (~50%) with leadership and strategic oversight as the team and AI product portfolio grow. Youβll guide engineers, conduct code reviews, and collaborate cross-functionally to build AI-ready data products.
Responsibilities:
β’ Own the technical strategy for data engineering and AI product development.
β’ Design and implement scalable data pipelines in Databricks and ETL/ELT workflows, leveraging Medallion Architecture and cloud-native best practices.
β’ Develop data engineering solutions using Python, RDE/RDS, and related core technologies.
β’ Define, document, and evolve engineering standards, architecture, and development workflows.
β’ Mentor and guide a growing team of engineers, including senior and entry-level talent, providing technical oversight and supporting professional growth.
β’ Conduct code reviews to ensure maintainable, secure, and high-quality code delivery.
β’ Balance hands-on implementation (about 50%) with leadership and strategic responsibilities as the team and AI product portfolio scale.
β’ Partner with cross-functional teams to deliver scalable, production-ready AI data products.
Requirements:
β’ Bachelor of Science in Computer Science or Data Science required; MS degree preferred.
β’ 10 years of professional experience in data engineering, including integrating AI/ML solutions.
β’ 2 years of experience as a Lead Engineer, conducting code reviews and mentoring both senior and junior engineers.
β’ Hands-on experience with Databricks and cloud-based data engineering.
β’ Expertise in Python, ETL/ELT pipeline development, and Medallion Architecture.
β’ Experience with vector databases (FAISS, Pinecone, Weaviate, Milvus) and building AI-ready data assets (embeddings, vector stores, feature tables, semantic AI context assets).
β’ Experience driving architectural decisions and technical standards for scalable data & AI solutions.
β’ Ability to define and drive engineering practices in a dynamic, data & AI-focused environment, applying DevOps and code-first principles to data workflows.
β’ US Citizens & Green Card holders only
β’






