

Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Engineer with a 6-month contract, based in San Jose (hybrid). Key skills include Databricks, SQL, and data engineering. Experience with generative AI, MDM, and cloud environments (Azure/AWS/GCP) is essential.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
480
-
ποΈ - Date discovered
July 9, 2025
π - Project duration
More than 6 months
-
ποΈ - Location type
Hybrid
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
San Jose, CA
-
π§ - Skills detailed
#Data Management #MDM (Master Data Management) #BI (Business Intelligence) #Spark (Apache Spark) #MS Dynamics #Delta Lake #Databricks #Migration #AWS (Amazon Web Services) #Scala #Data Pipeline #Data Quality #AI (Artificial Intelligence) #Microsoft Power BI #Data Governance #Data Engineering #SQL (Structured Query Language) #"ETL (Extract #Transform #Load)" #GCP (Google Cloud Platform) #Data Lake #Azure #Data Migration #Tableau #Data Modeling #CRM (Customer Relationship Management) #Fivetran #Python #Cloud
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
Job Title - Data Engineer
Duration - 6 months with possible extension
Location: San Jose, ( Hybrid: Tues, Wed, Thurs- in Office)
Overview:
We are seeking a highly skilled Data Engineer with strong expertise in Databricks to lead the design and execution of scalable data pipelines and architectures. This role sits at the intersection of Customer Analytics, Master Data Management (MDM), and Generative AI/LLMs. The ideal candidate also brings experience building custom applications, supporting BI integration, reverse ETL workflows, and applying user experience design principles to internal tools.
Key Responsibilities:
β’ Design, develop, and optimize advanced data pipelines using Databricks (SQL, Python, Spark, Delta Lake).
β’ Integrate and structure customer and enterprise data across systems to support analytics, AI, and MDM initiatives.
β’ Build and maintain custom internal applications and services that support data workflows and operational teams.
β’ Develop and refine prompts and workflows for LLM-based and generative AI use cases.
β’ Collaborate with MDM teams to support identity resolution and golden record management.
β’ Enable analytics delivery through Power BI, Tableau, and other BI tools.
β’ Replicate data from sources to targets via FIVETRAN
β’ Use reverse ETL tools like Census to replicate enriched data back into CRM and marketing systems.
β’ Apply user experience design principles to internal data tools, dashboards, and interfaces to improve usability and adoption.
β’ Ensure high data quality, reliability, and performance across the data stack.
Required Qualifications:
β’ Extensive hands-on experience with Databricks (Spark, Delta Lake, Lakehouse architecture, notebooks, orchestration).
β’ 3+ years of experience in data engineering, full-stack development, or a related field.
β’ Strong proficiency in SQL and data modeling best practices.
β’ Solid understanding of customer data structures, MDM concepts, and data governance.
β’ Experience working with generative AI and LLMs, including prompt engineering and AI-assisted workflows.
β’ Experience developing and deploying custom applications, APIs, or internal tools.
β’ Proficiency in cloud environments (Azure, AWS, or GCP).
Preferred Qualifications:
β’ Familiarity with Power BI, Tableau, and other business intelligence tools.
β’ Experience using Census or similar reverse ETL platforms.
β’ Background supporting marketing, sales, or customer experience analytics initiatives.
β’ Awareness of UX design best practices, especially when building internal tools or data interfaces.
β’ Strong collaboration and communication skills with both technical and business stakeholders.
Hiring Mgr is looking for candidate with exp-
β’ Data migration from Hana to AZURE using Databricks
β’ MS Dynamics knowledge/ exp.
β’ Knowledge of Hana (SQL db) , at least basics of new generation data house (data lake) eg Databricks .
β’ Good understanding ETL pipelines (ETL Tools / Reverse ETL tools : SNAPLOGIC, FIVETRAN, Sensus, Hana)
β’ Reporting dashboards.
β’ BI tech, Analytics experience in general.
β’ AI workflows (Most of Gen AI - algorithms based on Gen AI). May not necessarily work on AI tech, but need to know basis AI)
β’ Databricks - exp Administration, Build pipelines, high level expertise
β’ Azure experience