Neotech Global

Databrick Architect

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Databricks Architect in Los Angeles, CA or New York, requiring 7+ years in data engineering, 3+ years with Databricks, and expertise in cloud platforms. Contract length and pay rate are unspecified.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
June 27, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
On-site
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Los Angeles, CA
-
🧠 - Skills detailed
#GIT #MLflow #Automation #SQL (Structured Query Language) #Azure Data Factory #PySpark #Automated Testing #Compliance #GCP (Google Cloud Platform) #Data Pipeline #Documentation #DevOps #Lambda (AWS Lambda) #Dataflow #REST (Representational State Transfer) #Data Processing #Observability #S3 (Amazon Simple Storage Service) #"ACID (Atomicity #Consistency #Isolation #Durability)" #Scala #Databricks #Vault #Cloud #Data Engineering #AWS (Amazon Web Services) #Azure DevOps #"ETL (Extract #Transform #Load)" #Storage #Data Quality #Spark SQL #Terraform #Leadership #AWS S3 (Amazon Simple Storage Service) #AWS Glue #Data Ingestion #ADLS (Azure Data Lake Storage) #Azure #BI (Business Intelligence) #Data Science #ADF (Azure Data Factory) #Delta Lake #ML (Machine Learning) #REST API #Monitoring #Security #Spark (Apache Spark)
Role description
Role : Databrick Architect Location : Los Angeles CA or New YorkΒ (onsite) Job Description: The Databricks Architect is responsible for designing, implementing, and optimizing scalable data analytics and data engineering solutions on the Databricks Lakehouse Platform. This role requires deep expertise in cloud platforms (Azure/AWS/GCP), distributed data processing, Delta Lake architectures, and modern data engineering practices. The architect will collaborate with cross-functional teams to define data strategies, ensure platform reliability, and enable advanced analytics, ML, and BI use cases. Key Responsibilities Architecture & Design Design end-to-end Databricks Lakehouse architectures for data ingestion, processing, storage, and consumption. Define and implement Delta Lake patterns, including medallion architecture (Bronze/Silver/Gold). Develop scalable data pipelines using PySpark, Spark SQL, and Databricks workflows. Architect solutions for structured, semi-structured, and unstructured data. Engineering & Implementation Build robust ETL/ELT pipelines with Databricks notebooks, jobs, and workflows. Design and implement high-performance streaming solutions using Structured Streaming. Optimize Spark jobs for cost, performance, and scalability. Implement CI/CD and automation using Databricks Repos, Git, and DevOps pipelines. Cloud & Platform Expertise Architect solutions across Azure/AWS/GCP leveraging native cloud services (e.g., Azure Data Factory, AWS Glue, GCP Dataflow). Ensure security, governance, and compliance through Unity Catalog, RBAC, and encryption. Monitor workloads and optimize cluster configurations for performance and cost. Collaboration & Leadership Work closely with data engineers, data scientists, BI teams, and business stakeholders. Act as a subject matter expert (SME) for Databricks best practices, standards, and patterns. Conduct architectural reviews and guide teams on design decisions. Lead PoCs, evaluate new features, and drive platform adoption. Quality, Governance & Observability Define standards for data quality, lineage, observability, and governance. Implement automated testing frameworks for pipelines and notebooks. Establish performance baselines and monitoring dashboards. Required Skills & Experience Technical Skills 7+ years of experience in data engineering/architecture. 3+ years of hands-on experience with Databricks. Strong expertise in Spark, PySpark, SQL, and distributed data processing. Deep understanding of Delta Lake features: ACID transactions, OPTIMIZE, ZORDER, Auto Loader. Experience with workflow orchestration, jobs, and Databricks REST APIs. Hands-on expertise with at least one cloud platform: Azure (preferred): ADF, ADLS, Key Vault, Event Hub, Azure DevOps AWS: S3, Glue, Lambda, Kinesis GCP: GCS, Dataflow, Pub/Sub Familiarity with CI/CD, Git, DevOps, and Infrastructure-as-Code (Terraform preferred). Soft Skills Strong analytical and problem-solving skills. Excellent communication and stakeholder management. Ability to lead design discussions and guide technical teams. Strong documentation and architectural blueprinting skills. Preferred Qualifications Databricks certifications, such as: Databricks Certified Data Engineer Professional Databricks Certified Machine Learning Professional Databricks Lakehouse Fundamentals Experience with MLflow, Feature Store, or MLOps workflows. Experience working in regulated industries (BFSI, healthcare, etc.). Thanks Suresh Jaami Technical recruiter Email: Suresh.j@neotechusa.com