

Azure Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for an Azure Data Engineer with a contract length of "unknown" and a pay rate of "unknown." Located onsite in Indianapolis, IN, candidates must have 4+ years in Azure Databricks with PySpark and strong experience in Azure Data Factory and ADLS Gen2.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
September 13, 2025
π - Project duration
Unknown
-
ποΈ - Location type
On-site
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
Indianapolis, IN
-
π§ - Skills detailed
#Cloud #Monitoring #Azure #Migration #Data Layers #Metadata #NoSQL #SQL (Structured Query Language) #Databricks #Triggers #GitHub #SonarQube #Version Control #Libraries #Azure DevOps #Batch #Data Engineering #Azure Data Factory #Big Data #PySpark #Data Access #SQL Queries #Data Lakehouse #Synapse #Deployment #Azure SQL #DevOps #Spark (Apache Spark) #Pytest #Azure ADLS (Azure Data Lake Storage) #Google Cloud Storage #Python #Programming #Terraform #Automation #ADLS (Azure Data Lake Storage) #Agile #Data Migration #Azure Databricks #GIT #Data Lake #Data Processing #API (Application Programming Interface) #Airflow #Code Reviews #Security #Scrum #Data Management #Azure cloud #Kafka (Apache Kafka) #Spark SQL #Data Transformations #"ETL (Extract #Transform #Load)" #Jira #ADF (Azure Data Factory) #Informatica BDM (Big Data Management) #Vault #Business Analysis #Storage
Role description
Job Title: Azure Data Engineer
Location: Onsite to Indianapolis, IN
Job Type: Contract
Job Description:
Key technical skills :
Data management experience handling Analytics workload covering design, development, and maintainenance of lakehouse solutions sourcing data from platforms such as ERP sources, API sources, Relational stores, NoSQL and on-prem sources using Databricks/PySpark as distributed /big data management service,supporting batch and near-real-time ingestion, transformation, and processing.
Ability to optimize Spark jobs and manage large-scale data processing using RDD/DataFrame APIs.Demonstrated expertise in partitioning strategies, file format optimization (Parquet/Delta), and Spark SQL tuning. Familiarity with Databricks runtime versions, cluster policies, libraries, and workspace management.
Skilled in governing and manage data access for Azure Data lakehouse with Unity Catalog. Experience in configuring data permissions, object lineage, and access policies with Unity Catalog.Understanding of integrating Unity Catalog with Azure AD, external metastores, and audit trails.
Experience in building efficient orchestration solutions using Azure data factory, Databricks Workflows.Ability to design modular, reusable workflows using tasks, triggers, and dependencies. Skilled in using dynamic expressions, parameterized pipelines, custom activities, and triggers.
Familiarity with integration runtime configurations, pipeline performance tuning, and error handling strategies.
Strong experience in implementing secure, hierarchical namespace-based data lake storage for structured/semi-structured data, aligned to bronze-silver-gold layers with ADLS Gen2. Hands-on experience with lifecycle policies, access control (RBAC/ACLs), and folder-level security. Understanding of best practices in file partitioning, retention management, and storage performance optimization.
Capable of developing T-SQL queries, stored procedures, and managing metadata layers on Azure SQL.
Comprehensive experience working across the Azure ecosystem, including networking, security, monitoring, and cost management relevant to data engineering workloads. Understanding of VNets, Private Endpoints, Key Vaults, Managed Identities, and Azure Monitor. Exposure to DevOps tools for deployment automation (e.g., Azure DevOps, ARM/Bicep/Terraform).
Experience in writing modular, testable Python code used in data transformations, utility functions, and packaging reusable components. Familiarity with Python environments, dependency management (pip/Poetry/Conda), and packaging libraries. Ability to write unit tests using PyTest/unittest and integrate with CI/CD pipelines.
Lead solution design discussions, mentor junior engineers, and ensure adherence to coding guidelines, design patterns, and peer review processes. Able to prepare Design documents for development and guiding the team technically. Experience preparing technical design documents, HLD/LLDs, and architecture diagrams. Familiarity with code quality tools (e.g., SonarQube, pylint), and version control workflows (Git).
Demonstrates strong verbal and written communication, proactive stakeholder engagement, and a collaborative attitude in cross-functional teams. Ability to articulate technical concepts clearly to both technical and business audiences. Experience in working with product owners, QA, and business analysts to translate requirements into deliverables.
Soft skills/other skills
Communication Skills:
Communicate effectively with internal and customer stakeholders
Communication approach: verbal, emails and instant messages
Interpersonal Skills:
Strong interpersonal skills to build and maintain productive relationships with team members
Provide constructive feedback during code reviews and be open to receiving feedback on your own code.
Problem-Solving and Analytical Thinking:
Capability to troubleshoot and resolve issues efficiently.
Analytical mindset.
Task/ Work Updates
Prior experience in working on Agile/Scrum projects with exposure to tools like Jira/Azure DevOps.
Provides regular updates, proactive and due diligent to carry out responsibilities.
Expected Outcome
We are seeking a highly skilled Data Engineering specialist with above mentioned mentioned Primary Skills to join our dynamic team who are at the forefront of enabling enterprises in Healthcare sectors.
The ideal candidate should be passionate about working on Data Engineering on Azure cloud with strong focus on DevOps practices in building product for our customers.
Effectively Communicate and Collaborate with internal teams and customer to build code leveraging or building low level design documents aligning to standard coding principles and guidelines.
Secondary Skills:
Good to have Azure Entra/AD skills and GitHub Actions.
Good to have orchestration experience using Airflow, Dagster, LogicApp.
Good to have experince working on event-driven architectures using Kafka, Azure Event Hub.
Good to have exposure on Google Cloud Pub/Sub.
Good to have experience developing and maintaining Change Data Capture (CDC) solutions preferrably using Debezium.
Good to have hands-on experience on data migration projects specifically involving Azure Synapse and Databricks Lakehouse.
Good to have experienced in managing cloud storage solutions on Azure Data Lake Storage . Experience with Google Cloud Storage will be an advantage.
Primary (Must have skills)
4+ years of experience in Azure Databricks with PySpark.
2+ years of experience in Databricks workflow & Unity catalog.
3+ years of experience in ADF (Azure Data Factory).
3+ years of experience in ADLS Gen 2.
3+ years of experience in Azure SQL.
5+ years of experience in Azure Cloud platform.
2+ years of experience in Python programming & package builds.
Educational Qualification
Any Bachelor's Degree
Job Title: Azure Data Engineer
Location: Onsite to Indianapolis, IN
Job Type: Contract
Job Description:
Key technical skills :
Data management experience handling Analytics workload covering design, development, and maintainenance of lakehouse solutions sourcing data from platforms such as ERP sources, API sources, Relational stores, NoSQL and on-prem sources using Databricks/PySpark as distributed /big data management service,supporting batch and near-real-time ingestion, transformation, and processing.
Ability to optimize Spark jobs and manage large-scale data processing using RDD/DataFrame APIs.Demonstrated expertise in partitioning strategies, file format optimization (Parquet/Delta), and Spark SQL tuning. Familiarity with Databricks runtime versions, cluster policies, libraries, and workspace management.
Skilled in governing and manage data access for Azure Data lakehouse with Unity Catalog. Experience in configuring data permissions, object lineage, and access policies with Unity Catalog.Understanding of integrating Unity Catalog with Azure AD, external metastores, and audit trails.
Experience in building efficient orchestration solutions using Azure data factory, Databricks Workflows.Ability to design modular, reusable workflows using tasks, triggers, and dependencies. Skilled in using dynamic expressions, parameterized pipelines, custom activities, and triggers.
Familiarity with integration runtime configurations, pipeline performance tuning, and error handling strategies.
Strong experience in implementing secure, hierarchical namespace-based data lake storage for structured/semi-structured data, aligned to bronze-silver-gold layers with ADLS Gen2. Hands-on experience with lifecycle policies, access control (RBAC/ACLs), and folder-level security. Understanding of best practices in file partitioning, retention management, and storage performance optimization.
Capable of developing T-SQL queries, stored procedures, and managing metadata layers on Azure SQL.
Comprehensive experience working across the Azure ecosystem, including networking, security, monitoring, and cost management relevant to data engineering workloads. Understanding of VNets, Private Endpoints, Key Vaults, Managed Identities, and Azure Monitor. Exposure to DevOps tools for deployment automation (e.g., Azure DevOps, ARM/Bicep/Terraform).
Experience in writing modular, testable Python code used in data transformations, utility functions, and packaging reusable components. Familiarity with Python environments, dependency management (pip/Poetry/Conda), and packaging libraries. Ability to write unit tests using PyTest/unittest and integrate with CI/CD pipelines.
Lead solution design discussions, mentor junior engineers, and ensure adherence to coding guidelines, design patterns, and peer review processes. Able to prepare Design documents for development and guiding the team technically. Experience preparing technical design documents, HLD/LLDs, and architecture diagrams. Familiarity with code quality tools (e.g., SonarQube, pylint), and version control workflows (Git).
Demonstrates strong verbal and written communication, proactive stakeholder engagement, and a collaborative attitude in cross-functional teams. Ability to articulate technical concepts clearly to both technical and business audiences. Experience in working with product owners, QA, and business analysts to translate requirements into deliverables.
Soft skills/other skills
Communication Skills:
Communicate effectively with internal and customer stakeholders
Communication approach: verbal, emails and instant messages
Interpersonal Skills:
Strong interpersonal skills to build and maintain productive relationships with team members
Provide constructive feedback during code reviews and be open to receiving feedback on your own code.
Problem-Solving and Analytical Thinking:
Capability to troubleshoot and resolve issues efficiently.
Analytical mindset.
Task/ Work Updates
Prior experience in working on Agile/Scrum projects with exposure to tools like Jira/Azure DevOps.
Provides regular updates, proactive and due diligent to carry out responsibilities.
Expected Outcome
We are seeking a highly skilled Data Engineering specialist with above mentioned mentioned Primary Skills to join our dynamic team who are at the forefront of enabling enterprises in Healthcare sectors.
The ideal candidate should be passionate about working on Data Engineering on Azure cloud with strong focus on DevOps practices in building product for our customers.
Effectively Communicate and Collaborate with internal teams and customer to build code leveraging or building low level design documents aligning to standard coding principles and guidelines.
Secondary Skills:
Good to have Azure Entra/AD skills and GitHub Actions.
Good to have orchestration experience using Airflow, Dagster, LogicApp.
Good to have experince working on event-driven architectures using Kafka, Azure Event Hub.
Good to have exposure on Google Cloud Pub/Sub.
Good to have experience developing and maintaining Change Data Capture (CDC) solutions preferrably using Debezium.
Good to have hands-on experience on data migration projects specifically involving Azure Synapse and Databricks Lakehouse.
Good to have experienced in managing cloud storage solutions on Azure Data Lake Storage . Experience with Google Cloud Storage will be an advantage.
Primary (Must have skills)
4+ years of experience in Azure Databricks with PySpark.
2+ years of experience in Databricks workflow & Unity catalog.
3+ years of experience in ADF (Azure Data Factory).
3+ years of experience in ADLS Gen 2.
3+ years of experience in Azure SQL.
5+ years of experience in Azure Cloud platform.
2+ years of experience in Python programming & package builds.
Educational Qualification
Any Bachelor's Degree