Mindlance

Big Data Developer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Big Data Developer on a 6-month contract in Charlotte, NC. Requires expertise in Python, PySpark, Spark SQL, Azure (ADF), and Databricks. Must be a US Citizen or Green Card holder.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
512
-
πŸ—“οΈ - Date
June 26, 2026
πŸ•’ - Duration
More than 6 months
-
🏝️ - Location
On-site
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
United States
-
🧠 - Skills detailed
#Spark (Apache Spark) #SQL (Structured Query Language) #Data Quality #"ETL (Extract #Transform #Load)" #Data Pipeline #Spark SQL #Azure Data Factory #ADF (Azure Data Factory) #Infrastructure as Code (IaC) #Data Engineering #Data Security #Azure #Azure DevOps #Big Data #Data Architecture #Data Storage #Clustering #Data Integration #GIT #Python #Storage #Vault #Code Reviews #Databricks #Scala #ADLS (Azure Data Lake Storage) #SQL Server #Azure ARM Templates (Azure Resource Manager Templates) #Programming #Security #DevOps #Airflow #Data Governance #Terraform #Apache Airflow #UAT (User Acceptance Testing) #Data Processing #PySpark #Delta Lake #AutoScaling
Role description
Job Tittle: Big Data Architect Duration: 06-month contract with possible extension Location: Charlotte, NC 28202 (100% Onsite Role) This role is only open for US Citizens or Green Card (Permanent Residents), Kindly do not apply if you are OPT, EAD, H1B or C2C candidates. Job Description: β€’ As a Big Data Architect Contractor, you will support the project team by designing and implementing large-scale data solutions to meet business needs. β€’ Design and develop scalable big data architectures to handle large volumes of data. Collaborate with stakeholders to understand data requirements and translate them into technical solutions. β€’ Implement data integration, data processing, and data storage solutions using big data technologies. Ensure data security, data quality, and data governance standards are met. β€’ Optimize data architectures for performance, scalability, and cost-efficiency. Responsibilities: β€’ Design, develop and optimize ETL/ELT pipelines using Azure Data Factory (ADF) and Databricks β€’ Write and tune PySpark / Spark SQL notebooks for large-scale data transformation β€’ Architect end-to-end data solutions across dev β†’ UAT β†’ prod environments using Unity Catalog β€’ Lead and drive design discussions with client architects and other counterparts β€’ Collaborate with different teams on data contracts and schema agreements β€’ Lead design and optimization of high-volume data pipeline. β€’ Define and enforce data engineering standards β€” naming conventions, partitioning strategies, cluster configurations, Spark tuning β€’ Drive performance optimization β€” AQE tuning, liquid clustering, broadcast joins, shuffle partition management β€’ Design Databricks cluster policies, autoscaling configurations, and cost optimization strategies β€’ Conduct root cause analysis on production incidents and implement permanent fixes β€’ Mentor junior and mid-level engineers through code reviews and pair programming β€’ Evaluate new technologies and recommend adoption (e.g., DABs, DLT, Auto Loader, Serverless Compute, event hubs) Mandatory Skills: β€’ Python, PySpark, Spark SQL, SQL Server β€’ Azure (ADF, ADLS Gen2, Key Vault, Azure Monitor) β€’ Databricks (Delta Lake, Unity Catalog, Workflows) β€’ Apache Airflow β€’ Git / Azure DevOps β€’ Deep Spark internals (DAG optimization, spill analysis, skew handling) β€’ Delta Lake advanced features (time travel, deletion vectors, predictive I/O) β€’ Unity Catalog governance (row/column security, external locations, system tables) β€’ IaC β€” Terraform, Azure ARM templates EEO: β€œMindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.”