E-Solutions

Sr. BIgdata Engineer - Local to NC Only

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Sr. Bigdata Engineer in Charlotte, NC, with a contract length of "unknown" and a pay rate of "unknown." Key skills include Python/PySpark, Hadoop, Airflow, and strong SQL. Experience in ETL migration and regulated environments is preferred.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
May 23, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
On-site
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Charlotte, NC
-
🧠 - Skills detailed
#Apache Airflow #Data Engineering #Logging #Hadoop #Infrastructure as Code (IaC) #PySpark #Clustering #Monitoring #Batch #"ETL (Extract #Transform #Load)" #Storage #Airflow #Python #Data Modeling #Spark (Apache Spark) #Data Mapping #ML (Machine Learning) #Data Storage #Debugging #SQL Server #AI (Artificial Intelligence) #Docker #Metadata #GIT #SSIS (SQL Server Integration Services) #SQL (Structured Query Language) #DevOps #YARN (Yet Another Resource Negotiator) #Data Lake #Migration #REST (Representational State Transfer) #Indexing #HDFS (Hadoop Distributed File System) #MongoDB #Programming #Scala #Databases #Compliance #Big Data #Jenkins
Role description
Role : Sr. BIgdata Engineer Location : Charlotte, NC (Onsite) Required Skills & Experience Programming: Python/PySpark, Scala is a plus Big Data: Hadoop (HDFS, YARN), Hive, Spark (optimization, tuning) Orchestration: Apache Airflow Databases/ETL: MongoDB (indexing, sharding, tuning) SQL Server & SSIS (development, migration) Strong SQL & stored procedures Data Lake: HDFS, Hive, Parquet/ORC, partitioning, compaction APIs: REST-based ingestion Reverse engineering & lineage tools CI/CD & DevOps: Git, Jenkins, Docker, IaC Monitoring: logging, metrics, lineage Key Responsibilities Reverse Engineering & Data Mapping Reverse engineer ETL pipelines (SSIS, Spark, stored procedures) to document data flows, logic, and transformations. Perform detailed source-to-target mappings with field-level transformations and business rules. Build data dictionaries, lineage, and mapping artifacts. Collaborate with SMEs to uncover undocumented logic. Identify data model gaps and recommend remediation. ETL Pipeline Remediation Design and refactor pipelines aligned to new source APIs and data contracts. Re-engineer ETL for 1:1 functional parity during migrations. Implement schema evolution, transformations, and mapping changes (batch & streaming). Eliminate redundancy and optimize legacy logic. Build modular, reusable pipelines using Spark/PySpark/Scala. Modernize SSIS and integrate with orchestration frameworks. Orchestrate workflows in Airflow (DAGs, dependencies, SLAs). Implement logging, error handling, alerting, and metadata capture. Data Storage Optimization Simplify schemas; remove redundant/obsolete data across Hive and MongoDB. Optimize partitioning, clustering, and file formats (Parquet, ORC, Avro). Redesign MongoDB indexing, sharding, and collections. Tune HDFS, Hive, MongoDB, and SQL Server for performance and cost. Implement lifecycle management, archival, and retention. Functional Skills • Experience in ETL migration/remediation projects • Strong reverse engineering of legacy ETL (SSIS, Spark, scripts) • Expertise in STM, transformation specs, and lineage artifacts • Data modeling (dimensional, normalized, denormalized) • Schema evolution and zero-downtime migrations • Performance tuning across compute and storage layers • Strong debugging and problem-solving for distributed systems Preferred Qualifications AI/ML-assisted ETL remediation or code conversion Experience with Wiz or Palo Alto Prisma (APIs, data models, risk metrics) Prior Prisma to Wiz (or similar CSPM/CNAPP) migrations Knowledge of CSPM/CNAPP domains (vulnerabilities, identities, exposures) Experience in regulated, compliance-heavy environments