Lead Pyspark/Iceberg Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Lead Pyspark/Iceberg Engineer in the Banking/Financial industry, based in Charlotte, NC. The contract lasts 24+ months with a competitive pay rate. Key skills include 5+ years in SQL and 3+ years in PySpark, Iceberg, and Airflow.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

680

🗓️ - Date discovered

August 8, 2025

🕒 - Project duration

More than 6 months

🏝️ - Location type

Hybrid

📄 - Contract type

Unknown

🔒 - Security clearance

Unknown

📍 - Location detailed

Charlotte, NC

🧠 - Skills detailed

#Web Services #Java #"ETL (Extract #Transform #Load)" #GIT #Scala #Grafana #Data Modeling #Kafka (Apache Kafka) #SQL (Structured Query Language) #Python #Ab Initio #Batch #Airflow #Compliance #Data Pipeline #Kubernetes #Agile #S3 (Amazon Simple Storage Service) #PySpark #Prometheus #Visual Studio #Docker #Data Engineering #Scrum #Deployment #Documentation #Migration #Monitoring #Data Ingestion #Spark (Apache Spark) #Jira #GitHub #Kanban #Data Governance

Role description

Please find details for this position below: Client: Banking/Financial Industry Title: Lead Pyspark/Iceberg Engineer / Lead Data Engineer - Pyspark/Iceberg Location: Charlotte, NC – Hybrid Roles (In Person Interview) Duration: 24+ Month (s) Extend or Convert based on performances Job Descriptions: • We are seeking a highly skilled and adaptable Lead Software Engineer to join our Counterparty Credit Risk organization. • This role supports: Data Services, with a focus on modernizing legacy systems, managing high-volume data pipelines, and contributing to full-stack application development. • You will a team member on business-as-usual (BAU) processes, while also contributing to the development of a new data platform over the next two years. The ideal candidate is a strong communicator, a proactive problem-solver, and comfortable working in a Kanban Agile environment. Key Responsibilities: • Lead Agile Development: Guide and support multiple Agile teams focused on data Extract , Ingestion, and transformation. • Modernize Legacy Systems: Migrate data pipelines from Ab Initio and Filesystem to modern technologies such as PySpark, S3, Airflow, Parquet, and Iceberg. • Full-Stack Engineering: Design and develop scalable backend services using PySpark and Python. • Data Platform Enablement: Support ingestion of 300+ data feeds into the platform to ensure timely nightly batch processing. • Cross-Functional Collaboration: Partner with business stakeholders and product owners to understand requirements and deliver effective solutions. • Agile Execution: Working with both Kanban and scrum teams and should be familiar with both and check-ins and managing tasks via Jira. • Platform Transition Support: Contribute to the migration from legacy systems to a new data platform over the next two years. • BAU and Strategic Support: Balance business-as-usual responsibilities while contributing to long-term platform modernization. • Documentation and Data Modeling: Maintain clear technical documentation and demonstrate a strong understanding of columnar data structures. • Experience on the different file format systems (Parquet, ORC, AVRO). • Experience on the code containerization deployments using Docker / Kubernetes. • Java background would be a plus. • Good Knowledge on large scale ETL Based frameworks • Experience on ETL tool (AbInitio). Top Skills: • 5 years of experience as an engineer • 5 years of SQL engineering • 2-3 years Pyspark • 2-3 years Iceberg • 2-3 years Parquet • S3 • Airflow • Application is legacy Abinitio, transitioning to Python and Pyspark. Also replacing Autosys with Airflow for job scheduling. Trying to upscale to growing demand for data with the asset cap lifting. This person will be working on the new project after first learning the existing platform. On a team with 7-8 onshore engineers and 6-7 offshore. • Will do heads down engineering but needs to be able to interact with the existing team and coordinate with them on needs when implementing the new system. Will by doing daily scrum calls, reviewing work orders in order. • Agile is not necessary but good to know. • Banking background is good but not a must. • Certifications are a plus. Required Skills & Experience Top Technical Skills • 3+ years of experience with PySpark, S3, Iceberg, Git, Python, Airflow, and Parquet • 5+ years of experience with SQL • Experience with Agile methodologies and tools like Jira • Familiarity with Kafka • Experience with GitHub Copilot, Web Services, Visual studio, IntelliJ, and Gradle • Experience in monitoring tools like Grafana or Prometheus Preferred Qualifications • Proven experience leading Agile teams and mentoring junior developers • Strong communication skills and the ability to collaborate with business stakeholders • Comfortable working in both Scrum and Kanban model with frequent scrum check-ins • Ability to identify blockers and proactively seek help when needed • Experience working in a regulated environment with a focus on compliance and data governance. • 2+ years of working with Ab Initio graphs and plans Team Structure & Projects • You will be part of a team that handles over 300+ data feeds, ensuring timely ingestion for nightly batch processing. • Role will focus on Data Services, modernizing data ingestion pipelines. EEO: Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.

Apply now Apply with DFH Sign up

← See all roles

Go to role

Lead Pyspark/Iceberg Engineer

Premium Members Land Roles Faster—Upgrade today.

Senior Power Platform Developer

Process Architect / Business Analyst

Analyst 2, Bus Mtrcs/Analytics - Entry Level

Machine Learning Engineer

Premium Members Land Roles Faster—Upgrade today.

Book a

chat

with us

Company