

Persistent Systems
Senior Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer with 12+ years of experience, focusing on data platform modernization in a regulated banking environment. It offers a contract length of FTE/CTH, a pay rate of "$X", and requires expertise in Apache Spark, Cloudera, SQL, and GCP. Location: Onsite in Irving, TX / Wilmington, DE.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
566
-
ποΈ - Date
June 3, 2026
π - Duration
Unknown
-
ποΈ - Location
On-site
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
Irving, TX
-
π§ - Skills detailed
#Data Architecture #Cloudera #SQL (Structured Query Language) #Logging #Spark (Apache Spark) #PySpark #Compliance #Metadata #"ETL (Extract #Transform #Load)" #Jenkins #SQL Server #Monitoring #Hadoop #Apache Spark #Data Quality #Automated Testing #Oracle #Scala #GCP (Google Cloud Platform) #HDFS (Hadoop Distributed File System) #Cloud #YARN (Yet Another Resource Negotiator) #Documentation #Code Reviews #DevOps #Data Engineering #Migration #Batch #RDBMS (Relational Database Management System) #GitHub #AI (Artificial Intelligence) #GIT #MS SQL (Microsoft SQL Server)
Role description
About Persistent
We are a trusted Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate whatβs next. Our offerings and proven solutions create unique competitive advantage for our clients by giving them the power to see beyond and rise above.
We are experiencing tremendous growth, with $566 million in revenue in FY21, representing 12.9% year-over-year growth. Along with that growth, we onboarded over 3,000 new employees in the past year, bringing our total employee count to over 15,000 people located in 18 countries across the globe.
At Persistent, our values are more than a list of ideals to improve our corporate image. Weβre dedicated to building an inclusive culture that reflects whatβs important to our employees and is based on what they value. As a result, 95% of our employees approve of the CEO and 83% recommend working at Persistent to a friend.
About Position: Experienced Senior Data Engineer (12+ Years) to support large scale data platform modernization initiatives within a regulated banking environment.
The role focuses on designing and building enterprise-grade in-house frameworks, supporting high-volume batch and CDC-based incremental processing using Cloudera platform, and enabling ongoing Google Cloud Platform (GCP) modernization efforts
About Position
Role: Senior Data Engineer
Location: Irving, TX / Wilmington, DE (Onsite)
Hire Type : FTE/CTH
What You'll
Apache Spark (PySpark and/or Scala) in large-scale production environments
Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera)
Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic
Enterprise RDBMS experience with Oracle and MS SQL Server
Batch ingestion, incremental ingestion, and CDC processing patterns
CDC concepts and tooling (tool-agnostic: Golden Gate, Debezium, or equivalent)
Data merge, deduplication, watermarking, checkpointing, and SCD handling
Google Cloud Platform services including Dataproc , Composer and Dataplex
Hybrid on prem to cloud data architecture and migration patterns
Metadata-driven framework development and data quality validation techniques
CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps)
Git-based development workflows, code reviews, and automated testing practices
Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments
Logging, monitoring, alerting, and operational readiness practices
Secure coding, access control, and compliance-aware development
Documentation of design artifacts, runbooks, and operational procedures
Expertise You'll :
Apache Spark (PySpark and/or Scala) in large-scale production environments
Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera)
Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic
Enterprise RDBMS experience with Oracle and MS SQL Server
Batch ingestion, incremental ingestion, and CDC processing patterns
CDC concepts and tooling (tool-agnostic: GoldenGate, Debezium, or equivalent)
Data merge, deduplication, watermarking, checkpointing, and SCD handling
Google Cloud Platform services including Dataproc , Composer and Dataplex
Hybrid on prem to cloud data architecture and migration patterns
Metadata-driven framework development and data quality validation techniques
CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps)
Git-based development workflows, code reviews, and automated testing practices
Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments
Logging, monitoring, alerting, and operational readiness practices
Secure coding, access control, and compliance-aware development
Documentation of design artifacts, runbooks, and operational procedures
Benefits:
Competitive salary and benefits package Culture focused on talent development with quarterly promotion cycles and company-sponsored higher education and certifications
Opportunity to work with cutting-edge technologies
Employee engagement initiatives such as project parties, flexible work hours, and βLong Service awards Annual health check-ups as well as insurance
Group term life insurance Personal accident insurance
Mediclaim hospitalization insurance for self, spouse, two children, and parents
Why Persistent is an employer of choice
Technology Innovation: culture of innovation using cutting-edge technology to bring value to clients.
Growth and Career Progression: learning opportunities for growth, including quarterly promotion
cycles.
One Persistent Culture: global outlook with diversity and inclusion at it core.
Mental and Physical Wellness: employee health and mindfulness programs
About Persistent
We are a trusted Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate whatβs next. Our offerings and proven solutions create unique competitive advantage for our clients by giving them the power to see beyond and rise above.
We are experiencing tremendous growth, with $566 million in revenue in FY21, representing 12.9% year-over-year growth. Along with that growth, we onboarded over 3,000 new employees in the past year, bringing our total employee count to over 15,000 people located in 18 countries across the globe.
At Persistent, our values are more than a list of ideals to improve our corporate image. Weβre dedicated to building an inclusive culture that reflects whatβs important to our employees and is based on what they value. As a result, 95% of our employees approve of the CEO and 83% recommend working at Persistent to a friend.
About Position: Experienced Senior Data Engineer (12+ Years) to support large scale data platform modernization initiatives within a regulated banking environment.
The role focuses on designing and building enterprise-grade in-house frameworks, supporting high-volume batch and CDC-based incremental processing using Cloudera platform, and enabling ongoing Google Cloud Platform (GCP) modernization efforts
About Position
Role: Senior Data Engineer
Location: Irving, TX / Wilmington, DE (Onsite)
Hire Type : FTE/CTH
What You'll
Apache Spark (PySpark and/or Scala) in large-scale production environments
Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera)
Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic
Enterprise RDBMS experience with Oracle and MS SQL Server
Batch ingestion, incremental ingestion, and CDC processing patterns
CDC concepts and tooling (tool-agnostic: Golden Gate, Debezium, or equivalent)
Data merge, deduplication, watermarking, checkpointing, and SCD handling
Google Cloud Platform services including Dataproc , Composer and Dataplex
Hybrid on prem to cloud data architecture and migration patterns
Metadata-driven framework development and data quality validation techniques
CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps)
Git-based development workflows, code reviews, and automated testing practices
Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments
Logging, monitoring, alerting, and operational readiness practices
Secure coding, access control, and compliance-aware development
Documentation of design artifacts, runbooks, and operational procedures
Expertise You'll :
Apache Spark (PySpark and/or Scala) in large-scale production environments
Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera)
Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic
Enterprise RDBMS experience with Oracle and MS SQL Server
Batch ingestion, incremental ingestion, and CDC processing patterns
CDC concepts and tooling (tool-agnostic: GoldenGate, Debezium, or equivalent)
Data merge, deduplication, watermarking, checkpointing, and SCD handling
Google Cloud Platform services including Dataproc , Composer and Dataplex
Hybrid on prem to cloud data architecture and migration patterns
Metadata-driven framework development and data quality validation techniques
CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps)
Git-based development workflows, code reviews, and automated testing practices
Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments
Logging, monitoring, alerting, and operational readiness practices
Secure coding, access control, and compliance-aware development
Documentation of design artifacts, runbooks, and operational procedures
Benefits:
Competitive salary and benefits package Culture focused on talent development with quarterly promotion cycles and company-sponsored higher education and certifications
Opportunity to work with cutting-edge technologies
Employee engagement initiatives such as project parties, flexible work hours, and βLong Service awards Annual health check-ups as well as insurance
Group term life insurance Personal accident insurance
Mediclaim hospitalization insurance for self, spouse, two children, and parents
Why Persistent is an employer of choice
Technology Innovation: culture of innovation using cutting-edge technology to bring value to clients.
Growth and Career Progression: learning opportunities for growth, including quarterly promotion
cycles.
One Persistent Culture: global outlook with diversity and inclusion at it core.
Mental and Physical Wellness: employee health and mindfulness programs






