Persistent Systems

Senior Data Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer with 12+ years of experience, focusing on data platform modernization in a regulated banking environment. It offers a contract length of FTE/CTH, a pay rate of "$X", and requires expertise in Apache Spark, Cloudera, SQL, and GCP. Location: Onsite in Irving, TX / Wilmington, DE.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
566
-
πŸ—“οΈ - Date
June 3, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
On-site
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Irving, TX
-
🧠 - Skills detailed
#Data Architecture #Cloudera #SQL (Structured Query Language) #Logging #Spark (Apache Spark) #PySpark #Compliance #Metadata #"ETL (Extract #Transform #Load)" #Jenkins #SQL Server #Monitoring #Hadoop #Apache Spark #Data Quality #Automated Testing #Oracle #Scala #GCP (Google Cloud Platform) #HDFS (Hadoop Distributed File System) #Cloud #YARN (Yet Another Resource Negotiator) #Documentation #Code Reviews #DevOps #Data Engineering #Migration #Batch #RDBMS (Relational Database Management System) #GitHub #AI (Artificial Intelligence) #GIT #MS SQL (Microsoft SQL Server)
Role description
About Persistent We are a trusted Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what’s next. Our offerings and proven solutions create unique competitive advantage for our clients by giving them the power to see beyond and rise above. We are experiencing tremendous growth, with $566 million in revenue in FY21, representing 12.9% year-over-year growth. Along with that growth, we onboarded over 3,000 new employees in the past year, bringing our total employee count to over 15,000 people located in 18 countries across the globe. At Persistent, our values are more than a list of ideals to improve our corporate image. We’re dedicated to building an inclusive culture that reflects what’s important to our employees and is based on what they value. As a result, 95% of our employees approve of the CEO and 83% recommend working at Persistent to a friend. About Position: Experienced Senior Data Engineer (12+ Years) to support large scale data platform modernization initiatives within a regulated banking environment. The role focuses on designing and building enterprise-grade in-house frameworks, supporting high-volume batch and CDC-based incremental processing using Cloudera platform, and enabling ongoing Google Cloud Platform (GCP) modernization efforts About Position Role: Senior Data Engineer Location: Irving, TX / Wilmington, DE (Onsite) Hire Type : FTE/CTH What You'll Apache Spark (PySpark and/or Scala) in large-scale production environments Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera) Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic Enterprise RDBMS experience with Oracle and MS SQL Server Batch ingestion, incremental ingestion, and CDC processing patterns CDC concepts and tooling (tool-agnostic: Golden Gate, Debezium, or equivalent) Data merge, deduplication, watermarking, checkpointing, and SCD handling Google Cloud Platform services including Dataproc , Composer and Dataplex Hybrid on prem to cloud data architecture and migration patterns Metadata-driven framework development and data quality validation techniques CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps) Git-based development workflows, code reviews, and automated testing practices Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments Logging, monitoring, alerting, and operational readiness practices Secure coding, access control, and compliance-aware development Documentation of design artifacts, runbooks, and operational procedures Expertise You'll : Apache Spark (PySpark and/or Scala) in large-scale production environments Cloudera Hadoop ecosystem (HDFS, Hive, YARN, Spark on Cloudera) Strong SQL expertise with complex transformations, performance tuning, and reconciliation logic Enterprise RDBMS experience with Oracle and MS SQL Server Batch ingestion, incremental ingestion, and CDC processing patterns CDC concepts and tooling (tool-agnostic: GoldenGate, Debezium, or equivalent) Data merge, deduplication, watermarking, checkpointing, and SCD handling Google Cloud Platform services including Dataproc , Composer and Dataplex Hybrid on prem to cloud data architecture and migration patterns Metadata-driven framework development and data quality validation techniques CI/CD pipeline implementation using enterprise tooling (GitHub Actions, Jenkins, DevOps) Git-based development workflows, code reviews, and automated testing practices Experience using Copilot or similar AI-assisted development tools safely and effectively in enterprise environments Logging, monitoring, alerting, and operational readiness practices Secure coding, access control, and compliance-aware development Documentation of design artifacts, runbooks, and operational procedures Benefits: Competitive salary and benefits package Culture focused on talent development with quarterly promotion cycles and company-sponsored higher education and certifications Opportunity to work with cutting-edge technologies Employee engagement initiatives such as project parties, flexible work hours, and β€˜Long Service awards Annual health check-ups as well as insurance Group term life insurance Personal accident insurance Mediclaim hospitalization insurance for self, spouse, two children, and parents Why Persistent is an employer of choice Technology Innovation: culture of innovation using cutting-edge technology to bring value to clients. Growth and Career Progression: learning opportunities for growth, including quarterly promotion cycles. One Persistent Culture: global outlook with diversity and inclusion at it core. Mental and Physical Wellness: employee health and mindfulness programs