Data Engineer (Scala/Spark/AWS EMR)

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Data Engineer (Scala/Spark/AWS EMR) on a 6-month contract in Durham, NC, requiring 10+ years of experience, strong skills in Scala, Apache Spark, and AWS EMR, and a background in secure financial data aggregation.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

🗓️ - Date discovered

July 9, 2025

🕒 - Project duration

More than 6 months

🏝️ - Location type

Hybrid

📄 - Contract type

Unknown

🔒 - Security clearance

Unknown

📍 - Location detailed

Raleigh-Durham-Chapel Hill Area

🧠 - Skills detailed

#Computer Science #Cloud #Spark (Apache Spark) #Databases #Java #AWS (Amazon Web Services) #Scala #Data Pipeline #Data Processing #Data Aggregation #Data Engineering #"ETL (Extract #Transform #Load)" #AWS EMR (Amazon Elastic MapReduce) #REST (Representational State Transfer) #Amazon EMR (Amazon Elastic MapReduce) #NoSQL #Big Data #Datasets #Apache Spark #Snowflake #Batch #Kafka (Apache Kafka)

Role description

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

Item 1
Item 2
Item 3

Unordered list

Item A
Item B
Item C

Text link

Bold text

Emphasis

^Superscript

_Subscript

Job Title: Data Engineer – Leave of Absence Coverage (6-Month Contract) Location: Durham, NC – Hybrid (2 Weeks Onsite per Month) We’re looking for an experienced Data Engineer to provide 6 months of coverage for a team focused on secure financial data aggregation—powering customer access to their financial data across 3rd party platforms like Mint and Morningstar. You’ll work with batch systems that collect data from multiple business lines (brokerage, 401(k), stock plans, etc.) and deliver it via secure APIs based on consumer consent. The environment is on-premises, but the role involves modernizing with cloud and big data technologies. Must-Have Skills • Strong Data Engineering background • Expert-level Scala • Solid experience with Apache Spark and Amazon EMR • Experience with ETL pipelines and Control-M job scheduling • Familiarity with Cassandra or other distributed NoSQL databases • Some Java knowledge Nice to Have • Experience with streaming platforms (especially Kafka) Responsibilities • Design and develop scalable data pipelines for aggregating customer financial data • Build and maintain batch Spark and Spring Batch ETL jobs • Monitor and troubleshoot ETL workflows using Control-M • Consume and integrate REST/SOAP APIs • Work on data processing using Apache Spark on AWS EMR • Store large datasets efficiently using Parquet, Cassandra, or similar tools • Collaborate across teams and advocate for best practices in software engineering Required Experience • Bachelor’s degree in Computer Science or related field • 10+ years of hands-on development experience with Spark or Spring Batch • Experience with cloud data tools, such as AWS and Snowflake. • Proven experience working with big data solutions and distributed systems

Apply now Apply with DFH Sign up

← See all roles