GroupA

Data Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is a long-term contract Data Engineer position, remote (EST hours), with a pay rate of "TBD." Requires a GC holder or USC, 5+ years of data engineering experience, strong skills in Databricks, Apache Spark, and advanced SQL.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
600
-
πŸ—“οΈ - Date
May 23, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
Remote
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
United States
-
🧠 - Skills detailed
#Data Access #Data Engineering #Data Pipeline #API (Application Programming Interface) #Monitoring #Batch #"ETL (Extract #Transform #Load)" #Observability #Airflow #Data Cleansing #Spark (Apache Spark) #Cloud #Azure #Databricks #GCP (Google Cloud Platform) #Datasets #SQL (Structured Query Language) #Data Quality #Scala #Databases #Data Ingestion #AWS (Amazon Web Services) #Agile #Apache Spark #Data Science #Data Processing #Data Layers
Role description
β€’ Job Type – Long Term Contract β€’ Location – Remote – EST hours β€’ Requirements - GC holder or USC required Overview We are seeking an experienced Data Engineer to design, develop, and support scalable data solutions that enable analytics, reporting, and operational data services across the organization. This role will focus on building and optimizing ETL/ELT pipelines using Databricks and Apache Spark, developing complex SQL transformations, and enabling secure and scalable data exposure through APIs. The ideal candidate brings strong hands-on technical expertise, a deep understanding of data engineering best practices, and the ability to work collaboratively across technical and business teams in an agile environment. Responsibilities: β€’ Design, develop, and maintain ETL/ELT pipelines using Databricks and Apache Spark for batch and incremental data processing β€’ Implement robust data ingestion patterns from multiple source systems including files, databases, APIs, and streaming sources where applicable β€’ Optimize Spark jobs and data pipelines for performance, scalability, reliability, and cost efficiency β€’ Ensure data quality, reconciliation, monitoring, and observability across pipelines and datasets β€’ Develop advanced SQL transformations for data cleansing, enrichment, aggregation, and reporting β€’ Design and maintain analytical data models including fact and dimension tables, curated data layers, and reporting views β€’ Support downstream reporting, analytics, and data science use cases with well-structured and scalable datasets β€’ Configure and manage data exposure through APIs for internal and external consumers β€’ Partner with application and integration teams to define API contracts and data payloads β€’ Ensure secure, scalable, and performant data access patterns β€’ Support API versioning and backward compatibility for published data services β€’ Collaborate with cross-functional teams to understand business and technical data requirements β€’ Participate in agile delivery processes and contribute to continuous improvement initiatives β€’ Document technical designs, processes, and operational procedures Requirements: β€’ 5+ years of experience in data engineering roles β€’ Strong hands-on experience with Databricks and Apache Spark β€’ Advanced SQL proficiency including complex joins, window functions, and performance tuning β€’ Experience building and managing ETL/ELT pipelines β€’ Experience configuring and supporting data exposure through APIs β€’ Solid understanding of data warehousing and analytics concepts β€’ Experience with cloud data platforms such as AWS, Azure, or GCP preferred β€’ Familiarity with CI/CD practices for data pipelines preferred β€’ Experience with orchestration tools such as Airflow or Databricks Workflows preferred β€’ Knowledge of data quality, monitoring, and reconciliation frameworks preferred β€’ Strong problem-solving and analytical skills β€’ Ability to work independently on complex data challenges β€’ Clear communication skills with both technical and non-technical stakeholders β€’ Comfortable working in an agile and product-oriented environment