Dataworks

Senior Data Engineer - Databricks

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is a Senior Data Engineer Contractor focused on energy assets, offering a 3-month remote engagement. Key skills include Databricks, PySpark, Azure, and SQL. Experience with production-grade data pipelines and data quality frameworks is required.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
February 27, 2026
πŸ•’ - Duration
3 to 6 months
-
🏝️ - Location
Remote
-
πŸ“„ - Contract
W2 Contractor
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
United States
-
🧠 - Skills detailed
#Databricks #Data Quality #Datasets #Spark (Apache Spark) #Azure SQL #Data Accuracy #Regression #Azure #Cloud #PySpark #Data Engineering #SQL (Structured Query Language) #Data Pipeline #Monitoring #Lean #Compliance
Role description
Senior Data Engineer Contractor US Remote | 3-Month Initial Contract | Senior / Staff Level | Energy-Focused Investment Firm ONLY APPLICATIONS VIA LINKEDIN WILL BE CONSIDERED. ALL DIRECT EMAILS WILL NOT BE READ. Tech Stack: Databrick, Pyspark, Azure, SQL Dataworks is supporting a US-based investment platform focused on energy assets in the search for a Senior Data Engineer Contractor to stabilize, harden, and evolve an existing production data platform. This is a high-impact engagement within a lean, senior team. You will not be building from scratch. You will take ownership of reliability, quality, and production discipline across an Azure-hosted data environment powering investor-grade analytics. About the Environment The client operates a cloud-based data platform running on Azure with multiple production pipelines already in place. The objective is to convert complex third-party operational datasets into reliable, versioned time-series and network-style datasets used for analytics and decision support. Primary consumers are business stakeholders and investment teams. Data accuracy, immutability, and traceability are critical. There is no real-time streaming stack. The focus is production stability, deterministic logic, and high data quality. The Role You will own the engineering backbone of the platform. Key responsibilities include: β€’ Productionizing ingestion from external vendors into structured, reliable datasets β€’ Implementing incremental processing, safe backfills, and reproducible historical replays β€’ Building and maintaining curated β€œmodel-ready” datasets with stable keys β€’ Designing and enforcing data quality gates that prevent bad publishes β€’ Ensuring deterministic aggregations and preserving time-series directionality β€’ Establishing run discipline: manifests, run IDs, lineage, rerunnable workflows β€’ Improving orchestration, monitoring, retries, and alerting β€’ Partnering with analytics stakeholders to define stable data contracts This is a senior-level role requiring autonomy. You may be the sole engineer on the project. What β€œGood” Looks Like Within 30–60 days: β€’ Daily pipeline runs are stable with clear manifests and failure alerts 2026 02 Senior data engineer Co… β€’ Data quality gates catch regressions before publishing β€’ Historical replay and backfills are routine and low risk β€’ Performance and cost are optimized as volumes scale By 90 days: β€’ The pipeline supports modeling and scenario analysis without upstream churn 2026 02 Senior data engineer Co… β€’ An operational playbook exists Ideal Profile We are looking for someone who: β€’ Has built and maintained production-grade data pipelines in the cloud β€’ Understands incremental processing, schema evolution, partitioning, and optimization β€’ Has implemented data quality frameworks and publish gates β€’ Is comfortable being highly autonomous β€’ Has experience in energy, commodities, logistics, or flow-based systems (strong plus) β€’ Can operate independently while participating in stand-ups and team alignment East Coast US timezone overlap preferred. Fully remote. Engagement structure: 1099, C2C, or W-2 depending on compliance and setup. Initial contract: 3 months with strong likelihood of extension.