

Dataworks
Senior Data Engineer - Databricks
β - Featured Role | Apply direct with Data Freelance Hub
This role is a Senior Data Engineer Contractor focused on energy assets, offering a 3-month remote engagement. Key skills include Databricks, PySpark, Azure, and SQL. Experience with production-grade data pipelines and data quality frameworks is required.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
February 27, 2026
π - Duration
3 to 6 months
-
ποΈ - Location
Remote
-
π - Contract
W2 Contractor
-
π - Security
Unknown
-
π - Location detailed
United States
-
π§ - Skills detailed
#Databricks #Data Quality #Datasets #Spark (Apache Spark) #Azure SQL #Data Accuracy #Regression #Azure #Cloud #PySpark #Data Engineering #SQL (Structured Query Language) #Data Pipeline #Monitoring #Lean #Compliance
Role description
Senior Data Engineer Contractor
US Remote | 3-Month Initial Contract | Senior / Staff Level | Energy-Focused Investment Firm
ONLY APPLICATIONS VIA LINKEDIN WILL BE CONSIDERED. ALL DIRECT EMAILS WILL NOT BE READ.
Tech Stack: Databrick, Pyspark, Azure, SQL
Dataworks is supporting a US-based investment platform focused on energy assets in the search for a Senior Data Engineer Contractor to stabilize, harden, and evolve an existing production data platform.
This is a high-impact engagement within a lean, senior team. You will not be building from scratch. You will take ownership of reliability, quality, and production discipline across an Azure-hosted data environment powering investor-grade analytics.
About the Environment
The client operates a cloud-based data platform running on Azure with multiple production pipelines already in place. The objective is to convert complex third-party operational datasets into reliable, versioned time-series and network-style datasets used for analytics and decision support.
Primary consumers are business stakeholders and investment teams. Data accuracy, immutability, and traceability are critical.
There is no real-time streaming stack. The focus is production stability, deterministic logic, and high data quality.
The Role
You will own the engineering backbone of the platform. Key responsibilities include:
β’ Productionizing ingestion from external vendors into structured, reliable datasets
β’ Implementing incremental processing, safe backfills, and reproducible historical replays
β’ Building and maintaining curated βmodel-readyβ datasets with stable keys
β’ Designing and enforcing data quality gates that prevent bad publishes
β’ Ensuring deterministic aggregations and preserving time-series directionality
β’ Establishing run discipline: manifests, run IDs, lineage, rerunnable workflows
β’ Improving orchestration, monitoring, retries, and alerting
β’ Partnering with analytics stakeholders to define stable data contracts
This is a senior-level role requiring autonomy. You may be the sole engineer on the project.
What βGoodβ Looks Like
Within 30β60 days:
β’ Daily pipeline runs are stable with clear manifests and failure alerts 2026 02 Senior data engineer Coβ¦
β’ Data quality gates catch regressions before publishing
β’ Historical replay and backfills are routine and low risk
β’ Performance and cost are optimized as volumes scale
By 90 days:
β’ The pipeline supports modeling and scenario analysis without upstream churn 2026 02 Senior data engineer Coβ¦
β’ An operational playbook exists
Ideal Profile
We are looking for someone who:
β’ Has built and maintained production-grade data pipelines in the cloud
β’ Understands incremental processing, schema evolution, partitioning, and optimization
β’ Has implemented data quality frameworks and publish gates
β’ Is comfortable being highly autonomous
β’ Has experience in energy, commodities, logistics, or flow-based systems (strong plus)
β’ Can operate independently while participating in stand-ups and team alignment
East Coast US timezone overlap preferred. Fully remote.
Engagement structure: 1099, C2C, or W-2 depending on compliance and setup.
Initial contract: 3 months with strong likelihood of extension.
Senior Data Engineer Contractor
US Remote | 3-Month Initial Contract | Senior / Staff Level | Energy-Focused Investment Firm
ONLY APPLICATIONS VIA LINKEDIN WILL BE CONSIDERED. ALL DIRECT EMAILS WILL NOT BE READ.
Tech Stack: Databrick, Pyspark, Azure, SQL
Dataworks is supporting a US-based investment platform focused on energy assets in the search for a Senior Data Engineer Contractor to stabilize, harden, and evolve an existing production data platform.
This is a high-impact engagement within a lean, senior team. You will not be building from scratch. You will take ownership of reliability, quality, and production discipline across an Azure-hosted data environment powering investor-grade analytics.
About the Environment
The client operates a cloud-based data platform running on Azure with multiple production pipelines already in place. The objective is to convert complex third-party operational datasets into reliable, versioned time-series and network-style datasets used for analytics and decision support.
Primary consumers are business stakeholders and investment teams. Data accuracy, immutability, and traceability are critical.
There is no real-time streaming stack. The focus is production stability, deterministic logic, and high data quality.
The Role
You will own the engineering backbone of the platform. Key responsibilities include:
β’ Productionizing ingestion from external vendors into structured, reliable datasets
β’ Implementing incremental processing, safe backfills, and reproducible historical replays
β’ Building and maintaining curated βmodel-readyβ datasets with stable keys
β’ Designing and enforcing data quality gates that prevent bad publishes
β’ Ensuring deterministic aggregations and preserving time-series directionality
β’ Establishing run discipline: manifests, run IDs, lineage, rerunnable workflows
β’ Improving orchestration, monitoring, retries, and alerting
β’ Partnering with analytics stakeholders to define stable data contracts
This is a senior-level role requiring autonomy. You may be the sole engineer on the project.
What βGoodβ Looks Like
Within 30β60 days:
β’ Daily pipeline runs are stable with clear manifests and failure alerts 2026 02 Senior data engineer Coβ¦
β’ Data quality gates catch regressions before publishing
β’ Historical replay and backfills are routine and low risk
β’ Performance and cost are optimized as volumes scale
By 90 days:
β’ The pipeline supports modeling and scenario analysis without upstream churn 2026 02 Senior data engineer Coβ¦
β’ An operational playbook exists
Ideal Profile
We are looking for someone who:
β’ Has built and maintained production-grade data pipelines in the cloud
β’ Understands incremental processing, schema evolution, partitioning, and optimization
β’ Has implemented data quality frameworks and publish gates
β’ Is comfortable being highly autonomous
β’ Has experience in energy, commodities, logistics, or flow-based systems (strong plus)
β’ Can operate independently while participating in stand-ups and team alignment
East Coast US timezone overlap preferred. Fully remote.
Engagement structure: 1099, C2C, or W-2 depending on compliance and setup.
Initial contract: 3 months with strong likelihood of extension.






