Cyberobotix

Data Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Data Engineer in Berkeley Heights, NJ, with a contract length of "unknown." Pay rate is "unknown." Key skills include AWS, Python, PySpark, SQL, and experience with data modeling and big data technologies.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date

April 23, 2026

🕒 - Duration

Unknown

🏝️ - Location

On-site

📄 - Contract

Unknown

🔒 - Security

Unknown

📍 - Location detailed

New Jersey, United States

🧠 - Skills detailed

#Scala #SQL (Structured Query Language) #Spark (Apache Spark) #PySpark #SageMaker #Security #Spark SQL #Data Quality #Visualization #Lambda (AWS Lambda) #Big Data #AWS (Amazon Web Services) #Microsoft Power BI #Data Engineering #ML (Machine Learning) #S3 (Amazon Simple Storage Service) #Terraform #BI (Business Intelligence) #TensorFlow #Data Pipeline #Data Lake #AI (Artificial Intelligence) #Infrastructure as Code (IaC) #"ETL (Extract #Transform #Load)" #Snowflake #Redshift #Data Science #Apache Spark #Python #Data Warehouse #Athena #Data Modeling #AWS S3 (Amazon Simple Storage Service) #Cloud #Hadoop #Batch

Role description

Job Title: Lead Data Engineer / Sr Data Engineer Location: Berkeley Heights, NJ Key Skills Required · AWS (S3, Redshift, Glue, Lambda, EMR, Athena) · Data Engineering & Data Modeling (Star Schema, Snowflake, Dimensional Modeling) · Python, PySpark, SQL · Big Data Technologies (Hadoop, Spark) · Infrastructure as Code (Terraform) · AI/ML integration basics · Visualization tools (Power BI) Roles & Responsibilities · Design, develop, and maintain scalable data pipelines for batch and real-time processing using AWS services · Build and optimize data lakes and data warehouses using Amazon S3, Redshift, and Glue · Develop robust ETL/ELT pipelines using Python, PySpark, and SQL · Implement efficient data modeling techniques such as star schema and dimensional modeling · Work with large-scale distributed systems using Hadoop and Apache Spark · Integrate AI/ML models into data pipelines to support advanced analytics · Automate infrastructure provisioning using Terraform (IaC) · Ensure data quality, governance, and security across pipelines · Collaborate with cross-functional teams including data scientists, analysts, and business stakeholders · Develop dashboards and reports using Power BI for business insights · Monitor and optimize performance of data pipelines and cloud resources. · Exposure to AI/ML frameworks (SageMaker, TensorFlow, etc.)

Apply now Apply with DFH

Cyberobotix

Data Engineer

FlowCal Measurement Analyst (Oil & Gas Data Analyst)

Tableau Admin - Columbus OH Location Only

Data Modeler

AI Data Engineer

Book a

chat

with us

Company