

DRS IT Solutions Inc
Data Engineer – Hybrid @ LOS ALTOS, CA.
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is a Data Engineer position on a contract basis in Los Altos, CA, offering a pay rate of "X" for a duration of "Y". Requires 3+ years in data infrastructure, strong Python and SQL skills, and AWS expertise.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
April 11, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Hybrid
-
📄 - Contract
1099 Contractor
-
🔒 - Security
Unknown
-
📍 - Location detailed
Los Altos, CA
-
🧠 - Skills detailed
#Deployment #Data Engineering #Luigi #"ETL (Extract #Transform #Load)" #Python #Model Evaluation #Batch #S3 (Amazon Simple Storage Service) #Data Quality #Athena #EC2 #Storage #AWS (Amazon Web Services) #Metadata #Data Storage #Spark (Apache Spark) #Computer Science #Datasets #ML (Machine Learning) #Scala #Data Pipeline #Airflow #Reinforcement Learning #SageMaker #SQL (Structured Query Language) #Documentation #Distributed Computing
Role description
JOB DESCRIPTION
Employment Type – CONTRACT
Client is looking for CITIZENS/GREEN CARD
ONLY W2/1099 candidates
Hybrid (2 days onsite- Tuesday and Wednesday, 4x10 shift- off every Friday)
As a Data Engineer, you will be a key enabler of this mission—owning the systems that
collect, organize, clean, and deliver the volumes of sensor and simulation data that fuel
our world models, perception systems, and reinforcement learning algorithms. You will
collaborate closely with research scientists and machine learning engineers to ensure
our pipelines are reliable, scalable, and performant—powering breakthroughs in
intelligent driving across simulation and real-world deployments.
Responsibilities
● Design, implement, and maintain robust data pipelines for ingesting, cleaning,
and transforming large-scale autonomous vehicle datasets (camera, LiDAR,
radar, GPS, simulation logs).
● Develop scalable storage and retrieval systems using AWS services (S3, EC2,
SageMaker, Athena, etc.).
● Ensure data quality and consistency through automated validation, deduplication,
and schema enforcement.
● Collaborate with ML researchers and engineers to provide efficient access to
training data, labels, and metadata.
● Optimize data preprocessing and batching pipelines to support large-scale
training and evaluation workflows.
● Build tools to manage and audit dataset versions, experiment tracking, and
feature reproducibility.
● Implement and maintain CI/CD workflows for data and pipeline updates, ensuring
minimal downtime and reproducible outputs.
● Monitor data pipeline performance and respond to bottlenecks or outages
proactively.
Qualifications
● B.S. or M.S. in Computer Science, Data Engineering, or a related field.
● 3+ years of experience building production-grade data infrastructure or ML data
pipelines.
● Strong proficiency with Python and SQL, and experience with data workflow
orchestration tools (e.g., Airflow, Prefect, Luigi).
● Deep experience with AWS services, especially S3 (data storage), EC2
(compute), and SageMaker (model training).
● Familiarity with distributed computing frameworks like Spark, Dask, or Ray.
● Understanding of best practices for dataset documentation, standardization, and
reproducibility in research.
Bonus Qualifications
● Experience with autonomous vehicle datasets or robotics sensor data.
● Familiarity with ML training pipelines and model evaluation workflows.
● Prior experience collaborating with researchers or applied ML teams in
high-throughput environments.
Best Regards.
Bini Skaria,
DRS IT Solutions Inc,
28175 Haggerty Road,
Novi, MI 48377
(C) 248-440-7600 EXT-1
(F) 248-859-4430
Bini Skaria | LinkedIn
Bini@drsitsolutions.com
www.drsitsolutions.com
An E-Verified Company
Certified Women Business Enterprise (WBENC) Certified Women Owned Small Business (WOSB)
JOB DESCRIPTION
Employment Type – CONTRACT
Client is looking for CITIZENS/GREEN CARD
ONLY W2/1099 candidates
Hybrid (2 days onsite- Tuesday and Wednesday, 4x10 shift- off every Friday)
As a Data Engineer, you will be a key enabler of this mission—owning the systems that
collect, organize, clean, and deliver the volumes of sensor and simulation data that fuel
our world models, perception systems, and reinforcement learning algorithms. You will
collaborate closely with research scientists and machine learning engineers to ensure
our pipelines are reliable, scalable, and performant—powering breakthroughs in
intelligent driving across simulation and real-world deployments.
Responsibilities
● Design, implement, and maintain robust data pipelines for ingesting, cleaning,
and transforming large-scale autonomous vehicle datasets (camera, LiDAR,
radar, GPS, simulation logs).
● Develop scalable storage and retrieval systems using AWS services (S3, EC2,
SageMaker, Athena, etc.).
● Ensure data quality and consistency through automated validation, deduplication,
and schema enforcement.
● Collaborate with ML researchers and engineers to provide efficient access to
training data, labels, and metadata.
● Optimize data preprocessing and batching pipelines to support large-scale
training and evaluation workflows.
● Build tools to manage and audit dataset versions, experiment tracking, and
feature reproducibility.
● Implement and maintain CI/CD workflows for data and pipeline updates, ensuring
minimal downtime and reproducible outputs.
● Monitor data pipeline performance and respond to bottlenecks or outages
proactively.
Qualifications
● B.S. or M.S. in Computer Science, Data Engineering, or a related field.
● 3+ years of experience building production-grade data infrastructure or ML data
pipelines.
● Strong proficiency with Python and SQL, and experience with data workflow
orchestration tools (e.g., Airflow, Prefect, Luigi).
● Deep experience with AWS services, especially S3 (data storage), EC2
(compute), and SageMaker (model training).
● Familiarity with distributed computing frameworks like Spark, Dask, or Ray.
● Understanding of best practices for dataset documentation, standardization, and
reproducibility in research.
Bonus Qualifications
● Experience with autonomous vehicle datasets or robotics sensor data.
● Familiarity with ML training pipelines and model evaluation workflows.
● Prior experience collaborating with researchers or applied ML teams in
high-throughput environments.
Best Regards.
Bini Skaria,
DRS IT Solutions Inc,
28175 Haggerty Road,
Novi, MI 48377
(C) 248-440-7600 EXT-1
(F) 248-859-4430
Bini Skaria | LinkedIn
Bini@drsitsolutions.com
www.drsitsolutions.com
An E-Verified Company
Certified Women Business Enterprise (WBENC) Certified Women Owned Small Business (WOSB)





