

DRS IT Solutions Inc
Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Engineer focused on Autonomous Vehicle AI Research, offering a hybrid work arrangement. Contract length and pay rate are unspecified. Requires 3+ years of experience, proficiency in Python, SQL, and AWS, plus familiarity with autonomous vehicle datasets.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
April 25, 2026
π - Duration
Unknown
-
ποΈ - Location
Hybrid
-
π - Contract
1099 Contractor
-
π - Security
Unknown
-
π - Location detailed
Los Altos, CA
-
π§ - Skills detailed
#Scala #ML (Machine Learning) #Data Quality #Reinforcement Learning #Model Evaluation #Deployment #Airflow #Distributed Computing #Athena #Batch #"ETL (Extract #Transform #Load)" #EC2 #Storage #Metadata #Data Engineering #Python #Computer Science #AWS (Amazon Web Services) #Documentation #AI (Artificial Intelligence) #SageMaker #SQL (Structured Query Language) #Spark (Apache Spark) #Data Pipeline #S3 (Amazon Simple Storage Service) #Data Storage #Datasets #Luigi
Role description
Data Engineer β Autonomous Vehicle AI Research Infrastructure
β’
β’ Hybrid work, locals are preferred but also open to remote work
β’
β’
β’
β’ Must be open to W2/1099 ( NO C2C)
β’
β’ As a Data Engineer, you will be a key enabler of this missionβowning the systems that
collect, organize, clean, and deliver the volumes of sensor and simulation data that fuel
our world models, perception systems, and reinforcement learning algorithms. You will
collaborate closely with research scientists and machine learning engineers to ensure
our pipelines are reliable, scalable, and performantβpowering breakthroughs in
intelligent driving across simulation and real-world deployments.
Responsibilities
β Design, implement, and maintain robust data pipelines for ingesting, cleaning,
and transforming large-scale autonomous vehicle datasets (camera, LiDAR,
radar, GPS, simulation logs).
β Develop scalable storage and retrieval systems using AWS services (S3, EC2,
SageMaker, Athena, etc.).
β Ensure data quality and consistency through automated validation, deduplication,
and schema enforcement.
β Collaborate with ML researchers and engineers to provide efficient access to
training data, labels, and metadata.
β Optimize data preprocessing and batching pipelines to support large-scale
training and evaluation workflows.
β Build tools to manage and audit dataset versions, experiment tracking, and
feature reproducibility.
β Implement and maintain CI/CD workflows for data and pipeline updates, ensuring
minimal downtime and reproducible outputs.
β Monitor data pipeline performance and respond to bottlenecks or outages
proactively.
Qualifications
β B.S. or M.S. in Computer Science, Data Engineering, or a related field.
β 3+ years of experience building production-grade data infrastructure or ML data
pipelines.
β Strong proficiency with Python and SQL, and experience with data workflow
orchestration tools (e.g., Airflow, Prefect, Luigi).
β Deep experience with AWS services, especially S3 (data storage), EC2
(compute), and SageMaker (model training).
β Familiarity with distributed computing frameworks like Spark, Dask, or Ray.
β Understanding of best practices for dataset documentation, standardization, and
reproducibility in research.
Bonus Qualifications
β Experience with autonomous vehicle datasets or robotics sensor data.
β Familiarity with ML training pipelines and model evaluation workflows.
β Prior experience collaborating with researchers or applied ML teams in
high-throughput environments.
Best Regards.
Sara RG,
DRS IT Solutions, Inc
28175 Haggerty Road,
Novi, MI 48377
(C) 248-440-7600 EXT -4
sara@drsitsolutions.com
Data Engineer β Autonomous Vehicle AI Research Infrastructure
β’
β’ Hybrid work, locals are preferred but also open to remote work
β’
β’
β’
β’ Must be open to W2/1099 ( NO C2C)
β’
β’ As a Data Engineer, you will be a key enabler of this missionβowning the systems that
collect, organize, clean, and deliver the volumes of sensor and simulation data that fuel
our world models, perception systems, and reinforcement learning algorithms. You will
collaborate closely with research scientists and machine learning engineers to ensure
our pipelines are reliable, scalable, and performantβpowering breakthroughs in
intelligent driving across simulation and real-world deployments.
Responsibilities
β Design, implement, and maintain robust data pipelines for ingesting, cleaning,
and transforming large-scale autonomous vehicle datasets (camera, LiDAR,
radar, GPS, simulation logs).
β Develop scalable storage and retrieval systems using AWS services (S3, EC2,
SageMaker, Athena, etc.).
β Ensure data quality and consistency through automated validation, deduplication,
and schema enforcement.
β Collaborate with ML researchers and engineers to provide efficient access to
training data, labels, and metadata.
β Optimize data preprocessing and batching pipelines to support large-scale
training and evaluation workflows.
β Build tools to manage and audit dataset versions, experiment tracking, and
feature reproducibility.
β Implement and maintain CI/CD workflows for data and pipeline updates, ensuring
minimal downtime and reproducible outputs.
β Monitor data pipeline performance and respond to bottlenecks or outages
proactively.
Qualifications
β B.S. or M.S. in Computer Science, Data Engineering, or a related field.
β 3+ years of experience building production-grade data infrastructure or ML data
pipelines.
β Strong proficiency with Python and SQL, and experience with data workflow
orchestration tools (e.g., Airflow, Prefect, Luigi).
β Deep experience with AWS services, especially S3 (data storage), EC2
(compute), and SageMaker (model training).
β Familiarity with distributed computing frameworks like Spark, Dask, or Ray.
β Understanding of best practices for dataset documentation, standardization, and
reproducibility in research.
Bonus Qualifications
β Experience with autonomous vehicle datasets or robotics sensor data.
β Familiarity with ML training pipelines and model evaluation workflows.
β Prior experience collaborating with researchers or applied ML teams in
high-throughput environments.
Best Regards.
Sara RG,
DRS IT Solutions, Inc
28175 Haggerty Road,
Novi, MI 48377
(C) 248-440-7600 EXT -4
sara@drsitsolutions.com





