

ML Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for an ML Data Engineer in Redmond, WA, for 12 months at a pay rate of "unknown." Requires 5+ years in data engineering/machine learning, strong Python, SQL, and ML workflow knowledge, with familiarity in multimedia data formats.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
840
-
ποΈ - Date discovered
September 25, 2025
π - Project duration
More than 6 months
-
ποΈ - Location type
On-site
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
Redmond, WA
-
π§ - Skills detailed
#Graph Databases #Security #Anomaly Detection #Linux #NoSQL #Compliance #PyTorch #Data Engineering #Data Governance #Data Pipeline #REST API #"ETL (Extract #Transform #Load)" #Pandas #Shell Scripting #Data Cleaning #Scala #REST (Representational State Transfer) #Data Quality #NumPy #ML (Machine Learning) #Scripting #Datasets #Databases #SciPy #SQL (Structured Query Language) #Python
Role description
Job Title: ML Data Engineer
Location: Redmond WA - Onsite
Duration: 12 Months (possibility of extensions)
Machine Learning Data Engineer to support a cutting-edge research team working on next-generation ML models and intelligent devices. This role sits at the intersection of data engineering and applied machine learning, transforming complex multimedia data into robust, high-quality datasets ready for training.
Must-Have Skills
5+ years in data engineering or machine learning.
Strong Python skills (NumPy, SciPy, Pandas).
Experience with SQL, data cleaning, anomaly detection.
Understanding of ML training workflows and data quality impact (especially with PyTorch).
Proficiency in Linux and shell scripting.
Familiarity with audio, image, and video formats.
Nice-to-Have
Experience with multimodal datasets (Audio/Video, Optitrack, multi-Camera/Sensor).
Basic knowledge of audio processing (DSP, acoustics).
REST APIs, NoSQL/graph databases, and research environments.
Strong mathematical foundation.
Youβll collaborate closely with researchers and data collection teams to ensure the data pipeline is optimized for performance, accuracy, and scalability.
β’ Build scalable pipelines for audio, video, and sensor data (IMU).
β’ Translate stakeholder needs into technical solutions.
β’ Perform advanced data operations: filtering, feature extraction, synchronization, and ML inference.
β’ Interface with internal tools for dataset management, validation, transformation, and QA.
β’ Collaborate with researchers to integrate prototypes into production workflows.
β’ Ensure compliance with data governance and security standards.
Job Title: ML Data Engineer
Location: Redmond WA - Onsite
Duration: 12 Months (possibility of extensions)
Machine Learning Data Engineer to support a cutting-edge research team working on next-generation ML models and intelligent devices. This role sits at the intersection of data engineering and applied machine learning, transforming complex multimedia data into robust, high-quality datasets ready for training.
Must-Have Skills
5+ years in data engineering or machine learning.
Strong Python skills (NumPy, SciPy, Pandas).
Experience with SQL, data cleaning, anomaly detection.
Understanding of ML training workflows and data quality impact (especially with PyTorch).
Proficiency in Linux and shell scripting.
Familiarity with audio, image, and video formats.
Nice-to-Have
Experience with multimodal datasets (Audio/Video, Optitrack, multi-Camera/Sensor).
Basic knowledge of audio processing (DSP, acoustics).
REST APIs, NoSQL/graph databases, and research environments.
Strong mathematical foundation.
Youβll collaborate closely with researchers and data collection teams to ensure the data pipeline is optimized for performance, accuracy, and scalability.
β’ Build scalable pipelines for audio, video, and sensor data (IMU).
β’ Translate stakeholder needs into technical solutions.
β’ Perform advanced data operations: filtering, feature extraction, synchronization, and ML inference.
β’ Interface with internal tools for dataset management, validation, transformation, and QA.
β’ Collaborate with researchers to integrate prototypes into production workflows.
β’ Ensure compliance with data governance and security standards.