

Motion Recruitment
Data Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Engineer focused on full-stack data engineering and AI/ML, with a 6-month remote contract at $50–60/hour. Key skills include Python, SQL, Spark/PySpark, Airflow, and cloud platform experience. A degree in a related field and 2+ years of relevant experience are required.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
480
-
🗓️ - Date
May 21, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
Remote
-
📄 - Contract
W2 Contractor
-
🔒 - Security
Unknown
-
📍 - Location detailed
Michigan, United States
-
🧠 - Skills detailed
#Compliance #Libraries #Apache Spark #MySQL #Cloud #Computer Science #Data Analysis #BI (Business Intelligence) #Classification #Databricks #SaaS (Software as a Service) #Tableau #AWS (Amazon Web Services) #Data Science #GCP (Google Cloud Platform) #SciPy #Data Quality #Automation #Apache Kafka #Data Pipeline #Kafka (Apache Kafka) #AI (Artificial Intelligence) #NLP (Natural Language Processing) #Data Governance #Pandas #PySpark #Data Management #Microsoft Power BI #Datasets #Code Reviews #Spark (Apache Spark) #Python #Statistics #Looker #Azure #Data Engineering #SQL (Structured Query Language) #Data Lineage #Scala #PostgreSQL #Apache Airflow #Databases #ML (Machine Learning) #Anomaly Detection #SQL Server #GIT #Metadata #Airflow #NumPy #Informatica #Delta Lake #MDM (Master Data Management)
Role description
Data Engineer – Full Stack – Python, AI/ML
Location: Remote – US
Duration: 6 Months (possibility of renewal or contract-to-hire thereafter)
Rate: $50–60/hour w2
Join a remote, US-based opportunity supporting a data governance-focused engineering team as a hands-on Data Engineer working across full-stack data engineering and AI/ML-enabled workflows. This role is ideal for a mid-level engineer with experience in Python, SQL, Spark/PySpark, and Airflow, contributing to initiatives involving data quality, lineage, metadata, master data management, and analytics-ready datasets. This is a contract opportunity with the potential for renewal or conversion to a full-time position.
This opportunity offers the chance to work at the intersection of modern data engineering and emerging AI/ML-powered governance practices. You will contribute to initiatives such as embedding-based data classification, anomaly detection, LLM-assisted catalog search, and governed data exposure for AI assistants while partnering with technical teams and stakeholders in a collaborative environment. If you enjoy solving complex data problems, building scalable pipelines, and expanding your expertise in data governance and AI, this role provides strong technical growth potential.
Contract Duration: 6 Months (possible extensions)
Required Skills & Experience
· Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, Statistics, or related field
· 2+ years of experience building data pipelines using Python (Pandas, NumPy, SciPy) and SQL
· Experience with Apache Spark or PySpark and workflow orchestration tools such as Apache Airflow
· Experience designing schemas across relational and analytical databases including PostgreSQL, MySQL, and SQL Server
· Experience implementing data quality validation, exploratory data analysis (EDA), and integrity enforcement in production datasets
· Hands-on experience with at least one cloud platform (AWS, Azure, or GCP)
· Working familiarity with Python ML libraries such as Scikit-Learn for feature engineering and exploratory analysis
· Experience producing analytics-ready datasets for BI platforms including Tableau, Power BI, or Looker
· Experience with Git, code reviews, CI/CD practices, and modular engineering workflows
· Strong written and verbal communication skills with collaborative working style
Desired Skills & Experience
· Exposure to data governance tooling including metadata management, data lineage, stewardship workflows, and catalogs
· Experience with MDM platforms, especially Informatica MDM SaaS, C360, or multi-domain environments
· Experience supporting compliance, audit, or regulated-data initiatives
· Experience with Apache Kafka and Spark Structured Streaming
· Exposure to lakehouse technologies including Delta Lake and Databricks
· Familiarity with LLM APIs, RAG architectures, agentic AI patterns, and MCP applied to governance use cases
· NLP and text preprocessing experience for unstructured data
· Power BI certifications
· Attention to detail and ownership of data quality outcomes
· Collaborative, team-first mindset with the ability to operate within established engineering standards
· Clear written and verbal communication skills with technical and non-technical audiences
· Curiosity and willingness to grow within modern AI/ML-assisted governance environments
What You Will Be Doing
Tech Breakdown
· 40% Python, SQL, and Data Pipeline Engineering
· 20% Spark/PySpark, Airflow, and Workflow Automation
· 20% Data Governance, Quality, Metadata, and Lineage
· 10% AI/ML-Assisted Governance and Analytics
· 10% BI Reporting and Cloud-Based Data Platforms
Daily Responsibilities
· 80% Hands On
· 0% Management Duties
· 20% Team Collaboration
Data Engineer – Full Stack – Python, AI/ML
Location: Remote – US
Duration: 6 Months (possibility of renewal or contract-to-hire thereafter)
Rate: $50–60/hour w2
Join a remote, US-based opportunity supporting a data governance-focused engineering team as a hands-on Data Engineer working across full-stack data engineering and AI/ML-enabled workflows. This role is ideal for a mid-level engineer with experience in Python, SQL, Spark/PySpark, and Airflow, contributing to initiatives involving data quality, lineage, metadata, master data management, and analytics-ready datasets. This is a contract opportunity with the potential for renewal or conversion to a full-time position.
This opportunity offers the chance to work at the intersection of modern data engineering and emerging AI/ML-powered governance practices. You will contribute to initiatives such as embedding-based data classification, anomaly detection, LLM-assisted catalog search, and governed data exposure for AI assistants while partnering with technical teams and stakeholders in a collaborative environment. If you enjoy solving complex data problems, building scalable pipelines, and expanding your expertise in data governance and AI, this role provides strong technical growth potential.
Contract Duration: 6 Months (possible extensions)
Required Skills & Experience
· Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, Statistics, or related field
· 2+ years of experience building data pipelines using Python (Pandas, NumPy, SciPy) and SQL
· Experience with Apache Spark or PySpark and workflow orchestration tools such as Apache Airflow
· Experience designing schemas across relational and analytical databases including PostgreSQL, MySQL, and SQL Server
· Experience implementing data quality validation, exploratory data analysis (EDA), and integrity enforcement in production datasets
· Hands-on experience with at least one cloud platform (AWS, Azure, or GCP)
· Working familiarity with Python ML libraries such as Scikit-Learn for feature engineering and exploratory analysis
· Experience producing analytics-ready datasets for BI platforms including Tableau, Power BI, or Looker
· Experience with Git, code reviews, CI/CD practices, and modular engineering workflows
· Strong written and verbal communication skills with collaborative working style
Desired Skills & Experience
· Exposure to data governance tooling including metadata management, data lineage, stewardship workflows, and catalogs
· Experience with MDM platforms, especially Informatica MDM SaaS, C360, or multi-domain environments
· Experience supporting compliance, audit, or regulated-data initiatives
· Experience with Apache Kafka and Spark Structured Streaming
· Exposure to lakehouse technologies including Delta Lake and Databricks
· Familiarity with LLM APIs, RAG architectures, agentic AI patterns, and MCP applied to governance use cases
· NLP and text preprocessing experience for unstructured data
· Power BI certifications
· Attention to detail and ownership of data quality outcomes
· Collaborative, team-first mindset with the ability to operate within established engineering standards
· Clear written and verbal communication skills with technical and non-technical audiences
· Curiosity and willingness to grow within modern AI/ML-assisted governance environments
What You Will Be Doing
Tech Breakdown
· 40% Python, SQL, and Data Pipeline Engineering
· 20% Spark/PySpark, Airflow, and Workflow Automation
· 20% Data Governance, Quality, Metadata, and Lineage
· 10% AI/ML-Assisted Governance and Analytics
· 10% BI Reporting and Cloud-Based Data Platforms
Daily Responsibilities
· 80% Hands On
· 0% Management Duties
· 20% Team Collaboration





