Droisys

AI/ML Data Engineer (Pytest and PySpark, Databricks)

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI/ML Data Engineer in Plano, TX, with a contract length of unspecified duration, offering $40 to $46/hr. Key skills include PySpark, Databricks, AI/ML integration, and Pytest. Experience in cloud environments and CI/CD tools is required.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
368
-
πŸ—“οΈ - Date
October 10, 2025
πŸ•’ - Duration
Unknown
-
🏝️ - Location
On-site
-
πŸ“„ - Contract
W2 Contractor
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Plano, TX
-
🧠 - Skills detailed
#Data Science #Automated Testing #Azure #S3 (Amazon Simple Storage Service) #Jenkins #Spark (Apache Spark) #Data Storage #Model Deployment #Scala #GIT #Agile #Python #Pytest #Automation #Cloud #Azure DevOps #AWS S3 (Amazon Simple Storage Service) #"ETL (Extract #Transform #Load)" #DevOps #Databricks #GCP (Google Cloud Platform) #Deployment #Unit Testing #Data Quality #Data Lake #PySpark #Business Analysis #AI (Artificial Intelligence) #AWS (Amazon Web Services) #Storage #Programming #MLflow #ML (Machine Learning) #Data Engineering #Data Pipeline #Datasets #Delta Lake #SQL (Structured Query Language) #Computer Science #Strategy
Role description
About Company, Droisys is an innovation technology company focused on helping companies accelerate their digital initiatives from strategy and planning through execution. We leverage deep technical expertise, Agile methodologies, and data-driven intelligence to modernize systems of engagement and simplify human/tech interaction. Amazing things happen when we work in environments where everyone feels a true sense of belonging and when candidates have the requisite skills and opportunities to succeed. At Droisys, we invest in our talent and support career growth, and we are always on the lookout for amazing talent who can contribute to our growth by delivering top results for our clients. Join us to challenge yourself and accomplish work that matters. Here's the job details, AI/ML Data Engineer (Pytest and PySpark, Databricks) Plano, TX (Onsite) Interview Mode: Video + In-Person Rate Range $40 to $46 hr W2 All Inc Job Description: We are seeking an experienced Data Engineer with a strong background in PySpark, Databricks, AI/ML integration, and automated testing using Pytest. The ideal candidate will design, develop, and optimize scalable data pipelines, ensuring reliability and performance across analytical and machine learning workloads. This role requires hands-on technical expertise, strong analytical thinking, and a collaborative mindset. Responsibilities: β€’ Design, develop, and maintain scalable data pipelines using PySpark and Databricks. β€’ Integrate AI/ML models into existing data workflows for predictive analytics and intelligent automation. β€’ Implement robust testing frameworks using Pytest to validate data quality, transformations, and pipeline integrity. β€’ Optimize data storage and processing performance across distributed systems. β€’ Collaborate with data scientists, AI engineers, and business analysts to enable machine learning workflows. β€’ Develop CI/CD processes for data pipelines and model deployment within Databricks. β€’ Troubleshoot and resolve performance issues across large-scale datasets. β€’ Document and maintain best practices for coding, testing, and deployment. Required Skills: β€’ Strong experience with PySpark for large-scale data transformation and ETL. β€’ Hands-on expertise in Databricks environment (cluster management, job orchestration, and notebooks). β€’ Proficient in Pytest for unit testing and data validation automation. β€’ Experience in AI/ML pipelines β€” model training, evaluation, and integration with data engineering workflows. β€’ Strong programming skills in Python and SQL. β€’ Knowledge of Delta Lake, Azure Data Lake, or AWS S3 environments. β€’ Familiarity with CI/CD tools (Git, Jenkins, Azure DevOps). β€’ Excellent analytical and problem-solving skills. Preferred Qualifications: β€’ Experience with MLflow or MLOps frameworks. β€’ Exposure to cloud-based data ecosystems (Azure, AWS, or GCP). β€’ Bachelor’s or Master’s degree in Computer Science, Data Engineering, or related field Droisys is an equal opportunity employer. We do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Droisys believes in diversity, inclusion, and belonging, and we are committed to fostering a diverse work environment.