ALIS Software LLC

MLOps Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an MLOps Engineer in NYC, NY, on a long-term contract. Pay rate is competitive. Key skills include AWS SageMaker, PyTorch, TensorFlow, and experience with ML inference systems. Strongly preferred experience with large-scale datasets and Transformer-based models.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
June 3, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Hybrid
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
New York, NY
-
🧠 - Skills detailed
#AWS (Amazon Web Services) #Security #SageMaker #Deployment #A/B Testing #AWS SageMaker #Neural Networks #"ETL (Extract #Transform #Load)" #Monitoring #Model Deployment #Observability #PyTorch #NLP (Natural Language Processing) #AutoScaling #BERT #Datasets #Storage #Data Science #ML (Machine Learning) #TensorFlow #DevOps
Role description
Role: MLOps Engineer Location: NYC, NY Hybrid: 3 Days Onsite Duration: Long term Roles & Responsibilities • Design, deploy, and operate end‑to‑end production ML pipelines across Dev, QA, and Prod environments. • Set up and manage AWS SageMaker pipelines, endpoints, and monitoring for large scale inference workloads, including embedding generation, named entity recognition, reranking, and video processing. • Own GPU & CPU infrastructure selection, scaling, & optimization, including instance benchmarking, autoscaling behavior, & load testing. • Deploy, monitor, & operate inference services that support hundreds of thousands of queries per day across text, image, & video pipelines. • Establish standardized ML deployment patterns at AP, including: • Containerization and orchestration strategies • Environment isolation (Dev / QA / Prod) • Versioned promotion, rollback, and recovery mechanisms • Implement monitoring, alerting, drift detection, and evaluation metrics for production ML systems, tracking latency, error rates, throughput, and model/data drift. • Enable A/B testing & controlled rollout strategies for ML models in partnership with engineering & product teams. • Partner closely with ML Engineers, Data Scientists, DevOps, and Platform teams to: • Operationalize new models and pipeline improvements • Promote systems across environments safely • Ensure deployments meet reliability, scale, and cost targets • Manage high-throughput I/O and data movement for large collections of media assets (text, images, video), avoiding CPU, network, and storage bottlenecks. • Reduce operational risk by enforcing reproducibility, observability, security, & cost control across production ML systems. • • This role owns: • Deployment, scaling, and runtime operation of ML systems • ML infrastructure configuration and orchestration • Monitoring, alerting, A/B testing infrastructure, and drift detection • Reliability, cost control, and production governance • • This role does NOT own: • Designing model architecture • Feature engineering or data science outputs • Model accuracy or inference logic (These are owned by ML Engineers and Data Science) • • Required Skills & Experience • Hands‑on experience deploying and operating ML inference systems in production. • Experience with AWS SageMaker, including pipelines, endpoints, monitoring, and multi‑environment deployments. • Expertise deploying ML models using PyTorch and TensorFlow from an operational and serving perspective. • Proven experience with model deployment and orchestration, including containerized inference and autoscaling. • Experience selecting, evaluating, and optimizing compute resources (GPU/CPU) for production ML workloads. • Experience setting up monitoring, evaluation metrics, and A/B testing frameworks for ML systems in production. • Ability to collaborate effectively with ML Engineers, Data Scientists, and platform teams in a shared ownership model. • • Strongly Preferred • Experience running ML workloads over large‑scale text, image, and video datasets. • Operational experience supporting ML systems involving Transformer‑based NLP models (e.g., BERT‑family models), Computer vision models, Ranking & reranking systems • Familiarity operating systems that use common ML model types such as Convolutional & feed‑forward neural networks, Ranking algorithms, Approximate Nearest Neighbor methods (HNSW)