

Jobs via Dice
AI Optimization Engineer || NYC, NY (Onsite) ||
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI Optimization Engineer based in NYC, NY, with a contract length of 6 months. Key skills include Python, machine learning algorithms, deep learning frameworks, and experience with GPU-accelerated clusters. Familiarity with model optimization techniques is essential.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
February 17, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
On-site
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
New York, NY
-
🧠 - Skills detailed
#Scripting #Data Analysis #Grafana #JavaScript #Jenkins #Lambda (AWS Lambda) #Linux #ML (Machine Learning) #NLP (Natural Language Processing) #API (Application Programming Interface) #Docker #Kubernetes #Scala #Neural Networks #Matplotlib #Prometheus #Libraries #Clustering #NumPy #MLflow #Data Cleaning #MongoDB #Terraform #Keras #PyTorch #Programming #TensorFlow #Security #SageMaker #Regression #Plotly #Logistic Regression #Deployment #Unsupervised Learning #Deep Learning #Oracle #HTML (Hypertext Markup Language) #Redis #AI (Artificial Intelligence) #AWS SageMaker #"ETL (Extract #Transform #Load)" #Microservices #Model Optimization #Supervised Learning #MySQL #Flask #AWS (Amazon Web Services) #EC2 #Jupyter #REST (Representational State Transfer) #GitHub #Normalization #Angular #SQL (Structured Query Language) #Python
Role description
Dice is the leading career destination for tech experts at every stage of their careers. Our client, IT Minds LLC, is seeking the following. Apply via Dice today!
Title: AI Optimization Engineer
Duration: 6 Months
Location: NYC, NY (Onsite)
Long Term Contract
Qualifications
Proficiency in languages such as Python, with experience in libraries like NumPy and scikit-learn.
Knowledge of various machine learning algorithms, including supervised and unsupervised learning, neural networks, decision trees, clustering, and dimensionality reduction.
Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras, and knowledge of their architectures and APIs.
Proficient with SLURM workload manager with REST and Flask APIs for automated and secure job scheduling.
Experienced in scalable infrastructure for deploying and managing large language models (LLMs),
HPC engineer with hands-on experience designing and managing GPU-accelerated clusters for large-scale AI/ML workloads.
Experience with deploying machine learning models in production environments, including containerization, microservices, and API design.
Leveraging Prometheus and Grafana to collect and analyze metrics, identify performance issues, and implement fixes. Experience creating Slurm and Triton metrics will be a plus.
Familiarity with Triton Inference Server, including its architecture, configuration, and deployment.
Knowledge of model optimization techniques, including pruning, quantization, and knowledge distillation.
Exploratory Data Analysis - Plotly, Seaborn, matplotlib
Deep Learning, Neural Networks, Decision Trees, Ensemble Methods, Gradient Boosting, Support Vector Machines, Random Forest, Logistic Regression, Transfer learning, Transformer based models, BART, Hyperparameter Tuning, Gen-AI, CNN, Computer Vision, NLP
Tools and Platforms like - Docker, Kubernetes, Jupyter, MLFlow, Github, Terraform, Jenkins, HuggingFace
Flask API Development and Security
Container Runtimes: Enroot, Pyxis, Podman
Linux (RHEL/CentOS) System Administration
Model Optimization techniques using Triton with TRTLLM
Desired Qualifications:
Experience with data cleaning, feature scaling, and normalization
Programming skills creating UI/UX using the Angular framework, HTML, CSS, and JavaScript
Creating vector embeddings
Tools and Platforms like - AWS (SageMaker, Lambda, EC2)
Database Technologies Oracle, MS-SQL, MongoDB, Redis and MySQL
SQL and PL/SQL Scripting
Dice is the leading career destination for tech experts at every stage of their careers. Our client, IT Minds LLC, is seeking the following. Apply via Dice today!
Title: AI Optimization Engineer
Duration: 6 Months
Location: NYC, NY (Onsite)
Long Term Contract
Qualifications
Proficiency in languages such as Python, with experience in libraries like NumPy and scikit-learn.
Knowledge of various machine learning algorithms, including supervised and unsupervised learning, neural networks, decision trees, clustering, and dimensionality reduction.
Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras, and knowledge of their architectures and APIs.
Proficient with SLURM workload manager with REST and Flask APIs for automated and secure job scheduling.
Experienced in scalable infrastructure for deploying and managing large language models (LLMs),
HPC engineer with hands-on experience designing and managing GPU-accelerated clusters for large-scale AI/ML workloads.
Experience with deploying machine learning models in production environments, including containerization, microservices, and API design.
Leveraging Prometheus and Grafana to collect and analyze metrics, identify performance issues, and implement fixes. Experience creating Slurm and Triton metrics will be a plus.
Familiarity with Triton Inference Server, including its architecture, configuration, and deployment.
Knowledge of model optimization techniques, including pruning, quantization, and knowledge distillation.
Exploratory Data Analysis - Plotly, Seaborn, matplotlib
Deep Learning, Neural Networks, Decision Trees, Ensemble Methods, Gradient Boosting, Support Vector Machines, Random Forest, Logistic Regression, Transfer learning, Transformer based models, BART, Hyperparameter Tuning, Gen-AI, CNN, Computer Vision, NLP
Tools and Platforms like - Docker, Kubernetes, Jupyter, MLFlow, Github, Terraform, Jenkins, HuggingFace
Flask API Development and Security
Container Runtimes: Enroot, Pyxis, Podman
Linux (RHEL/CentOS) System Administration
Model Optimization techniques using Triton with TRTLLM
Desired Qualifications:
Experience with data cleaning, feature scaling, and normalization
Programming skills creating UI/UX using the Angular framework, HTML, CSS, and JavaScript
Creating vector embeddings
Tools and Platforms like - AWS (SageMaker, Lambda, EC2)
Database Technologies Oracle, MS-SQL, MongoDB, Redis and MySQL
SQL and PL/SQL Scripting






