Openkyber

Cloud-Native MLOps Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Cloud-Native MLOps Engineer with a 12+ month contract, offering competitive pay. Key skills include Python, IaC, cloud platforms (AWS, Azure), and SRE practices. Candidates must be U.S. citizens with 10+ years of relevant experience.
🌎 - Country
United States
💱 - Currency
Unknown
-
💰 - Day rate
Unknown
-
🗓️ - Date
January 24, 2026
🕒 - Duration
More than 6 months
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Georgia
-
🧠 - Skills detailed
#CHEF #Continuous Deployment #Data Encryption #"ETL (Extract #Transform #Load)" #Docker #Cloud #Python #AWS (Amazon Web Services) #CircleCI #Firewalls #Bash #GDPR (General Data Protection Regulation) #Load Balancing #Monitoring #Prometheus #Jenkins #Consulting #Documentation #PCI (Payment Card Industry) #Azure #Ruby #Observability #Datadog #Ansible #Containers #GCP (Google Cloud Platform) #Kubernetes #Vault #GitLab #Scala #GitHub #EC2 #Linux #Project Management #Unix #Programming #Automation #Deployment #DevOps #Leadership #Compliance #Infrastructure as Code (IaC) #Terraform #S3 (Amazon Simple Storage Service) #Grafana #Scripting #Security
Role description
Looking only USA Citizens and Cloud Security Engineer - SRE Location: Alpharetta, GA/ Columbus, OH/ Berkeley Heights, NJ/ Frisco, TX (5 days/week onsite) Type: 12+ Months Contract Position Summary: We are seeking a skilled and motivated Cloud Security Engineer SRE to join our dynamic team. The ideal candidate will possess a strong technical background in systems administration, cloud computing, and infrastructure as code, with a particular focus on solution engineering/site reliability. This role will involve collaborating with cross-functional teams to enhance our security posture and streamline processes through automation.1. Technical Skills Programming and Scripting: Strong proficiency in languages like Python, Go, Bash, or Ruby. SREs often need to write automation scripts and build tooling. Systems Administration: Deep understanding of operating systems (Linux/Unix), file systems, processes, and system configurations. Infrastructure as Code (IaC): Experience with IaC tools like Terraform, Ansible, or Chef to manage infrastructure. Cloud Computing: Knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform, including services like EC2, S3, Kubernetes, and serverless functions. Containers and Orchestration: Expertise in containerization (Docker) and container orchestration (Kubernetes, OpenShift). Networking: Understanding of networking concepts, including DNS, firewalls, load balancing, and VPNs. Monitoring and Observability: Experience with monitoring and observability tools like Prometheus, Grafana, Datadog, or New Relic. Ability to set up and maintain monitoring dashboards, alerts, and logs. Continuous Integration/Continuous Deployment (CI/CD): Familiarity with CI/CD tools like Jenkins, GitLab CI, GitHub Actions, or CircleCI. A strong understanding of HashiCorp Vault and Terraform will make you stand out. 1. Problem-Solving and Troubleshooting Incident Management: Ability to manage and respond to incidents, perform root cause analysis, and implement post-mortem reviews. Automation: Focus on automating repetitive tasks to improve efficiency and reduce human error. Performance Tuning: Skills in identifying and resolving performance bottlenecks in systems and applications. 1. Collaboration and Communication Teamwork: Ability to work closely with cross-functional teams, including software engineers, product managers, and DevOps teams. Documentation: Skill in creating clear and comprehensive documentation for systems, processes, and incident reports. Communication: Effective communication skills for interacting with stakeholders and explaining technical concepts to non-technical audiences. 1. Reliability and Scalability Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs): Understanding of setting, monitoring, and maintaining SLOs and SLAs for system reliability. Scalability: Knowledge of best practices for designing and scaling systems to handle increased loads and demands. Redundancy and Resilience: Experience in designing systems with redundancy and fault tolerance to minimize downtime. 1. Security and Compliance Security Best Practices: Understanding of security principles, such as access control, data encryption, and secure coding practices. Compliance: Familiarity with compliance standards like GDPR, HIPAA, or PCI-DSS, depending on the industry. Minimum Job Qualifications: Bachelor degree in business or equivalent work experience 10 years of previous program leadership and/or relevant consulting experience Knowledge of and demonstrated experience in program management framework, knowledge groups & life cycle 5+ years' experience in driving large scale data center consolidation efforts Minimum 5 years' experience with matrix management of cross-functional processes and teams Proficient with Project Management tools For applications and inquiries, contact: hirings@openkyber.com