TalentOla

Sr. SRE/DevOps Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Sr. SRE/DevOps Engineer in Sunnyvale, CA, with a contract length of "unknown" and a pay rate of "unknown." Requires 8+ years of experience, expertise in Docker, Kubernetes, Terraform, and proficiency in Java or Python.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date

October 3, 2025

🕒 - Duration

Unknown

🏝️ - Location

On-site

📄 - Contract

Unknown

🔒 - Security

Unknown

📍 - Location detailed

Sunnyvale, CA

🧠 - Skills detailed

#Security #Splunk #Terraform #Linux #Monitoring #Infrastructure as Code (IaC) #Snowflake #"ETL (Extract #Transform #Load)" #DevOps #Java #Scripting #Kubernetes #Scala #GIT #Observability #Docker #Automation #Programming #Python #Cloud #AWS (Amazon Web Services) #Ansible #Deployment

Role description

Title: Sr. SRE / DevOps Engineer Location: Sunnyvale, CA (Only Local candidate) Client Interview – In-Person Job Description: Job Summary – For this role, we are looking for a Sr. SRE / DevOps Engineer at Sunnyvale, California location. As Site Reliability Engineer, the individual will work closely with multi-functional teams, automate operations, optimize infrastructure, implement security and solve issues in an exciting, fast-paced environment. The individual will play a vital role in ensuring that the systems are reliable, scalable, and high performing. Responsibilities – • Ensure system reliability and availability – Monitor system issues, create strategies to detect issues, address those issues, design automated systems to troubleshoot, write and review post-mortems. • Mitigate Operational risks - Collaborate with development teams and other stakeholders to identify potential risks, perform risk assessments, implement risk mitigation strategies, continuously monitor and review the effectiveness of risk strategies. • Monitor system health. • Minimize emergency response (MTTR). • Maintain CI/CD pipelines, etc. • Continuous improvement by collaborating with various teams. • Automation of processes. • Must have/required experience and skills: • 8+ years of experience on DevOps and Site Reliability Engineering. • Hands-on with containerization and orchestration: Docker, Kubernetes/EKS. • Proficiency in infrastructure as code tools: Terraform, Ansible, or CloudFormation. • Experience setting up and managing services running on Kubernetes. • In-depth understanding of SRE principals including monitoring, alerting, error budgets, fault analysis, and automation. • In-depth knowledge of monitoring and observability tools: Apache Splunk • Knowledge of Linux operating system principles, networking fundamentals, and systems management • Demonstrable fluency in at least one of the following languages: Java or Python • Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions. • Building and managing CI/CD pipeline – gatekeeping production deployments, develop and implement GIT branching strategies, branch protection rules, network policies, scale up/ scale down the load on AWS. • Strong problem-solving and analytical skills • Solve performance issues and scalability issues in the system. Technical Skills: • DevOps and SRE • AWS Kubernetes/EKS, Docker • Terraform, Ansible, or CloudFormation • Apache Splunk, Apache Flink • Programming/Scripting using Java or Python • CI/CD • Database – Vertica, Snowflake. Behavioural Skills: • Excellent Communication skills and collaboration skills • Ability to propose and implement improvements in the system • Ability to work with cross-functional stakeholders • Adaptability and a willingness to learn new technologies and techniques. • Proactive approach to issues, ability to provide prompt resolution/work around.

Apply now Apply with DFH Sign up

TalentOla

Sr. SRE/DevOps Engineer

Senior Supply Chain Analyst

Senior Supply Chain Analyst

Business Intelligence Analyst - Data Analytics

Business Analyst

Book a

chat

with us

Company