Sr. SRE/ DevOps Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Sr. SRE/DevOps Engineer in Sunnyvale, CA, with a contract length of "unknown" and a pay rate of "unknown." Requires 12+ years in DevOps/SRE, expertise in Docker, Kubernetes, Terraform, and proficiency in Java or Python.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
-
πŸ—“οΈ - Date discovered
September 4, 2025
πŸ•’ - Project duration
Unknown
-
🏝️ - Location type
On-site
-
πŸ“„ - Contract type
Unknown
-
πŸ”’ - Security clearance
Unknown
-
πŸ“ - Location detailed
Sunnyvale, CA
-
🧠 - Skills detailed
#Docker #Python #AWS (Amazon Web Services) #Automation #Ansible #GIT #Scala #DevOps #Monitoring #Observability #Programming #Snowflake #Scripting #Deployment #Infrastructure as Code (IaC) #Cloud #"ETL (Extract #Transform #Load)" #Linux #Splunk #Kubernetes #Terraform #Java
Role description
Title: Sr. SRE / DevOps Engineer Location: Sunnyvale, CA (Local candidate) Responsibilities β€’ Ensure system reliability and availability – Monitor system issues, create strategies to detect issues, address those issues, design automated systems to troubleshoot, write and review post-mortems. β€’ Mitigate Operational risks - Collaborate with development teams and other stakeholders to identify potential risks, perform risk assessments, implement risk mitigation strategies, continuously monitor and review the effectiveness of risk strategies. β€’ Monitor system health. β€’ Minimize emergency response (MTTR). β€’ Maintain CI/CD pipelines, etc. β€’ Continuous improvement by collaborating with various teams. β€’ Automation of processes. Must have/required experience and skills: β€’ 12+ years of experience on DevOps and Site Reliability Engineering. β€’ Hands-on with containerization and orchestration: Docker, Kubernetes/EKS. β€’ Proficiency in infrastructure as code tools: Terraform, Ansible, or CloudFormation. β€’ Experience setting up and managing services running on Kubernetes. β€’ In-depth understanding of SRE principals including monitoring, alerting, error budgets, fault analysis, and automation. β€’ In-depth knowledge of monitoring and observability tools: Apache Splunk β€’ Knowledge of Linux operating system principles, networking fundamentals, and systems management β€’ Demonstrable fluency in at least one of the following languages: Java or Python β€’ Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions. β€’ Building and managing CI/CD pipeline – gatekeeping production deployments, develop and implement GIT branching strategies, branch protection rules, network policies, scale up/ scale down the load on AWS. β€’ Strong problem-solving and analytical skills β€’ Solve performance issues and scalability issues in the system. Technical Skills: β€’ DevOps and SRE β€’ AWS Kubernetes/EKS, Docker β€’ Terraform, Ansible, or CloudFormation β€’ Apache Splunk, Apache Flink β€’ Programming/Scripting using Java or Python β€’ CI/CD β€’ Database – Vertica, Snowflake. Behavioural Skills: β€’ Excellent Communication skills and collaboration skills β€’ Ability to propose and implement improvements in the system β€’ Ability to work with cross-functional stakeholders β€’ Adaptability and a willingness to learn new technologies and techniques. β€’ Proactive approach to issues, ability to provide prompt resolution/work around.