

Senior SRE Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior SRE Engineer with a contract length of "unknown," offering a pay rate of "$X per hour." Key skills include cloud platforms (AWS, Azure, GCP), DevOps methodologies, and monitoring tools (Grafana, Prometheus). Certifications like "SRE Foundation" are preferred.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
July 21, 2025
π - Project duration
Unknown
-
ποΈ - Location type
Unknown
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
New York, United States
-
π§ - Skills detailed
#Jira #Terraform #Docker #Azure #Kubernetes #Monitoring #GCP (Google Cloud Platform) #Python #Automation #DevOps #Security #Infrastructure as Code (IaC) #Leadership #Ansible #Scala #AWS (Amazon Web Services) #Prometheus #Cloud #Scripting #Grafana #Observability #Splunk #Bash
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
We are seeking a highly skilled and experienced Senior Site Reliability Engineering (SRE) Engineer to lead our SRE team in ensuring the reliability, scalability, and performance of our production systems. The ideal candidate will have a strong background in cloud infrastructure, automation, and system monitoring, with excellent leadership and communication skills to collaborate across teams and foster a culture of operational excellence.
Key Responsibilities:
β’ Lead and manage the SRE team to uphold system reliability, availability, and performance standards.
β’ Design, implement, and optimize scalable infrastructure and automation solutions to support business needs.
β’ Build, maintain, and enhance monitoring dashboards and alerting systems utilizing tools such as Grafana and Prometheus.
β’ Develop and continuously improve detailed runbooks detailing operational procedures, incident response, and recovery processes.
β’ Drive Incident, Problem, and Change Management processes to minimize downtime and improve system resilience.
β’ Collaborate closely with development, DevOps, and security teams to design resilient, secure, and scalable systems.
β’ Define, monitor, and refine SLAs, SLOs, and KPIs for critical services to ensure continuous improvement.
β’ Evaluate emerging tools and technologies to enhance operational efficiency and reliability.
β’ Promote a proactive culture of automation, reliability, and problem-solving within the team.
β’ Mentor and develop team members, encouraging continuous learning and technical growth.
β’ Manage stakeholder expectations and communication related to system reliability and operational activities.
Required Skills & Qualifications:
β’ Extensive experience with IT infrastructure, cloud platforms (AWS, Azure, GCP), and modern DevOps/SRE methodologies.
β’ Hands-on expertise with monitoring and observability tools: Grafana, Prometheus, Splunk.
β’ Familiarity with ITSM and operational tools such as ServiceNow and OpsRamp.
β’ Experience with project and incident tracking tools like JIRA.
β’ Proficiency in scripting and automation using Python, Bash, Terraform, Ansible.
β’ Strong understanding of CI/CD pipelines, containerization (Docker), and container orchestration (Kubernetes).
β’ Knowledge of Infrastructure as Code (IaC) principles and tools.
β’ Proven track record managing distributed and large-scale production environments.
β’ Excellent leadership, communication, and problem-solving capabilities.
β’ Ability to adapt quickly to new technologies and evolving operational landscapes.
Preferred Certifications:
β’ SRE Foundation
β’ AWS Certified Solutions Architect / Azure Solutions Architect
β’ ITIL Foundation or higher certifications