

Hire Tech Services
Chaos Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Chaos Engineer in St Louis, MO, with a contract length of "unknown" and a pay rate of "unknown." Requires 6+ years in SRE/DevOps, expertise in chaos engineering tools, and strong knowledge of Kubernetes and cloud platforms. Relocation is mandatory.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
December 18, 2025
π - Duration
Unknown
-
ποΈ - Location
On-site
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
St Louis, MO
-
π§ - Skills detailed
#Cloud #Datadog #Grafana #Kubernetes #AWS (Amazon Web Services) #GCP (Google Cloud Platform) #Microservices #Monitoring #Splunk #Python #Scripting #DevOps #Bash #Azure #Prometheus
Role description
Location: St Louis, MO (3 days to office)
Note: Relocation Mandatory
Responsibilities:
Design and run chaos experiments to test system reliability, fault tolerance, and recovery.
Build automated chaos tests using tools like Gremlin, Litmus, Chaos Mesh, AWS Fault Injection Simulator, etc.
Identify failure points in microservices, APIs, and cloud infrastructure.
Collaborate with SRE, DevOps, and Development teams to improve resilience.
Document findings, create remediation plans, and drive resilience best practices.
Required Skills:
6+ years in SRE/DevOps/Platform Engineering with strong distributed systems knowledge.
Hands-on experience with chaos engineering tools (Gremlin, Litmus, FIS, Chaos Mesh).
Strong knowledge of Kubernetes, microservices, container orchestration, and cloud (AWS/Azure/GCP).
Experience with monitoring tools (Prometheus, Grafana, Datadog, Splunk).
Solid scripting skills: Python, Bash, or Go.
Location: St Louis, MO (3 days to office)
Note: Relocation Mandatory
Responsibilities:
Design and run chaos experiments to test system reliability, fault tolerance, and recovery.
Build automated chaos tests using tools like Gremlin, Litmus, Chaos Mesh, AWS Fault Injection Simulator, etc.
Identify failure points in microservices, APIs, and cloud infrastructure.
Collaborate with SRE, DevOps, and Development teams to improve resilience.
Document findings, create remediation plans, and drive resilience best practices.
Required Skills:
6+ years in SRE/DevOps/Platform Engineering with strong distributed systems knowledge.
Hands-on experience with chaos engineering tools (Gremlin, Litmus, FIS, Chaos Mesh).
Strong knowledge of Kubernetes, microservices, container orchestration, and cloud (AWS/Azure/GCP).
Experience with monitoring tools (Prometheus, Grafana, Datadog, Splunk).
Solid scripting skills: Python, Bash, or Go.






