SRE Dynatrace Lead Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for an SRE Dynatrace Lead Engineer in Austin (Hybrid) with a contract length of "unknown" and a pay rate of "unknown." Requires 10-12 years of SRE experience, expertise in Dynatrace, cloud technologies, and strong skills in automation and monitoring tools.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

🗓️ - Date discovered

August 22, 2025

🕒 - Project duration

Unknown

🏝️ - Location type

Hybrid

📄 - Contract type

Unknown

🔒 - Security clearance

Unknown

📍 - Location detailed

Austin, TX

🧠 - Skills detailed

#Oracle #AI (Artificial Intelligence) #MySQL #AWS (Amazon Web Services) #Monitoring #Kafka (Apache Kafka) #Microservices #RDS (Amazon Relational Database Service) #Security #GitLab #Docker #Scala #"ETL (Extract #Transform #Load)" #Deployment #Observability #Forecasting #Prometheus #Anomaly Detection #Groovy #Automation #Grafana #Dynatrace #Azure #Kubernetes #YAML (YAML Ain't Markup Language) #Ansible #Splunk #DevOps #Terraform #Linux #Compliance #Scripting #Databases #GCP (Google Cloud Platform) #Java #Cloud #Python

Role description

Job Title: SRE Dynatrace Lead Engineer Location: Austin [Hybrid] Job Description: We are currently seeking a highly skilled SRE hands-on Lead Engineer with solid experience to help lead transformational initiatives within IT operations, encompassing development as well. As a crucial figure in this role, you will participate/help designing and implementing cutting-edge SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach. Responsibilities: · Participate in design, architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance. · Primary skillset to be expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools · Configure application performance monitoring (APM), infrastructure monitoring, synthetic monitoring, RUM, and log monitoring. · Integrate Dynatrace with CI/CD pipelines, alerting tools, ITSM systems, and incident automation frameworks. · Tune alert thresholds, baselines, and AI-driven anomaly detection to reduce noise and improve actionable insights. · Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management) · Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications. · Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices. · Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation. · Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind. · Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization. · Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents. · Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency. · Ensure that SRE practices align with security and compliance requirements and implementing measures to protect systems and data. · Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency. · Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice. · Ability to develop close relationship with other operational teams to integrate SRE practices and drive overall operational improvements across enterprise. · Stay up to date on industry trends, new technologies, and best practices in SRE and applying relevant advancements to the organization. · Ability to build strong working relationships across different levels, client focus mindset. Qualifications: · Around 10-12 years of SRE hands on experience with cloud technologies, development, SRE toolsets and automation · Own the design, configuration, deployment, and optimization of Dynatrace for enterprise-wide observability. · Define monitoring standards, best practices, and governance to ensure consistency and scalability. · Strong skills in APM, distributed tracing, synthetic & real user monitoring, log monitoring, and Davis AI configuration. · Experience to deploy and tune OneAgent, build end-to-end PurePath tracing, and leverage Smartscape topology for proactive performance monitoring and root-cause analysis. · Experience integrating Dynatrace with incident management, automation, and cloud platforms (AWS, Azure, GCP). · Strong problem-solving skills and ability to work in cross-functional, fast-paced environments. · Collaborate with application and infrastructure teams to troubleshoot performance issues and implement permanent fixes. · Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications. · Strong hands-on experience with any Cloud Technology (AWS): Control Tower, Project Setup, Creating Accounts, RDS, SSO · Solid understanding and hands on experience with Docker/Kubernetes · Should have good experience with Linux Commands, GitLab CICD Setup and Terraform (state management, etc) · Monitoring & alerting setup experience with Splunk, Prometheus, Grafana, Kibana, ELK etc. · Good understanding of Observability Framework leveraging programmatic SLI/SLO blueprints to standardize the collection of golden signals. · Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages · Experience with following languages (Groovy-DSL, Java, Python, Yaml and microservices architecture) · Good understanding and hands on experience with MQ, Kafka · Experience with Databases (Oracle, MySQL) Good to have:· Any of the relevant professional certifications – Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, , Google Cloud Professional; DevOps Engineer

Apply now Apply with DFH Sign up

← See all roles

Go to role

SRE Dynatrace Lead Engineer

Premium Members Land Roles Faster—Upgrade today.

Senior Data Engineer(Lakehouse)

Azure AI Service Developer

Data Engineer

Specialized Developer

Premium Members Land Roles Faster—Upgrade today.

Book a

chat

with us

Company