

Lead SRE Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Lead SRE Engineer in Arlington, TX, offering a 6-month contract-to-hire at a competitive pay rate. Key skills include 5-7 years SRE experience, leadership, C#/.NET, Azure, Terraform, and automation expertise.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
600
-
ποΈ - Date discovered
September 12, 2025
π - Project duration
More than 6 months
-
ποΈ - Location type
Hybrid
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
Arlington, TX
-
π§ - Skills detailed
#Bash #.Net #Scripting #Compliance #Kubernetes #DevOps #Monitoring #SQL (Structured Query Language) #Automation #C# #Azure #Computer Science #Perl #Observability #Python #Infrastructure as Code (IaC) #Scala #Docker #Leadership #Ruby #Terraform #AWS (Amazon Web Services) #WAF (Web Application Firewall) #Cloud #VPC (Virtual Private Cloud) #Spark (Apache Spark) #Linux #API (Application Programming Interface) #ML (Machine Learning) #Azure cloud
Role description
About the Company
Motion Recruitment has partnered with a fincncials service copany and seeking a Lead Site Reliability Engineer for a 6 months contract to hire.
About the Role
The Lead SRE will provide strategic leadership and direction for building and running large-scale software systems. This role involves identifying and delivering automation solutions to ensure high availability and resiliency, leveraging expertise in software development, complexity analysis, and scalable system design. The Lead SRE will work closely with other engineering teams to ensure services and systems are highly stable and performant, meeting the expectations of business partners and end users. This position will be approximately 80% hands-on and 20% leading/mentoring other SREs. Will be working in a C#/.NET/Azure development environment with Python for automation. This is not a match for candidates that only have Production Support experience as this role is primarily involving observability and Terraform, with some development and automation.
Location: Arlington, TX
Duration: 6+ Month Contract to hire
Interview: Onsite.( mandatory)
Term: Hybrid ( 2 days in office)
Applicants must have Lead experince and willing to interview onsite. Locals are given preferance.
Responsibilities
β’ Lead architecture and development teams to ensure applications are highly available, reliable, and performant at a global scale.
β’ Partner with the architecture team to ensure operability, measurability, and manageability are integrated into business features and enablers.
β’ Collaborate with product owners and managers to establish service level objectives (SLOs) for applications and define consequences if objectives are not met.
β’ Work with development team members to identify monitoring gaps, improve application performance, and assist with troubleshooting issues.
β’ Drive Root Cause Analysis (RCA) of production issues and other failures within the product software, pipeline, or other DevOps support processes or technology.
β’ Design, build, and advocate for automated solutions to optimize application/service/platform uptime with minimal human intervention.
β’ Participate in an on-call rotation to support troubleshooting and communication efforts outside of normal business hours.
β’ Create and implement standards and best practices, driving adoption across development teams and external vendors as applicable.
β’ Ensure compliance with all company policies and procedures.
Qualifications
β’ Bachelor of Computer Science or related Engineering field required.
β’ Masterβs Degree preferred.
Required Skills
β’ 5-7 years of hands-on SRE experience.
β’ 1-2 years of leading and mentoring others.
β’ Hands-on experience supporting Linux production environments, hands-on administration on Spark, and hands-on experience with MS Azure Cloud technologies.
β’ 3-5 years hands-on experience with scripting with bash, perl, ruby, or python required.
β’ 3-5 years experience with Docker Datacenter required.
β’ 2-4 years of hands-on administration experience on Machine learning platforms required.
β’ Minimum of 1 year of experience in Mesos, Kubernetes, OpenShift and/or Deis or other such container/platform-as-a-service orchestrator required.
β’ Minimum of 1 year of hands-on experience on CICD tools & Technologies required.
β’ Minimum of 1 year of lead experience of site reliability engineering team required.
β’ Proven leadership skills and the ability to guide and mentor a team.
β’ Strong collaboration and communication skills.
β’ A proactive approach to problem-solving and continuous improvement.
β’ Passion for automation and operational excellence.
β’ Deep expertise in cloud technologies and software development, with a strong technical background.
β’ Significant experience in C#/.NET preferred.
β’ Proficiency in SQL and Powershell.
β’ Expertise in defining, implementing, and evaluating Service Level Objectives (SLOs) and Service Level Indicators (SLIs), and associated consequences.
β’ Strong skills in performing Root Cause Analysis (RCA) and Problem Management.
β’ Extensive experience in cloud native applications Azure/AWS (monitoring, networking, containerization, infrastructure).
β’ Proficiency in containerization technologies such as Azure Kubernetes Service, Kubernetes (open source), and Docker.
β’ Knowledge of metrics and monitoring tools like Azure Application Insights and Azure Monitor.
β’ Familiarity with networking technologies relevant to Azure and AWS, including Azure DNS, Virtual Networks, Azure API Manager, Azure Application Gateway, Akamai WAF/CDN, AWS Route 53, AWS VPC, AWS API Gateway, and AWS CloudFront.
β’ Strong experience with Terraform for infrastructure as code.
β’ Ability to establish and maintain a culture of learning through the development and sharing of skills, knowledge, processes, and tools; combat traditional silos that create "us and them" environments.
Pay range and compensation package
Contract Duration: 6 Months Contract-to-Hire
Equal Opportunity Statement
We are committed to diversity and inclusivity.
\`\`\`
About the Company
Motion Recruitment has partnered with a fincncials service copany and seeking a Lead Site Reliability Engineer for a 6 months contract to hire.
About the Role
The Lead SRE will provide strategic leadership and direction for building and running large-scale software systems. This role involves identifying and delivering automation solutions to ensure high availability and resiliency, leveraging expertise in software development, complexity analysis, and scalable system design. The Lead SRE will work closely with other engineering teams to ensure services and systems are highly stable and performant, meeting the expectations of business partners and end users. This position will be approximately 80% hands-on and 20% leading/mentoring other SREs. Will be working in a C#/.NET/Azure development environment with Python for automation. This is not a match for candidates that only have Production Support experience as this role is primarily involving observability and Terraform, with some development and automation.
Location: Arlington, TX
Duration: 6+ Month Contract to hire
Interview: Onsite.( mandatory)
Term: Hybrid ( 2 days in office)
Applicants must have Lead experince and willing to interview onsite. Locals are given preferance.
Responsibilities
β’ Lead architecture and development teams to ensure applications are highly available, reliable, and performant at a global scale.
β’ Partner with the architecture team to ensure operability, measurability, and manageability are integrated into business features and enablers.
β’ Collaborate with product owners and managers to establish service level objectives (SLOs) for applications and define consequences if objectives are not met.
β’ Work with development team members to identify monitoring gaps, improve application performance, and assist with troubleshooting issues.
β’ Drive Root Cause Analysis (RCA) of production issues and other failures within the product software, pipeline, or other DevOps support processes or technology.
β’ Design, build, and advocate for automated solutions to optimize application/service/platform uptime with minimal human intervention.
β’ Participate in an on-call rotation to support troubleshooting and communication efforts outside of normal business hours.
β’ Create and implement standards and best practices, driving adoption across development teams and external vendors as applicable.
β’ Ensure compliance with all company policies and procedures.
Qualifications
β’ Bachelor of Computer Science or related Engineering field required.
β’ Masterβs Degree preferred.
Required Skills
β’ 5-7 years of hands-on SRE experience.
β’ 1-2 years of leading and mentoring others.
β’ Hands-on experience supporting Linux production environments, hands-on administration on Spark, and hands-on experience with MS Azure Cloud technologies.
β’ 3-5 years hands-on experience with scripting with bash, perl, ruby, or python required.
β’ 3-5 years experience with Docker Datacenter required.
β’ 2-4 years of hands-on administration experience on Machine learning platforms required.
β’ Minimum of 1 year of experience in Mesos, Kubernetes, OpenShift and/or Deis or other such container/platform-as-a-service orchestrator required.
β’ Minimum of 1 year of hands-on experience on CICD tools & Technologies required.
β’ Minimum of 1 year of lead experience of site reliability engineering team required.
β’ Proven leadership skills and the ability to guide and mentor a team.
β’ Strong collaboration and communication skills.
β’ A proactive approach to problem-solving and continuous improvement.
β’ Passion for automation and operational excellence.
β’ Deep expertise in cloud technologies and software development, with a strong technical background.
β’ Significant experience in C#/.NET preferred.
β’ Proficiency in SQL and Powershell.
β’ Expertise in defining, implementing, and evaluating Service Level Objectives (SLOs) and Service Level Indicators (SLIs), and associated consequences.
β’ Strong skills in performing Root Cause Analysis (RCA) and Problem Management.
β’ Extensive experience in cloud native applications Azure/AWS (monitoring, networking, containerization, infrastructure).
β’ Proficiency in containerization technologies such as Azure Kubernetes Service, Kubernetes (open source), and Docker.
β’ Knowledge of metrics and monitoring tools like Azure Application Insights and Azure Monitor.
β’ Familiarity with networking technologies relevant to Azure and AWS, including Azure DNS, Virtual Networks, Azure API Manager, Azure Application Gateway, Akamai WAF/CDN, AWS Route 53, AWS VPC, AWS API Gateway, and AWS CloudFront.
β’ Strong experience with Terraform for infrastructure as code.
β’ Ability to establish and maintain a culture of learning through the development and sharing of skills, knowledge, processes, and tools; combat traditional silos that create "us and them" environments.
Pay range and compensation package
Contract Duration: 6 Months Contract-to-Hire
Equal Opportunity Statement
We are committed to diversity and inclusivity.
\`\`\`