Job Description

Is this your new role in New Zealand? Don't forget to checkout out our specialised category 'Accredited Employers'

Lead Site Reliability Engineer

EROAD

Auckland (Albany HQ) (NZ)

Category IT Jobs

As Lead Site Reliability Engineer in our Azure Platform Team you will be part of the multi-faceted group responsible for designing, building and operating our SaaS solutions cloud infrastructure within the Azure ecosystem. The teams priorities are to maintain and improve the operational stability and flexibility of our platforms, including enabling internal engineering teams to achieve their goals while fostering and improving on EROADs DevOps model. At EROAD we empower and resource all of our engineering teams to do amazing things and the Azure Platform Team is no different.

Our platforms are treated as products and as such we are on a continuous journey of improvement. Responsibilities: You will enjoy working in a small team of positive, supportive, like-minded and pro-active people within a self-managed agile environment. Working with the other Azure Platform Team SREs to iterate on our environments, unlocking additional benefits for engineering development teams.

Contributing to new solution designs as needed both inside and outside the team. Ensure EROAD's multiple systems (including some legacy) are operating at peak efficiency, performance and uptime. Provide root cause analysis of complex faults in a large distributed system, and work with multiple teams to see issues through to resolution within our incident management process.

Develop metric collection and visualisation tools to allow you to perform capacity-planning, troubleshooting and take pre-emptive actions in support of overall system stability. Carry out legacy deployments of new releases of EROAD's SaaS applications to production and other environments with minimal to no impact on customers and refine and enhance the tools used to achieve this. Identify and automate tasks wherever possible to enhance our engineering teams autonomy.

Conduct performance and reliability tests to establish limits, bottlenecks or single points of failure and resolve them. Being called on to work flexible hours to complete tasks that would otherwise disrupt a great customer experience. Keep up to date with the cutting edge of modern web operations, and continually strive to push the EROAD operations practice forward.

Provided day to day support to the engineering team across production and non-production environments. About you: Youre a proven technical leader, able to elevate those around you and contribute to wider technical pieces of work by leveraging your extensive experience in complicated production environments Experience managing large production workloads in AWS or Azure (ideally in a real time and 24/7 environment) Experience operating and troubleshooting container orchestration frameworks like Kubernetes, ECS or AKS Experience working with Infrastructure as code tooling (Terraform and Azure DevOps) Experience operating and managing complex systems in customer-facing production web environments. Operation and architecture of multi-tier distributed systems involving real-time event processin.

...

MAKE YOUR NEW ZEALAND
DREAM A REALITY

Begin Your Journey

CONTACT US

We're not around right now. But you can send us an email and we'll get back to you, asap.

Sending

© Copyright MoveToNZ 2025. All Rights Reserved.

Terms of Use | Terms of trade | Privacy Policy | FAQ's