
Site Reliability Engineer
- Wellington
- Contract
- Full-time
- Apply SRE principles to monitor, measure, and improve service reliability and performance.
- Automate platform operations and embed security and governance controls at scale.
- Manage OpenShift and AWS microservices, ensuring they are resilient, scalable, and well-instrumented.
- Lead incident response and postmortem reviews to drive learning and prevent recurrence.
- Design and maintain monitoring and observability solutions using Dynatrace, Splunk, and ElasticSearch.
- Collaborate with teams to optimise databases, messaging systems, and API security protocols.
- Maintain certificate lifecycles and ensure seamless backend integrations with banking services.
- Experience with OpenShift, AWS, and containerised microservices.
- A collaborative mindset and a proactive approach to problem-solving.
- Comfort working across teams and disciplines, with a focus on shared success.
- A commitment to continuous learning and improvement.
- Experience in application or infrastructure support, with a background in SysOps or DevOps.
- Proficiency in scripting or development-ideally with Java, JavaScript, NodeJS, or Python.
- Strong Linux/Unix skills and familiarity with enterprise-grade platforms.
- Hands-on experience with OpenShift, Docker, and cloud-native technologies (especially AWS).
- Knowledge of monitoring and logging tools such as Dynatrace, Splunk, and ElasticSearch.
- Familiarity with messaging systems (NATS), databases (MongoDB, PostgreSQL), and API security protocols (OAuth2).
- Understanding of ITIL practices, particularly incident and problem management.
- Experience instrumenting monitoring and logging of applications.
- Ability to assess, define, and implement fit-for-purpose, cost-effective engineering solutions and toolsets.