Manager, Site Reliability Engineering (Observability)
Axon
United States
Full Time
Responsibilities:
Driving the execution of your team by leading planning, prioritization, stand-ups, and team retro meetings
Leading the technical direction of the team through architectural discussions and code reviews
Working with technical program managers, senior engineers, and client teams to deliver on foundational observability programs that cut across all product organizations.
Join forces with recruiting to hire a top-notch engineering team
Managing engineers and their careers with 1:1s and performance reviews
Execute and deliver new features and improvements to internal customers on a regular basis
Coaching your team for continuous improvements
Tech Stack experience:
Metrics: Grafana, Prometheus, Cortex, and 20+ open source components in that ecosystem
Logs: Splunk Enterprise, self-hosted
Frameworks for managing metric and log assets as code with continuous delivery
In-house libraries (Scala, Golang) for interfacing to the metrics system
Tracing: Jaeger, self hosted
Requirements:
Bachelor’s Degree in Computer Science, Engineering, Physics, Mathematics or an equivalent highly technical field
4+ years of experience in software or infrastructure engineering
3+ years of experience managing software developers building customer-facing software applications
A track record of planning and delivering on multiple overlapping projects spanning several months
Experience implementing an engineering process that emphasizes security, availability, scalability, and operational discipline
Experience either directly managing observability-focused systems or a demonstrated interest in observability concerns like metrics, logging, distributed tracing, etc