-
ID
#49848971 -
Job type
Contract -
Salary
Depends on Experience -
Source
StratG Inc -
Date
2023-05-02 -
Deadline
2023-07-01
Incident manager
California, Sanfrancisco, 94101 Sanfrancisco USAContract
Vacancy expired!
- 5+ years managing and monitoring Incident/Crisis management
- 3+ years’ experience monitoring with various tools like Grafana, NewRelic etc.
- 1+ years’ experience programming in a programming language such as Python and Go
- On call experience
- Attention to detail and ability to manage multiple projects
- Strong analytical skills and ability to present complex data on site reliability and other factors
- Demonstrated ability to work with 3rd parties and collaborate on solutions
- Lead the on-call teams and processes to improve site reliability
- Focus on managing large scale systems with high loads 24/7
- Support our SRE and engineering teams in their day to day
- Build, enhance and maintain runbooks working with various teams cross-functionally
- Infrastructure as Code and Terraform
- Thrive on automating processes as much as possible
- Observability and Monitoring with services like Prometheus, Grafana, New Relic
- Additional other duties and responsibilities, as assigned
- Lead the NOC tools, runbooks, processes and teams
- Automation of runbooks as necessary
- Work with our development teams on improving the system
Vacancy expired!
Report job