|
Qualifications
|
Description
|
|
Level of Education and Discipline
|
- Bachelor’s degree required in Information Technology, Computer Science, Engineering, Information Systems, or a related technical discipline.
- Advanced degree in a related field preferred.
|
|
Certification and/or Licenses
|
- Relevant certifications in IT service management, cloud, reliability engineering, or operational disciplines are preferred, such as ITIL, SRE, or major platform certifications.
|
|
Experience - functional/industry/commercial knowledge, business acumen
|
- 12+ yrs of experience
- Experience in observability, monitoring engineering, production operations, operational reporting, service availability, or technology operations roles in a complex enterprise environment.
- Experience designing, implementing, administering, or optimizing enterprise monitoring and observability solutions across infrastructure, platforms, applications, and user experience layers.
- Experience building dashboards, alerts, health indicators, service reporting, and operational analytics that support both technical operations and leadership visibility.
- Experience supporting incident response, major incident management, command center operations, or service restoration activities through monitoring and operational insight.
- Experience working across enterprise infrastructure, cloud, collaboration platforms, and end user services, with practical understanding of how issues affect employees and business operations.
- Experience working with managed service providers, outsourced support models, or third-party operational teams.
- Experience in ITIL, service management, SIAM, or similarly structured operating environments preferred.
|
|
Interpersonal Skills - leadership, interactions, communication, influence
|
- Strong technical knowledge of monitoring, event management, telemetry collection, alerting, event correlation, dashboarding, and service reporting.
- Strong understanding of enterprise operations, service health measurement, and end user experience considerations.
- Ability to interpret logs, metrics, events, traces, and performance indicators to identify material conditions, trends, and risk.
- Ability to translate technical telemetry into clear operational and executive level reporting.
- Strong analytical and problem-solving skills, with the ability to improve signal to noise ratio and monitoring effectiveness.
- Strong communication skills, especially in incident situations and operational review settings.
- Ability to work across organizational boundaries and influence internal teams and service providers without direct authority.
|
|
Other Skills and HPO Competencies
|
- Deep expertise in observability and monitoring practices, with strong operational judgment.
- Ability to connect technical telemetry to business service impact and employee experience.
- Strong focus on data quality, reporting accuracy, and practical operational insight.
- Drives consistency, accountability, and continual improvement in service visibility and monitoring standards.
- Acts as a credible technical advisor for operational reporting, service health transparency, and monitoring maturity.
|
|