This is a checklist devised by DS of the information a manager of a Network Operations Centre (NOC) needs to have close at hand:
Command and control
- Date and time of current shift including start, finish and handover.
- NOC manager on duty
- Shift leaders
- Service Delivery Manager on duty/standby
- Major Incident Manager on duty/standby
Tiger Team status (refer here for process)
- echo
- whisky
- delta
- romeo
- bravo
- alpha
Red-Amber-Green (RAG) of the trenches
- Security
- Data centre
- Apps
- Support
- Infrastructure
Notifications
- Ongoing Service Level Agreement (SLA) or contract violations
- All Major Incidents
- All failures and outages
- Last 10 maintenance tasks completed
- Next 10 maintenance tasks scheduled
- Planned continuity tests scheduled (inverter/generator tests, network path protection tests, business continuity or application high availability tests)
- Resources available to the NOC
- Resources unavailable to the NOC
- Changes completed during the past week (includes the status on whether they were successful or failed)
- Changes scheduled for the next week
- Emergency changes completed or in progress
- Top 10 most important projects that are ongoing
- Top 10 congested links
- Top 10 devices with temperature alerts
- Top 10 devices with cooling alerts
- Top 10 devices with storage or capacity alerts (including raid failures)
- Systems/devices with known problems or symptoms of degradation
- Top 10 network with path protection faults
Comments
Post a Comment