Kanban for IT Operations | Manage Incidents & SLAs
Apply Kanban to IT operations with GitScrum. Manage incidents, requests, and changes with WIP limits and SLA tracking. Reduce chaos with structured workflows.
5 min read
IT operations teams face constant interruptions from incidents, user requests, and maintenance tasks. GitScrum's Kanban boards with WIP limits, swimlanes for work types, and SLA tracking help ops teams manage flow, maintain service levels, and avoid the chaos of unmanaged work queues.
IT Operations Work Types
| Work Type | Priority | Flow Treatment |
|---|---|---|
| P1 Incidents | Immediate | Expedite lane |
| P2 Incidents | Same day | High priority |
| Service requests | Normal | Standard flow |
| Maintenance | Planned | Scheduled |
| Improvements | Low | When capacity allows |
IT Operations Kanban Board
IT OPERATIONS BOARD STRUCTURE
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β EXPEDITE (P1 Incidents) - No WIP Limit β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β [INC-234] Production database unresponsive β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
βββββββββββ¬βββββββββββ¬βββββββββββ¬βββββββββββ¬βββββββββββ¬ββββββββββββββ
β Backlog β Ready β Working β Review β Deploy β Done β
β (no WIP)β (WIP: 3) β (WIP: 4) β (WIP: 2) β (WIP: 2) β (no WIP) β
βββββββββββΌβββββββββββΌβββββββββββΌβββββββββββΌβββββββββββΌββββββββββββββ€
β [REQ] β [INC] β [INC] β [CHG] β [CHG] β [INC] β β
β [REQ] β [REQ] β [CHG] β β β [REQ] β β
β [CHG] β β [REQ] β β β [CHG] β β
β [CHG] β β β β β β
β [IMP] β β β β β β
βββββββββββ΄βββββββββββ΄βββββββββββ΄βββββββββββ΄βββββββββββ΄ββββββββββββββ
SWIMLANES BY WORK TYPE:
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β π΄ Incidents (P2-P3) β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β π Service Requests β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β π§ Changes & Maintenance β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β π Improvements β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Work Item Types
WORK ITEM DEFINITIONS
INCIDENT (INC):
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Unplanned service disruption β
β Goal: Restore service ASAP β
β β
β Fields: β
β βββ Priority (P1-P4) β
β βββ Affected service β
β βββ Impact (users affected) β
β βββ Start time β
β βββ Resolution time β
β β
β SLAs: β
β P1: Response <15 min, Resolve <4 hours β
β P2: Response <1 hour, Resolve <8 hours β
β P3: Response <4 hours, Resolve <24 hours β
β P4: Response <1 day, Resolve <1 week β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
SERVICE REQUEST (REQ):
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Standard user request β
β Goal: Fulfill per SLA β
β β
β Fields: β
β βββ Request type β
β βββ Requester β
β βββ Due date β
β βββ Approval status β
β β
β Examples: Access request, new account, β
β software install, hardware request β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
CHANGE (CHG):
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Planned infrastructure modification β
β Goal: Implement safely β
β β
β Fields: β
β βββ Change type (standard/normal/emergency) β
β βββ Risk level β
β βββ Rollback plan β
β βββ Change window β
β βββ CAB approval β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
IMPROVEMENT (IMP):
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Proactive enhancement β
β Goal: Improve operations β
β β
β Examples: Automation, monitoring, β
β documentation, process improvement β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
Capacity Planning
OPS TEAM CAPACITY MODEL
TEAM: 5 engineers
CAPACITY ALLOCATION:
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Incident response: 1-2 engineers (20-40%) β
β (rotates daily) β
β β
β Planned work: 3-4 engineers (60-80%) β
β βββ Service requests β
β βββ Changes β
β βββ Improvements β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
WIP LIMITS RATIONALE:
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Ready: 3 β
β βββ Half day's worth of prepared work β
β β
β Working: 4 β
β βββ Less than team size (5) for focus β
β β
β Review: 2 β
β βββ Minimize approval bottleneck β
β β
β Deploy: 2 β
β βββ Control change velocity β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
Flow Metrics
IT OPS METRICS DASHBOARD
FLOW METRICS:
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Cycle Time (avg days): β
β Incidents P2: 0.5 days Target: <1 β β
β Requests: 3.2 days Target: <5 β β
β Changes: 5.1 days Target: <7 β β
β Improvements: 12 days Target: <14 β β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
THROUGHPUT:
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β This Week: β
β Incidents resolved: 8 β
β Requests fulfilled: 15 β
β Changes deployed: 6 β
β Improvements done: 2 β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
SERVICE LEVELS:
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β SLA Compliance: β
β P1 Incidents: 100% (2/2 met) β β
β P2 Incidents: 95% (19/20 met) β β
β Service Requests: 92% (46/50 met) β β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
Best Practices
Anti-Patterns
β No WIP limits (everything in progress)
β Expedite lane for all incidents
β Hidden work not on board
β No time for improvements
β Ignoring blocked items
β No metrics on flow