GitScrum / Docs
All Best Practices

DevOps and SRE Teams | 30% Faster MTTR

Manage DevOps and SRE with GitScrum incident tracking, infrastructure tasks, and runbooks in NoteVault. Track incidents, coordinate on-call. 30% faster MTTR.

4 min read

How to use GitScrum for DevOps and SRE teams?

GitScrum supports DevOps/SRE teams with columns for infrastructure work, incident tracking, and operational improvements. Configure columns for incidents, infrastructure tasks, and automation projects. Use labels for severity, systems affected, and work type. NoteVault stores runbooks and postmortems. Structured SRE management reduces MTTR by 30% [Source: SRE State Report 2024].

DevOps/SRE project setup:

  • Create project - DevOps or SRE team
  • Configure incident columns - Incident workflow
  • Add infra columns - Planned work
  • Set severity labels - P1, P2, P3, P4
  • Configure auto-assign - On-call routing
  • Create NoteVault docs - Runbooks
  • DevOps/SRE columns

    ColumnWIP LimitPurpose
    Incident Triage5New incidents
    Active Incident3Being investigated
    Mitigated2Stable, needs fix
    ResolvedNoneIncident closed
    Infra Backlog15Planned work
    In Progress3Active infra work
    DoneNoneCompleted

    Incident severity labels

    SeverityDescriptionResponse
    P1 - CriticalService downImmediate
    P2 - HighDegraded service<1 hour
    P3 - MediumMinor impact<4 hours
    P4 - LowNo impactNext sprint

    System labels

    LabelSystem
    awsAWS infrastructure
    gcpGoogle Cloud
    k8sKubernetes
    terraformIaC
    ci-cdPipelines
    monitoringObservability

    Incident workflow

    StageActions
    TriageAssess severity, assign
    ActiveInvestigate, communicate
    MitigatedStable, plan fix
    ResolvedFix deployed
    PostmortemReview, learn

    Runbooks in NoteVault

    RunbookContent
    Service X OutageSteps to diagnose, fix
    Database FailoverFailover procedure
    Deployment RollbackRollback steps
    On-Call HandoffRotation process
    EscalationWhen and how

    Balancing incidents vs planned work

    ApproachImplementation
    WIP limitsLimit active incidents
    Separate columnsVisual separation
    Priority labelsClear priority
    Time allocation70% planned, 30% incidents
    RotationOn-call rotation

    DevOps/SRE Team Standup

    TabContent
    YesterdayIncidents resolved, work done
    TodayOn-call status, planned work
    BlockersDependencies, waiting
    WeeklyIncident trends, SLO status

    Postmortem workflow

    StepGitScrum Action
    Incident resolvedMove to Resolved
    Schedule postmortemCreate task, assign
    Write postmortemDocument in NoteVault
    Action itemsCreate tasks from postmortem
    Track actionsRegular tasks in Infra Backlog

    Automation tracking

    Automation TypeLabel
    Toil reductiontoil
    Monitoringmonitoring
    Deploymentci-cd
    Securitysecurity
    Cost optimizationcost

    SRE metrics to track

    MetricGitScrum Tracking
    MTTRIncident cycle time
    Incident countResolved column
    Toil reductionAutomation tasks
    Error budgetNoteVault SLO doc

    Related articles