Skip to content

Root Cause Analyses

This section contains Root Cause Analysis (RCA) documents for incidents that occurred during Cortex AI development and operations.

RCA Index

RCA Incident Date Impact Resolution
RCA-001 Lint Pipeline Failure 2026-02-16 CI/CD blocked Fixed linting rules

RCA Process

When an incident occurs:

  1. Immediate Response - Mitigate the impact
  2. Investigation - Gather data and understand root cause
  3. Documentation - Create RCA document
  4. Action Items - Define preventive measures
  5. Follow-up - Verify fixes are effective

RCA Template

# RCA-XXX: Incident Title

**Date:** YYYY-MM-DD
**Duration:** X hours
**Impact:** Description of impact
**Severity:** Critical | High | Medium | Low

## Summary

Brief summary of the incident.

## Timeline

| Time | Event |
|------|-------|
| HH:MM | First alert |
| HH:MM | Root cause identified |
| HH:MM | Fix deployed |

## Root Cause

What caused the incident?

## Resolution

How was it fixed?

## Action Items

| Action | Owner | Due Date | Status |
|--------|-------|----------|--------|
| Preventive measure | Person | YYYY-MM-DD | ⬜ |

## Lessons Learned

What can we do better next time?