Root Cause Analyses¶
This section contains Root Cause Analysis (RCA) documents for incidents that occurred during Cortex AI development and operations.
RCA Index¶
| RCA | Incident | Date | Impact | Resolution |
|---|---|---|---|---|
| RCA-001 | Lint Pipeline Failure | 2026-02-16 | CI/CD blocked | Fixed linting rules |
RCA Process¶
When an incident occurs:
- Immediate Response - Mitigate the impact
- Investigation - Gather data and understand root cause
- Documentation - Create RCA document
- Action Items - Define preventive measures
- Follow-up - Verify fixes are effective
RCA Template¶
# RCA-XXX: Incident Title
**Date:** YYYY-MM-DD
**Duration:** X hours
**Impact:** Description of impact
**Severity:** Critical | High | Medium | Low
## Summary
Brief summary of the incident.
## Timeline
| Time | Event |
|------|-------|
| HH:MM | First alert |
| HH:MM | Root cause identified |
| HH:MM | Fix deployed |
## Root Cause
What caused the incident?
## Resolution
How was it fixed?
## Action Items
| Action | Owner | Due Date | Status |
|--------|-------|----------|--------|
| Preventive measure | Person | YYYY-MM-DD | ⬜ |
## Lessons Learned
What can we do better next time?