The problem
Root cause analysis is a discipline, not a conversation. It requires defining the problem precisely, gathering evidence, tracing a specific causal chain, and arriving at a root cause that is both real and actionable. The whiteboard is where the causal chain gets traced — each step connecting the observable problem to the underlying cause that must be addressed.
The RCA board often looks like a chain of boxes and arrows — 'System outage → database connection exhausted → connection pool not tuned → high traffic not anticipated → no capacity planning process.' Each box represents a real event or condition, and each arrow represents a causal relationship that must be supported by evidence.
Without a documented RCA, incidents repeat. The causal chain traced in one post-mortem session exists only in the memory of the people in the room. The next time the same failure mode appears, the team goes through the same investigation again. An RCA report that lives in the team's incident documentation prevents that repetition.