Done Work Item
Failure triage surface + correlated audit trail — telemetry failure investigation with correlationId navigation
Implement the production-grade failure triage surface by extending the telemetry failures view (BF-239) with deep correlation navigation. On /telemetry/failures: each failure event shows its correlationId as a clickable link to the full trace (all events sharing that correlationId). Failure detail view shows: the triggering error event + all preceding events in the same correlationId chain, the user session that initiated the request (from session context in auth audit), the exchange or job record involved (if applicable), suggested remediation based on error type. Also extend audit_event display so every audit_event links to its correlationId chain. This makes production incidents navigable within minutes.
Execution Context
- ID
- wi-BF-245
- Branch
- bf/BF-244-deployments-route-ui
- Validation
- ./scripts/validate-local.ps1
- PR
- https://github.com/SingletonTheory/build-factory-bootstrap/pull/318
Lifecycle Metadata
- Lane
- Done
- Work type
- feature
- Source
- done
- Status
- done
- State
- done
- Done criteria
- 7
Queue Truth
Freshness: Fresh (snapshot age 0s)
Drift: none.
Reconciliation Guidance
- No reconciliation required; item truth signals are consistent.
Prompt Context
./work-items/prompts/wi-BF-245.prompt.md
Implement the production-grade failure triage surface by extending the telemetry failures view (BF-239) with deep correlation navigation. On /telemetry/failures: each failure event shows its correlationId as a clickable link to the full trace (all events sharing that correlationId). Failure detail view shows: the triggering error event + all preceding events in the same correlationId chain, the user session that initiated the request (from session context in auth audit), the exchange or job record involved (if applicable), suggested remediation based on error type. Also extend audit_event display so every audit_event links to its correlationId chain. This makes production incidents navigable within minutes.
Available Actions
- Start executionBlocked
Queue execution handoff for this work item.
Recovery guidance
- Cause
- Done items stay closed; Start execution cannot move them backward.
- Policy context
- Lane transition policy blocks Start execution when the item is already Done.
- Next step
- Create a net-new work item if more delivery is needed instead of reopening this one.
- Safe retry
- Do not retry Start execution on a done item.
identity: start-execution
permission: factory.work-item.execute
policy gate lane-transition: blocked (Done items stay closed; Start execution cannot move them backward.)
POST /api/control-plane/items/wi-BF-245/actions/start-execution
- Request reviewBlocked
Mark this item ready for review lane handoff.
Recovery guidance
- Cause
- Done items stay closed; Request review cannot move them backward.
- Policy context
- Lane transition policy blocks Request review when the item is already Done.
- Next step
- Create a net-new work item if more delivery is needed instead of reopening this one.
- Safe retry
- Do not retry Request review on a done item.
identity: request-review
permission: factory.work-item.request-review
policy gate lane-transition: blocked (Done items stay closed; Request review cannot move them backward.)
POST /api/control-plane/items/wi-BF-245/actions/request-review
- Prepare releaseEnabled
Run release-preparation checks for the work item.
identity: prepare-release
permission: factory.work-item.prepare-release
policy gate lane-eligibility: pass
policy gate pull-request: pass
POST /api/control-plane/items/wi-BF-245/actions/prepare-release
- Record historyEnabled
Capture history snapshots for audit and validation views.
identity: record-history
permission: factory.work-item.record-history
policy gate context-visibility: pass
POST /api/control-plane/items/wi-BF-245/actions/record-history