Skip to main content

Evidence Model

Contexta treats ML observability as durable evidence rather than loose logs. A run should explain what happened, what inputs were involved, what outputs were produced, and what a reviewer should trust or question.

Truth Planes

PlanePurposeExamples
Metadatalifecycle and ownershipproject, run, stage, batch, sample, deployment
Recordsobservations over timemetric, event, span, degraded marker
Artifactsfiles worth reviewingmodel, checkpoint, prompt, report, schema

Evidence Shape

{
"run": "run:fraud.candidate-v2",
"stage": "stage:fraud.candidate-v2.evaluate",
"records": [
{"type": "metric", "key": "f1", "value": 0.91},
{"type": "event", "key": "dataset.loaded", "message": "Loaded fraud.v12"}
],
"artifacts": [
{"kind": "model", "path": "artifacts/fraud-v2.joblib"}
]
}

Why This Matters

Logs say something happened. Evidence helps answer whether the result is reviewable, comparable, reproducible, and recoverable.