Phase 4 · W37–W38

W37–W38: Continuous Updates (content pipeline)

Build a repeatable content pipeline so knowledge stays fresh, tested, and trustworthy instead of slowly rotting.

Suggested time: 4–6 hours/week

Outcomes

A clear content lifecycle (draft → reviewed → published → retired).
A folder structure that supports growth.
Automated ingestion on change.
Retrieval tests as a quality gate.
A stale-content checklist for regular review.

Deliverables

Content lifecycle rules document with metadata fields.
One command to run end-to-end ingestion consistently.
Retrieval quality gate with threshold and non-zero failure exit.
Stale content report flagging review/retire and missing metadata.

Prerequisites

W35–W36: Runbooks, RCA, and Standard Operating Procedures

W37–W38: Continuous Updates (content pipeline)

What you’re doing

You stop letting your knowledge base rot.

Every KB dies the same way:

people write a few docs
nobody updates them
they become wrong
everyone stops trusting KB

So you build a pipeline that makes updates cheap:

new docs are easy to add
old docs get reviewed
ingestion runs automatically
retrieval tests catch breakage

Time: 4–6 hours/week
Output: a content pipeline that keeps KB fresh: versioning, review loop, automated ingestion, and quality checks

The promise (what you’ll have by the end)

By the end of W38 you will have:

A clear “content lifecycle” (draft → reviewed → published → retired)
A folder structure that supports growth
Automated ingestion on change
Retrieval tests as a quality gate
A “stale content” checklist (what must be reviewed regularly)

The rule: freshness is part of quality

An outdated runbook is worse than no runbook.
Because it creates confident wrong actions.

So you treat content like code:

versioned
reviewed
tested
released

Step-by-step checklist

1) Define content lifecycle

Simple statuses:

DRAFT
REVIEWED
PUBLISHED
DEPRECATED

Add this status into metadata for every doc.

2) Add ownership

Every doc must have:

owner
last_reviewed_at
review_frequency (e.g., 90 days)

If there is no owner, it will rot.

3) Create KB folder conventions

Example:

knowledge/runbooks/
knowledge/rca/
knowledge/sop/
knowledge/mappings/
knowledge/known-issues/

Keep it boring and consistent.

4) Automate ingestion

On every change to /knowledge:

regenerate chunks.jsonl
update index data (even if local for now)

If you have GitHub Actions:

run ingestion on PR
store artifact (optional)

If you don’t, at least create one command:

`make ingest` (or similar)

5) Add retrieval tests as a gate

Use your W31–W32 benchmark.
If top-5 hit rate drops below threshold:

fail the pipeline / CI
force fixes before merge

This prevents “docs changed and retrieval broke” silently.

6) Stale content detection

Create a simple script/report:

list docs not reviewed in 90/180 days
list docs with deprecated status
list docs missing owner or metadata

This keeps the KB alive.

Deliverables (you must ship these)

Deliverable A — Content lifecycle rules

a doc describing statuses and transitions
metadata fields defined

Deliverable B — Automation command

one command to run ingestion end-to-end
output generated consistently

Deliverable C — Quality gate

retrieval benchmark threshold defined
failing threshold produces non-zero exit

Deliverable D — Stale content report

report lists docs to review/retire
missing metadata is flagged

Common traps (don’t do this)

No. You won’t. Automate it.

Trap 1: “We’ll remember to update docs.”

No owner = dead content.

Trap 2: “No ownership.”

If you don’t test retrieval, it will degrade silently.

Trap 3: “No tests for KB.”

Quick self-check (2 minutes)

Answer yes/no:

Does every doc have owner + review date?
Can I run ingestion with one command?
Do retrieval tests run automatically and block regressions?
Do I have a stale content report?
Is content lifecycle documented and used?

If any “no” — fix it before moving on.

Next module preview (W39–W40)

Next: Review & Hardening (quality gates).
We’ll make the whole KB system reliable enough to show in a portfolio.