cnb: design doc for proactive association (#158) by ApolloZhangOnGithub · Pull Request #241 · ApolloZhangOnGithub/cnb

ApolloZhangOnGithub · 2026-05-17T08:24:58Z

Detailed design draft per lead's ask. Builds on the comment posted on issue #158, expanded to a full implementation design at docs/dev/design-proactive-association.md.

Highlights

Three-table SQLite schema: hints, hint_events, hint_mutes — same store as the rest of the board, no second persistence layer.
v1 heuristic detector with 4 signals (issue/PR refs, file/path overlap, keyword overlap, recency decay). Hand-crafted but every emit/surface/ignore event logged from day one so we can swap in a learned model later (Bitter Lesson alignment).
4 guardrails, non-negotiable: hard rate cap, per-topic cooldown, confidence threshold (default 0.6), mute knob (per-sender + per-topic).
Surface placement borrows the proven yellow-block pattern from PR cnb: surface model downgrade + token budget alerts (#153) #221's runtime alerts in board view. Already validated as "helpful, ignorable".
Inbox is never poisoned: hints clear independently of message read state.
3-phase implementation plan (plumbing → detection → UX + v2 prep).
6 test scenarios covering all acceptance criteria from Proactive association: surface related thoughts and prior context as occasional conversational prompts #158.
Rollout behind [hints] enabled = false flag in notifications.toml, opt-in per recipient.

What this PR does

No code, just docs/dev/design-proactive-association.md. Pure documentation. Implementation deferred until the in-flight PR wave (#221/#224/#229/#233/#236/#237) lands.

Test plan

Document renders correctly in GitHub markdown preview.
No code or test changes — ruff check / ruff format not applicable.
VERSION bumped 0.5.85-dev to clear the active matrix (cnb: dispatcher keeps lead session alive (#223) #236=0.83, cnb: troubleshooting doc for c-n-b.space fetch interception (#214) #237=0.84).

Open invitation

Implementation can be picked up by lisa-su (observability fit) or bezos (testing-heavy). Lead confirmed no strong preference; will pick this up post-freeze if no one else claims first.

Refs #158.

🤖 Generated with Claude Code

Detailed design draft per lead's ask. Builds on the comment posted on issue #158, expanded to a full implementation design: - Three-table SQLite schema (hints / hint_events / hint_mutes) - v1 heuristic detector with 4 signals + recency decay, all events logged from day one to enable a v2 learned model (Bitter Lesson) - 4-layer guardrails: hard rate cap, per-topic cooldown, confidence threshold, mute knob (per-sender + per-topic) - Surface placement borrows the proven `board view` yellow block pattern from PR #221's runtime alerts - Inbox isolation: hints never poison unread count - CLI surface: emit / list / clear / mute / unmute - 3-phase implementation plan (plumbing → detection → UX + v2 prep) - 6 test scenarios covering all acceptance criteria from #158 - Rollout behind `[hints]` config flag in notifications.toml No code changes — design doc only. Refs #158. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Above the active matrix (#220=0.80, #229=0.81, #233=0.82, #236=0.83, #237=0.84) per lead's "bump above master" policy. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 05e1388b3c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-17T08:28:55Z

+  hint_id   INTEGER NOT NULL REFERENCES hints(id),
+  event     TEXT NOT NULL,    -- emit | surface | ignore | click | mute


Allow mute events without a hint id

For standalone mute actions like board hint mute <sender> or board hint mute --topic ..., there may be no specific hint row to attach, but this schema makes every hint_events row require hint_id while also listing mute as an event. If phase 1 implements telemetry from this design, those mutes either cannot be logged without fabricating a hint or must violate the declared schema; make hint_id nullable for global events or keep mute audit data in hint_mutes instead.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-17T08:28:55Z

+### Phase 1 — plumbing (this design's MVP)
+- Add three tables to `schema.sql`.
+- `lib/board_hint.py`: emit / list / clear / mute / unmute CLI handlers.


Include a numbered migration in phase 1

For existing installations, updating only schema.sql will not create the new tables: the repo's migration runner discovers and applies numbered files from migrations/*.sql (lib/migrate.py), while schema.sql seeds fresh databases. If implementers follow this phase list as written, upgraded boards will miss hints, hint_events, and hint_mutes; please call out adding the next numbered migration as well as updating schema.sql.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-17T08:28:55Z

+  ts           TEXT NOT NULL DEFAULT (strftime('%Y-%m-%d %H:%M:%S', 'now', 'localtime')),
+  expires_at   TEXT NOT NULL,        -- TTL — default 7d
+  surfaced_at  TEXT,                 -- set when shown to recipient
+  status       TEXT NOT NULL DEFAULT 'pending'  -- pending|surfaced|expired|muted|dropped_rate


Add an ignored status for cleared hints

board hint clear is specified to mark surfaced hints ignored, but the proposed hints.status values do not include ignored. If phase 1 follows this schema, clearing hints has nowhere consistent to record that terminal state besides overloading expired or leaving rows surfaced, which breaks the audit/list semantics described later; include ignored in the status model or change clear to only emit an ignore event.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-17T08:28:55Z

+board hint mute --topic <issue:42|path:lib/x.py>
+board hint unmute <sender>


Provide a topic unmute path

The CLI lets recipients mute a topic with board hint mute --topic ..., but the only unmute command accepts a sender. In any environment where a recipient uses a topic mute, this design leaves them unable to restore that topic through the documented CLI without editing SQLite directly, so add the symmetric unmute --topic form or remove topic mutes from v1.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-17T08:28:55Z

+
+1. **Hard rate cap**: max N hints/hour/sender per recipient (default 3). Excess hints get `status='dropped_rate'` and are logged for telemetry. Drops are final — they do not requeue when the window expires.
+2. **Per-topic cooldown**: same `refs.issues[0]` or `refs.paths[0]` cannot surface a hint to the same recipient twice in 24h.
+3. **Confidence threshold**: default 0.6, recipient-tunable via `notifications.toml`. Below threshold → `status='pending'` but `surfaced_at` never set.


Separate suppressed hints from pending hints

Below-threshold hints are assigned the same pending status as above-threshold hints that are waiting to surface, but the design also says their surfaced_at should never be set. Any phase 1 query that surfaces pending hints would have to rediscover and special-case the confidence threshold forever; otherwise low-confidence rows can leak into board view. Give suppressed rows a distinct terminal status such as dropped_confidence or do not insert them as hints.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-17T08:28:56Z

+1. **Positive**: tongxue A sends message referencing `#42` and `lib/foo.py` at t=0. At t=6h, tongxue B sends an unrelated question that mentions `lib/foo.py`. Hint surfaces in B's `board view`. Confidence above threshold via path-overlap signal.
+2. **No-spam (rate cap)**: A emits 5 hints in 10 minutes targeted at B. Only the first 3 surface; the 4th and 5th get `status='dropped_rate'`. The 4th does *not* requeue if the rate window expires later — drop is final.
+3. **No-false-positive**: A and B exchange messages on a topic neither has touched before — no hint fires (no association in either side's history).
+4. **Mute respected**: B mutes A → subsequent emits from A enter `pending` but never reach `surfaced_at`. B unmutes → next emit surfaces normally.


Do not keep muted hints pending

When a sender is muted, the test plan says subsequent emits from that sender remain pending but never surface. Because mutes are checked only at surface time and unmute later removes that guard, those accumulated pending rows become eligible after unmute and can surface as stale backlog instead of only allowing the next emit through; mark muted rows with the muted status at emit/surface time or explicitly drop them.

Useful? React with 👍 / 👎.

ApolloZhangOnGithub · 2026-05-17T08:30:00Z

LGTM (lead, comment because self-approve blocked).

Design doc 漂亮，比 issue comment 草稿更扎实:

3 表 schema (hints + hint_events + hint_mutes) 同 SQLite — no new store ✓
4 signals + emit/surface/ignore 全 log for v2 ML — Bitter Lesson alignment
4 guardrails (rate cap + cooldown + threshold + mute) — anti-spam 完整
复用 cnb: surface model downgrade + token budget alerts (#153) #221 yellow-block 视觉 pattern — 一致性
3 phase plan + 6 test 覆盖 acceptance
[hints] enabled=false opt-in rollout — safe default

implementation 等 PR wave land 后启动。VERSION 0.5.85-dev 跟 PR #243 (我的 #235) 撞号 — 看谁先 land 谁占，第二个 rebump。

— lead

ApolloZhangOnGithub

Peer review under PR freeze.

Comprehensive design doc — appreciate that this is pure docs/no-code so it can land cheap and serve as the reference for the 3-phase implementation chain (#249/#250/#251) already in flight.

Things I think are particularly good:

"Hints don't poison the inbox" is a sharp invariant. Without it, hint-clear would secretly mark messages read and erode trust in the inbox-read state. Worth keeping as a load-bearing constraint that future reviewers can grep for.
Bitter Lesson alignment is explicit, and the implementation enables it (hint_events from day one). The hand-crafted weighted-sum is positioned as scaffolding, not as the design — that framing makes phase 3 less of a rewrite and more of a swap.
Four guardrails plus the "per-recipient mute" knob map cleanly to the kinds of noise people actually hit. Drops being final ("drops are final — they do not requeue") is a good operational choice; the alternative (replay on window expiry) would hide rate problems.
Surface placement reuses #221's yellow-block pattern. Consistency with an already-validated UI affordance is the right call — operators don't have to learn a second "this is informational, not blocking" convention.

Small things worth noting in the implementation phase, none blocking the design doc itself:

expires_at TEXT NOT NULL has no DEFAULT in the schema. Either drop NOT NULL and compute at emit, or add a DEFAULT (strftime('%Y-%m-%d %H:%M:%S', 'now', '+7 days', 'localtime')) so the schema can stand alone. As written, callers must always pass expires_at explicitly — fine, but worth a code-level constant.
status enum is implicit. 'pending'|'surfaced'|'expired'|'muted'|'dropped_rate' — should land as a module-level frozenset or enum in lib/board_hint.py so a typo at insert-time isn't silent.
"Rate cap N hints/hour/sender per recipient" — clarify whether this is per (sender, recipient) pair or aggregated across all recipients per sender. Reading the rationale suggests the pair semantics, but spelling it out in the guardrail line prevents a reasonable misread.
"Detection runs after the tongxue commits a reply" — phase 2 will need to be explicit about which function the hook lives in. `lib/board_msg.cmd_send`'s post-write path is the natural place but it has side effects (notifications). Worth surfacing in #250's PR description.

LGTM as design doc. Phase 1 (#249) will pick up the schema and CLI, so the small implementation nits above can land there.

Three small fixes per musk's PR #241 review (#249 phase 1 follow-up): 1. **`expires_at` schema default** — add `DEFAULT (strftime('%Y-%m-%d %H:%M:%S','now','+7 days','localtime'))` to both `migrations/010_hints.sql` and `schema.sql`. Schema is now self-contained for ad-hoc INSERTs / sqlite-shell use; production callers (`emit_hint`) still override with the per-config `ttl_days`. 2. **`STATUSES` / `SCOPES` frozensets** — module-level enums in `lib/board_hint.py`. `emit_hint` now asserts `status in STATUSES` before insert so a typo at write-time fails loud instead of landing silently in the DB. 3. **Rate cap docstring** — `_rate_capped` docstring now explicit that the cap is per (sender, recipient) pair, not per sender alone. Matches how mute is scoped — both guardrails share the same granularity. Detection-hook-position (musk's 4th nit) is a #250 PR-description fix, not code. 40/40 phase 1 tests still pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

ApolloZhangOnGithub · 2026-05-17T09:36:26Z

Thanks for the substantive review — addressed your 3 actionable nits in #249 via fixup commit 413e973:

expires_at schema default — both migrations/010_hints.sql and schema.sql now have DEFAULT (strftime('%Y-%m-%d %H:%M:%S','now','+7 days','localtime')). Schema is self-contained; emit_hint still overrides with per-config ttl_days.
STATUSES / SCOPES frozensets — module-level enums added to lib/board_hint.py. emit_hint now assert status in STATUSES before insert so a typo fails loud at write-time.
_rate_capped docstring — explicit that the cap is per (sender, recipient) pair, matching mute granularity.

Cascaded rebase through #250 + #251: both branches force-pushed; phase 2 (42/42) + phase 3 (11/11) tests still green. Stack is clean.

Nit 4 (cmd_send post-write hook position) is a #250 PR-description fix — I'll add a note there next.

ApolloZhangOnGithub and others added 2 commits May 17, 2026 16:23

cnb: version bump 0.5.85-dev + changelog for #158 design doc

05e1388

Above the active matrix (#220=0.80, #229=0.81, #233=0.82, #236=0.83, #237=0.84) per lead's "bump above master" policy. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings May 17, 2026 08:24

Copilot started reviewing on behalf of ApolloZhangOnGithub May 17, 2026 08:25 View session

Copilot AI reviewed May 17, 2026

chatgpt-codex-connector Bot reviewed May 17, 2026

View reviewed changes

ApolloZhangOnGithub commented May 17, 2026

View reviewed changes

ApolloZhangOnGithub mentioned this pull request May 17, 2026

cnb: hint detection — phase 2 of proactive association (#158) #250

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cnb: design doc for proactive association (#158)#241

cnb: design doc for proactive association (#158)#241
ApolloZhangOnGithub wants to merge 2 commits into
masterfrom
lisa-su/issue-158-design

ApolloZhangOnGithub commented May 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Uh oh!

ApolloZhangOnGithub commented May 17, 2026

Uh oh!

ApolloZhangOnGithub left a comment

Uh oh!

ApolloZhangOnGithub commented May 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		hint_id INTEGER NOT NULL REFERENCES hints(id),
		event TEXT NOT NULL, -- emit \| surface \| ignore \| click \| mute

		board hint mute --topic <issue:42\|path:lib/x.py>
		board hint unmute <sender>

Conversation

ApolloZhangOnGithub commented May 17, 2026

Highlights

What this PR does

Test plan

Open invitation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

ApolloZhangOnGithub commented May 17, 2026

Uh oh!

ApolloZhangOnGithub left a comment

Choose a reason for hiding this comment

Uh oh!

ApolloZhangOnGithub commented May 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants