Wire-contract documentation and the CI check that keeps it honest. * cli/README.md gains a verbatim --help block (top-level + subtrees), an Exit codes table covering 0/1/2/3/4/5/6/7/10/124/130, a "Status vs check" verb-pair subtable, and a "doc wait" paragraph spelling out the four exit codes (0 / 1 / 124 / 130). The api passthrough note trims storage provider out of the deep-config list now that kb create --storage-provider is a polished flag. * cli/AGENTS.md becomes the contributor guide: build/test, CRUD flag conventions, the status/check verb pattern, long-poll wait commands, the SetAgentHelp pattern, and a full Error code reference with 35 typed codes mapped to namespaces, exit codes, retryable / hint guidance. Reference section is bracketed by HTML markers so a CI parity test can keep it in sync with AllCodes(). * cli/internal/cmdutil/errors_doc_test.go enforces parity: every code in AllCodes() must appear in AGENTS.md inside the markers, and AGENTS.md must not reference codes that no longer exist. Fails CI if a new typed code is added without documentation. * CHANGELOG.md gets the v0.6 entry: BREAKING (--json / --no-stream / WEKNORA_SDK_DEBUG / kb create --name), Added (--format / --jq / doc wait / --log-level / kb-and-agent status & check / multi-id delete / api --paginate / MCP schema extension / SetAgentHelp / signal-aware ctx / kb create --storage-provider / new operation.* namespace), Changed (multi-id partial-failure exit code, doc upload FlagError, --log-level FlagError, multi-id stdout cleanup, README / AGENTS.md changes), with a Migration from v0.5 section walking every BREAKING through its v0.6 replacement.
18 KiB
AGENTS.md
This is the WeKnora CLI (weknora), a command-line client for the WeKnora RAG server. The module path is github.com/Tencent/WeKnora/cli.
The wire contract for AI agents consuming weknora output (JSON shape, exit codes, error format) lives in README.md — read that if you're integrating with the CLI binary, not modifying it.
Build, Test, and Lint
go build -o weknora . # build (from cli/)
go test -count=1 ./... # unit + contract tests
go test -run TestFoo ./internal/format/ # single test
go test ./acceptance/contract/ -args -update # refresh wire goldens
go test -tags acceptance_e2e ./acceptance/e2e/... # live-server e2e (gated by env)
go vet ./...
Both go test -count=1 ./... and go vet ./... must pass before committing.
Architecture
Entry point: cmd/main.go → cmd.Execute() → cmd.NewRootCmd(cmdutil.New()).
Key packages:
cmd/<name>/— cobra command implementations, one subdir per top-level commandinternal/cmdutil/—Factory,FormatOptions, typedError, exit-code mapping, destructive-write confirm, KB id-or-name resolveinternal/format/— bare JSON emitter (WriteJSON/WriteJSONFiltered)internal/iostreams/— global IO singleton + TTY detection +SetForTestswapinternal/secrets/—Storeinterface;KeyringStoreprimary,FileStore0600 fallback,MemStorefor testsinternal/prompt/—TTYPrompter(password no-echo) +AgentPrompter(non-TTY no-prompt sentinel)internal/sse/—Accumulatorfor chat / agent invoke SSE streamsinternal/mcp/— curated 10-tool stdio MCP server (wired bycmd/mcp/serve.go); see MCP tool surface for the curation rationale and inventoryclient/(parent module) — generated SDK
Command Structure
A command weknora foo bar lives in cmd/foo/bar.go with bar_test.go.
Canonical Examples
- Command + tests:
cmd/kb/list.goandlist_test.go - Destructive write + confirm protocol:
cmd/kb/delete.go - SSE streaming command:
cmd/chat/chat.go - Factory wiring:
internal/cmdutil/factory.go
The Options + Narrow Service Pattern
Every command follows this structure (see cmd/kb/list.go):
Optionsstruct with flag-bound fieldsServiceinterface declaring only the SDK methods this command calls.*sdk.Clientsatisfies it implicitly via duck typing.NewCmd<Verb>(f *cmdutil.Factory) *cobra.Commandconstructor — flag registration +cmdutil.AddFormatFlag- Separate
run<Verb>(ctx, opts, fopts, svc, args...)with the business logic — the test injection point
Key rules:
- Each command owns its own
Serviceinterface; do NOT share interfaces acrosscmd/*packages. Per-file dependency graph is the goal. - Lazy-init
f.Client()/f.Secrets()/f.Prompter()insideRunE, not the constructor (else--helpforces auth). - Required flags:
_ = cmd.MarkFlagRequired("name")— cobra returns the error only on registration-time typo. - New subtrees register in
cmd/root.go NewRootCmd. Verb subtrees register their leaves in the subtree's ownNewCmd.
Command Examples and Help Text
Use a Go raw string with weknora as the example prefix. Keep one-line Short ≤ 70 chars; Long may run multi-paragraph; Example always includes weknora so copy-paste works:
Example: ` weknora kb view <id>
weknora kb view kb_abc --format json
weknora kb view kb_abc --format json --jq '{id, name}'`,
JSON Output
Add --format / --jq via cmdutil.AddFormatFlag(cmd, fieldNames...). In RunE:
fopts, err := cmdutil.CheckFormatFlag(c)
if err != nil { return err }
fopts.ResolveDefault(iostreams.IO.IsStdoutTTY())
// ...
if fopts.WantsJSON() {
return fopts.Emit(iostreams.IO.Out, result)
}
Emit is the single source for the bare-JSON contract — it honors --format json|ndjson and --jq <expr> filtering. Never call format.WriteJSON* directly from a command. See cmd/kb/list.go.
Destructive Writes
Commands that delete / empty / overwrite call cmdutil.ConfirmDestructive(p, opts.Yes, fopts.WantsJSON(), what, id) before mutation. In non-TTY OR JSON-output mode without -y, it returns CodeInputConfirmationRequired → exit 10. See internal/cmdutil/confirm.go.
Testing
Narrow Service Fakes
Each command's runX(ctx, opts, fopts, svc, ...) takes its interface, not *sdk.Client. Tests inject plain-struct fakes:
type fakeBarSvc struct {
gotID string
resp *sdk.Bar
err error
}
func (f *fakeBarSvc) GetBar(_ context.Context, id string) (*sdk.Bar, error) {
f.gotID = id
return f.resp, f.err
}
No mocking library; the narrow-interface design makes fakes 5 lines each.
IOStreams in Tests
out, errBuf := iostreams.SetForTest(t) // bytes.Buffer sinks, non-TTY
ios, _ := iostreams.SetForTestWithTTY(t) // simulate terminal
Confirm Prompts
Use testutil.ConfirmPrompter{Answer: bool, Err: error} from internal/testutil/. Single source for the prompt double — do NOT re-define confirmPrompter per package.
Assertions
Use testify. Prefer require (not assert) for error checks so the test halts immediately, and assert for value comparisons:
require.NoError(t, err)
require.ErrorAs(t, err, &typed)
assert.Equal(t, "expected", actual)
Acceptance: Wire-Shape Goldens
acceptance/contract/wire_test.go drives the in-process cobra tree against httptest.Server fixtures and compares stdout to acceptance/testdata/wire/<case>.json. Error-path cases also assert stderr contains the typed code substring (e.g. auth.unauthenticated). Update goldens with go test ./acceptance/contract/ -args -update.
Table-Driven Tests
Use for flag validation, error classification, parser edge cases. See internal/cmdutil/exit_test.go and cmd/kb/list_test.go.
tests := []struct{ name string; ...}{
{name: "descriptive case", ...},
}
for _, tt := range tests {
t.Run(tt.name, func(t *testing.T) { /* arrange, act, assert */ })
}
Code Style
- Add godoc to every exported function, type, and constant. Explain why, not what — the name already says what.
- Don't comment to restate the code. Delete comments that narrate the next line.
- Don't reference task numbers, commit SHAs, or version tags in inline comments — they belong in CHANGELOG or git log.
- Never paste em-dashes (—) into Go source; use ASCII
-or rewrite. (Markdown docs may use em-dashes.) - Don't add a helper for a single caller — inline.
Error Handling
Typed error helpers in internal/cmdutil/errors.go:
cmdutil.NewError(code, msg)— fresh typed errorcmdutil.WrapHTTP(err, format, args...)— wrap an SDK error + classify from HTTP status (404 →resource.not_found, 401 →auth.unauthenticated, …). Use at every SDK call site.cmdutil.Wrapf(code, err, format, args...)— explicit wrap with a chosen codecmdutil.NewFlagError(err)— flag / argument problem → exit 2cmdutil.SilentError— exit 1 without printing (when output already emitted)cmd.MarkFlagsMutuallyExclusive("a", "b")— cobra-level mutex
Errors print to STDERR via cmdutil.PrintError(w, err) as code: msg\nhint: .... STDOUT stays bare JSON or empty on failure, so --json | jq pipelines never have to filter error shapes.
User-facing exit-code mapping lives in README.md "Exit codes". When adding a new ErrorCode constant, also append to AllCodes() so the acceptance contract picks it up.
Error code reference
Audience: AI agents and scripted callers parsing
weknorastderr. Code authors writing new error sites — see## Error Handlingabove.
When weknora exits non-zero, stderr carries a structured triplet:
<code>: <message>
hint: <actionable next step>
Agents parse the first colon to extract the typed code. The exit code class (see README.md "Exit codes") controls retry / surface decisions; the typed code disambiguates within a class.
| Code | Exit | Retryable | Default hint |
|---|---|---|---|
auth.unauthenticated |
3 | no (run auth login) |
run weknora auth login |
auth.token_expired |
3 | yes (after refresh) | your session expired; run weknora auth login to re-authenticate |
auth.bad_credential |
3 | no (re-login) | run weknora auth login |
auth.forbidden |
3 | no | active context lacks permission for this resource |
auth.cross_tenant_blocked |
3 | no | verify tenant context with weknora auth status |
auth.tenant_mismatch |
3 | no | verify tenant context with weknora auth status |
input.invalid_argument |
5 | no | see weknora <command> --help for valid usage |
input.missing_flag |
5 | no | see weknora <command> --help for valid usage |
input.confirmation_required |
10 | NO automatic retry | high-risk write - re-run with -y/--yes after the user explicitly approves |
resource.not_found |
4 | no | verify the resource ID and try again |
resource.already_exists |
1 | no | use a different name or fetch the existing resource |
resource.locked |
1 | maybe (transient lock) | (no canonical hint; check resource state) |
server.error |
7 | yes (with backoff for 5xx) | (no canonical hint) |
server.timeout |
7 | yes (with backoff) | request timed out; retry, or run weknora doctor to check connectivity |
server.rate_limited |
6 | yes (back off, then retry) | rate-limited; retry after a few seconds |
server.session_create_failed |
1 | yes (with backoff) | could not create a chat session; pass --session to reuse an existing session |
server.incompatible_version |
7 | no (upgrade required) | run weknora doctor to see version skew details |
network.error |
7 | yes (with backoff) | check base URL reachability with weknora doctor |
operation.timeout |
124 | yes (raise --timeout) |
wait timed out; raise --timeout or check the underlying job |
operation.failed |
1 | no (target reached terminal failure) | one or more targets reached a terminal failure (e.g. doc parse_status=failed) |
operation.cancelled |
1 (main overrides to 130) | no | command interrupted by SIGINT / SIGTERM. The typed code maps to exit 1, but main raises the exit to 130 when the root context was signal-cancelled so the user-visible exit follows Unix signal convention. |
local.config_corrupt |
1 | no (manual fix) | remove ~/.config/weknora/config.yaml and re-run weknora auth login |
local.context_not_found |
1 | no | (no canonical hint; check weknora context list) |
local.file_io |
1 | no | check file permissions under $XDG_CONFIG_HOME/weknora/ |
local.kb_id_required |
1 | no | run weknora link to bind this directory to a knowledge base, or pass --kb |
local.kb_not_found |
1 | no | list available with weknora kb list |
local.keychain_denied |
1 | no (system-level) | verify keyring access; falls back to file storage |
local.project_link_corrupt |
1 | no | remove .weknora/project.yaml and run weknora link again |
local.sse_stream_aborted |
1 | yes (rerun chat / agent invoke) | the streaming answer was cut off mid-flight; retry, or pass --format json to buffer the full response |
local.unimplemented |
1 | no | (planned in a future release) |
local.upload_file_not_found |
1 | no | verify the path is correct and readable |
local.user_aborted |
1 | no (user said no) | no action taken; pass -y/--yes to skip the confirmation prompt |
mcp.readonly_mode |
1 | no | MCP tool surface is read-only; mutations not exposed in this mode |
mcp.schema_unknown_command |
1 | no | (no canonical hint) |
mcp.tool_not_allowed |
1 | no | MCP tool not in the curated allowlist |
Agent decision shortcuts
For common retry patterns, agents can hardcode:
network.*→ retry with exponential backoffauth.token_expired→ runweknora auth refresh, then retry onceserver.rate_limited→ back off (Retry-After if present) then retryoperation.timeout→ raise--timeoutand retry, or surface to userinput.confirmation_required→ NEVER auto-pass-ywithout explicit user authorization*.invalid_argument/*.missing_flag→ surface to user (don't retry)
MCP Tool Surface
WeKnora's MCP server exposes a curated read-only tool surface. Many MCP servers in the wild ship write / mutation operations on by default and rely on credential-scope or sandbox restrictions for safety. WeKnora opts for curation instead: the server side doesn't yet enforce per-token scope, so an agent holding a user's token has full write access. Until server-side scope ships, the CLI keeps mutation tools out of the MCP surface as a belt-and-braces second line of defense. When server scope arrives this stance can loosen.
The curated 10 tools (cli/internal/mcp/tools.go):
| Tool | Purpose |
|---|---|
kb_list |
list knowledge bases |
kb_view |
fetch a knowledge base by id |
doc_list |
list documents in a kb (paginated, status-filterable) |
doc_view |
fetch a document by id |
doc_download |
download raw bytes (1 MiB cap, base64 for binary) |
chunk_list |
list chunks of a document for RAG retrieval debug |
search_chunks |
hybrid (vector + keyword) retrieval |
chat |
stream a RAG answer; auto-creates a session if absent |
agent_list |
list custom agents |
agent_invoke |
run a query through a custom agent |
Adding a tool is a deliberate API expansion — the agent-callable surface is the reason this CLI ships an MCP server, not its CLI command list, so the registration list in registerTools is maintained by hand.
Command surface design SOP
Before specifying any CLI command, do this in order:
grep -A 50 "type Foo struct" client/foo.go— dump SDK request/response schemas.- List every field with type and source line.
- For each field, decide: hot-path flag / config-file only / hidden / never-expose.
- Cross-check pagination signatures: an SDK
(ctx, id, page, pageSize)shape demands--limit+--all-pages+--page-sizeon the CLI side. - ONLY THEN consult mainstream CLI conventions to choose flag names, positionals, mutex, and confirm semantics.
- Decide which fields are "top use case" (flag) / "advanced" (
--config-fileor escape hatch viaweknora api). Don't try to flag-cover every SDK field — mature CLIs that curate ship a tighter surface; CLIs that 1:1 mirror their API pay the UX cost.
Rationale: earlier drafts produced three categories of schema errors — fields that didn't exist on the underlying SDK, wrong field counts in user-facing docs, and missing pagination flags — that all stemmed from "design from convention, not from SDK." The fix is canonical: the SDK schema is the ground truth; convention decides names and shapes around that ground truth.
CRUD command flag conventions
CRUD commands follow the hard-required-flags pattern: every required input is a flag or positional, and a missing one yields an immediate input.invalid_argument exit. The contrast is TTY-prompts-fill, where missing input opens an interactive prompt; that pattern is reserved for auth login (the one command where a human must be at the terminal).
Required-input idioms in this codebase:
- Positional required:
cobra.ExactArgs(N)orcobra.MinimumNArgs(1) - Flag required:
cmd.MarkFlagRequired("flag") - Custom required (e.g.,
agent editneeds at-least-one-edit-flag): RunE-level validation that returnsinput.invalid_argument - Mutex:
cmd.MarkFlagsMutuallyExclusive("a", "b")
Reasons hard-required-flags is the v0.5+ default:
-
Admin / debug commands have no natural human-interactive prompt to lean on.
-
Agent-friendly: MCP callers do not stall waiting for stdin prompts.
-
Consistent with every existing non-auth WeKnora command.
-
Agent help blob (v0.6, partial): Commands MAY call
cmdutil.SetAgentHelp(cmd, cmdutil.AgentHelp{...})to expose a stable JSON used_for / required_flags / examples / output shape. Activated byWEKNORA_AGENT_HELP=1at--helptime. Currently applied tochatandkb listonly — extending to another command requires touching only that command'sNewCmd.
Status / check verb pair pattern
When a resource has both a cheap "is it alive?" probe and a deeper "verify its dependencies / aggregate state" probe, expose them as two verbs so the verb itself communicates cost:
status <id>— single HTTP, returns reachable + cheap fields.check <id>— 1 + N HTTP, adds derived state that needs follow-up calls (e.g., aggregatingfailed_countvia doc-list page-walk, probing every KB in an agent's scope).
Current pairs: kb status / kb check, agent status / agent check.
The deep verb's Long help text must enumerate the extra HTTP calls so
cost is predictable.
Long-poll wait commands
doc wait <doc-id> [<doc-id>...] is the model for any future
wait command:
- Always wait-all on multi-target (no fail-fast flag); compose in shell
(
wait id1 && wait id2) when fail-fast is needed. - Exponential backoff with jitter (initial
--interval, cap 15s). - Concurrency capped (5 in flight); large fan-out via
xargs -P. - Exit-code priority: failed (1) > timeout (124) > completed (0). The
failed bucket is
operation.failed, notserver.error— a target's own terminal failure is not a transient transport issue. - Validate
--format/--jqbefore polling so an invalid flag does not cost the caller a multi-minute poll.