WeKnora/internal at 3ae3ea97c5dce13e159b6cfcc11e33eb3ad08f87 - WeKnora - Gitea: Git with a cup of tea

pub_soft/WeKnora

mirror of https://github.com/Tencent/WeKnora.git synced 2026-06-04 13:30:32 +08:00

Files

History

wizardchen 3ae3ea97c5 fix(knowledge): prevent documents from getting stuck in "processing"

Several failure modes left Knowledge.parse_status pinned at "processing"
forever, with no signal to users beyond a permanent spinner. This commit
addresses the root causes and adds a safety net.

- Asynq worker pool: explicit Concurrency (default 16, env-tunable via
  WEKNORA_ASYNQ_CONCURRENCY) so batch uploads don't queue behind a
  CPU-count-sized worker pool. Redis op timeouts raised to 500ms/1000ms
  (WEKNORA_REDIS_OP_TIMEOUT_MS) to absorb bursty multimodal counter ops.

- DocReader RPC: cap each call with WEKNORA_DOCREADER_CALL_TIMEOUT
  (default 30m). Without this, a hung docreader pinned a worker for the
  full DocumentProcessTimeout window.

- ImageMultimodal: finalize-on-last-attempt semantics. A permanently
  failing single image no longer strands the parent — the asynq retry
  is allowed to run, but on the final attempt we count the image
  regardless of outcome. Redis DECR errors fall back to enqueuing the
  post-process task instead of returning silently.

- Dead-letter callback: when DocumentProcess / KnowledgePostProcess /
  ManualProcess exhausts retries, immediately mark the corresponding
  Knowledge as failed with the last error. This surfaces the failure
  in the UI without waiting for the housekeeping sweep.

- HousekeepingService: 5-minute cron that flips knowledge rows stuck
  in "processing" past DocumentProcessTimeout + 10m to failed, plus
  summary rows stuck > 1h. Catches anything the other safety nets
  miss (worker SIGKILL mid-handler, etc.). Disable with
  WEKNORA_HOUSEKEEPING_ENABLED=false.

- Distributed startup recovery: previously the post-restart sweep was
  skipped whenever REDIS_ADDR was set, even though Asynq does not
  reschedule the task that was actively running on the dead instance.
  Now the sweep runs in distributed mode too, but only against rows
  older than 30 minutes to avoid racing peer instances.

2026-05-28 15:14:45 +08:00

..

fix(agent): harden tool parameter parsing against LLM type mismatches (#1505 )

2026-05-28 07:50:26 +08:00

fix(knowledge): prevent documents from getting stuck in "processing"

2026-05-28 15:14:45 +08:00

feat(assets): add ASR test audio file and embed it in the application

2026-04-02 21:27:27 +08:00

feat: Implement deadlock retry mechanism for chunk creation

2026-04-22 21:17:21 +08:00

fix(knowledge): prevent documents from getting stuck in "processing"

2026-05-28 15:14:45 +08:00

fix(knowledge): prevent documents from getting stuck in "processing"

2026-05-28 15:14:45 +08:00

feat(system-info): surface DB migration errors with troubleshooting links

2026-05-14 16:34:50 +08:00

fix(datasource): support Yuque team token in connector

2026-05-26 20:46:22 +08:00

feat(tenant): implement tenant creation limit and error handling

2026-05-18 17:28:58 +08:00

feat(agent): human-in-the-loop approval for MCP tool calls (#1173 )

2026-05-10 22:57:12 +08:00

feat(rbac): add multi-use share-link invitations for invite_only mode (#1504 )

2026-05-27 19:42:01 +08:00

fix(im): make presigned URL flow diagnosable end-to-end

2026-05-28 08:03:57 +08:00

feat(retriever): add OpenSearch driver skeleton + interface stubs (PR 2a of 3)

2026-05-26 20:54:58 +08:00

refactor(logger): support LOG_FORMAT template and harden level coloring

2026-05-22 20:31:54 +08:00

feat(mcp): implement reconnection logic for MCP tool calls and tool listing

2026-03-31 11:57:15 +08:00

fix(knowledge): prevent documents from getting stuck in "processing"

2026-05-28 15:14:45 +08:00

docs(chat): clarify cached-token semantics for explicit-cache providers

2026-05-25 16:47:14 +08:00

fix(knowledge): prevent documents from getting stuck in "processing"

2026-05-28 15:14:45 +08:00

chore(runtime): silence gin per-route logs and emit env config banner at startup

2026-05-17 15:27:52 +08:00

增加sandbox对windows编译支持，现在默认是linux的实现，windows直接编译报错

2026-05-25 16:57:56 +08:00

fix(summary): preserve image caption/OCR text in document summaries

2026-05-22 17:25:39 +08:00

feat(redis): add REDIS_USERNAME support for Redis ACL

2026-02-04 19:38:40 +08:00

feat(observability): extend Langfuse tracing across asynq pipeline

2026-04-24 13:16:47 +08:00

feat(rbac): add multi-use share-link invitations for invite_only mode (#1504 )

2026-05-27 19:42:01 +08:00

feat(system): consolidate system admin and settings into one Settings panel

2026-05-26 21:13:56 +08:00