WeKnora/internal at fd3d9f547c373ba4c73638adcf5b52c7543fe0d2 - WeKnora - Gitea: Git with a cup of tea

pub_soft/WeKnora

mirror of https://github.com/Tencent/WeKnora.git synced 2026-06-04 13:30:32 +08:00

Files

History

ochan.kwon fd3d9f547c feat(search): fan-out KB retrieval across bound vector stores

Multi-KB hybrid search now groups KBs by their bound VectorStore (partition
key (storeID, owner_tenant_id)), retrieves in parallel via errgroup with a
SetLimit(4) cap and a per-group timeout (MULTI_STORE_RETRIEVE_TIMEOUT_SEC,
default 30s), and merges results. When the collected results span more than
one engine type, an EngineAwareNormalizer rescales vector scores to [0, 1];
keyword (BM25) scores pass through to the existing RRF fusion. Single-group
calls take the fast path with zero fan-out overhead, preserving today's
behavior for deployments where every KB has vector_store_id = NULL.

Embedding-model consistency is now enforced explicitly via
ResolveEmbeddingModelKeys. Multi-KB searches across KBs whose resolved
model identities differ return BadRequest instead of silently producing
incomparable scores. Cross-tenant Organization-shared KBs are preserved by
partitioning on KB.TenantID so the factory's ownership lookup runs against
the source tenant. Foreign-tenant KB UUIDs injected via the request body
are rejected via kbShareService.HasTenantKBPermission (Plan 3 of #1303,
3-D capped) before any retrieval; rejected scopes surface as 404 to avoid
leaking foreign KB existence.

Service-layer typed AppErrors (ErrVectorStoreBindingInvalid 2200 /
ErrVectorStoreUnavailable 2201) are mapped from PR2 sentinel hierarchy and
preserved end-to-end: the iterative FAQ path returns them rather than
swallowing, and the HybridSearch handler routes typed AppErrors to the
client unchanged instead of downgrading to 500.

Part of #993 (Phase 2: Per-KB VectorStore Binding).
Phase 2 roadmap item: PR 4 (Multi-store fan-out search).
Depends on #994, #1310, #1372.

2026-05-20 22:25:39 +08:00

..

refactor(agent): simplify grep_chunks tool to a single regex query

2026-05-20 19:34:27 +08:00

feat(search): fan-out KB retrieval across bound vector stores

2026-05-20 22:25:39 +08:00

feat(assets): add ASR test audio file and embed it in the application

2026-04-02 21:27:27 +08:00

feat: Implement deadlock retry mechanism for chunk creation

2026-04-22 21:17:21 +08:00

chore(rbac): update default behavior for tenant RBAC configuration

2026-05-18 21:24:28 +08:00

feat(obs): 支持华为云obs存储

2026-05-18 19:38:23 +08:00

feat(system-info): surface DB migration errors with troubleshooting links

2026-05-14 16:34:50 +08:00

fix(feishu): tolerate partial wiki node listing failures

2026-05-12 17:40:59 +08:00

feat(tenant): implement tenant creation limit and error handling

2026-05-18 17:28:58 +08:00

feat(agent): human-in-the-loop approval for MCP tool calls (#1173 )

2026-05-10 22:57:12 +08:00

feat(search): fan-out KB retrieval across bound vector stores

2026-05-20 22:25:39 +08:00

fix(agent): exclude wiki-only KBs from quick-answer (RAG) mode

2026-05-12 16:27:28 +08:00

feat: add vLLM server URL configuration for MinerU

2026-05-20 16:33:12 +08:00

fix(rbac): propagate TenantRole through logger.CloneContext

2026-05-18 17:28:58 +08:00

feat(mcp): implement reconnection logic for MCP tool calls and tool listing

2026-03-31 11:57:15 +08:00

chore(rbac): update default behavior for tenant RBAC configuration

2026-05-18 21:24:28 +08:00

fix(moonshot): pin temperature=1 for models that reject other values moonshot-v1-* and kimi-k2.5/k2.6 reject any temperature ≠ 1 with HTTP 400. Detect these models in BuildChatCompletionRequest and force Temperature=1 while leaving kimi-k2/k2-turbo/k2-thinking unaffected.

2026-05-18 21:28:40 +08:00

feat(knowledge-base): implement per-user pinning for knowledge bases

2026-05-18 17:28:58 +08:00

chore(runtime): silence gin per-route logs and emit env config banner at startup

2026-05-17 15:27:52 +08:00

feat: optimize security and deployment of agent skills

2026-02-04 20:08:49 +08:00

feat(prompt): enhance question rewriting and intent classification in prompt templates

2026-04-16 23:15:02 +08:00

feat(redis): add REDIS_USERNAME support for Redis ACL

2026-02-04 19:38:40 +08:00

feat(observability): extend Langfuse tracing across asynq pipeline

2026-04-24 13:16:47 +08:00

feat: add vLLM server URL configuration for MinerU

2026-05-20 16:33:12 +08:00

fix(initialization): surface upstream and SSRF errors verbatim in test-connection responses

2026-05-17 15:27:52 +08:00