WeKnora

mirror of https://github.com/Tencent/WeKnora.git synced 2026-06-04 13:30:32 +08:00

Author	SHA1	Message	Date
wizardchen	15845f4e62	fix(frontend): poll knowledge list after single-item delete DeleteKnowledge is now an async pipeline on the backend, so refreshing the list immediately after a successful response can still show the deleted entry. Mirror the polling loop already used for batch delete: re-list up to ~12s (30 polls * 400ms) until the deleted id disappears, then refresh tags. Without this the UI shows "delete success" but the list visibly contains the entry until the user manually refreshes.	2026-05-25 17:11:20 +08:00
wizardchen	d12fb42e60	refactor(knowledge): route single delete through async pipeline Reuse enqueueKnowledgeListDelete inside DeleteKnowledge so that single-item delete shares the same hardening as BatchDeleteKnowledge / ClearKnowledgeBase: asynq retries, business-aware queue routing, and marking-as-deleting inside the worker. The endpoint now returns 200 once the delete task has been enqueued; the response body carries the asynq task_id and the message is updated to "Delete task submitted". Swagger annotations, generated docs and the Go client SDK comment are updated to reflect the new asynchronous semantics. Note: this is a behavior change. Callers that previously assumed the knowledge was already gone on a 200 response should poll the task status or accept eventual consistency, matching the existing BatchDeleteKnowledge contract.	2026-05-25 17:11:20 +08:00
chenwenhui	af1f2b469a	增加sandbox对windows编译支持，现在默认是linux的实现，windows直接编译报错	2026-05-25 16:57:56 +08:00
wolfkill	6331cce23d	fix: skip empty milvus enabled status groups	2026-05-25 16:54:56 +08:00
cn-kali-team	fdb53ff92a	fix type	2026-05-25 16:50:17 +08:00
young1lin	29820e4cac	docs(chat): clarify cached-token semantics for explicit-cache providers `cached_tokens` is reported by every OpenAI-compatible provider that supports prompt caching, but how it becomes non-zero differs by mode: - Implicit caching (OpenAI, Azure OpenAI, DeepSeek, …) populates the field automatically whenever a prompt prefix matches a previous request within the provider's cache TTL. No client-side opt-in. - Explicit caching (Qwen on Aliyun, Anthropic Claude, …) only populates the field after the caller attaches `cache_control: {"type": "ephemeral"}` to the relevant message / content block. Until that opt-in is applied upstream of the request, the field stays zero even when the prefix is otherwise byte-stable. Without this distinction documented, the previous commit reads as if `TokenUsage.CachedTokens` will show non-zero values for Qwen / Claude once this PR lands — which is not the case. The plumbing here is a prerequisite (stable prefix via sorted tools) and a meter (visibility of the field), but the explicit-cache opt-in itself is out of scope and lives elsewhere. Document this on `TokenUsage.CachedTokens` and the `cachedTokens` helper so callers do not mistake observability for activation.	2026-05-25 16:47:14 +08:00
young1lin	582f7b3056	feat(chat): surface cached prompt tokens from upstream usage OpenAI-compatible providers (Qwen, DeepSeek, OpenAI, Azure, etc.) report prompt-cache hits in `usage.prompt_tokens_details.cached_tokens`. This value was being read by go-openai but dropped at the WeKnora boundary, so there was no way to tell whether prompt caching was actually working. This change plumbs the field end-to-end: - `types.TokenUsage.CachedTokens` (json:"cached_tokens,omitempty") — zero-values are omitted so payloads stay quiet for providers that never report cache hits. - `cachedTokens` helper in remote_api.go guards against nil `PromptTokensDetails` (Ollama and older OpenAI-compat backends omit the details block entirely). - All three response-parsing paths populate it: * `parseCompletionResponse` (non-streaming) * `processStream` (SDK streaming) * `processRawHTTPStream` (raw-HTTP streaming, used when callers need to inject custom fields like `cache_control`) - The five `[LLM Usage]` log lines now print `cached_tokens=%d` so cache hit rate is visible in `journalctl` / log tail without going through metrics. Together with the deterministic tool ordering from the previous commit, this makes Qwen explicit caching observable: a warmed prefix should show `cached_tokens` ≈ system + tools token count (typically several thousand) on subsequent requests within the 5-minute TTL. Tests: - `TestCachedTokensHelper` — nil safety + round-trip - `TestParseCompletionResponse_CachedTokens` — populated + missing details paths through `parseCompletionResponse` - `TestTokenUsage_CachedTokensJSONOmitempty` — zero is omitted, non-zero is emitted	2026-05-25 16:47:14 +08:00
young1lin	29bec4204a	fix(agent/tools): sort function definitions for deterministic ordering `ToolRegistry.GetFunctionDefinitions` and `ListTools` previously ranged over the internal map directly. Go map iteration is intentionally randomized, so the resulting `tools` array reshuffled on every request. That reshuffling silently breaks provider-side prompt caches that key on a byte-level prefix match — most visibly Qwen explicit caching, which requires the messages (system + tools + history) to be byte-identical up to the `cache_control` marker. With random ordering the serialized tools block changes every call, so the cache prefix never matches and the hit rate stays at 0%. Sort by tool name in both functions. Output is now byte-stable across calls and `cache_control: ephemeral` can actually take effect. Tests in registry_test.go cover: - Deterministic ordering across 50 iterations - JSON byte-stability across 20 iterations (the real motivation) - Field projection (Name / Description / Parameters) - Empty registry returns `[]` not `null` - ListTools sorting - First-wins duplicate registration policy (GHSA-67q9-58vj-32qx)	2026-05-25 16:47:14 +08:00
wizardchen	30332cca99	fix(frontend): correct floating UI positioning under root `zoom` The user font-size preference applies CSS `zoom` on `<html>` (see `composables/useFont.ts`). This breaks every popover that anchors itself with `getBoundingClientRect()` + `position: fixed/absolute`, because the rect is in visual pixels while CSS lengths are then multiplied by zoom again — shifting popovers toward the lower-right and miscomputing maxHeight / width / viewport-edge clamping. PR #1459 patched only `AgentSelector.vue` (and left its width/maxHeight unfixed). This PR introduces a shared helper `frontend/src/utils/zoom.ts` (`getRootZoom`, `rectToCssPx`, `cssViewportSize`) and applies it to every affected popover: - `AgentSelector.vue` — refactored to use the helper; also normalizes `width` and `maxHeight` (previously still in visual px). - `Input-field.vue` — model dropdown, mention popup (typed @ and button- triggered), and agent-mode dropdown. - `KnowledgeBaseSelector.vue` — primary anchor path and fallback path. - `UserMenu.vue` — tenant and IM floating submenus + viewport clamp. - `FAQTagTooltip.vue` — fixed tooltip positioning + edge clamp. - `AgentEditorModal.vue` — placeholder popups for system_prompt, context_template, and rewrite/fallback prompts. - `AgentStreamDisplay.vue` — `kb-float-popup` (absolute under body, also under the zoom containing block). Verified with `npm --prefix frontend run build`.	2026-05-24 23:03:26 +08:00
shendong	a02d6d06b9	fix: correct agent selector position under zoom	2026-05-24 22:44:18 +08:00
wizardchen	a469ee6a36	feat(wiki): add "View in Graph" entry on wiki page Add a link in the wiki page header that navigates to the graph tab with the page slug pre-selected, so users can jump from a wiki page to its corresponding graph node in one click.	2026-05-22 21:02:01 +08:00
ochan.kwon	744a367f16	feat(retriever): add OpenSearch type prep ahead of Phase 3 driver Lays type-system groundwork for the upcoming OpenSearch k-NN driver (Phase 3 PR 2, see Tencent/WeKnora#1440), with strict feature-gate: this PR ships only inert constants, schema extensions, and an unreachable normalizer case. No path activates an OpenSearch VectorStore yet — creation continues to fail with "not a valid engine type" until the activation switch lands in Phase 3 PR 3. Changes: - OpenSearchRetrieverEngineType constant ("opensearch") in internal/types/retriever.go. Not added to validEngineTypes / GetVectorStoreTypes / retrieverEngineMapping yet (gated). - ConnectionConfig.InsecureSkipVerify (bool, default false) in internal/types/vectorstore.go, placed inside the // Common section because it is a cross-driver TLS option. Distinct from the Qdrant-specific UseTLS, which enables TLS on gRPC — InsecureSkipVerify only controls verification of an already-TLS HTTPS connection. AES-GCM Value/Scan round-trips the field as plaintext (it travels alongside the encrypted Password / APIKey but is not sensitive itself). - VectorStoreFieldInfo gains four optional fields: Immutable (bool), Min/Max (float64), Enum ([]string). All omitempty so existing UI schema entries serialize identically. The fields will drive the UI in Phase 3 PR 3 (read-only on Edit + range/enum constraints). - Six new AuditAction constants under vector_store. and opensearch.* namespaces. Definitions only — emission lands in Phase 3 PR 3. - EngineAwareNormalizer.Normalize is restructured to group engines by the effective score range observed by the normalizer (not the theoretical raw cosine range): Range [-1, 1] (raw cosine, mapped via (score + 1) / 2): - Milvus (COSINE metric mode). Milvus docs explicitly state the COSINE metric range is [-1, 1]. Range [0, 1] (passthrough — already on the target scale): - Elasticsearch v8 / ElasticFaiss. The driver issues a cosineSimilarity(...) script_score script, and Lucene rejects negative final scores ("Final relevance scores from the script_score query cannot be negative" — ES docs); the effective range observed by the normalizer is therefore [0, 1] for IR-normalized embeddings. ES was previously grouped with Milvus and over-corrected via (score + 1) / 2, which inflated every ES result by 50% in mixed-engine RRF fusion. - OpenSearch. The k-NN plugin's SpaceType.COSINESIMIL.scoreTranslation maps the underlying Lucene/Faiss distance (1 - cosine) to (1 + cosine) / 2 ∈ [0, 1] before the score reaches us. Source: github.com/opensearch-project/k-NN at src/main/java/org/opensearch/knn/index/SpaceType.java (COSINESIMIL enum, scoreTranslation method). - Weaviate. The driver requests `certainty`, defined by Weaviate as (2 - distance) / 2 = (1 + cosine) / 2, intrinsically [0, 1]. - Postgres pgvector / SQLite sqlite-vec / Qdrant / TencentVectorDB / Doris. The driver computes (1 - cosine_distance) or normalized inner_product, whose theoretical range is [-1, 1]. The IR-normalized positive-component unit vectors WeKnora targets (BGE / OpenAI text-embedding-3 / Cohere / sentence-transformers) keep the observed range in [0, 1]; negative-cosine embedding models would silently clamp to 0 downstream — explicitly documented as the IR-normalization caveat in the struct godoc. Dead enum references (ElasticFaiss, Infinity) are flagged in the godoc with a pointer to internal/types/vectorstore.go's existing "legacy/experimental, no standalone deployable instance" annotation. Their case labels are kept for switch exhaustiveness. Test coverage: - OpenSearch constant wire value + collision check against existing 10 engines + gated invariant (NOT in validEngineTypes). - ConnectionConfig.InsecureSkipVerify backward-compat (missing JSON field deserializes as false) + round-trip + AES-GCM coexistence with encrypted Password / APIKey. - VectorStoreFieldInfo omitempty preservation + new-field serialization + float64 pointer distinction (min=0 vs nil). - AuditAction dot-namespace convention enforcement, prefix invariants for vector_store. and opensearch.*, no wire-string collisions, exact wire-value pins for the six new constants. - EngineAwareNormalizer: - CosineRange retains its (score + 1) / 2 coverage but now only for Milvus. - UnitInterval now covers the full passthrough group (ES / ElasticFaiss / OpenSearch / Weaviate / Postgres / SQLite / Qdrant / Infinity / TencentVectorDB / Doris). - New TestEngineAwareNormalizer_ElasticsearchCosinePassthrough is an explicit regression guard for the score-range correction: cos=0.5 maps to 0.5 (not (0.5 + 1) / 2 = 0.75 as an earlier draft assumed). - OpenSearch passthrough across (0 / 0.5 / 0.75 / 1) + engine drift (1.0001) + defensive negative + ±Inf / NaN edges + keyword passthrough + nil-ctx safety. Backward compatibility is preserved at every layer: - All new struct fields are omitempty / pointer-tagged so existing rows and existing wire formats remain unchanged. - Normalizer's new OpenSearch case is unreachable until the driver lands. The ES regrouping changes the post-normalization value for every ES vector search result (a correctness fix, not a feature) — ES vector retrieval is currently the only production path affected. - AuditAction constants emit no audit_log rows in this PR. - engine_type=opensearch VectorStore creation still rejected (gated).	2026-05-22 20:43:50 +08:00
yuheng.huang	3d5b4c16fe	refactor(logger): support LOG_FORMAT template and harden level coloring	2026-05-22 20:31:54 +08:00
yuheng.huang	39c9985c3b	refactor(logger): support LOG_FORMAT template and harden level coloring - Add CustomFormatter.Template driven by LOG_FORMAT env var with placeholders %d/%level/%thread/%logger/%traceId/%msg; default format unchanged for backward compatibility. - Replace chained ReplaceAll with strings.NewReplacer for single-pass substitution, avoiding accidental re-substitution when a field value contains a literal placeholder string. - Inject ANSI color at the %level substitution step; removes the old whole-line ReplaceAll(line, "INFO", ...) which would mis-color literal level tokens appearing inside messages. - Cache threadNeeded on the formatter so runtime.Stack only runs when the template references %thread.	2026-05-22 20:31:54 +08:00
wizardchen	bccc27b162	fix(frontend): keep X-Tenant-ID override when switching back to home After re-login, the JWT is minted with the user's last-active tenant (see userService.resolveLoginTenantID). Both TenantSelector and UserMenu used to clear weknora_selected_tenant_id when "switching to home", which made request.ts stop attaching X-Tenant-ID. Without the header, the auth middleware fell back to the JWT-encoded tenant id — i.e. the peer tenant the user just tried to leave — so the switch was silently a no-op until the user manually wiped localStorage and re-logged in. Always write the active tenant id into selectedTenantId so the "always attach X-Tenant-ID" invariant request.ts relies on stays true. The server-side last_active_tenant_id preference is still cleared when switching to home, so a clean re-login still lands the user on home. Also fix TenantSelector.defaultTenantId to read user.tenant_id (the immutable home id) instead of authStore.tenant.id (the active tenant, which is overwritten by /auth/me). This matches the contract spelled out in useHomeTenant() and prevents switchingToHome from misfiring when the active tenant differs from home.	2026-05-22 18:20:23 +08:00
ochan.kwon	8b7d00b260	feat: expose KB ↔ vector store binding in list, editor, and detail UI Backend ------- ListKnowledgeBases now enriches each row with the resolved vector_store metadata (name / source / engine_type / status) via a new buildKBListResponse helper. Store views are batch-resolved once per request through BatchResolveStoreView so an N-KB list costs one vector-store service call rather than N — closing the N+1 limitation flagged in #1372's known-limitations section. Cross-tenant shared KBs continue to render via SharedStoreDisplay so the owning tenant's store inventory cannot be correlated across rows; the underlying vector_store_id UUID is stripped from those responses. Resolver failures degrade gracefully: bound KBs render as unavailable instead of breaking the list. Test coverage pins the env / bound / shared distinction, the batch-call-count invariant, and the graceful-failure path. Frontend -------- KB editor modal gains a new "Vector Store" section. Create mode shows a dropdown that combines the system default (env store) and the tenant's configured user stores, fetched once at mount via the existing listVectorStores API. Edit mode shows the bound store read-only via a new VectorStoreBadge component with an explicit immutability hint — matching the backend's `<-:create` GORM tag and the service-layer UpdateKnowledgeBaseRequest DTO that already omit the field. KB list cards surface a small engine-type badge for own-tenant bound KBs, and a warning badge when the bound store is unavailable. Env-bound and shared KBs render no badge (visual noise control). KB detail header shows the bound store via the same VectorStoreBadge component; shared KBs fall through to the badge's internal "shared" branch with no name / engine / id rendered. The KB editor's create-time error handler translates the typed ErrVectorStoreBindingInvalid (2200) and ErrVectorStoreUnavailable (2201) into localized messages and jumps the user back to the VectorStore section so they can pick a different store or fall back to the system default. The KB row type gains five optional fields (vector_store_id / name / engine_type / source / status). i18n: 18 new keys added to en-US, ko-KR, zh-CN; ru-RU receives English placeholders pending translation (consistent with prior PRs in this locale). Part of #993 (Phase 2: Per-KB VectorStore Binding). Phase 2 roadmap item: PR 5 (KB binding UI + list-response enrichment). Depends on #994, #1310, #1372, #1386 (all backend in the Phase 2 series).	2026-05-22 17:40:10 +08:00
Phuong Tran	0f57bf3ff8	fix (chat history): #1431 user's multilines query formatting was lost	2026-05-22 17:26:05 +08:00
wizardchen	c0e4a1d2f1	fix(summary): preserve image caption/OCR text in document summaries Documents whose only payload is an embedded image (e.g. a docx with a single picture) intermittently produced the refusal line "No textual content was extractable from this document." even though the vision model had successfully extracted a caption. Three coordinated fixes: - Clarify the summary prompt that text inside `<image_caption>` and `<image_ocr>` is first-class extracted content, not an image reference, so the model only triggers the empty-content branch when the body is genuinely textless. - For image-dominated documents (real text < 200 runes after stripping image markup) include OCR alongside captions so screenshots and scanned figures contribute their actual content; text-heavy documents continue to use caption-only enrichment to avoid OCR noise from incidental figures. - Add `EnrichContentCaptionAndOCR` which embeds caption + OCR text inline next to the original Markdown image link, deliberately omitting the `<image url=...>` and `<image_original>` wrapper blocks. Those wrappers carry only opaque export hashes that consume tokens and have been observed to retrigger the LLM's "image reference with no extracted text" heuristic.	2026-05-22 17:25:39 +08:00
wizardchen	72e52f7258	fix(frontend): improve tenant settings and member list UX Keep the member search input visible during empty/loading states, avoid layout jitter on search, simplify role select display, widen the role column, and use icon-only edit buttons for tenant name/description.	2026-05-21 23:28:48 +08:00
ChenRussell	d3832edea4	fix(repository): qualify tenant_id with table name to resolve ambiguous column reference	2026-05-21 21:22:41 +08:00
ChenRussell	09f0f5b8a9	fix(swagger): Fix some Swagger API endpoints returning 404 errors and regenerate the Swagger documentation	2026-05-21 21:13:40 +08:00
MarionMa	0a9a920460	perf(repository): exclude embedding field from Elasticsearch search response to reduce size	2026-05-21 21:11:19 +08:00
wizardchen	b0094ff479	docs(readme): flatten Latest Updates into single-line per-version list Condense v0.6.0 highlights into one line and remove the collapsible <details> wrapper so the full release history is visible by default across all four READMEs (EN/CN/JA/KO). v0.6.0	2026-05-21 17:17:39 +08:00
wizardchen	cdfc9ce23a	chore(release): v0.6.0 Tenant RBAC headline release: 4-tier role matrix (Owner/Admin/ Contributor/Viewer), per-KB resource ownership, per-tenant audit log, tenant member management, self-service workspaces. Also: CLI v0.3/v0.4 GA, KB retrieval fan-out across vector stores, AES-256-GCM credential at-rest, docreader gRPC TLS+Token, Zhipu embedding, Huawei OBS, vLLM URL for MinerU, Apache Doris compat modes, server-side user preferences, Go 1.26.0. See CHANGELOG.md for the full list. docs(rbac): wire RBAC screenshots into READMEs and RBAC guide - README.md / README_CN.md / README_JA.md / README_KO.md: replace the single member-management thumbnail under the v0.6.0 RBAC highlight with a 2×2 showcase (member management, workspace switcher, self-service workspace creation, pending invitations). - docs/RBAC说明.md: add the member-management screenshot to the existing 前端实际界面 showcase so the guide is self-contained and no longer cross-references README for it. feat(rbac-ui): link tenant member page to RBAC guide Add an inline doc-link in the Tenant Members settings page that opens docs/RBAC说明.md on GitHub in a new tab, complementing the existing in-app role-matrix popover. New i18n key tenantMember.learnRbacGuide covered for zh-CN / en-US / ko-KR / ru-RU.	2026-05-21 16:56:19 +08:00
Thinh Tran	e1ce4270d0	fix(repository): correct query syntax to get correctly built-in models	2026-05-21 16:11:22 +08:00
Thinh Tran	3db5b0d126	fix(chat): adjust user message container to support pre-wrapped text	2026-05-21 16:11:22 +08:00
孙常熙	0dd6126826	fix(frontend): guard knowledge list against stale updates	2026-05-21 16:10:32 +08:00
wizardchen	7ebb29cd3e	docs: add Chinese RBAC guide and link with shared space docs Replace the English docs/rbac.md with a comprehensive Chinese docs/RBAC说明.md and a wiki-style summary under docs/wiki/安全认证/. Explain how tenant RBAC relates to the shared space feature (they are orthogonal: tenant RBAC is the vertical defense, shared space is the horizontal collaboration channel) and cross-link the two docs in both the flat and wiki trees. Update inbound references in .env.example, docker-compose.yml, and the auth legacy env test to point at the new file name.	2026-05-21 12:28:31 +08:00
wizardchen	7aca1017db	fix(docparser): address review feedback on PR #1404 Three follow-up fixes on top of the MinerU markdown preservation work: - Stop applying normalizeMinerUMarkdown inside ResolveAndStore. The helper is already called by MinerUReader.Read, and ResolveAndStore is shared by every parser (docreader, session attachments, ...). Running the heading/image unescape regexes globally would silently rewrite content (including inside fenced code blocks) for non-MinerU sources. - Recognize MinerU image references whose path contains spaces, e.g. "images/第 1 页.jpg". The previous regex used in extractImageRefsFromContent disallowed whitespace in the URL group, so such images were never matched and never persisted. Use a whitespace-tolerant pattern aligned with ResolveAndStore's own imgPattern. - Deduplicate uploads when the same MinerU image is referenced under multiple path forms (e.g. "images/foo.png" vs "./images/foo.png"). saveReferencedImage now caches by ref.Filename in addition to the raw ref path, so the second variant reuses the previously stored ServingURL instead of writing the same bytes to object storage again. Tests added: - TestProcessImagesMatchesPathsWithSpaces - TestResolveAndStoreDedupsSameImageRefVariants	2026-05-21 11:42:56 +08:00
MidnightSun	6210f44fb6	fix(docparser): preserve MinerU markdown and persist relative images MinerU already returns markdown with embedded HTML blocks, but the current\nreader runs the whole document back through html-to-markdown. That\nsecond conversion escapes valid headings and image syntax, so chunk\nprofiling sees plain text instead of markdown structure and relative\nimage references stop matching the storage pipeline.\n\nKeep MinerU output in its original markdown form and only apply narrow\ncompatibility normalization for the specific over-escaped patterns we\nactually need to recover. The converter now matches image refs by the\npaths that are really present in markdown or embedded HTML instead of\nassuming a single images/<name> form.\n\nExtend ImageResolver so relative HTML <img src=...> references share the\nsame storage rewrite path as markdown images, deduplicate repeated saves,\nand keep the frontend sanitizer compatible with MinerU's details/summary\nblocks. Add focused docparser tests that cover escaped markdown repair,\nvariant image path matching, and relative HTML image persistence.	2026-05-21 11:39:57 +08:00
wizardchen	9f6148784a	fix(frontend): improve offline and legacy browser support	2026-05-21 11:19:44 +08:00
wizardchen	8f4626158d	fix(chat): improve history rendering stability	2026-05-21 11:19:44 +08:00
wolfkill	5bdaf58d45	fix(session): scope wiki fixer to shared KB tenant	2026-05-21 11:19:19 +08:00
qingfhuang	6a6513caba	fix(client): align UpdateAgent request types with internal API Sync AgentConfig, UpdateAgentRequest, and CreateAgentRequest JSON fields with internal/types.CustomAgentConfig and handler request structs so SDK clients can send and receive the full agent configuration. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-20 22:41:21 +08:00
wolfkill	bb95488ac8	fix(knowledge): complete indexed documents immediately	2026-05-20 22:40:48 +08:00
ochan.kwon	fd3d9f547c	feat(search): fan-out KB retrieval across bound vector stores Multi-KB hybrid search now groups KBs by their bound VectorStore (partition key (storeID, owner_tenant_id)), retrieves in parallel via errgroup with a SetLimit(4) cap and a per-group timeout (MULTI_STORE_RETRIEVE_TIMEOUT_SEC, default 30s), and merges results. When the collected results span more than one engine type, an EngineAwareNormalizer rescales vector scores to [0, 1]; keyword (BM25) scores pass through to the existing RRF fusion. Single-group calls take the fast path with zero fan-out overhead, preserving today's behavior for deployments where every KB has vector_store_id = NULL. Embedding-model consistency is now enforced explicitly via ResolveEmbeddingModelKeys. Multi-KB searches across KBs whose resolved model identities differ return BadRequest instead of silently producing incomparable scores. Cross-tenant Organization-shared KBs are preserved by partitioning on KB.TenantID so the factory's ownership lookup runs against the source tenant. Foreign-tenant KB UUIDs injected via the request body are rejected via kbShareService.HasTenantKBPermission (Plan 3 of #1303, 3-D capped) before any retrieval; rejected scopes surface as 404 to avoid leaking foreign KB existence. Service-layer typed AppErrors (ErrVectorStoreBindingInvalid 2200 / ErrVectorStoreUnavailable 2201) are mapped from PR2 sentinel hierarchy and preserved end-to-end: the iterative FAQ path returns them rather than swallowing, and the HybridSearch handler routes typed AppErrors to the client unchanged instead of downgrading to 500. Part of #993 (Phase 2: Per-KB VectorStore Binding). Phase 2 roadmap item: PR 4 (Multi-store fan-out search). Depends on #994, #1310, #1372.	2026-05-20 22:25:39 +08:00
wizardchen	07afec1499	fix(chat): stabilize history pagination and keep message order	2026-05-20 21:03:39 +08:00
wizardchen	3475af1707	feat(frontend): configure API proxy target for development environment Updated the Vite configuration to allow dynamic setting of the API proxy target based on environment variables. The default target is now configurable via VITE_DEV_PROXY_TARGET or FRONTEND_BACKEND_URL, enhancing flexibility for different development setups. Additionally, the development script logs the current API proxy target for better visibility during startup.	2026-05-20 21:00:16 +08:00
wizardchen	f3c7281f47	refactor(agent): simplify grep_chunks tool to a single regex query The grep_chunks tool previously accepted an array of regex queries (1-5) and an optional knowledge_base_ids filter and limit. In practice the LLM either fired multiple near-duplicate calls or split synonyms across entries instead of using POSIX alternation, and KB scoping plus result limit are server-side concerns the model should not control. Reshape the contract to match `grep -E -i` semantics: - Schema accepts a single required `query` string. Combine concepts with `\|` alternation in one regex instead of multiple calls. - Drop `knowledge_base_ids` and `limit` from the schema; the tool now always searches the full agent scope and uses a fixed internal cap. - Legacy `pattern`, `queries`, `patterns`, `max_results` keys are still accepted and joined into a single alternation regex so older callers and in-flight model outputs keep working. - Update the agent system prompt template to document the new single `query` field. - Frontend tool title now reads `query`/`queries`/`pattern`/`patterns` in that order so the search text is shown again under the new schema. - Add a dedicated `grepSearch` / `grepSearchFailed` tool status (zh-CN, en-US, ko-KR, ru-RU) and rename the zh-CN tool label to "搜索关键词" so the UI no longer prefixes the call with a generic "调用 ..." label.	2026-05-20 19:34:27 +08:00
wizardchen	31f560ecf1	fix(frontend): mirror cross-tenant superuser Admin role in UI gates When a CanAccessAllTenants user switches into a tenant without a tenant_members row, the backend grants temporary Admin via resolveTenantRole step2 but currentTenantRole stayed empty, hiding all mutation controls. Fall back to admin for UI gating only; Owner surfaces remain hidden to match the server cap.	2026-05-20 16:55:39 +08:00
wizardchen	5fcdc30914	chore: update Go version to 1.26.0 in go.mod	2026-05-20 16:55:18 +08:00
knight	513e589494	feat: add vLLM server URL configuration for MinerU Add support for configuring vLLM server URL when using vlm-http-client or hybrid-http-client backend in MinerU. Changes: - internal/types/tenant.go: Add MinerUVLMServerURL field to ParserEngineConfig - internal/infrastructure/docparser/mineru_converter.go: Pass server_url to MinerU API when backend is vlm-http-client or hybrid-http-client - frontend/src/api/system/index.ts: Add TypeScript type definition - frontend/src/views/settings/ParserEngineSettings.vue: Add vLLM server URL input field - frontend/src/i18n/locales/*.ts: Add translations (zh-CN, en-US, ko-KR)	2026-05-20 16:33:12 +08:00
Miles Lai	e62c0563aa	doris: add configurable compatibility modes and guard mode switches Problem: The hard-coded Doris vector function implementation (cosine_distance_approximate with UNIQUE KEY ANN tables) fails on SelectDB 4.0.2-rc01 and other Doris builds lacking that specific function support. Users had no way to adapt without code changes. Root cause: There is assumption all Doris deployments support the same vector function API, but different builds (Doris OSS, SelectDB, Doris Cloud) ship with different function variants and table key constraints. No capability detection or user configuration existed. Solution: Implement DORIS_COMPAT_MODE environment variable with three modes: * auto (default/recommended): probe Doris server on first use to detect available vector functions; prefer inner_product_duplicate (modern Doris 4.0+), fall back to legacy (older builds lacking inner_product_approximate) * legacy: hard-set to cosine_distance_approximate + UNIQUE KEY (for older Doris/SelectDB builds without inner_product_approximate support) * inner_product_duplicate: hard-set to inner_product_approximate + DUPLICATE KEY (for modern Doris 4.0+ and current SelectDB with normalized embeddings) Implementation details: - add compat.go with one-time mode resolution (sync.Once) and capability probing - inspect existing weknora_embeddings_* table DDL via SHOW CREATE TABLE to detect and enforce schema compatibility; prevents silent mismatches - fail fast with clear error message when configured mode does not match existing tables, with explicit remediation steps (recreate tables or change env var) - branch all query paths (inner_product_approximate vs cosine_distance_approximate), DDL generation (DUPLICATE KEY vs UNIQUE KEY), write paths (embed normalization), and chunk updates (Stream Load vs read-modify-write) by resolved compat mode - add comprehensive repository tests for mode selection, auto-detection, and mismatch scenarios; all tests pass - expose DORIS_COMPAT_MODE in docker-compose.yml with auto as default - document in .env.example with clear mode decision guidance - log all mode decisions (requested, detected, probed, final) at INFO/WARN level Key guarantee: ⚠️ DORIS_COMPAT_MODE is NOT interchangeable after embedding tables are created. App will reject mode switches that conflict with existing table layout, preventing silent data mismatches and query failures.	2026-05-19 17:15:59 +08:00
wolfkill	eb52caf033	fix(chunker): keep top-level heading chunks separate	2026-05-19 17:08:56 +08:00
wizardchen	1cb522e621	fix(i18n): escape '@' in invite email placeholder vue-i18n treats a bare '@' as the start of a linked-message reference, so the placeholder "invitee@example.com" failed to compile with "Invalid linked format". The whole email t-form-item then failed to render, leaving the invite-member popup with only the role row. Escape the literal '@' as {'@'} across all four locales so vue-i18n emits the original text and the email input renders again.	2026-05-18 22:56:11 +08:00
wizardchen	ee74dcb545	Revert "style(tenant-members): unify role identity with a colored chip across surfaces" This reverts commit `a6949155c9`.	2026-05-18 22:31:20 +08:00
wizardchen	7bb38f81b6	feat(session): persist last request state for UI restoration Sessions now record the input-bar state used for the most recent QA request (agent, model, KB scope, web search). The chat UI hydrates those settings on session reopen so users see the same configuration they used last time, instead of the global default. The state is stored in the existing sessions.agent_config JSONB column to avoid a new migration. Frontend snapshots the user's global defaults on session enter and restores them on session leave, so opening an old session does not pollute new-chat defaults.	2026-05-18 22:28:00 +08:00
wizardchen	8dd9b67df9	fix(menu): stop truncating session titles when extra space is available The session list items in the sidebar forced the title's max-width to 155px on hover and on the active item, while .menu-more-wrap was pushed to the right edge with margin-left:auto. This left a large gap between the truncated title and the kebab button on rows whose title actually fit within the row. Switch the title to a flex layout (flex: 1; min-width: 0) so it grows to fill the available space and only shows an ellipsis when the text genuinely overflows. The kebab wrapper keeps flex-shrink: 0 so its opacity-controlled visibility no longer needs to reshape the layout.	2026-05-18 22:25:11 +08:00
wizardchen	6f00056572	fix(menu): avoid refreshing session list when opening an existing chat The route watcher in the sidebar treated any navigation away from the creatChat pages as "a new session was created" and called getMessageList(), which clears menuArr and re-fetches page 1. As a result, clicking an existing session from /platform/creatChat caused the whole session list to flicker. Only refresh when the target chat id is not already in the list, which is the real signal that a brand-new session was just created.	2026-05-18 21:55:47 +08:00
wizardchen	4845955e9a	style(menu): tighten session list density in sidebar Reduce vertical spacing in the chat session list so more sessions fit on screen without feeling cramped: row height 38→34px, item height 32→30px, and slimmer timeline group headers.	2026-05-18 21:47:43 +08:00

1 2 3 4 5 ...

1833 Commits