anything-llm

mirror of https://github.com/Mintplex-Labs/anything-llm synced 2026-04-25 17:15:37 +02:00

Author	SHA1	Message	Date
Timothy Carambat	b96988aae9	Dell model capability patch (#4719 )	2025-12-05 13:31:15 -08:00
Sean Hatfield	f910609825	Fix similarity score not showing in citation UI for Astra, Qdrant, Weaviate (#4718 ) * fix astra not showing similarity score in citations ui * fix qdrant not showing similarity score in citations ui * fix weaviate not showing similarity score in citations ui	2025-12-05 13:05:43 -08:00
Sean Hatfield	1e96acab9d	Fix broken citations in Zilliz Cloud (#4716 ) fix broken citations in zilliz cloud vector db	2025-12-05 12:12:29 -08:00
Timothy Carambat	92752a2a15	Improve MCP functionality (#4709 ) * Improve MCP functionality * apply ANSI strip path * attempt ARM build * improve dockerfile IO build time and migrate to ARM build * fix comment * add ability to disable MCP cooldown feature * update devbuild name * move chromium arm build patch to CDN	2025-12-05 11:01:20 -08:00
Timothy Carambat	b54ac2de58	Modify ChromaCloud provider to check base limitations of chroma cloud before upsert to prevent 422 (#4705 ) * Modify ChromaCloud provider to check base limitations of chroma cloud before upsert to prevent 422 * update comments	2025-12-03 10:41:44 -08:00
Timothy Carambat	155900eae7	dev build with new `epub2` build target and remove patch work (#4694 )	2025-11-26 17:36:34 -08:00
方程	90e474abcb	Support Gitee AI(LLM Provider) (#3361 ) * Support Gitee AI(LLM Provider) * refactor(server): 重构 GiteeAI 模型窗口限制功能,暂时将窗口限制硬编码,计划使用外部 API 数据和缓存 * updates for Gitee AI * use legacy lookup since gitee does not enable getting token context windows * add more missing records * reorder imports --------- Co-authored-by: 方程 <fangcheng@oschina.cn> Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-25 14:19:32 -08:00
Sean Hatfield	66e44f65b4	Add batch embedding support for Ollama embedding provider (#4553 ) * add batch embedding support for ollama embedding provider * lint * simplify ollama embedder input	2025-11-25 13:03:47 -08:00
Colin Perry	157e3e4b38	Feat/add openrouter embedding models (#4682 ) * implemented openrouter embedding model support * ran yarn lint * data handling entry --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-25 11:16:16 -08:00
Neha Prasad	340709504c	feat: add base64 document attachment support to API chat endpoints (#4672 ) * feat: add base64 document attachment support to API chat endpoints * remove parsed documents from api chat process linting for simplicity fix logic issues with flow of data through the pipeline documentiation * change var scope --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-24 15:48:44 -08:00
Marcello Fitton	5716ac5ed5	Custom Default System Prompt (#4487 ) * Add Default System Prompt Management - Introduced a new route for fetching and updating the default system prompt in the backend. - Added a new Admin page for managing the default system prompt, including a form for editing and saving changes. - Updated the SettingsSidebar to include a link to the new Default System Prompt page. - Implemented fetching of available system prompt variables for use in the prompt editor. - Enhanced the ChatSettings and ChatPromptSettings components to support the new default system prompt functionality. This commit lays the groundwork for improved management of system prompts across workspaces. * Remove validation for system prompt in ChatSettings component * Add comment for system prompt in workspaces model * linting, simplify logic for default assumption * dev build --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-24 13:24:10 -08:00
timothycarambat	de11a06622	Fix directOutput returning nothing for streaming provider agentic calls	2025-11-24 12:15:33 -08:00
Sean Hatfield	1f45a9ee34	Fix undefined result in llm-instruction agent flow executor (#4676 ) fix typo in llm-instruction agent flow executor	2025-11-24 09:31:00 -08:00
timothycarambat	3b4f07cdbd	add longer HTTP ttl on forward extension requests resolves #4605	2025-11-20 23:00:18 -08:00
timothycarambat	4ec85418c4	Solve theoretical bug in forwardRequestSigner resolves #4611	2025-11-20 18:36:10 -08:00
Sean Hatfield	c913a2d68c	Prompt caching for Anthropic LLM and Agent providers (#4488 ) * prompt caching for anthropic llm and agent providers * add UI for control of ENV simplify implementation --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 17:17:03 -08:00
Timothy Carambat	f0b3dab4c1	Simplify cache condition for LMStudio and Ollama to prevent race condition (#4669 ) closes #4597 resolves #4572 closes #4600 resolves #4599	2025-11-20 16:32:02 -08:00
Sean Hatfield	49c29fb968	Z.ai LLM & agent provider (#4573 ) * wip zai llm provider * cleanup + add zai agent provider * lint * change how caching works for failed models --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 15:57:03 -08:00
Marcello Fitton	7a7ec969d7	Update Ollama AI Provider to Support Parsing "Thinking" Content From New Message Schema (#4587 ) * add className prop to OllamaAILLM * Enhance `OllamaAILLM.handleStream` to support parsing thinking content from the `message.thinking` property. * refactor thinking property handler patched ollama `@agent` flow calls --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-20 15:39:17 -08:00
Chetan Sarva	c169193fc4	feature: Support for AWS Bedrock API Keys (#4651 ) * feat: add AWS Bedrock API Key option to settings panel * feat: Bedrock API key auth method * fix: hide IAM note when using bedrock api key * move to camcelCase identifier for bedrock api key use linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-20 15:38:45 -08:00
Alex Barron	2eb5384e27	Add SerpApi web search (#4623 ) Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 15:12:15 -08:00
jonathanortega2023	7a0c149d2e	fix: Use eval_duration for output TPS calculations in Ollama LLM provider (#4568 ) * fix: Use eval_duration for output TPS calculations and add as a metric field * refactor usage of eval_duration from ollama metrics * move eval_duration to usage * overwrite duration in ollama provider wip measureAsyncFunction optional param * allow for overloaded duration in measureAsyncFunction * simplify flow for duration tracking --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com> Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 13:02:47 -08:00
Timothy Carambat	cf76bad452	Implement full chat and `@agent` chat `user` indentificiation for OpenRouter (#4668 ) Implmenet chat and agentic chat user-id for OpenRouter resolves #4553 closes #4482	2025-11-20 12:38:43 -08:00
Sean Hatfield	05df4ac72b	Paperless ngx data connector (#4121 ) * paperless ngx data connector * wip resync paperless ngx * fix generateChunkSource for resyncing paperless ngx * lint * Refactor Paperless-NGX connector Fix issue with date rendering in tooltip + extended width Move tooltip details to be column for more space --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 11:27:38 -08:00
Sean Hatfield	2e1c1ff891	Disable swagger docs endpoint with environment variable (#4665 ) * add option to disable swagger docs via env * update .example.env for docker and server --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-19 19:32:10 -08:00
Timothy Carambat	6b1b8bbc94	4595 refactor PWA (#4664 ) * feat: add web app manifest and mobile PWA meta tags * feat: serve dynamic manifest.json with custom branding for pwa * feat: add ios status bar theming for pwa * fix: prevent overscroll behavior for mobile * fix: prevent ios safari auto-zoom on chat input * fix: remove theme-color meta tags conflicting with ios status bar * fix: add missing apple-mobile-web-app-capable meta tag for ios pwa * fix: move catch-all route after manifest endpoint to prevent interception * feat: add pwa detection helper and conditional styling for standalone mode * PWA refactor * undo changes to native CSS * class fix * proper response obj * fix patch for import * fix manifest errors --------- Co-authored-by: Christian De Santis <christian.constantino98@gmail.com>	2025-11-19 16:08:09 -08:00
Neha Prasad	ea336de3c3	feat: Allow email addresses as usernames (#4619 ) Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-19 13:43:28 -08:00
Neha Prasad	b86aca765b	fix: unhelpful error message for invite link user creation (#4621 ) * remove the hardcoded generic message instead of the actual validation error * lint --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com> Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-19 13:37:37 -08:00
Timothy Carambat	22c619586b	Failover invalid vector db identifier to lanceDB (#4661 ) resolves #4640 closes #4626	2025-11-19 13:36:19 -08:00
Sean Hatfield	f5f8fb1edb	Agent workspace system prompt with variable expansion (#4526 ) * agent workspace system prompt with variable expansion * cleanup --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-10-15 13:32:50 -07:00
Zacharia Maina	985527c3eb	fix(server): correct Qdrant batching logic for large uploads (#4545 ) * fix(server): correct Qdrant batching logic for large uploads Closes #4544 * modify O(3N) process to O(N) --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-10-15 12:28:39 -07:00
Sean Hatfield	6270a0a189	Fix KoboldCPP agent provider max tokens (#4519 ) add max tokens to koboldcpp agent provider	2025-10-09 15:28:50 -07:00
timothycarambat	71cd46ce1b	1.9.0 tag	2025-10-09 15:11:59 -07:00
jaynedoezy-web	89a01492b5	Update .gitignore (#4507 )	2025-10-07 16:32:32 -07:00
Timothy Carambat	0ee0a96506	Migrate gemini agents away from `Untooled` (#4505 ) * Migrate gemini agents away from `Untooled` * disable agents for gemma models as they are not supported for tool calling * Dev build resolve #4452 via function name prefix and then stripping within provider	2025-10-07 11:40:00 -07:00
Timothy Carambat	cf3fbcbf0f	Improve URL handler for collector processes (#4504 ) * Improve URL handler for collector processes * dev build	2025-10-07 11:03:27 -07:00
timothycarambat	0a1a5a216a	patch ollama context window error when unreachable	2025-10-06 16:25:06 -07:00
Timothy Carambat	c2e7ccc00f	Reimplement Cohere models for basic chat (#4489 ) * Reimplement Cohere models - Redo LLM implementation to grab models from endpoint and pre-filter - Migrate embedding models to also grab from remote - Add records for easy context window lookup' * fix comment	2025-10-03 18:28:20 -07:00
Timothy Carambat	8cdadd8cb3	Sync models from remote for FireworksAI (#4475 ) resolves #4474	2025-10-02 12:34:05 -07:00
Sean Hatfield	0b18ac6577	Model context limit auto-detection for LM Studio and Ollama LLM Providers (#4468 ) * auto model context limit detection for ollama llm provider * auto model context limit detection for lmstudio llm provider * Patch Ollama to function and sync context windows like Foundry * normalize how model context windows are cached from endpoint service todo: move this into global utility class with MODEL_MAP eager load models on boot to pre-cache them add performance model improvements into ollama agent as well as apply n_ctx * remove debug log --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-10-02 11:54:19 -07:00
Sean Hatfield	599a3fd8b8	Microsoft Foundry Local LLM provider & agent provider (#4435 ) * add microsoft foundry local llm and agent providers * minor change to fix early stop token + overloading of context window always use user defined window _unless_ it is larger than the models real contenxt window cache the context windows when we can from the API (0.7.)+ Unload model forcefully on model change to prevent resource hogging add back token preference since some models have very large windows and can crash a machine normalize cases --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-10-01 20:04:13 -07:00
Timothy Carambat	8f0f9df4fc	Migrate OpenAI Agent to use ResponsesAPI (#4467 ) resolves #4465'	2025-10-01 16:07:30 -07:00
Marcello Fitton	004327264a	Add stream options to Gemini LLM for usage tracking (#4466 ) * Add stream options to Gemini LLM for usage tracking * Update Gemini LLM to disable prompt token calculation --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-10-01 14:00:26 -07:00
Timothy Carambat	d6f0d305ab	Enable real-time agent tool call streaming for all providers (#4279 ) * WIP agentic tool call streaming - OpenAI - Anthropic - Azure OpenAI * WIP rest of providers EXCLUDES Bedrock and GenericOpenAI * patch untooled complete/streaming to use chatCallback provider from provider class and not assume OpenAI client struct example: Ollama * modify ollama to function with its own overrides normalize completion/stream outputs across providers/untooled * dev build * fix message sanization for anthropic agent streaming * wip fix anthropic agentic streaming sanitization * patch gemini, webgenui, generic aibitat providers + disable providers unable to test * refactor anthropic aibitat provider for empty message and tool call formatting * Add frontend missing prop check update Azure for streaming support update Gemini to streamting support on gemini-* models generic OpenAI disable streaming verify localAI support verify NVIDIA Nim support * DPAIS, remove temp from call, support streaming' * remove 0 temp to remove possibility of bad temp error/500s/400s * Patch condition where model is non-streamable and no tools are present or called resulting in the provider `handleFunctionCallChat` being called - which returns a string. This would then fail in Untooled.complete since response would be a string and not the expected `response.choices?.[0]?.message` Modified this line to handle both conditions for stream/non-streaming and tool presence or lack thereof * Allow generic Openai to be streamable since using untooled it should work fine honor disabled streaming for provider where that concern may apply for regular chats * rename function and more gemini-specific function to gemini provider * add comments for readability .complete on azure should be non-streaming as this is the sync response * migrate CometAPI, but disable as we cannot test --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2025-10-01 10:17:18 -07:00
Timothy Carambat	cd34063111	Patch OpenAI metrics (#4458 ) resolves #4457	2025-09-30 15:19:34 -07:00
timothycarambat	d800f8a073	Merge branch 'master' of github.com:Mintplex-Labs/anything-llm	2025-09-30 14:35:41 -07:00
timothycarambat	d1e39e1753	Fix issue where filenames with spaces could be marked as orphaned and pruned from threads they are attached to - resulting in the model not seeing a file during chat after the backend boots	2025-09-30 14:35:14 -07:00
Marcello Fitton	96bf127696	New Default System Prompt Variables (User ID, Workspace ID, & Workspace Name) (#4414 ) * Fix system prompt variable color logic by removing unused variable type from switch statement and adding new types. * Add workspace id, name and user id as default system prompt variables * Combine user and workspace variable evaluations into a single if statment, reducing redundant code. * minor refactor * add systemPromptVariable expandSystemPromptVariables test cases --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-09-29 14:32:56 -07:00
Marcello Fitton	7ca2753c24	Sanitize Metadata Before PG Vector Database Insertion (#4434 ) * Fix JSDOC for updateOrCreateCollection * Add sanitizeForJsonb method to PGVector for safe JSONB handling This new method recursively sanitizes values intended for JSONB storage, removing disallowed control characters and ensuring safe insertion into PostgreSQL. The method is integrated into the vector insertion process to sanitize metadata before database operations. * Add unit tests for PGVector.sanitizeForJsonb method This commit introduces a comprehensive test suite for the PGVector.sanitizeForJsonb method, ensuring it correctly handles various input types, including null, undefined, strings with disallowed control characters, objects, arrays, and Date objects. The tests verify that the method sanitizes inputs without mutating the original data structures. --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-09-29 13:49:45 -07:00
Marcello Fitton	eb77876127	Add HTTP request/response logging middleware for development mode (#4425 ) * Add HTTP request logging middleware for development mode - Introduced httpLogger middleware to log HTTP requests and responses. - Enabled logging only in development mode to assist with debugging. * Update httpLogger middleware to disable time logging by default * Add httpLogger middleware for development mode in collector service * Refactor httpLogger middleware to rename timeLogs parameter to enableTimestamps for clarity * Make HTTP Logger only mount in development and environment flag is enabled. * Update .env.example to clarify HTTP Logger configuration comments --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-09-29 13:33:15 -07:00

... 2 3 4 5 6 ...

968 Commits