anything-llm

mirror of https://github.com/Mintplex-Labs/anything-llm synced 2026-04-25 17:15:37 +02:00

Author	SHA1	Message	Date
Timothy Carambat	ff7cb17e34	Improved DMR support (#4863 ) * Improve DMR support - Autodetect models installed - Grab all models from hub.docker to show available - UI to handle render,search, install, and management of models - Support functionality for chat, stream, and agentic calls * forgot files * fix loader circle being too large fix tooltip width command adjust location of docker installer open for web platform * adjust imports	2026-01-14 15:55:26 -08:00
方程	90e474abcb	Support Gitee AI(LLM Provider) (#3361 ) * Support Gitee AI(LLM Provider) * refactor(server): 重构 GiteeAI 模型窗口限制功能,暂时将窗口限制硬编码,计划使用外部 API 数据和缓存 * updates for Gitee AI * use legacy lookup since gitee does not enable getting token context windows * add more missing records * reorder imports --------- Co-authored-by: 方程 <fangcheng@oschina.cn> Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-25 14:19:32 -08:00
Timothy Carambat	8cdadd8cb3	Sync models from remote for FireworksAI (#4475 ) resolves #4474	2025-10-02 12:34:05 -07:00
TensorNull	5922349bb7	feat: Implement CometAPI integration for chat completions and model m… (#4379 ) * feat: Implement CometAPI integration for chat completions and model management - Added CometApiLLM class for handling chat completions using CometAPI. - Implemented model synchronization and caching mechanisms. - Introduced streaming support for chat responses with timeout handling. - Created CometApiProvider class for agent interactions with CometAPI. - Enhanced error handling and logging throughout the integration. - Established a structure for managing function calls and completions. * linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-09-16 14:38:49 -07:00
Timothy Carambat	2c19dd09ed	Native Embedder model selection (incl: Multilingual support) (#3835 ) * WIP on embedder selection TODO: apply splitting and query prefixes (if applicable) * wip on upsert * Support base model support nomic-text-embed-v1 support multilingual-e5-small Add prefixing for both embedding and query for RAG tasks Add chunking prefix to all vector dbs to apply prefix when possible Show dropdown and auto-pull on new selection * norm translations * move supported models to constants handle null seelction or invalid selection on dropdown update comments * dev * patch text splitter maximums for now * normalize translations * add tests for splitter functionality * normalize --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2025-07-22 10:07:20 -07:00
Timothy Carambat	e80492606a	Automatic Context window detection (#3817 ) * Add context window finder from litellm maintained list apply to all cloud providers, have client cache for 3 days * linting	2025-05-14 11:03:19 -07:00
cnJasonZ	2aeb4c2961	Add new model provider PPIO (#3211 ) * feat: add new model provider PPIO * fix: fix ppio model fetching * fix: code lint * reorder LLM update interface for streaming and chats to use valid keys linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-02-27 10:53:00 -08:00
Sushanth Srivatsa	3fd0fe8fc5	2749 ollama client auth token (#3005 ) * ollama auth token provision * auth token provision * ollama auth provision * ollama auth token * ollama auth provision * token input field css fix * Fix provider handler not using key sensible fallback to not break existing installs re-order of input fields null-check for API key and header optional insert on request linting * apply header and auth to agent invocations * upgrading to ollama 5.10 for passing headers to constructor * rename Auth systemSetting key to be more descriptive linting and copy * remove untracked files + update gitignore * remove debug * patch lockfile --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-02-18 16:00:17 -08:00
Timothy Carambat	2a9066e83a	OCR PDFs as fallback during upload (#3204 ) * OCR PDFs as fallback in spawn thread * wip * build our own worker fanout and wrapper * norm pkgs * bump dev	2025-02-14 11:57:31 -08:00
Sean Hatfield	48dcb22b25	Dynamic fetching of TogetherAI models (#3017 ) * implement dynamic fetching of togetherai models * implement caching for togetherai models * update gitignore for togetherai model caching * Remove models.json from git tracking * Remove .cached_at from git tracking * lint * revert unneeded change --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-01-24 11:06:59 -08:00
Timothy Carambat	21af81085a	Add caching to Gemini /models (#2969 ) rename file typo	2025-01-13 13:12:03 -08:00
Timothy Carambat	ad01df8790	Reranker option for RAG (#2929 ) * Reranker WIP * add cacheing and singleton loading * Add field to workspaces for vectorSearchMode Add UI for lancedb to change mode update all search endpoints to pass in reranker prop if provider can use it * update hint text * When reranking, swap score to rerank score * update optchain	2025-01-02 14:27:52 -08:00
Timothy Carambat	80565d79e0	2488 novita ai llm integration (#2582 ) * feat: add new model provider: Novita AI * feat: finished novita AI * fix: code lint * remove unneeded logging * add back log for novita stream not self closing * Clarify ENV vars for LLM/embedder seperation for future Patch ENV check for workspace/agent provider --------- Co-authored-by: Jason <ggbbddjm@gmail.com> Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2024-11-04 11:34:29 -08:00
Timothy Carambat	bce7988683	Integrate Apipie support directly (#2470 ) resolves #2464 resolves #989 Note: Streaming not supported	2024-10-15 12:36:06 -07:00
Timothy Carambat	ac6ca13f60	1173 dynamic cache openrouter (#1176 ) * patch agent invocation rule * Add dynamic model cache from OpenRouter API for context length and available models	2024-04-23 11:10:54 -07:00
Timothy Carambat	0ada882991	Support external transcription providers (#909 ) * Support External Transcription providers * patch files * update docs * fix return data	2024-03-14 15:43:26 -07:00
Timothy Carambat	61db981017	feat: Embed on-instance Whisper model for audio/mp4 transcribing (#449 ) * feat: Embed on-instance Whisper model for audio/mp4 transcribing resolves #329 * additional logging * add placeholder for tmp folder in collector storage Add cleanup of hotdir and tmp on collector boot to prevent hanging files split loading of model and file conversion into concurrency * update README * update model size * update supported filetypes	2023-12-15 11:20:13 -08:00
Timothy Carambat	1e98da07bc	docs: placeholder for model downloads folder (#446 )	2023-12-14 10:31:14 -08:00
Timothy Carambat	655ebd9479	[Feature] AnythingLLM use locally hosted Llama.cpp and GGUF files for inferencing (#413 ) * Implement use of native embedder (all-Mini-L6-v2) stop showing prisma queries during dev * Add native embedder as an available embedder selection * wrap model loader in try/catch * print progress on download * add built-in LLM support (expiermental) * Update to progress output for embedder * move embedder selection options to component * saftey checks for modelfile * update ref * Hide selection when on hosted subdomain * update documentation hide localLlama when on hosted * saftey checks for storage of models * update dockerfile to pre-build Llama.cpp bindings * update lockfile * add langchain doc comment * remove extraneous --no-metal option * Show data handling for private LLM * persist model in memory for N+1 chats * update import update dev comment on token model size * update primary README * chore: more readme updates and remove screenshots - too much to maintain, just use the app! * remove screeshot link	2023-12-07 14:48:27 -08:00
Timothy Carambat	88cdd8c872	Add built-in embedding engine into AnythingLLM (#411 ) * Implement use of native embedder (all-Mini-L6-v2) stop showing prisma queries during dev * Add native embedder as an available embedder selection * wrap model loader in try/catch * print progress on download * Update to progress output for embedder * move embedder selection options to component * forgot import * add Data privacy alert updates for local embedder	2023-12-06 10:36:22 -08:00

20 Commits