Jyong 0c1307b083 add jina rerank http timout parameter (#10476) hai 1 ano
..
__base e61752bd3a feat/enhance the multi-modal support (#8818) hai 1 ano
anthropic 1e8457441d fix(model_runtime): remove vision from features for Claude 3.5 Haiku (#10360) hai 1 ano
azure_ai_studio 574c4a264f chore(lint): Use logging.exception instead of logging.error (#10415) hai 1 ano
azure_openai f6fecb957e fix azure chatgpt o1 parameter error (#10067) hai 1 ano
baichuan b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
bedrock 05d43a4074 Fix: Correct the max tokens of Claude-3.5-Sonnet-20241022 for Bedrock and VertexAI (#10508) hai 1 ano
chatglm 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) hai 1 ano
cohere b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
deepseek 153807f243 fix: response_format label (#8326) hai 1 ano
fireworks b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
fishaudio 62051d5171 Corrected type annotation to "Any" from "any" all files in "model_providers" folder (#9135) hai 1 ano
gitee_ai 2aa171c348 Using a dedicated interface to obtain the token credential for the gitee.ai provider (#10243) hai 1 ano
google 12adcf8925 fix: gemini model use some tools raise error (#9993) hai 1 ano
gpustack 76b0328eb1 feat: add gpustack model provider (#10158) hai 1 ano
groq b92504bebc Added Llama 3.2 Vision Models Speech2Text Models for Groq (#9479) hai 1 ano
huggingface_hub b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
huggingface_tei 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
hunyuan 92a3898540 fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) hai 1 ano
jina 0c1307b083 add jina rerank http timout parameter (#10476) hai 1 ano
leptonai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 1 ano
localai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
minimax b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
mistralai 5ddb601e43 add MixtralAI Model (#8517) hai 1 ano
mixedbread b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
moonshot 1b5adf40da fix: moonshot response_format raise error (#9847) hai 1 ano
nomic b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
novita 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 1 ano
nvidia b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
nvidia_nim 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 1 ano
oci b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
ollama b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
openai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
openai_api_compatible 70ddc0ce43 openai compatiable api usage and id (#9800) hai 1 ano
openllm 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
openrouter 5a9448245b fix: remove unsupported vision in OpenRouter Haiku 3.5 (#10364) hai 1 ano
perfxcloud b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
replicate b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
sagemaker d45d90e8ae chore: lazy import sagemaker (#10342) hai 1 ano
siliconflow 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
spark d0e0111f88 fix:Spark's large language model token calculation error #7911 (#8755) hai 1 ano
stepfun 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
tencent 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) hai 1 ano
togetherai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 1 ano
tongyi 033ab5490b feat: support LLM understand video (#9828) hai 1 ano
triton_inference_server 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
upstage b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
vertex_ai 05d43a4074 Fix: Correct the max tokens of Claude-3.5-Sonnet-20241022 for Bedrock and VertexAI (#10508) hai 1 ano
vessl_ai aa895cfa9b fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417) hai 1 ano
volcengine_maas 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
voyage b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 1 ano
wenxin 4d5546953a add llm: ernie-4.0-turbo-128k of wenxin (#10135) hai 1 ano
x bf9349c4dc feat: add xAI model provider (#10272) hai 1 ano
xinference 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 1 ano
yi e0846792d2 feat: add yi custom llm intergration (#9482) hai 1 ano
zhinao 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 1 ano
zhipuai 033ab5490b feat: support LLM understand video (#9828) hai 1 ano
__init__.py d069c668f8 Model Runtime (#1858) hai 1 ano
_position.yaml fb49413a41 feat: add voyage ai as a new model provider (#8747) hai 1 ano
model_provider_factory.py 4e7b6aec3a feat: support pinning, including, and excluding for model providers and tools (#7419) hai 1 ano