|  | 1 年間 前 | |
|---|---|---|
| .. | ||
| callbacks | 1 年間 前 | |
| docs | 1 年間 前 | |
| entities | 1 年間 前 | |
| errors | 1 年間 前 | |
| model_providers | 1 年間 前 | |
| schema_validators | 1 年間 前 | |
| utils | 1 年間 前 | |
| README.md | 1 年間 前 | |
| README_CN.md | 1 年間 前 | |
| __init__.py | 1 年間 前 | |
This module provides the interface for invoking and authenticating various models, and offers Dify a unified information and credentials form rule for model providers.
Supports capability invocation for 5 types of models
LLM - LLM text completion, dialogue, pre-computed tokens capabilityText Embedding Model - Text Embedding, pre-computed tokens capabilityRerank Model - Segment Rerank capabilitySpeech-to-text Model - Speech to text capabilityModeration - Moderation capabilityModel provider display
Displays a list of all supported providers, including provider names, icons, supported model types list, predefined model list, configuration method, and credentials form rules, etc. For detailed rule design, see: Schema.
After configuring provider/model credentials, the dropdown (application orchestration interface/default model) allows viewing of the available LLM list. Greyed out items represent predefined model lists from providers without configured credentials, facilitating user review of supported models.
In addition, this list also returns configurable parameter information and rules for LLM, as shown below:
These parameters are all defined in the backend, allowing different settings for various parameters supported by different models, as detailed in: Schema.
The provider list returns configuration information for the credentials form, which can be authenticated through Runtime's interface. The first image above is a provider credential DEMO, and the second is a model credential DEMO.
Model Runtime is divided into three layers:
It provides methods for obtaining all providers, all model lists, getting provider instances, and authenticating provider/model credentials.
It provides the current provider's model list, model instance obtaining, provider credential authentication, and provider configuration rule information, allowing horizontal expansion to support different providers.
It offers direct invocation of various model types, predefined model configuration information, getting predefined/remote model lists, model credential authentication methods. Different models provide additional special methods, like LLM's pre-computed tokens method, cost information obtaining method, etc., allowing horizontal expansion for different models under the same provider (within supported model types).