.. |
_assets
|
d069c668f8
Model Runtime (#1858)
|
1 年之前 |
llm
|
2b080b5cfc
feature: Add presence_penalty and frequency_penalty parameters to the … (#5637)
|
10 月之前 |
rerank
|
4365843c20
enhance:speedup xinference embedding & rerank (#3587)
|
1 年之前 |
speech2text
|
f76ac8bdee
enhance:speedup xinference audio transcription (#3636)
|
1 年之前 |
text_embedding
|
4365843c20
enhance:speedup xinference embedding & rerank (#3587)
|
1 年之前 |
__init__.py
|
d069c668f8
Model Runtime (#1858)
|
1 年之前 |
xinference.py
|
d069c668f8
Model Runtime (#1858)
|
1 年之前 |
xinference.yaml
|
ec181649ae
Update model provider configuration for Triton Inference Server and X… (#6274)
|
9 月之前 |
xinference_helper.py
|
f361c7004d
feat: support vision models from xinference (#4094)
|
1 年之前 |