Benjamin ec181649ae Update model provider configuration for Triton Inference Server and X… (#6274) пре 1 година
..
_assets d069c668f8 Model Runtime (#1858) пре 1 година
llm 2b080b5cfc feature: Add presence_penalty and frequency_penalty parameters to the … (#5637) пре 1 година
rerank 4365843c20 enhance:speedup xinference embedding & rerank (#3587) пре 1 година
speech2text f76ac8bdee enhance:speedup xinference audio transcription (#3636) пре 1 година
text_embedding 4365843c20 enhance:speedup xinference embedding & rerank (#3587) пре 1 година
__init__.py d069c668f8 Model Runtime (#1858) пре 1 година
xinference.py d069c668f8 Model Runtime (#1858) пре 1 година
xinference.yaml ec181649ae Update model provider configuration for Triton Inference Server and X… (#6274) пре 1 година
xinference_helper.py f361c7004d feat: support vision models from xinference (#4094) пре 1 година