Faster Ollama alternative

RandomlyRight@sh.itjust.works · 5 months ago

Faster Ollama alternative

hendrik@palaver.p3x.de · 5 months ago

I’m also aware of LocalAI with automatic model swapping and OpenAI compatible API.

But unless I’m mistaken, they all use ggml behind the scenes? So you might want to look for something that uses vllm or exllama or something if you want a completely different backend.

CaptnBook@feddit.org · 5 months ago

Vllm unfortunately doesn’t support switching the model without a restart.

Daughter3546@lemmy.world · 5 months ago

I would not recommend LocalAI. There documentation is somewhat lacking and it’s an all in one utility with many moving parts. The parts also tend to break, quite often.