jundot/omlx · gitaskhub

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Stars · 17,526

Language · Python

License · Apache-2.0

Ask anything about this repo to start.

Full explanation on explaingit →