LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
By chatting or signing in you agree to the Terms and chat-message logging (revocable in History).