Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
By chatting or signing in you agree to the Terms and chat-message logging (revocable in History).