Performance graph: https://raw.githubusercontent.com/openai/whisper/main/language-breakdown.svg
Tags: https://github.com/guillaumekln/faster-whisper/blob/master/faster_whisper/tokenizer.py#L177
https://github.com/openai/whisper/blob/main/whisper/tokenizer.py