Compare with state-of-the-art tokenizers (e.g., Llama’s)