Tokenization

Breaking text into smaller units.

Glossary Term Updated September 12, 2025

Tokenization splits text into words, subwords, or characters, making it suitable for model input.