Transformer Model

Neural architecture for sequential data.

Glossary Term Updated September 12, 2025

Transformers use attention mechanisms to process sequences like text efficiently. They underpin models like GPT and BERT.