Skip to main content

Transformer

Also known as: Transformer Architecture, Transformer Model

In one sentence

A neural network architecture that revolutionized AI by using attention mechanisms to understand relationships between words, enabling modern LLMs.

Explain like I'm 12

A type of AI brain design that can pay attention to all words in a sentence at once, figuring out how they relate to each other—like reading the whole page instead of one word at a time.

In context

The foundation of GPT, Claude, BERT, and most modern language models. Introduced in Google's 2017 'Attention Is All You Need' paper.

See also

Related Guides

Learn more about Transformer in these guides: