What is an example of a decoder-based transformer?

Study for the Hugging Face Agent Certification. Prepare with interactive quizzes and multiple-choice questions, complete with explanations and hints. Ace your exam!

Multiple Choice

What is an example of a decoder-based transformer?

Explanation:
Decoder-based transformers are the autoregressive models that generate text token by token, using masked self-attention so each position only sees previous tokens. Llama from Meta is built as a decoder-only model, meaning it consists of a stack of layers that generate output directly from past tokens without a separate encoding stage. This makes it a classic example of a decoder-based transformer. BERT from Google is an encoder-only model designed for understanding tasks, not generation, so it doesn’t fit the decoder-only pattern. BART from Facebook and T5 from Google use encoder-decoder architectures, with an input processed by an encoder and output produced by a separate decoder, which differentiates them from a decoder-only setup.

Decoder-based transformers are the autoregressive models that generate text token by token, using masked self-attention so each position only sees previous tokens. Llama from Meta is built as a decoder-only model, meaning it consists of a stack of layers that generate output directly from past tokens without a separate encoding stage. This makes it a classic example of a decoder-based transformer.

BERT from Google is an encoder-only model designed for understanding tasks, not generation, so it doesn’t fit the decoder-only pattern. BART from Facebook and T5 from Google use encoder-decoder architectures, with an input processed by an encoder and output produced by a separate decoder, which differentiates them from a decoder-only setup.

Subscribe

Get the latest from Passetra

You can unsubscribe at any time. Read our privacy policy