The 2017 Transformer paper passes five years old. Every state-of-the-art LLM — GPT, BERT, T5, PaLM, LLaMA — is a Transformer descendant. PrevMain BlogNext