Transformer

Transformer is a deep learning architecture introduced in the paper “Attention Is All You Need” (2017) by Vaswani et al., which revolutionized natural language processing (NLP) and became the foundation for many advanced artificial intelligence (AI) models, including BERT, GPT, and T5. Unlike previous recurrent models, transformers rely entirely on a mechanism called self-attention, which allows them to weigh the importance of different words or tokens in a sequence regardless of their position. Transformers process input data in parallel (rather than sequentially), making them highly efficient and scalable for large datasets. They are widely used not only in NLP but also in computer vision, audio processing and multimodal AI – enabling breakthroughs in tasks like translation, summarization, image captioning and content generation.

Related Insights

Research

Achieve Cost Leadership: How US Top Performers Use an Agentic Strategy to Slash SG&A Costs

Selling, general and administrative (SG&A) costs for U.S. enterprises have surged to 14.3% of revenue – the highest in five…

Research

Achieve Cost Leadership: How US Top Performers Use an Agentic Strategy to Slash SG&A Costs

Selling, general and administrative (SG&A) costs for U.S. enterprises have surged to 14.3% of revenue – the highest in five…

Podcast

IT Outsourcing Trends for 2025

Season 7, Episode 6

On this episode of the “Gen AI Breakthrough” podcast, hosts Harish Murthy and Ryan Sebastiani discuss emerging technology outsourcing trends…

Podcast

IT Outsourcing Trends for 2025

Season 7, Episode 6

On this episode of the “Gen AI Breakthrough” podcast, hosts Harish Murthy and Ryan Sebastiani discuss emerging technology outsourcing trends…

Market Intelligence Report

Digital World Class® Matrix: 2025 Learning and Development Software Provider Perspective – Summary Report

The Hackett Group’s 2025 Digital World Class® Matrix: Learning and Development Software Provider Perspective evaluates 19 leading platforms transforming how…

Market Intelligence Report

Digital World Class® Matrix: 2025 Learning and Development Software Provider Perspective – Summary Report

The Hackett Group’s 2025 Digital World Class® Matrix: Learning and Development Software Provider Perspective evaluates 19 leading platforms transforming how…

Market Intelligence Report

Digital World Class® Matrix: 2025 Learning and Development Software Provider Perspective – Full Report

Gain the full intelligence available in the full 2025 Digital World Class® Matrix: Learning and Development Software Provider Perspective –…

Market Intelligence Report

Digital World Class® Matrix: 2025 Learning and Development Software Provider Perspective – Full Report

Gain the full intelligence available in the full 2025 Digital World Class® Matrix: Learning and Development Software Provider Perspective –…

Transformer

Achieve Cost Leadership: How US Top Performers Use an Agentic Strategy to Slash SG&A Costs

IT Outsourcing Trends for 2025

Digital World Class® Matrix: 2025 Learning and Development Software Provider Perspective – Summary Report

Digital World Class® Matrix: 2025 Learning and Development Software Provider Perspective – Full Report

Related Terms

Solutions

Business Functions

Technology Implementation

Exclusive Assets

Insights

About

Careers