Question 1

What is Attention Mechanism?

Accepted Answer

The core reason modern AI handles long, complex text without losing track of what's important. Without it, AI would process text left-to-right, easily forgetting earlier context. With it, every word in a passage can weigh its relevance to every other word. It's why Claude can read a 100-page document and still accurately answer a question about something on page 3 — it hasn't forgotten it. The "attention" is which parts of the context the model focuses on when generating each word of its response.

Question 2

What is another name for Attention Mechanism?

Accepted Answer

Attention Mechanism is also known as: self-attention.