Bin Zhang

Exploring Data, AI, and Engineering.

Featured

Attention Dilution

Mar 15, 2026

Attention dilution (also called context dilution) is one of the fundamental limitations of transformer-based LLMs when dealing with long contexts or extended agent memory.

Bin Zhang

Featured

Attention Dilution

Recent Posts

AI Terminology: Agents, Skills, RAG, MCP, and the Layers Beneath the Hype

From Prompt to Response: A Step-by-Step Walkthrough of LLM Inference

ChatGPT in 2025: A Year in Review