Despite sparse attention being a known approach for years, DeepSeek claims its version achieves "fine-grained sparse ...