DeepSeek Introducing NSA: A Hardware-Aligned, Natively Trainable Sparse Attention Mechanism for Ultra-Fast Long-Context Training and Inference by DeepSeek AI ByDeepMind April 30, 2025 4:18 pm