DeepSeek Debuts New AI Models to Rival Google and OpenAI China’s DeepSeek unveiled two new versions of an experimental artificial-intelligence model it released weeks ago, adding fresh capabilities the startup said would help with combining reasoning
DeepSeek-V3. 2: Pushing the Frontier of Open Large Language Models We introduce DeepSeek-V3 2, a model that harmonizes high computational efficiency with superior reasoning and agent performance The key technical breakthroughs of DeepSeek-V3 2 are as follows: (1) DeepSeek Sparse Attention (DSA): We introduce DSA, an efficient attention mechanism that substantially reduces computational complexity while preserving model performance in long-context scenarios