On April 24, 2026, the highly anticipated DeepSeek-V4 model was officially released and open-sourced, with Huawei Cloud announcing its premier adaptation. DeepSeek-V4 demonstrates leading capabilities in the domestic and open-source AI landscape, particularly featuring an ultra-long context window of one million tokens, alongside significant advancements in AI agent capabilities, world knowledge, and reasoning performance.
Among the DeepSeek-V4 series, the DeepSeek-V4-Flash version is notable. With its parameter count optimized to 284B, it not only significantly reduces inference costs but also offers faster and more economical API services due to smaller model parameters and activation. This development democratizes access to million-token context capabilities.
Huawei Cloud's MaaS (Model as a Service) platform now provides developers with easy, deployment-free, one-click access to DeepSeek-V4-Flash API services. To ensure rapid adaptation and high-performance deployment of the new model, Huawei Cloud collaborated extensively across its system, operator, and cluster layers, optimizing scheduling efficiency, computational efficiency, and data transfer efficiency.
Specifically for DeepSeek-V4, Huawei Cloud was the first to adapt a hierarchical attention compression mechanism, enabling efficient KVCache allocation and management under the V4 attention mechanism. Furthermore, Huawei Cloud offers over 10 high-performance Ascend-based fused operators, including TopK, SWA, and CFA. These are combined with framework optimizations such as asynchronous scheduling and MTP (multi-step speculative) multi-step speculative execution, ensuring high-performance inference for native million-token long contexts.
Huawei Cloud is committed to building an AI infrastructure ecosystem, termed the “silicon-based black land,” which supports both proprietary and third-party large models and numerous intelligent agents. This initiative aims to help enterprises solve complex problems and significantly boost production efficiency. Companies like Kingsoft Office and 360 have already integrated the new DeepSeek models via Huawei Cloud. Additionally, the DeepSeek-V4-Pro version is expected to launch soon, promising further enhancements.