The MiniCPM series represents a family of ultra-efficient, open-source models tailored for on-device AI. These models deliver significant speed-ups on edge chips, coupled with robust performance, and include highly quantized BitCPM versions for enhanced efficiency.
The recently launched MiniCPM5-1B marks the eighth iteration in the MiniCPM series. It is positioned as a new State-of-the-Art (SOTA) for compact open models operating on the edge.
MiniCPM5-1B is a dense 1B parameter open model specifically engineered for on-device and local deployment. Key features include support for a substantial 131K context window, the integration of 'Think / No Think' modes, comprehensive tool calling capabilities, and compatibility with major formats like GGUF and MLX. Furthermore, it supports various inference backends and is even capable of powering an offline desktop pet, demonstrating its versatility in diverse application scenarios.