MNN is a highly efficient and lightweight deep learning framework that supports both inference and training of deep learning models, boasting industry-leading performance in on-device inference and training. Currently, MNN has been integrated into over 30 Alibaba applications, such as Taobao, Tmall, Youku, DingTalk, and Xianyu, covering more than 70 use cases including live streaming, short video shooting, search recommendation, image search, interactive marketing, benefit distribution, and security risk control.
GitHub Address: https://github.com/alibaba/MNN
Architecture/Precision | Normal | FP16 | BF16 | Int8 |
---|---|---|---|---|
CPU | ||||
Native | B | C | B | B |
x86/x64-SSE4.1 | A | B | B | A |
x86/x64-AVX2 | S | B | B | A |
x86/x64-AVX512 | S | B | B | S |
ARMv7a | S | S(ARMv8.2) | S | S |
ARMv8 | S | S(ARMv8.2) | S(ARMv8.6) | S |
GPU | ||||
OpenCL | A | S | C | S |
Vulkan | A | A | C | A |
Metal | A | S | C | S |
CUDA | A | S | C | A |
NPU | ||||
CoreML | A | C | C | C |
HIAI | A | C | C | C |
NNAPI | B | B | C | B |
Legend: S - Strongly Recommended | A - Well Supported | B - Supported but with Issues | C - Not Supported
Model conversion tool, supports converting models from other frameworks to MNN models:
Model compression tool, reduces model size and improves performance
Supports model execution with control flow, using MNN operators for general-purpose computation
Lightweight image processing library, similar to OpenCV but implemented based on MNN
Supports MNN model training
A large language model runtime solution developed based on the MNN engine, aiming to deploy LLM models locally on everyone's platform (mobile phone/PC/IoT). Supports:
A stable diffusion model runtime solution based on the MNN engine, supporting local deployment of stable diffusion models on various platforms.
MNN-related research has been published in top systems conferences OSDI'22 and MLSys 2020, demonstrating its influence in academia and industry.
Available for download from the MNN official website, providing:
Provides easy-to-use Python interfaces for machine learning engineers, allowing inference, training, and image processing without writing C++ code.
As an open-source deep learning framework from Alibaba, MNN has become an excellent choice for mobile and embedded AI deployment due to its lightweight design, high performance, and cross-platform capabilities. Whether it's traditional CNN model inference or the latest large language model deployment, MNN provides complete solutions and is an invaluable toolkit for AI developers.