# mindnlp **Repository Path**: mindspore-lab/mindnlp ## Basic Information - **Project Name**: mindnlp - **Description**: MindNLP is an open source NLP library based on MindSpore. - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 38 - **Forks**: 20 - **Created**: 2022-11-15 - **Last Updated**: 2025-12-31 ## Categories & Tags **Categories**: nature-language **Tags**: None ## README #
MindNLP

docs GitHub PRs Welcome open issues ci

**MindNLP** stands for **MindSpore + Natural Language Processing**, representing seamless compatibility with the HuggingFace ecosystem. MindNLP enables you to leverage the best of both worlds: the rich HuggingFace model ecosystem and MindSpore's powerful acceleration capabilities. ## Table of Contents - [ MindNLP](#-mindnlp) - [Table of Contents](#table-of-contents) - [Features ✨](#features-) - [Installation](#installation) - [Install from Pypi](#install-from-pypi) - [Daily build](#daily-build) - [Install from source](#install-from-source) - [Version Compatibility](#version-compatibility) - [Introduction](#introduction) - [Major Features](#major-features) - [Supported models](#supported-models) - [License](#license) - [Feedbacks and Contact](#feedbacks-and-contact) - [MindSpore NLP SIG](#mindspore-nlp-sig) - [Acknowledgement](#acknowledgement) - [Citation](#citation) ## Features ✨ ### 1. 🤗 Full HuggingFace Compatibility MindNLP provides seamless compatibility with the HuggingFace ecosystem, enabling you to run any Transformers/Diffusers models on MindSpore across all hardware platforms (GPU/Ascend/CPU) without code modifications. #### Direct HuggingFace Library Usage You can directly use native HuggingFace libraries (transformers, diffusers, etc.) with MindSpore acceleration: **For HuggingFace Transformers:** ```python import mindspore import mindnlp from transformers import pipeline chat = [ {"role": "system", "content": "You are a sassy, wise-cracking robot as imagined by Hollywood circa 1986."}, {"role": "user", "content": "Hey, can you tell me any fun things to do in New York?"} ] pipeline = pipeline(task="text-generation", model="Qwen/Qwen3-8B", ms_dtype=mindspore.bfloat16, device_map="auto") response = pipeline(chat, max_new_tokens=512) print(response[0]["generated_text"][-1]["content"]) ``` **For HuggingFace Diffusers:** ```python import mindspore import mindnlp from diffusers import DiffusionPipeline pipeline = DiffusionPipeline.from_pretrained("stable-diffusion-v1-5/stable-diffusion-v1-5", ms_dtype=mindspore.float16, device_map='cuda') pipeline("An image of a squirrel in Picasso style").images[0] ``` #### MindNLP Native Interface You can also use MindNLP's native interface for better integration: ```python from mindnlp.transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased") model = AutoModel.from_pretrained("bert-base-uncased") inputs = tokenizer("Hello world!", return_tensors='ms') outputs = model(**inputs) ``` > **Note**: Due to differences in autograd and parallel execution mechanisms, any training or distributed execution code must utilize the interfaces provided by MindNLP. ### 2. ⚡ High-Performance Features Powered by MindSpore MindNLP leverages MindSpore's powerful capabilities to deliver exceptional performance and unique features: #### PyTorch-Compatible API with MindSpore Acceleration MindNLP provides `mindtorch` (accessible via `mindnlp.core`) for PyTorch-compatible interfaces, enabling seamless migration from PyTorch code while benefiting from MindSpore's acceleration on Ascend hardware: ```python import mindnlp # Automatically enables proxy for torch APIs import torch from torch import nn # All torch.xx APIs are automatically mapped to mindnlp.core.xx (via mindtorch) net = nn.Linear(10, 5) x = torch.randn(3, 10) out = net(x) print(out.shape) # core.Size([3, 5]) ``` #### Advanced Features Beyond Standard MindSpore MindNLP extends MindSpore with several advanced features for better model development: 1. **Dispatch Mechanism**: Operators are automatically dispatched to the appropriate backend based on `Tensor.device`, enabling seamless multi-device execution. 2. **Meta Device Support**: Perform shape inference and memory planning without actual computations, significantly speeding up model development and debugging. 3. **NumPy as CPU Backend**: Use NumPy as a CPU backend for acceleration, providing better compatibility and performance on CPU devices. 4. **Heterogeneous Data Movement**: Enhanced `Tensor.to()` for efficient data movement across different devices (CPU/GPU/Ascend). These features enable better support for model serialization, heterogeneous computing, and complex deployment scenarios. ## Installation #### Install from Pypi You can install the official version of MindNLP which is uploaded to pypi. ```bash pip install mindnlp ``` #### Daily build You can download MindNLP daily wheel from [here](https://repo.mindspore.cn/mindspore-lab/mindnlp/newest/any/). #### Install from source To install MindNLP from source, please run: ```bash pip install git+https://github.com/mindspore-lab/mindnlp.git # or git clone https://github.com/mindspore-lab/mindnlp.git cd mindnlp bash scripts/build_and_reinstall.sh ``` #### Version Compatibility | MindNLP version | MindSpore version | Supported Python version | |-----------------|-------------------|--------------------------| | master | daily build | >=3.7.5, <=3.9 | | 0.1.1 | >=1.8.1, <=2.0.0 | >=3.7.5, <=3.9 | | 0.2.x | >=2.1.0 | >=3.8, <=3.9 | | 0.3.x | >=2.1.0, <=2.3.1 | >=3.8, <=3.9 | | 0.4.x | >=2.2.x, <=2.5.0 | >=3.9, <=3.11 | | 0.5.x | >=2.5.0, <=2.7.0 | >=3.10, <=3.11 | | MindNLP version | MindSpore version | Supported Python version | |-----------------|-------------------|--------------------------| | 0.6.x | >=2.7.1. | >=3.10, <=3.11 | ## Supported models Since there are too many supported models, please check [here](https://mindnlp.cqu.ai/supported_models) ## License This project is released under the [Apache 2.0 license](LICENSE). ## Feedbacks and Contact The dynamic version is still under development, if you find any issue or have an idea on new features, please don't hesitate to contact us via [Github Issues](https://github.com/mindspore-lab/mindnlp/issues). ## MindSpore NLP SIG MindSpore NLP SIG (Natural Language Processing Special Interest Group) is the main development team of the MindNLP framework. It aims to collaborate with developers from both industry and academia who are interested in research, application development, and the practical implementation of natural language processing. Our goal is to create the best NLP framework based on the domestic framework MindSpore. Additionally, we regularly hold NLP technology sharing sessions and offline events. Interested developers can join our SIG group using the QR code below.
## Acknowledgement MindSpore is an open source project that welcomes any contribution and feedback. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible as well as standardized toolkit to re-implement existing methods and develop their own new semantic segmentation methods. ## Citation If you find this project useful in your research, please consider citing: ```latex @misc{mindnlp2022, title={{MindNLP}: Easy-to-use and high-performance NLP and LLM framework based on MindSpore}, author={MindNLP Contributors}, howpublished = {\url{https://github.com/mindspore-lab/mindnlp}}, year={2022} } ```