# sherpa-ncnn **Repository Path**: create_future/sherpa-ncnn ## Basic Information - **Project Name**: sherpa-ncnn - **Description**: sherpa-nccn拷贝,版权归原作者所有 - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2022-11-16 - **Last Updated**: 2024-05-31 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Introduction **Documentation**: Try it in colab: [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1zdNAdWgV5rh1hLbLDqvLjxTa5tjU7cPa?usp=sharing) We provide two YouTube videos for demonstration about real-time speech recognition with `sherpa-ncnn` from a microphone: - `English`: - `Chinese`: **Note**: If you don't have access to YouTube, we provide the links in bilibili below: - `English`: - `Chinese`: See This repo uses [ncnn](https://github.com/tencent/ncnn) for running the neural network model and does not depend on libtorch. Please read if you are interested in how the model is trained. We provide exported models in ncnn format and they can be downloaded using the following links: - English: - Chinese: ## Build for Linux/macOS ```bash git clone https://github.com/k2-fsa/sherpa-ncnn cd sherpa-ncnn mkdir build cd build cmake -DCMAKE_BUILD_TYPE=Release .. make -j6 cd .. ``` ## Download the pretrained model (Chinese) **Caution**: You have to run `git lfs install`. Otherwise, you will be **SAD** later. ```bash git lfs install git clone https://huggingface.co/csukuangfj/sherpa-ncnn-2022-09-30 ./build/bin/sherpa-ncnn \ ./sherpa-ncnn-2022-09-30/tokens.txt \ ./sherpa-ncnn-2022-09-30/encoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/encoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/decoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/decoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/joiner_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/joiner_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/test_wavs/0.wav # You will find executables in ./bin/ ``` ## Build for Windows ```bash git clone https://github.com/k2-fsa/sherpa-ncnn cd sherpa-ncnn mkdir build cd build cmake -DCMAKE_BUILD_TYPE=Release .. cmake --build . --config Release cd .. # You will find executables in ./bin/Release ``` ## Download the pretrained model (Chinese) **Caution**: You have to run `git lfs install`. Otherwise, you will be **SAD** later. ```bash git lfs install git clone https://huggingface.co/csukuangfj/sherpa-ncnn-2022-09-30 ./build/bin/sherpa-ncnn \ ./sherpa-ncnn-2022-09-30/tokens.txt \ ./sherpa-ncnn-2022-09-30/encoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/encoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/decoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/decoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/joiner_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/joiner_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/test_wavs/0.wav # If you are using Windows, please use ./build/bin/Release/sherpa-ncnn # If you get encoding issues on Windows, please run # CHCP 65001 # in you commandline window. ``` To do speech recognition in real-time with a microphone, run: ```bash ./build/bin/sherpa-ncnn-microphone \ ./sherpa-ncnn-2022-09-30/tokens.txt \ ./sherpa-ncnn-2022-09-30/encoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/encoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/decoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/decoder_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-30/joiner_jit_trace-epoch-11-avg-2-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-30/joiner_jit_trace-epoch-11-avg-2-pnnx.ncnn.bin # If you are using Windows, please use ./build/bin/Release/sherpa-ncnn-microphone.exe ``` ## Download the pretrained model (English) ```bash git lfs install git clone https://huggingface.co/csukuangfj/sherpa-ncnn-2022-09-05 ./build/bin/sherpa-ncnn \ ./sherpa-ncnn-2022-09-05/tokens.txt \ ./sherpa-ncnn-2022-09-05/bar/encoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-05/bar/encoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-05/bar/decoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-05/bar/decoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-05/bar/joiner_jit_trace-iter-468000-avg-16-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-05/bar/joiner_jit_trace-iter-468000-avg-16-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-05/test_wavs/1089-134686-0001.wav # If you are using Windows, please use ./build/bin/Release/sherpa-ncnn.exe ``` To do speech recognition in real-time with a microphone, run: ```bash ./build/bin/sherpa-ncnn-microphone \ ./sherpa-ncnn-2022-09-05/tokens.txt \ ./sherpa-ncnn-2022-09-05/bar/encoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-05/bar/encoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-05/bar/decoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-05/bar/decoder_jit_trace-iter-468000-avg-16-pnnx.ncnn.bin \ ./sherpa-ncnn-2022-09-05/bar/joiner_jit_trace-iter-468000-avg-16-pnnx.ncnn.param \ ./sherpa-ncnn-2022-09-05/bar/joiner_jit_trace-iter-468000-avg-16-pnnx.ncnn.bin # If you are using Windows, please use ./build/bin/Release/sherpa-ncnn-microphone.exe # If you get encoding issues on Windows, please run # CHCP 65001 # in you commandline window. ``` [ncnn]: https://github.com/tencent/ncnn