mirror of
https://github.com/shirayu/whispering.git
synced 2024-11-25 10:21:00 +00:00
Streaming transcriber with whisper
.github | ||
whisper_streaming | ||
.gitignore | ||
.markdownlint.json | ||
LICENSE | ||
LICENSE.whisper | ||
Makefile | ||
package-lock.json | ||
package.json | ||
poetry.lock | ||
pyproject.toml | ||
README.md | ||
setup.cfg |
whisper_streaming
Streaming transcriber with whisper. Transcribing in real time, enough machine power is needed.
Example
# Setup
git clone https://github.com/shirayu/whisper_streaming.git
cd whisper_streaming
poetry install --only main
# If you use GPU with CUDA 11.6
poetry run pip install 'torch==1.12.0+cu116' -f https://download.pytorch.org/whl/torch_stable.html
# Run in English
poetry run whisper_streaming --language en --model base -n 20
--help
shows full options--language
sets the language to transcribe. The list of languages are shown withpoetry run whisper_streaming -h
-n
sets interval of parsing. Larger values can improve accuracy but consume more memory.--debug
outputs logs for debug
Tips
If you get OSError: PortAudio library not found
: Install portaudio
# Ubuntu
sudo apt-get install portaudio19-dev
License
- MIT License
- Some codes are ported from the original whisper. Its license is also MIT License