mirror of https://github.com/shirayu/whispering.git synced 2025-02-22 13:26:19 +00:00

Streaming transcriber with whisper

Find a file

Yuta Hayashibe 9cd80ab047 Fix language in DecodingOptions		2022-09-24 09:42:10 +09:00
.github	Initial commit	2022-09-23 19:20:29 +09:00
whisper_streaming	Fix language in DecodingOptions	2022-09-24 09:42:10 +09:00
.gitignore	Initial commit	2022-09-23 19:20:29 +09:00
.markdownlint.json	Initial commit	2022-09-23 19:20:29 +09:00
LICENSE	Initial commit	2022-09-23 19:20:29 +09:00
LICENSE.whisper	Initial commit	2022-09-23 19:20:29 +09:00
Makefile	Fix setting for isort	2022-09-23 20:05:33 +09:00
package-lock.json	Initial commit	2022-09-23 19:20:29 +09:00
package.json	Initial commit	2022-09-23 19:20:29 +09:00
poetry.lock	Initial commit	2022-09-23 19:20:29 +09:00
pyproject.toml	Initial commit	2022-09-23 19:20:29 +09:00
README.md	Updated README	2022-09-24 09:38:40 +09:00
setup.cfg	Fix setting for isort	2022-09-23 20:05:33 +09:00

README.md

whisper_streaming

Streaming transcriber with whisper. Transcribing in real time, enough machine power is needed.

Example

# Setup
git clone https://github.com/shirayu/whisper_streaming.git
cd whisper_streaming
poetry install --only main

# Run in English
poetry run whisper_streaming --language en --model base -n 20

--help shows full options
--language sets the language to transcribe. The list of languages are shown with poetry run whisper_streaming -h
-n sets interval of parsing. Larger values can improve accuracy but consume more memory.
--debug outputs logs for debug

Tips

If you get OSError: PortAudio library not found: Install portaudio

# Ubuntu
sudo apt-get install portaudio19-dev

License

MIT License
Some codes are ported from the original whisper. Its license is also MIT License