whispering/README.md


# whisper_streaming (beta version)

[![CI](https://github.com/shirayu/whisper_streaming/actions/workflows/ci.yml/badge.svg)](https://github.com/shirayu/whisper_streaming/actions/workflows/ci.yml)
[![CodeQL](https://github.com/shirayu/whisper_streaming/actions/workflows/codeql-analysis.yml/badge.svg)](https://github.com/shirayu/whisper_streaming/actions/workflows/codeql-analysis.yml)
[![Typos](https://github.com/shirayu/whisper_streaming/actions/workflows/typos.yml/badge.svg)](https://github.com/shirayu/whisper_streaming/actions/workflows/typos.yml)

Streaming transcriber with [whisper](https://github.com/openai/whisper).
Transcribing in real time, enough machine power is needed.

## Setup

```bash
git clone https://github.com/shirayu/whisper_streaming.git
cd whisper_streaming
poetry install --only main

# If you use GPU, install proper torch and torchaudio with "poetry run pip install -U"
# Example : torch for CUDA 11.6
poetry run pip install -U torch torchaudio --extra-index-url https://download.pytorch.org/whl/cu116
```

## Example of microphone

```bash
# Run in English
poetry run whisper_streaming --language en --model base
```

- ``--help`` shows full options
- ``--language`` sets the language to transcribe. The list of languages are shown with ``poetry run whisper_streaming -h``
- ``-t`` sets temperatures to decode. You can set several like (``-t 0.0 -t 0.1 -t 0.5``), but too many temperatures exhaust decoding time
- ``--debug`` outputs logs for debug

### Parse interval

If you want quick response, set small ``-n`` and add ``--allow-padding``.
However, this may be at the sacrifice of accuracy.

```bash
poetry run whisper_streaming --language en --model base -n 20 --allow-padding
```

## Example of web socket

⚠  **No security mechanism. Please make secure with your responsibility.**

### Host

Run with ``--host`` and ```--port``. You can set``-n`` and other options as use with microphone.

```bash
poetry run whisper_streaming --language en --model base --host 0.0.0.0 --port 8000
```

### Client

```bash
poetry run python -m whisper_streaming.websocket_client --host ADDRESS_OF_HOST --port 8000 
```

## Tips

## PortAudio Error

If you get ``OSError: PortAudio library not found``: Install ``portaudio``

```bash
# Ubuntu
sudo apt-get install portaudio19-dev
```

## License

- [MIT License](LICENSE)
- Some codes are ported from the original whisper. Its license is also [MIT License](LICENSE.whisper)
Initial commit 2022-09-23 10:20:11 +00:00
Add beta version 2022-09-24 05:34:03 +00:00			`# whisper_streaming (beta version)`
Initial commit 2022-09-23 10:20:11 +00:00
			`[![CI](https://github.com/shirayu/whisper_streaming/actions/workflows/ci.yml/badge.svg)](https://github.com/shirayu/whisper_streaming/actions/workflows/ci.yml)`
			`[![CodeQL](https://github.com/shirayu/whisper_streaming/actions/workflows/codeql-analysis.yml/badge.svg)](https://github.com/shirayu/whisper_streaming/actions/workflows/codeql-analysis.yml)`
			`[![Typos](https://github.com/shirayu/whisper_streaming/actions/workflows/typos.yml/badge.svg)](https://github.com/shirayu/whisper_streaming/actions/workflows/typos.yml)`

Updated README 2022-09-24 00:38:40 +00:00			`Streaming transcriber with [whisper](https://github.com/openai/whisper).`
			`Transcribing in real time, enough machine power is needed.`
Initial commit 2022-09-23 10:20:11 +00:00
Updated README 2022-09-24 15:46:37 +00:00			`## Setup`
Initial commit 2022-09-23 10:20:11 +00:00
			```bash
			`git clone https://github.com/shirayu/whisper_streaming.git`
			`cd whisper_streaming`
			`poetry install --only main`
Add description 2022-09-23 13:19:53 +00:00
Fix a typo 2022-09-24 03:51:07 +00:00			`# If you use GPU, install proper torch and torchaudio with "poetry run pip install -U"`
Fix instruction 2022-09-24 03:38:37 +00:00			`# Example : torch for CUDA 11.6`
Updated 2022-09-24 03:44:14 +00:00			`poetry run pip install -U torch torchaudio --extra-index-url https://download.pytorch.org/whl/cu116`
Updated README 2022-09-24 15:46:37 +00:00			```

			`## Example of microphone`
Add a setup step 2022-09-24 02:02:40 +00:00
Updated README 2022-09-24 15:46:37 +00:00			```bash
Add --language description 2022-09-24 00:26:37 +00:00			`# Run in English`
Fix -n (Resolve #3) 2022-09-24 06:39:41 +00:00			`poetry run whisper_streaming --language en --model base`
Initial commit 2022-09-23 10:20:11 +00:00			```

Updated README 2022-09-24 00:38:40 +00:00			- ``--help`` shows full options
Fix a typo 2022-09-24 00:31:13 +00:00			- ``--language`` sets the language to transcribe. The list of languages are shown with ``poetry run whisper_streaming -h``
Add README 2022-09-24 04:58:39 +00:00			- ``-t`` sets temperatures to decode. You can set several like (``-t 0.0 -t 0.1 -t 0.5``), but too many temperatures exhaust decoding time
Add message 2022-09-23 13:46:27 +00:00			- ``--debug`` outputs logs for debug
Add description 2022-09-23 13:19:53 +00:00
Updated README 2022-09-24 15:46:37 +00:00			`### Parse interval`

			If you want quick response, set small ``-n`` and add ``--allow-padding``.
			`However, this may be at the sacrifice of accuracy.`

			```bash
			`poetry run whisper_streaming --language en --model base -n 20 --allow-padding`
			```

Add README (Resolve #7) 2022-09-24 12:54:25 +00:00			`## Example of web socket`

Updated 2022-09-24 12:57:46 +00:00			`⚠ No security mechanism. Please make secure with your responsibility.`
Add README (Resolve #7) 2022-09-24 12:54:25 +00:00
Updated 2022-09-24 15:48:27 +00:00			`### Host`

			Run with ``--host`` and ```--port``. You can set``-n`` and other options as use with microphone.

Add README (Resolve #7) 2022-09-24 12:54:25 +00:00			```bash
			`poetry run whisper_streaming --language en --model base --host 0.0.0.0 --port 8000`
			```

Updated 2022-09-24 15:48:27 +00:00			`### Client`

Add README (Resolve #7) 2022-09-24 12:54:25 +00:00			```bash
Fix a typo 2022-09-24 12:56:17 +00:00			`poetry run python -m whisper_streaming.websocket_client --host ADDRESS_OF_HOST --port 8000`
Add README (Resolve #7) 2022-09-24 12:54:25 +00:00			```

Initial commit 2022-09-23 10:20:11 +00:00			`## Tips`

Add README 2022-09-24 04:58:39 +00:00			`## PortAudio Error`

Initial commit 2022-09-23 10:20:11 +00:00			If you get ``OSError: PortAudio library not found``: Install ``portaudio``

			```bash
			`# Ubuntu`
			`sudo apt-get install portaudio19-dev`
			```

			`## License`

			`- [MIT License](LICENSE)`
			`- Some codes are ported from the original whisper. Its license is also [MIT License](LICENSE.whisper)`