mirror of
https://github.com/shirayu/whispering.git
synced 2024-11-25 10:21:00 +00:00
Add description
This commit is contained in:
parent
74ed23a9e3
commit
8b464ff85d
1 changed files with 2 additions and 1 deletions
|
@ -52,7 +52,8 @@ This interval is determined by the value of ``-n`` and its default is ``20``.
|
||||||
When an interval is predicted as "silence", it will not be passed to whisper.
|
When an interval is predicted as "silence", it will not be passed to whisper.
|
||||||
If you want to disable VAD, please make VAD threshold 0 by adding ``--vad 0``.
|
If you want to disable VAD, please make VAD threshold 0 by adding ``--vad 0``.
|
||||||
|
|
||||||
By default, Whisper does not perform analysis until the total length of the segments determined by VAD to have speech exceeds 30 seconds.
|
By default, whispering does not perform analysis until the total length of the segments determined by VAD to have speech exceeds 30 seconds.
|
||||||
|
This is because the original Whisper assumes that the inputs are 30 seconds segments.
|
||||||
However, if silence segments appear 16 times (the default value of ``--max_nospeech_skip``) after speech is detected, the analysis is performed.
|
However, if silence segments appear 16 times (the default value of ``--max_nospeech_skip``) after speech is detected, the analysis is performed.
|
||||||
|
|
||||||
## Example of web socket
|
## Example of web socket
|
||||||
|
|
Loading…
Reference in a new issue