Updated document

2025-03-28 04:55:28 +00:00 · 2022-10-15 12:56:08 +09:00 · 2022-10-15 12:56:08 +09:00 · ae1dbd721c
commit ae1dbd721c
parent 91231811e7
1 changed files with 11 additions and 5 deletions
--- a/README.md
+++ b/README.md
@ -46,17 +46,23 @@ whispering --language en --model tiny

 ### Parse interval

-Without ``--allow-padding``, whispering just performs VAD for the period,
-and when it is predicted as "silence", it will not be passed to whisper.
-If you want to change the VAD interval, change ``-n``.
+By default, whispering performs VAD for every 3.75 second.
+This interval is determined by the value of ``-n`` and its default is ``20``.
+When an interval is predicted as "silence", it will not be passed to whisper.
+If you want to disable VAD, please use ``--no-vad`` option.

-If you want quick response, set small ``-n`` and add ``--allow-padding``.
-However, this may sacrifice the accuracy.
+By default, Whisper does not perform analysis until the total length of the segments determined by VAD to have speech exceeds 30 seconds.
+This is because Whisper is trained to make predictions for 30-second intervals.
+Nevertheless, if you want to force Whisper to perform analysis even if a segment is less than 30 seconds, please use ``--allow-padding`` option like this.

 ```bash
 whispering --language en --model tiny -n 20 --allow-padding
 ```

+This forces Whisper to analyze every 3.75 seconds speech segment.
+Using ``--allow-padding`` may sacrifice the accuracy, while you can get quick response.
+The smaller value of ``-n`` with ``--allow-padding`` is, the worse the accuracy becomes.
+
 ## Example of web socket

 ⚠  **No security mechanism. Please make secure with your responsibility.**