From 8b464ff85d3393172578a4340100994adbcee6fe Mon Sep 17 00:00:00 2001 From: Yuta Hayashibe Date: Tue, 8 Nov 2022 23:19:37 +0900 Subject: [PATCH] Add description --- README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 0ed169c..f7c8dd2 100644 --- a/README.md +++ b/README.md @@ -52,7 +52,8 @@ This interval is determined by the value of ``-n`` and its default is ``20``. When an interval is predicted as "silence", it will not be passed to whisper. If you want to disable VAD, please make VAD threshold 0 by adding ``--vad 0``. -By default, Whisper does not perform analysis until the total length of the segments determined by VAD to have speech exceeds 30 seconds. +By default, whispering does not perform analysis until the total length of the segments determined by VAD to have speech exceeds 30 seconds. +This is because the original Whisper assumes that the inputs are 30 seconds segments. However, if silence segments appear 16 times (the default value of ``--max_nospeech_skip``) after speech is detected, the analysis is performed. ## Example of web socket