kotoba-tech
/

kotoba-whisper-v1.1

@@ -6,9 +6,6 @@ tags:
 - audio
 - automatic-speech-recognition
 - hf-asr-leaderboard
-metrics:
-- wer
-- cer
 widget:
 - example_title: CommonVoice 8.0 (Test Split)
   src: >-
@@ -20,45 +17,6 @@ widget:
   src: >-
     https://huggingface.co/datasets/japanese-asr/ja_asr.reazonspeech_test/resolve/main/sample.flac
 pipeline_tag: automatic-speech-recognition
-model-index:
-- name: kotoba-tech/kotoba-whisper-v1.1
-  results:
-  - task:
-      type: automatic-speech-recognition
-    dataset:
-      name: CommonVoice_8.0 (Japanese)
-      type: japanese-asr/ja_asr.common_voice_8_0
-    metrics:
-    - type: WER
-      value: 59.27
-      name: WER
-    - type: CER
-      value: 9.44
-      name: CER
-  - task:
-      type: automatic-speech-recognition
-    dataset:
-      name: ReazonSpeech (Test)
-      type: japanese-asr/ja_asr.reazonspeech_test
-    metrics:
-    - type: WER
-      value: 56.62
-      name: WER
-    - type: CER
-      value: 12.6
-      name: CER
-  - task:
-      type: automatic-speech-recognition
-    dataset:
-      name: JSUT Basic5000
-      type: japanese-asr/ja_asr.jsut_basic5000
-    metrics:
-    - type: WER
-      value: 64.36
-      name: WER
-    - type: CER
-      value: 8.48
-      name: CER
 datasets:
 - japanese-asr/whisper_transcriptions.reazonspeech.large
 - japanese-asr/whisper_transcriptions.reazonspeech.large.wer_10.0
@@ -77,13 +35,25 @@ Following table presents the raw CER (unlike usual CER where the punctuations ar
 along with the.
-| model                                                    |   CommonVoice 8.0 (Japanese) |   JSUT Basic 5000 |  ReazonSpeech Test |
-|:---------------------------------------------------------|---------------------------------------:|-------------------------------------:|----------------------------------------:|
-| [kotoba-tech/kotoba-whisper-v1.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.1) (punctuator + stable-ts) |                                   13.7 |                                 11.2 |                                    17.4 |
-| [kotoba-tech/kotoba-whisper-v1.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.1) (punctuator)             |                                   13.9 |                                 11.4 |                                    18   |
-| [kotoba-tech/kotoba-whisper-v1.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.1) (stable-ts)              |                                   15.7 |                                 15   |                                    17.7 |
-| [kotoba-tech/kotoba-whisper-v1.0](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0)                          |                                   15.6 |                                 15.2 |                                    17.8 |
-| [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3)                                  |                                   12.9 |                                 13.4 |                                    20.6 |
 Regarding to the normalized CER, since those update from v1.1 will be removed by the normalization, kotoba-tech/kotoba-whisper-v1.1 marks the same CER values as [kotoba-tech/kotoba-whisper-v1.0](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0).

 - audio
 - automatic-speech-recognition
 - hf-asr-leaderboard
 widget:
 - example_title: CommonVoice 8.0 (Test Split)
   src: >-
   src: >-
     https://huggingface.co/datasets/japanese-asr/ja_asr.reazonspeech_test/resolve/main/sample.flac
 pipeline_tag: automatic-speech-recognition
 datasets:
 - japanese-asr/whisper_transcriptions.reazonspeech.large
 - japanese-asr/whisper_transcriptions.reazonspeech.large.wer_10.0
 along with the.
+| model                                                                                                                                             |   [CommonVoice 8 (Japanese test set)](https://huggingface.co/datasets/japanese-asr/ja_asr.common_voice_8_0) |   [JSUT Basic 5000](https://huggingface.co/datasets/japanese-asr/ja_asr.jsut_basic5000) |   [ReazonSpeech (held out test set)](https://huggingface.co/datasets/japanese-asr/ja_asr.reazonspeech_test) |
+|:--------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------:|----------------------------------------------------------------------------------------:|------------------------------------------------------------------------------------------------------------:|
+| [kotoba-tech/kotoba-whisper-v2.0](https://huggingface.co/kotoba-tech/kotoba-whisper-v2.0)                                                         |                                                                                                        17.6 |                                                                                    15.4 |                                                                                                        17.4 |
+| [kotoba-tech/kotoba-whisper-v2.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v2.1)                                                         |                                                                                                        17.7 |                                                                                    15.4 |                                                                                                        17   |
+| [kotoba-tech/kotoba-whisper-v2.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v2.1) (punctuator + stable-ts)                                |                                                                                                        17.7 |                                                                                    15.4 |                                                                                                        17   |
+| [kotoba-tech/kotoba-whisper-v2.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v2.1) (punctuator)                                            |                                                                                                        17.7 |                                                                                    15.4 |                                                                                                        17   |
+| [kotoba-tech/kotoba-whisper-v2.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v2.1) (stable-ts)                                             |                                                                                                        17.7 |                                                                                    15.4 |                                                                                                        17   |
+| [kotoba-tech/kotoba-whisper-v1.0](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0)                                                         |                                                                                                        17.8 |                                                                                    15.2 |                                                                                                        17.8 |
+| [kotoba-tech/kotoba-whisper-v1.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.1)                                                         |                                                                                                        17.9 |                                                                                    15   |                                                                                                        17.8 |
+| [kotoba-tech/kotoba-whisper-v1.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.1) (punctuator + stable-ts)                                |                                                                                                        17.9 |                                                                                    15   |                                                                                                        17.8 |
+| [kotoba-tech/kotoba-whisper-v1.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.1) (punctuator)                                            |                                                                                                        17.9 |                                                                                    15   |                                                                                                        17.8 |
+| [kotoba-tech/kotoba-whisper-v1.1](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.1) (stable-ts)                                             |                                                                                                        17.9 |                                                                                    15   |                                                                                                        17.8 |
+| [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3)                                                                         |                                                                                                        15.3 |                                                                                    13.4 |                                                                                                        20.5 |
+| [openai/whisper-large-v2](https://huggingface.co/openai/whisper-large-v2)                                                                         |                                                                                                        15.9 |                                                                                    10.6 |                                                                                                        34.6 |
+| [openai/whisper-large](https://huggingface.co/openai/whisper-large)                                                                               |                                                                                                        16.6 |                                                                                    11.3 |                                                                                                        40.7 |
+| [openai/whisper-medium](https://huggingface.co/openai/whisper-medium)                                                                             |                                                                                                        17.9 |                                                                                    13.1 |                                                                                                        39.3 |
+| [openai/whisper-base](https://huggingface.co/openai/whisper-base)                                                                                 |                                                                                                        34.5 |                                                                                    26.4 |                                                                                                        76   |
+| [openai/whisper-small](https://huggingface.co/openai/whisper-small)                                                                               |                                                                                                        21.5 |                                                                                    18.9 |                                                                                                        48.1 |
+| [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny)                                                                                 |                                                                                                        58.8 |                                                                                    38.3 |                                                                                                       153.3 |
 Regarding to the normalized CER, since those update from v1.1 will be removed by the normalization, kotoba-tech/kotoba-whisper-v1.1 marks the same CER values as [kotoba-tech/kotoba-whisper-v1.0](https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0).