From fe1526b32e17a9006d28a22b27afd266aae91c14 Mon Sep 17 00:00:00 2001 From: laubonghaudoi Date: Fri, 7 Feb 2025 14:12:57 -0800 Subject: [PATCH] Update index.html --- index.html | 62 ++++++++++++++++++++++++++++++++++++++++++------------ 1 file changed, 48 insertions(+), 14 deletions(-) diff --git a/index.html b/index.html index 2a89bfb..3e3a64f 100644 --- a/index.html +++ b/index.html @@ -98,7 +98,7 @@

Total Duration

- 104.64 個鐘 hours
(6278.20 分鐘 minutes) + 110.10 個鐘 hours
(6605.83 分鐘 minutes)

@@ -106,7 +106,7 @@

總字數(含標點)
Total # Characters (including punctuation)

-

1,561,789

+

1,642,902

@@ -121,15 +121,16 @@

介紹 Introduction

本數據集由廣州最出名嘅話劇演員、説書藝人(講古佬)張悦楷喺 1980 - 年代電台播講《三國演義》嘅錄音製成。數據集所有文本均由人工轉寫,並根據《三國演義》原文校對嚟確保準確性。 + 年代電台播講《三國演義》《水滸傳》《走進毛澤東的最後歲月》嘅錄音製成。數據集所有文本均由人工轉寫,並根據原文校對嚟確保準確性。

This dataset was made from recordings of Zoeng Jyut Gaai, the most famous drama actor and storyteller in Canton, storytelling - Romance of the Three Kingdoms during the 1980s. All texts - in the dataset were transcribed manually and proofread according to - the original text of Romance of the Three Kingdoms to - ensure accuracy. + Romance of the Three Kingdoms, Water Margin and + The Final Days of Mao Zedong during the 1980s. All texts in + the dataset were transcribed manually and proofread according to the + original text of Romance of the Three Kingdoms to ensure + accuracy.

本數據集可用於各種用途,例如語音合成(TTS)、語音識別(ASR)、語言模型(LLM)、語言學分析等等。數據統計 Statistics - 全集 Full + 全集 Total 三國演義 saamgwokjinji @@ -298,6 +299,9 @@

數據統計 Statistics

水滸傳 seoiwuzyun + + 走進毛澤東的最後歲月 mouzaakdung + @@ -306,7 +310,7 @@

數據統計 Statistics

總時長 Total Duration (個鐘 hours | 分鐘 minutes) - 104.64 | 6278.20 + 110.10 | 6605.83 66.01 | 3960.73 @@ -314,13 +318,16 @@

數據統計 Statistics

38.62 | 2317.43 + + 5.46 | 327.62 + 平均音頻時長 Average Clip Duration (秒 seconds) - 5.893 + 5.899 6.067 @@ -328,13 +335,16 @@

數據統計 Statistics

5.619 + + 6.004 + 中位音頻時長 Median Clip Duration (秒 seconds) - 5.436 + 5.441 5.607 @@ -342,6 +352,9 @@

數據統計 Statistics

5.198 + + 5.546 + @@ -356,6 +369,9 @@

數據統計 Statistics

0.322 + + 0.925 + @@ -370,6 +386,9 @@

數據統計 Statistics

33.144 + + 19.040 + @@ -377,7 +396,7 @@

數據統計 Statistics

punctuation - 24.33 + 24.35 24.00 @@ -385,6 +404,9 @@

數據統計 Statistics

24.86 + + 24.77 + @@ -400,13 +422,16 @@

數據統計 Statistics

23 + + 23 + 文本總字數,含標點 Total Characters, including punctuation - 1561789 + 1642902 952427 @@ -414,13 +439,16 @@

數據統計 Statistics

621682 + + 81113 + 覆蓋漢字數 Unique Chinese Characters Coverage - 4406 + 4562 3993 @@ -428,6 +456,9 @@

數據統計 Statistics

3520 + + 2538 + @@ -443,6 +474,9 @@

數據統計 Statistics

4.47 + + 4.13 +