graviti logoProductOpen DatasetsAbout
Request DemoSign in
201
0
0
THCHS-30
General
Discussion
Code
Activities
c77a92cd-8cd1-11eb-88ae-0e1f58d5e9a9
bc9b1f2·
Jun 20, 2021 7:38 AM
·1Commits

Overview

Speech data is crucially important for speech recognition research. There are quite some speech databases that can be purchased at prices that are reasonable for most research institutes. However, for young people who just start research activities or those who just gain initial interest in this direction, the cost for data is still an annoying barrier. We support the `free data' movement in speech recognition: research institutes (particularly supported by public funds) publish their data freely so that new researchers can obtain sufficient data to kick of their career.Here, we follow this trend and release a free Chinese speech database THCHS-30 that can be used to build a full- edged Chinese speech recognition system.

Citation

Please use the following citation when referencing the dataset:

@article{DBLP:journals/corr/WangZ15e,
  author    = {Dong Wang and
               Xuewei Zhang},
  title     = {{THCHS-30} : {A} Free Chinese Speech Corpus},
  journal   = {CoRR},
  volume    = {abs/1512.01882},
  year      = {2015},
  url       = {http://arxiv.org/abs/1512.01882},
  archivePrefix = {arXiv},
  eprint    = {1512.01882},
  timestamp = {Mon, 13 Aug 2018 16:46:59 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/WangZ15e.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}
🎉Many thanks to Graviti Open Datasets for contributing the dataset
Basic Information
Application ScenariosNot Available
AnnotationsNot Available
TasksNot Available
LicenseCustom
Updated on2021-01-20 05:27:42
Metadata
Data TypeNot Available
Data Volume0
Annotation Amount0
File Size0B
Copyright Owner
CSLT at Tsinghua University
Annotator
Unknown
More Support Options
Start building your AI now
Get StartedContact