Dataset Viewer

Some Podcasts

Podcasts are taken from the PodcastFillers dataset. The PodcastFillers dataset consists of 199 full-length podcast episodes in English with manually annotated filler words and automatically generated transcripts. The podcast audio recordings, sourced from SoundCloud, are CC-licensed, gender-balanced, and total 145 hours of audio from over 350 speakers.

This dataset doesn't upload the PodcastFillers annotations, which are under a non-commercial license. See here for more details.

Length by license type

CC_BY 3.0: Total length: 73.6 h. Mean length: 44.2 min

CC_BY SA 3.0: Total length: 54.9 h. Mean length: 41.7 min

CC_BY ND 3.0 : Total length: 16.7 h. Mean length: 50 min

License

See here for more details. The licenses are also in the metadata.

Citation Information

@inproceedings{Zhu:FillerWords:INTERSPEECH:22,
 title = {Filler Word Detection and Classification: A Dataset and Benchmark},
 booktitle = {23rd Annual Cong.~of the Int.~Speech Communication Association (INTERSPEECH)},
 address = {Incheon, Korea}, 
 month = {Sep.},
 url = {https://arxiv.org/abs/2203.15135},
 author = {Zhu, Ge and Caceres, Juan-Pablo and Salamon, Justin},
 year = {2022},
}

Contributions

Thanks to @ylacombe for adding this dataset.

Downloads last month: 95

Paper for ylacombe/podcast_fillers

Paper • 2203.15135 • Published Mar 28, 2022 • 1