Version 34 (modified by kmaclean, 16 years ago) (diff) |
---|
Here is a list of possible sources of Spoken Audio files that might be used for the creation of GPL Acoustic Models.
- Audio Source list:
- Gutenburg audio project
- Wikipedia Spoken Articles
- CMU:
- Festvox:
- Festvox databases
- CMU ARCTIC (no restrictions)
- CMU_FAF (Facts and Fables) database
- CMU_SIN database Speech in Noise
- CMU Chaplain (for research only)
- Diphone Databases
- ldom Databases (Limited Domain)
- Festvox databases
- ISIP/CAVS Switchboard
- American Rhetoric
- MICASE
- TalkBank TalkBank Audio Files (GNU license)
- Hansard Canada (Audio feeds on day of debate)
- MICASE Michigan Corpus of Academic Spoken English
- AUE - alt-usage-english
- Voxeo Telephony Audio Files
- Internet Archive's collection of audio recordings
- Links
Other Possible sources, but with licensing issues:
- Buckeye Corpus
- CSLU Speech Synthesis Research Group;
- OGIresLPC 2.1.0 voices (voice data not released yet - only for research/personal use ...)
- CMU
- Let's Go Speech Dialog Data - (license for research only)
- Festvox
- CSTR US KED Timit (for research, educational and individual use only)
(Also see Ticket #22)