VoxForge Dev
Home
·
Read
·
Listen
·
Forums
·
Dev
·
Downloads
·
About
Dev Site (Scripts)
Search:
Login
Help/Guide
About Trac
Preferences
Wiki
Timeline
Roadmap
Browse Source
View Tickets
Search
wiki:
LanguageModelSources
Context Navigation
←
Previous Version
View Latest Version
Next Version
→
Version 6 (modified by kmaclean,
16 years
ago) (
diff
)
--
Possible sources of written data (written corpora) for the creation of Language Models
U.S. Government Printing Office
Gutenburg project
Wikipedia Spoken Articles
Hansard Canada
House of Commons
Senate
Google Research word n-gram models and training corpora
Complete Works of William Shakespeare
Moby Project
Google Books
Download in other formats:
Plain Text