Changes between Version 4 and Version 5 of LanguageModelSources


Ignore:
Timestamp:
08/04/06 14:00:53 (15 years ago)
Author:
kmaclean
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • LanguageModelSources

    v4 v5  
    11== Possible sources of written data (written corpora) for the creation of Language Models == 
    22  * [http://www.gpoaccess.gov  U.S. Government Printing Office ] 
    3    * [http://www.gutenberg.org Gutenburg project ]  
    4    * [http://en.wikipedia.org Wikipedia Spoken Articles ] 
    5    * Hansard Canada 
    6      * [http://www.parl.gc.ca/common/Chamber_House_Debates.asp?Language=E&Parl=39&Ses=1 House of Commons] 
    7      * [http://www.parl.gc.ca/common/Chamber_Senate_Debates.asp?Language=E&Parl=39&Ses=1 Senate] 
    8    * [http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html Google Research word n-gram models and training corpora ] 
     3  * [http://www.gutenberg.org Gutenburg project ]  
     4  * [http://en.wikipedia.org Wikipedia Spoken Articles ] 
     5  * Hansard Canada 
     6    * [http://www.parl.gc.ca/common/Chamber_House_Debates.asp?Language=E&Parl=39&Ses=1 House of Commons] 
     7    * [http://www.parl.gc.ca/common/Chamber_Senate_Debates.asp?Language=E&Parl=39&Ses=1 Senate] 
     8  * [http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html Google Research word n-gram models and training corpora ] 
     9  * [http://www-tech.mit.edu/Shakespeare/ Complete Works of William Shakespeare] 
     10  * [http://www.dcs.shef.ac.uk/research/ilash/Moby/ Moby Project]