Changes between Version 11 and Version 12 of AcousticModels


Ignore:
Timestamp:
01/01/07 20:01:20 (15 years ago)
Author:
kmaclean
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • AcousticModels

    v11 v12  
    44    * [http://www.inference.phy.cam.ac.uk/kv227/htk/acoustic_models.html Trained acoustic models for HTK] 
    55    * [http://www.inference.phy.cam.ac.uk/kv227/htk/acoustic_models.html Trained acoustic models for Sphinx] 
    6  
    7 == Acoustic Model Notes == 
    8  
    9 [http://www.speech.cs.cmu.edu/sphinx/models/hub4opensrc_jan2002/INFO_ABOUT_MODELS Sphinx Acoustic Models ] were trained using 140 hours of 1996 and 1997 hub4 training data.  VoxForge's goal for release 1.0 is to collect 140 hours of speech audio for the creation of Open Source Acoustic Models. 
    10  
    11 details from LDC site: 
    12   * [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC97S44 1996 English Broadcast News Speech (Hub-4)] - 104 hours of broadcasts 
    13   * [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC98S71 1997 English Broadcast News Speech (Hub-4)] - 97 hours of news broadcasts  
    14  
    15  
    16  
    17  
    18 == Estimating Storage requirements: == 
    19  
    20   * for 48kHz:16bit audio, 5 seconds of audio takes 500k. 
    21   * therefore about 6 meg per minute! 
    22   * if we want 140 hours of speech, we will need 50400 Meg or around 50.4Gig (assumes a 1000k per Meg), for Original data. 
    23   * Will likely need at least double that space with the propagation of audio (downsampling, noise reduction, etc.) through version control to create Acoustic Models - therefore need at least '''100Gig''' of storage to meet our stated objective. 
    24   * !VoxForge server currently holds 200 Gig, and, if needed, can easily add additional storage. 
    25   * Bandwidth is a greater issue, therefore we will require Peer-to-Peer sharing of audio files (i.e. Bittorrent) - see ticket #11. 
    26