Ticket #376 (new defect)
Nightly Build Acoustic Model Performance Decrease
Reported by: | kmaclean | Owned by: | kmaclean |
---|---|---|---|
Priority: | major | Milestone: | Acoustic Model 0.1.2 |
Component: | Acoustic Model | Version: | Acoustic Model 0.1.1 |
Keywords: | Cc: |
Description (last modified by kmaclean) (diff)
See this thread - segmentation scripts using the most current version of the nightly acoustic model builds not very accurate - had to go back to one of the formal releases of the VoxForge? acoustic model.
Likely caused by submissions that contain noise or are improperly transcribed... these need to be fixed.
Change History
comment:2 Changed 14 years ago by kmaclean
More updates from Dano
douglaid-20080219: incorrect prompt lines (the prompt 5 is skipped) 5= 6 6 = 7 until douglaid-20080219/mfc/vf11-16 THE ADDED WEIGHT HAD A VELOCITY OF FIFTEEN MILES PER HOUR (15 and 16 are equal)) G-20080425-itf/wav/b0002 a little tap in the beginning xaviergonz-20080419-uje a0398 seems good, record of a0404 begins too late (the p of PERRAULT is not recorded.) ductapeguy-20070308b/wav/bab.0023 seems good. peterwhy-20080503-win/mfc/win0151 seems good, but I think they are two phrases, so he stops a while after lunch. (peterwhy-20080503-win/mfc/win0150 NOR YOU EITHER IF YOU'VE GOT ANY SENSE AT ALL DON'T EVER REFER TO IT AGAIN PLEASE peterwhy-20080503-win/mfc/win0151 NOW THEN HERE'S OUR BACKWATER AT LAST WHERE WE'RE GOING TO LUNCH LEAVING THE MAIN STREAM peterwhy-20080503-win/mfc/win0152 THEY NOW PASSED INTO WHAT SEEMED AT FIRST SIGHT LIKE A LITTLE LAND LOCKED LAKE) anonymous-20080204-hnl (sounds like breathing in in the first part) anonymous-20080716 (little tap in sound) anonymous-20080630-lhi (blows in microphone)
more:
douglaid-20080219 is very serious as 5 6 7 8 9 10 11 12 13 14 15 are wrong. --- (Edited on 06-09-2008 4:08 pm [GMT+0200] by dano) --- some additional files. anonymous-20080630-lhi wav/a0285 blows in microphone gilrim-20080120-vgs (all) very noisy, but is comprehendable rjmunro-20080517-winwav/a0236 big tap Toyo-20080229-ogz.zip very bad: noisy and can not speak English mjmm-20080526-hca VERY noisy nestea247-20080301-sbn begins with tap corno1979-10102006-NR seems good, but isn't it required to have capitals instead of normal sentences? (I don't know, but the other prompts did have.) Mark_Reynolds-20070531-cc/mfc/cc-27 AND LAID HER ON HER RIGHT SIDE THEN SARAH CONFIRMED THE VET'S DIAGNOSIS instead of cc-27 AND LAID HER ON HER RIGHT SIDE THEN SARAH CONFIRMED THE VET'S DIAGNOSIS ? all prompts in this file cebidae-20080522-ns also previous thing, but says 'that' instead of 'last' and the last words are not good spoken.
comment:3 Changed 14 years ago by kmaclean
more from nsh:
Here is another list of suspicious prompts: douglaid-20080219/wav/vf11-07, douglaid-20080219/wav/vf11-08, knotyouraveragejo-20080426-adv/wav/adv0190, knotyouraveragejo-20080426-adv/wav/adv0308, kayray-20070611-leo/wav/leo0210, knotyouraveragejo-20080502-adv/wav/adv0280, Toyo-20080229-ogz.zip/wav/a0111, mjmm-20080526-hca/wav/b0074, mjmm-20080526-hca/wav/b0075, mjmm-20080526-hca/wav/b0076, mjmm-20080526-hca/wav/b0078, mjmm-20080526-hca/wav/b0079, mjmm-20080526-hca/wav/b0080, mjmm-20080526-hca/wav/b0081, mjmm-20080526-hca/wav/b0082, leonMire-20080526-lev/wav/lev0063, corno1979-10102006-NR/wav/cc020, corno1979-10102006-NR/wav/cc029, Mark_Reynolds-20070531-cc/wav/cc-27, kayray-20070608-rhi/wav/rhi0094, safi-20071118-swr/wav/b0216, starlite-20070605-che/wav/che0142, kayray-20070611-ele/wav/ele0262, robertburrelldonkin-200709011-vf11/wav/vf11-26, KnitGirl-20071113-dil/wav/b0274, gilrim-20080120-uxi/wav/a0093, gilrim-20080120-uxi/wav/a0096, gilrim-20080120-uxi/wav/a0101, ttm-20071024-poe/wav/js0002, topherfangio-20080604-jvb/wav/a0105, ductapeguy-20080423-ang/wav/sto0020, tis-20080416-tou/wav/voy0155, knotyouraveragejo-20080525-mt2/wav/mtn0261, vikramjb-20080416-cls/wav/a0398, vikramjb-20080416-cls/wav/a0399, vikramjb-20080416-cls/wav/a0400, vikramjb-20080416-cls/wav/a0402, vikramjb-20080416-cls/wav/a0403, vikramjb-20080416-cls/wav/a0404, vikramjb-20080416-cls/wav/a0405, vikramjb-20080416-cls/wav/a0406, CptOatmeal-20080721-vnh/wav/a0426, Joel-20080716-qoz/wav/b0074, Joel-20080716-qoz/wav/b0075, Joel-20080716-qoz/wav/b0076, Joel-20080716-qoz/wav/b0077, Joel-20080716-qoz/wav/b0078, Joel-20080716-qoz/wav/b0080, Joel-20080716-qoz/wav/b0081, Joel-20080716-qoz/wav/b0082, Joel-20080716-qoz/wav/b0083, anonymous-20071127-rln/wav/a0575, anonymous-20080318-eaq/wav/b0073, anonymous-20080318-eaq/wav/b0078, anonymous-20080318-eaq/wav/b0079, jaiger-20061231-vf7/wav/vf7-25,
Note: See
TracTickets for help on using
tickets.
Problems with corpus (identified by nsh);