Ticket #93 (closed defect: wontfix)
Update docs to clarify dialect and accent
Reported by: | kmaclean | Owned by: | kmaclean |
---|---|---|---|
Priority: | minor | Milestone: | Unassigned |
Component: | Web Site | Version: | 0.1-alpha |
Keywords: | Cc: |
Description
See discussion entry re:dialect
The real purpose of categorizing speech by dialect is so we can created specialized Acoustic Models targeted to certain dialects or regions. We do this to reduce the size of the Acoustic Model, and to improve recognition accuracy because there is less variation in the sounds that the Acoustic Model was trained to recognize.
In your case, I think we need to make a distinction between 'dialect' and 'accent'. From Wikipedia, a dialect is a variety of language differing in vocabulary and grammar as well as pronunciation. Dialects are usually spoken by a group united by geography or class.
An accent may be any pronunciation that deviates from a standard language and pronunciation are defined by a group. Groups sharing an identifiable accent may be defined by any of a wide variety of common traits. An accent may be associated with the region in which its speakers reside (a geographical accent), the socio-economic status of its speakers, their ethnicity, their caste or social class, their first language (relative to the person hearing the accent - i.e. you may think I have an accent, and I may think you have an accent ...), and so on.
Correct me if I am wrong, but in your case, I think you speak in the European English dialect (the standard language in this case) with a Danish Accent. So when we are looking at creating an Acoustic Model for English speakers in Europe, we would include your voice because it is representative of the European English dialect. This will likely change as we get feedback on the performance of such an acoustic model.
I will update the docs to reflect this.
Hope that clarifies things, and thanks for the input.
Ken