Archive for the 'Notes' Category



As it comes with the Sphinx-4 package, the HelloWorld.jar example only recognizes the following words:
(Good morning | Hello)
( Bhiksha | Evandro | Paul | Philip | Rita | Will )
To expand this vocabulary, we need to modify a grammar file that the JSGFGrammar class imports, and then we need to rebuild the HelloWorld.jar using ant.
In […]

Sphinx 4 Architecture

( diagram taken from Sphinx-4: A Flexible Open Source Framework
for Speech Recognition )
NOTE: This post is not complete.
The beauty of the Sphinx 4 architecture is its modularity and pluggability. Previously, speech recognition programs were built to fulfill specific roles: continuous speech vs. non-continuous, large vocabulary vs. smaller vocabulary, etc. Now with Sphinx 4, […]

Bluetooth Headset results

Bluetooth headsets could turn out to be the most widely available interface for speaking to the computer, so we wanted to do some tests to see if the demos that came with Sphinx-4 would work with a headset.
One useful thing to note is that I had to jump through some hoops to get […]

I found one useful summary of what speech recognition is here. It details the types of speech recognizers, including speaker-independent, speaker-dependent, continuous speech recognition, isolated speech recognition, and vocabulary constrained system. As the technology exists now, it seems that one has to figure out a compromise between vocabulary size that the computer can […]




About

You are currently browsing the voicerecog weblog archives for the 'Notes' category.

Longer entries are truncated. Click the headline of an entry to read it in its entirety.
"

Categories