[japanese]/[english]

HTK3.0 batch and streaming recognition demo

01/08/2002: first.cvl.iis.u-tokyo.ac.jp (tested on Linux-2.4.16 for Japanese version)
03/16/2002: octopus.ius.cs.cmu.edu (tested on Linux-2.2.20 for English version)


Preface

The Hidden Markov Model Toolkit (HTK) is a portable toolkit for building and manipulating hidden Markov models. It is primarily used for speech recognition, but it can be used for other applications. In the following example, I modified HVite.c a bit and used OpenExtBuffer() function to demonstrate recognition from streaming keyboard input. Please keep in mind that I deliverately made this demo simple and minimal. If you are considering using the HTK, I strongly recommend that you go over the HTKBook before applying whats here on the page to your problem domain.

Preparation


Training/Recognizing user defined dataset

I've basically followed chapter 3 "A Tutorial Example of Using HTK" on The HTK Book, and trimmed it down for this simple training/recognition task.


Extension

You should be able to expand this by using differencial component of the data for training/recognition, using bigram/n-gram and so on. A good place to start would be the Chapter3 of The HTK Book.

- soshi


Soshi Iba <iba+@cmu.edu>
Last modified: Wed May 8 05:48:21 EDT 2002