acmod_s Struct Reference

Acoustic model structure. More...

#include <acmod.h>


Data Fields

cmd_ln_t * config
 Configuration.
logmath_t * lmath
 Log-math computation.
glist_t strings
 Temporary acoustic model filenames.
fe_t * fe
 Acoustic feature computation.
feat_t * fcb
 Dynamic feature computation.
bin_mdef_tmdef
 Model definition.
tmat_ttmat
 Transition matrices.
ps_mgau_tmgau
 Model parameters.
ps_mllr_tmllr
 Speaker transformation.
int16 * senone_scores
 GMM scores for current frame.
bitvec_t * senone_active_vec
 Active GMMs in current frame.
uint8 * senone_active
 Array of deltas to active GMMs.
int senscr_frame
 Frame index for senone_scores.
int n_senone_active
 Number of active GMMs.
int log_zero
 Zero log-probability value.
mfcc_t ** mfc_buf
 Temporary buffer of acoustic features.
mfcc_t *** feat_buf
 Temporary buffer of dynamic features.
FILE * rawfh
 File for writing raw audio data.
FILE * mfcfh
 File for writing acoustic feature data.
uint8 state
 State of utterance processing.
uint8 compallsen
 Compute all senones?
uint8 grow_feat
 Whether to grow feat_buf.
uint8 reserved
int16 output_frame
 Index of next frame of dynamic features.
int16 n_mfc_alloc
 Number of frames allocated in mfc_buf.
int16 n_mfc_frame
 Number of frames active in mfc_buf.
int16 mfc_outidx
 Start of active frames in mfc_buf.
int16 n_feat_alloc
 Number of frames allocated in feat_buf.
int16 n_feat_frame
 Number of frames active in feat_buf.
int16 feat_outidx
 Start of active frames in feat_buf.


Detailed Description

Acoustic model structure.

This object encapsulates all stages of acoustic processing, from raw audio input to acoustic score output. The reason for grouping all of these modules together is that they all have to "agree" in their parameterizations, and the configuration of the acoustic and dynamic feature computation is completely dependent on the parameters used to build the original acoustic model (which should by now always be specified in a feat.params file).

Because there is not a one-to-one correspondence from blocks of input audio or frames of input features to frames of acoustic scores (due to dynamic feature calculation), results may not be immediately available after input, and the output results will not correspond to the last piece of data input.

TODO: In addition, this structure serves the purpose of queueing frames of features (and potentially also scores in the future) for asynchronous passes of recognition operating in parallel.

Definition at line 142 of file acmod.h.


The documentation for this struct was generated from the following file:
Generated on Mon Jan 24 21:50:16 2011 for PocketSphinx by  doxygen 1.4.7