The Kaldi* Statistical Language Model (SLM) Conversion Tool is a command line tool that converts Kaldi language model resources to the format supported by the OpenVINO Speech Recognition Demos.
kaldi_slm_convertion_tool HCLG.const.fst transitions.txt words.txt slm.fst labels.bin
HCLG.const.fst parameter is the input weighted finite-state tranducer (WFST) file in OpenFST const format.
Most example scripts create the language model file in that format. If you have a WFST in a different OpenFST format, it can be converted with the call:
The source Kaldi language model file
HCLG.fst can be found in directories like
tri2b is the name of the model used for speech recognition tests.
transitions.txt The WFST transitions file describes the relations between WFST transitions and neural acoustic model outputs. This file is usually not generated by Kaldi example scripts, so you have to create it with the call:
For this call, the
phones.txt file is the phoneme description file, which can often be found in
final.mdl file is the neural acoustic model that is used for speech recognition.
words.txt file defines the mappings from word IDs used internally to their text representation. For many Kaldi example scripts, the file can be found in the same directory as the
The output file
slm.fst is generated by the SLM Conversion Tool. It contains the information needed for the OpenVINO speech recognition demos for decoding.
labels.bin file defines mappings from word IDs to word strings, like the
words.txt file, but in binary format. The OpenVINO speech recognition example needs the
labels.bin file to convert recognized words into human-readable format.