锐英源软件
第一信赖

精通

英语

开源

擅长

开发

培训

胸怀四海 

第一信赖

当前位置:锐英源 / 开源技术 / 语音识别开源 / 单声道音素解码零权重问题

服务方向

人工智能数据处理
人工智能培训
kaldi数据准备
小语种语音识别
语音识别标注
语音识别系统
语音识别转文字
kaldi开发技术服务
软件开发
运动控制卡上位机
机械加工软件
软件开发培训
Java 安卓移动开发
VC++
C#软件
汇编和破解
驱动开发

联系方式

固话:0371-63888850
手机:138-0381-0136
Q Q:396806883
微信:ryysoft

单声道音素解码零权重问题


I'm trying to develop a recipe for a corpus using the WSJ-s5 egs as a template. During monophone decoding, however, I keep on running into zero weight issues. 我正在尝试使用WSJ-s5 egs作为模板为语料库制定脚本。但是,在单声道音素解码期间,我一直遇到零权重问题。

Below is an excerpt from decode.1.log: 以下是decode.1.log的摘录:

"gmm-latgen-faster --max-active=7000 --beam=13.0 --lattice-beam=6.0 --acoustic-scale=0.083333 --allow-partial=true --word-symbol-table=exp/mono0a/graph_bg/words.txt exp/mono0a/final.mdl exp/mono0a/graph_bg/HCLG.fst 'ark,s,cs:apply-cmvn --utt2spk=ark:data/test/split8/1/utt2spk scp:data/test/split8/1/cmvn.scp scp:data/test/split8/1/feats.scp ark:- | add-deltas ark:- ark:- |' 'ark:|gzip -c > exp/mono0a/decode_bg/lat.1.gz'
add-deltas ark:- ark:-
apply-cmvn --utt2spk=ark:data/test/split8/1/utt2spk scp:data/test/split8/1/cmvn.scp scp:data/test/split8/1/feats.scp ark:-
nchlt-eng-500m-0001 !SIL rebellion !SIL
WARNING (gmm-latgen-faster:ComputeBackwardWeight():determinize-lattice-pruned.cc:1047) Total weight of input latice is zero.
[empty subset]
WARNING (gmm-latgen-faster:InitialToStateId():determinize-lattice-pruned.cc:587) Zero weight!
LOG (gmm-latgen-faster:DecodeUtteranceLatticeFaster():lattice-faster-decoder.cc:981) Log-like per frame for utterance nchlt-eng-500m-0001 is -7.68534 over 226 frames. ..."

A few lines from best_path.10.log: 来自best_path.10.log的几行内容:

"lattice-scale --inv-acoustic-scale=10 'ark:gunzip -c exp/mono0a/decode_bg/lat.*.gz|' ark:-
lattice-best-path --word-symbol-table=exp/mono0a/graph_bg/words.txt ark:- ark,t:exp/mono0a/decode_bg/scoring/10.tra
lattice-add-penalty --word-ins-penalty=0.0 ark:- ark:-
WARNING (lattice-best-path:main():lattice-best-path.cc:92) Best-path failed for key nchlt-eng-500m-0001 ... "

I initially trained a bigram ARPA model using different toolkits (MIT-LM, SRILM and IRSTLM), but all gave the same error. I have settled on a unigram FST (used the RM egs approach found in rm_prepare_grammar_ug.sh).I have most probably made a mistake somewhere in the data preparation phase but can't find where it is. 最初,我使用不同的工具包(MIT-LM,SRILM和IRSTLM)训练了bigram ARPA模型,但都给出了相同的错误。我已经确定了一个unigram FST(使用rm_prepare_grammar_ug.sh中的RM egs方法)。我很可能在数据准备阶段的某个地方犯了一个错误,但是找不到它的位置。

 

 

I think this may actually be a bug in the determinization code. Could you please add to that the command line the option
--determinize-lattice=false, and send the output file exp/mono0a/decode_bg/lat.1.gz and your final.mdl to me at dpovey@gmail.com?
Then I'll be able to reproduce the problem and fix it (if it is a bug). 您能否在命令行中添加--determinize-lattice = false选项,然后通过dpovey@gmail.com将输出文件exp/mono0a/decode_bg/lat.1.gz和您的final.mdl发送给我?然后,我将能够重现该问题并解决(如果是错误)。

 

 

Adding --determinize-lattice=false helped, no errors. 添加--determinize-lattice =false有帮助,没有错误。

 

 

Yes, no errors are expected in that case, I would run another program on the output (determinize-lattice-phone-pruned) which is expected to produce the error. 按提示改没事了,我将在输出(确定删节的音素)运行另外一个程序。

 

 

Just to follow up on this- it looks like the problem went away after updating the code and recompiling and rerunning. If it recurs I'll look at
this again. 只是为了跟进此事-更新代码并重新编译并重新运行后,问题似乎消失了。如果再次出现,我会再次查看。

友情链接
版权所有 Copyright(c)2004-2021 锐英源软件
公司注册号:410105000449586 豫ICP备08007559号 最佳分辨率 1024*768
地址:郑州大学北校区院(文化路97号院)内