精通
英语
和
开源
,
擅长
开发
与
培训
,
胸怀四海
第一信赖
服务方向
联系方式
dear povey, 亲爱的波维,
now i have a problem about the fst.i read fst.30.gz and read it in fstcopy. i am not sure the first and second rows is the trans-id.and the forth is the word-id.i do not understand the third row is what.if the first and second rows is state id,i am sure the third row is trans-id and the forth is word-id.but in our timit recipe in kaldi,it only have 48 phones and i see the si2022 have 20 phones,and the hmm trans-id is 6,so maybe trans-id.so i do not what means.thank you for your help. 现在我有一个关于fst.i的问题,我读取了fst.30.gz并在fstcopy中读取了它。我不确定第一列和第二列是“ trans-id”,第四列是单词“ id”。我不了解第三列是什么。如果第一列和第二列是状态id,我确定第三列是是trans-id,第四个是word-id。但是在我们的kaldi timit 脚本中,它只有48个音节,我看到si2022有20个音节,而hmm的trans-id是6,所以也许是对trans-id.so不理解。谢谢您的帮助。
best wishes, 最好的祝愿,
ben 本
faem0_si2022 0 1 2 0 1 2 4 0 1 1 1 0 2 3 6 0 2 2 3 0 3 4 2 38 0.693359 3 117 266 38 0.693359 3 3 5 0 4 115 4 0 4 4 1 0 5 6 268 0 6 7 270 0 6 6 267 0 7 8 2 0 0.693359 7 9 20 3 0.693359 7 7 269 0 8 113 4 0 8 8 1 0 9 10 22 0 9 9 19 0 10 11 24 0 10 10 21 0 11 12 2 0 0.693359 11 13 80 13 0.693359 11 11 23 0 12 111 4 0 12 12 1 0 13 14 82 0 13 13 79 0 14 15 84 0 14 14 81 0 15 16 2 0 0.693359 15 17 32 5 0.693359 15 15 83 0 16 109 4 0 16 16 1 0 17 18 34 0 17 17 31 0 18 19 36 0 18 18 33 0 19 20 2 0 0.693359 19 21 122 20 0.693359 19 19 35 0 20 107 4 0 20 20 1 0 21 22 124 0 21 21 121 0 22 23 126 0 22 22 123 0 23 24 2 0 0.693359 23 25 146 24 0.693359 23 23 125 0 24 105 4 0 24 24 1 0 25 26 148 0 25 25 145 0 26 27 150 0 26 26 147 0 27 28 2 0 0.693359 27 29 62 10 0.693359 27 27 149 0 28 103 4 0 28 28 1 0 29 30 64 0 29 29 61 0 30 31 66 0 30 30 63 0 31 32 2 0 0.693359 31 33 68 11 0.693359 31 31 65 0 32 101 4 0 32 32 1 0 33 34 70 0 33 33 67 0 34 35 72 0 34 34 69 0 35 36 2 0 0.693359 35 37 242 41 0.693359 35 35 71 0 36 99 4 0 36 36 1 0 37 38 244 0 37 37 241 0 38 39 246 0 38 38 243 0 39 40 2 0 0.693359 39 41 224 37 0.693359 39 39 245 0 40 97 4 0 40 40 1 0 41 42 226 0 41 41 223 0 42 43 228 0 42 42 225 0 43 44 2 0 0.693359 43 45 152 25 0.693359 43 43 227 0 44 95 4 0 44 44 1 0 45 46 154 0 45 45 151 0 46 47 156 0 46 46 153 0 47 48 2 0 0.693359 47 49 260 44 0.693359 47 47 155 0 48 93 4 0 48 48 1 0 49 50 262 0 49 49 259 0 50 51 264 0 50 50 261 0 51 52 2 0 0.693359 51 53 68 11 0.693359 51 51 263 0 52 91 4 0 52 52 1 0 53 54 70 0 53 53 67 0 54 55 72 0 54 54 69 0 55 56 2 0 0.693359 55 57 212 35 0.693359 55 55 71 0 56 89 4 0 56 56 1 0 57 58 214 0 57 57 211 0 58 59 216 0 58 58 213 0 59 60 2 0 0.693359 59 61 44 7 0.693359 59 59 215 0 60 87 4 0 60 60 1 0 61 62 46 0 61 61 43 0 62 63 48 0 62 62 45 0 63 64 2 0 0.693359 63 65 254 43 0.693359 63 63 47 0 64 85 4 0 64 64 1 0 65 66 256 0 65 65 253 0 66 67 258 0 66 66 255 0 67 68 2 0 0.693359 67 69 122 20 0.693359 67 67 257 0 68 83 4 0 68 68 1 0 69 70 124 0 69 69 121 0 70 71 126 0 70 70 123 0 71 72 2 0 0.693359 71 73 26 4 0.693359 71 71 125 0 72 81 4 0 72 72 1 0 73 74 28 0 73 73 25 0 74 75 30 0 74 74 27 0 75 76 2 0 75 75 29 0 76 77 4 0 76 76 1 0 77 78 6 0 77 77 3 0 78 79 2 38 0.693359 78 118 0 38 0.693359 78 78 5 0 79 80 4 0 79 79 1 0 80 119 6 0 80 80 3 0 81 82 6 0 81 81 3 0 82 73 26 4 82 82 5 0 83 84 6 0 83 83 3 0 84 69 122 20 84 84 5 0 85 86 6 0 85 85 3 0 86 65 254 43 86 86 5 0 87 88 6 0 87 87 3 0 88 61 44 7 88 88 5 0 89 90 6 0 89 89 3 0 90 57 212 35 90 90 5 0 91 92 6 0 91 91 3 0 92 53 68 11 92 92 5 0 93 94 6 0 93 93 3 0 94 49 260 44 94 94 5 0 95 96 6 0 95 95 3 0 96 45 152 25 96 96 5 0 97 98 6 0 97 97 3 0 98 41 224 37 98 98 5 0 99 100 6 0 99 99 3 0 100 37 242 41 100 100 5 0 101 102 6 0 101 101 3 0 102 33 68 11 102 102 5 0 103 104 6 0 103 103 3 0 104 29 62 10 104 104 5 0 105 106 6 0 105 105 3 0 106 25 146 24 106 106 5 0 107 108 6 0 107 107 3 0 108 21 122 20 108 108 5 0 109 110 6 0 109 109 3 0 110 17 32 5 110 110 5 0 111 112 6 0 111 111 3 0 112 13 80 13 112 112 5 0 113 114 6 0 113 113 3 0 114 9 20 3 114 114 5 0 115 116 6 0 115 115 3 0 116 120 266 45 116 116 5 0 117 5 0 45 117 117 265 0 118 119 118 0 0 119 119 5 0 120 5 0 0 120 120 265 0
It looks like this is the acceptor format of OpenFst. The 3rd field
is the word-id and the last field is the cost (negated
log-likelihood), coming from the lexicon (pron-prob of
silence/not-silence).
看起来这是OpenFst的接受者格式。第三列是word-id且最后一列是成本(否定对数似然性),来自lexicon(沉默/不沉默的问题概率)
dear povey, 亲爱的波维,
thank you for your reply.but it have five rows.the fifth rows is cost,but always
empty.the forth is word-id.because in timit,the word.txt is the same as the lexcious.txt and phone.txt,and it only no more than 50.so i am not sure what is it.
thank you for your reply again. 谢谢您的答复。但它有五列。第五列是费用,但总是空的。第四个是word-id。因为很简单,word.txt与lexcious.txt和phone.txt相同,并且最多不超过50个。所以我不确定这是什么。
best wishes, 再次感谢您的回复。
ben 最好的祝愿,
Oh, I see, this is a per-utterance decoding graph. The inputs are transition-ids. 哦,我知道了,这是每个发音的解码图。输入是transition-id。
yes,it is a per-utterance decoding graph.you means the third rows is transition-id?and the first and second rows is also transition-id? 是的,这是一个基于语音的解码图。您的意思是第三列是transition-id?而第一列和第二列也是transition-id?
ben 本
First and second rows are begin/end state in the FST; see www.openfst.org to understand the FST format. 第一和第二列是FST中的开始/结束状态;看
i know it .but in timit,every HMM is have 3 states.and this utterance is 20 phones.so the first and second rows should be no more than 60,but the first and second rows is 120 now.so i do not know that. 我知道。但是在timit中,每个HMM都有3个状态。这种话语是20个音节。所以第一和第二列应该不超过60,但是现在第一和第二列是120。所以我不知道那。