锐英源软件
第一信赖

精通

英语

开源

擅长

开发

培训

胸怀四海 

第一信赖

当前位置:锐英源 / 开源技术 / espeak / espeak音素表-第一部分

服务方向

人工智能数据处理
人工智能培训
kaldi数据准备
小语种语音识别
语音识别标注
语音识别系统
语音识别转文字
kaldi开发技术服务
软件开发
运动控制卡上位机
机械加工软件
软件开发培训
Java 安卓移动开发
VC++
C#软件
汇编和破解
驱动开发

联系方式

固话:0371-63888850
手机:138-0381-0136
Q Q:396806883
微信:ryysoft

espeak音素表-第一部分

重点

phsource目录

espeakedit函数

音素表可以继承

分元音辅音

音素属性,重要wav

音素定义描述

音素指令

为什么国外动不动就出个平台,美国搞IT真是基础好,氛围好,不服不行。

 

PHONEME TABLES 音素表

A phoneme table defines all the phonemes which are used by a language, together with their properties and the data for their production as sounds. 音素表定义了语言使用的所有音素,以及它们的属性和作为声音产生的数据。
Generally each language has its own phoneme table, although additional phoneme tables can be used for different voices within the language. These alternatives are referenced from Voice files. 通常,每种语言都有自己的音素表,尽管其他音素表可用于该语言中的不同语音。从语音文件中引用了这些替代方法。
A phoneme table does not need to define all the phonemes used by a language. It can inherit the phonemes from a previously defined phoneme table. For example, a phoneme table may redefine (or add) some of the vowels that it uses, but inherit most of its consonants from a standard set. 音素表不需要定义语言使用的所有音素。它可以从先前定义的音素表继承音素。例如,一个音素表可能会重新定义(或添加)它使用的某些元音,但会从标准集中继承其大部分辅音。
The source files for the phoneme data are in the "phsource" directory in the espeakedit download package. "Vowel files", which are referenced in FMT(), VowelStart(), and VowelEnding() instructions are made using the espeakedit program. 音素数据的源文件位于espeakedit下载包的“ phsource”目录中。使用espeakedit程序可以在FMT(),VowelStart()和VowelEnding()指令中引用“ Vowel文件”。

Phoneme files 音素文件
The phoneme tables are defined in a master phoneme file, named phonemes. This starts with the base phoneme table followed by phoneme tables for other languages and voices. These inherit phonemes from the base table or previously defined tables. 音素表在主音素文件(音素)中定义。这从基本音素表开始,然后是其他语言和语音的音素表。这些从基表或先前定义的表继承音素。
In addition to phoneme definitions, the phoneme file can contain the following: 除了音素定义之外,音素文件还可以包含以下内容:
include <filename>
Includes the text of the specified file at this point. This allows different phoneme tables to be kept in different text files, for convenience. <filename> is a relative path. The included file can itself contain include statements. 此时包括指定文件的文本。为了方便,这允许将不同的音素表保存在不同的文本文件中。<filename>是相对路径。包含文件本身可以包含包含语句。
phonemetable <name> <parent>
Starts a new phoneme table, and ends the previous table. 开始一个新的音素表,并结束上一个表。
<name> Is the name of this phoneme table. This name is used in Voice files. <name>是此音素表的名称。此名称在语音文件中使用。
<parent> Is the name of a previously defined phoneme table whose phoneme definitions are inherited by this one. The name base indicates the first (base) phoneme table. <parent>是先前定义的音素表的名称,其音素定义被此表继承。基名称表示第一个(基)音素表。

Phoneme definitions 音素定义
Note: These new Phoneme definitions apply to eSpeak version 1.42.20 and later. 注意:这些新的音素定义适用于eSpeak 1.42.20和更高版本。
A phoneme table contains a list of phoneme definitions. Each starts with the keyword phoneme and the phoneme name (this is the name used in the pronunciation rules in a language's *_rules and *_list files), and ends with the keyword endphoneme. For example: 音素表包含音素定义列表。每一个都以关键字音素和音素名称(这是一种语言的* _rules和* _list文件中的发音规则中使用的名称)开头,并以关键字endphoneme结尾。例如:

phoneme aI
    vowel
    starttype #a endtype #i
    length 230
    FMT(vowels/ai)
  endphoneme

  phoneme s
    vls alv frc sibilant
    voicingswitch z
    lengthmod 3
    Vowelin  f1=0  f2=1700 -300 300  f3=-100 80
    Vowelout f1=0  f2=1700 -300 250  f3=-100 80  rms=20

    IF nextPh(isPause) THEN
      WAV(ufric/s_)
    ELIF nextPh(p) OR nextPh(t) OR nextPh(k) THEN
      WAV(ufric/s!)
    ENDIF
    WAV(ufric/s)
  endphoneme

 

A phoneme definition contains both static properties and executed instructions. The instructions may contain conditional statements, so that the effect of the phoneme may be different depending on adjacent phonemes, whether the syllable is stressed, etc. 音素定义包含静态属性和执行的指令。指令可能包含条件语句,因此音素的效果可能会因相邻音素,是否重读音节等而有所不同。
The instructions of a phoneme are interpreted in two different phases. In the first phase, the instructions may change the phoneme and replace it by a different phoneme. In the second phase, instructions are used to produce the sound for the phoneme. 音素的指令分为两个不同的阶段。在第一阶段,指令可以更改音素,并用其他音素代替。在第二阶段,使用指令为音素产生声音。
The import_phoneme statement can be used to copy a previously defined phoneme from a specified phoneme table. For example: import_phoneme语句可用于从指定音素表中复制先前定义的音素。例如:

 phoneme t
 import_phoneme base/t[
 endphoneme 


means: phoneme t in this phoneme table is a copy of phoneme t[ from phoneme table "base". A length instruction can be used after import_phoneme to vary the length from the original. 表示:此音素表中的音素t是音素t [来自音素表“ base”的副本。可以在import_phoneme之后使用长度指令,以更改原始长度。

Phoneme Properties 音素属性
Within the phoneme definition the following lines may occur: ( (V) indicates only for vowels, (C) only for consonants) 在音素定义中,可能会出现以下几行:((V)仅表示元音,(C)仅表示辅音)
Type. One of these must be present. 类型。其中之一必须存在。

元音
液体 半元音,例如:  r, l, j, w
鼻例如:  m, n, N
停止例如:  p, b, t, d, k, g
frc 擦音,例如:  f, v, T, D, s, z, S, Z, C, x
非洲 附属例如:  tS, dZ
暂停
强调 用于应力符号,例如:',=%
虚拟 用于表示一类音素。

Properties: 特性:

vls (C)清音 p, t, k, f, s
VCD (三)发声例如。 b, d, g, v, z
稳定 (C)例如: s, z, S, Z, tS, dZ
lat (C)lat或or的辅音。
隆隆的 (C)“ r”型辅音。
不受压力 (V)除非明确标记,否则该元音始终不受力。
无链接 阻止与上一个音素的任何链接。
停顿 在liquid或nasal音素中使用,以防止eSpeak在此音素开头的单词和前一个单词以元音结尾的情况下插入简短的停顿。
颤音 (C)将颤音应用于发声。

Place of Articulation (C): 衔接地点(C):

blb   双唇 数据库   牙齿的 dnt   牙科的
阿尔夫 肺泡 射频 逆向反射 解放军 肺泡
朋友 lat vel 绒毛 体重 唇膜
紫外线 小卵 phr 咽部 t 声门

starttype starttype<phoneme> 
Allocates this phoneme to a group so that conditions such as nextPh(#e) can test for any of a group of phonemes. Pre-defined groups for use for vowels are: #@ #a #e #i #o #u. Additional groups can be defined as phonemes with type "virtual". 将此音素分配给一组,以便诸如nextPh(#e)之类的条件可以测试一组音素。用于元音的预定义组是:#@ #a #e #i #o #u。可以将其他组定义为类型为“虚拟”的音素。
endtype <phoneme>
Allocates this phoneme to a group so that conditions such as prevPh(#e) can test for any of a group of phonemes. Pre-defined groups for use for vowels are: #@ #a #e #i #o #u. Additional groups can be defined as phonemes with type "virtual". 将此音素分配给一组,以便诸如prevPh(#e)之类的条件可以测试一组音素中的任何一个。用于元音的预定义组是:#@ #a #e #i #o #u。可以将其他组定义为类型为“虚拟”的音素。
lengthmod <integer>
(C) Determines how this consonant affects the length of the previous vowel. This value is used as index into the length_mods table in the CalcLengths() function in the eSpeak program.(C)确定此辅音如何影响前一个元音的长度。此值用作eSpeak程序CalcLengths()函数中length_mods表的索引。


voicingswitch <phoneme>
This is used for some languages to change between voiced and unvoiced phonemes.用于某些语言在有声和无声音素之间切换。

Phoneme Instructions 音素说明
Phoneme Instructions may be included within conditional statements. 音素说明可能包含在条件语句中。
During the first phase of phoneme interpretation, an instruction which causes a change to a different phoneme will terminate the instructions. During the second phase, FMT() and WAV() instructions will terminate the instructions. 在音素解释的第一阶段,导致更改其他音素的指令将终止该指令。在第二阶段,FMT()和WAV()指令将终止指令。
length <length>
The relative length of the phoneme, typically about 140 for a short vowel and from 200 to 300 for a long vowel or diphong. A length() instruction is needed for vowels. It is optional for consonants. 音素的相对长度,对于短元音通常为约140,对于长元音或双音为200至300。元音需要length()指令。辅音是可选的。
ipa <ipa string>
In many cases, eSpeak makes IPA (International Phonetic Alpbabet) phoneme names automatically from eSpeak phoneme names. If this is not correct, then the phoneme definition can include an ipa instruction to specify the correct IPA name. IPA strings may include non-ascii characters. They may also include characters specified by their character codes in the form U+ followed by 4 hexadecimal digits. For example a string: aU+0303 indicates 'a' with a 'combining tilde'. 在许多情况下,eSpeak会根据eSpeak音素名称自动生成IPA(国际音标Alpbabet)音素名称。如果这不正确,则音素定义可以包含一个IPA指令以指定正确的IPA名称。 IPA字符串可以包含非ASCII字符。它们还可以包括由其字符代码指定的字符,格式为U +,后跟4个十六进制数字。例如,字符串:aU + 0303表示带有组合波浪号的“ a”。
WAV(<wav file>, <amplitude>)
 <wav file> is a path to a WAV file (22 kHz, 16 bits, mono) within phsource/ which will be played to produce the sound. This method is used for unvoiced consonants. <wavefile> does not include a .WAV filename extension, although the file to which it refers may or may not have one. <wav文件>是phsource /中WAV文件(22 kHz,16位,单声道)的路径,将播放该声音以产生声音。此方法用于清音辅音。<wavefile>不包括.WAV文件扩展名,尽管它引用的文件可能具有也可能没有。
<amplitude> is optional. It is a percentage change to the amplitude of the WAV file. So, WAV(ufric/s, 50) means: play file 'ufric/s.wav' at 50% amplitude. <amplitude>是可选的。它是WAV文件幅度的百分比变化。因此,WAV(ufric / s,50)表示:以50%的幅度播放文件ufric / s.wav。
FMT(<vowel file>, <amplitude>)
<vowel file> is a path to a file (within phsource/) which defines how to generate the sound (a vowel or voiced consonant) from a sequence of formant values. Vowel files are made using the espeakedit program. <元音文件>是文件的路径(在phsource /中),该文件定义了如何从共振峰值序列中生成声音(元音或浊辅音)。元音文件是使用espeakedit程序制作的。
<amplitude> is optional. It is a percentage change to the amplitude of the sound which is synthesized from the FMT() instruction. <amplitude>是可选的。它是根据FMT()指令合成的声音振幅的百分比变化。
FMT(<vowel file>, <amplitude>) addWav(<wav file>, <amplitude>)
For voiced consonants, a FMT() instruction may be followed by an addWav() instruction. addWav() has the same format as a WAV() instruction, but the WAV file is mixed with the sound which is synthesized from the FMT() instruction. 对于浊辅音,可以在FMT()指令后跟addWav()指令。 addWav()与WAV()指令具有相同的格式,但是WAV文件与从FMT()指令合成的声音混合在一起。
VowelStart(<vowel file>, <length adjust>)
This is used to modify the start of a vowel when it follows a sonorant consonant (such as [l] or [j]). It replaces the first frame of the <vowel file> which is specified in a FMT() instruction by this <vowel file>, and adjusts the length of the original by a signed value <length adjust>. The VowelStart() instruction may be specified either in the phoneme definition of the vowel, or in the phoneme definition of the sonorant consonant which precedes the vowel. The former takes precedence. 当元音跟随辅音(例如[l]或[j])时,用于修改元音的开头。它用此<vowel file>替换在FMT()指令中指定的<vowel file>的第一帧,并通过带符号的值<length adjust>调整原稿的长度。可以在元音的音素定义中或在元音之前的共振音辅音的音素定义中指定VowelStart()指令。前者优先。
VowelEnding(<vowel file>, <length adjust>)
This is used to modify the end of a vowel when it is followed by a sonorant consonant (such as [l] or [j]). It is appended to the <vowel file> which is specified in a FMT() instruction by this <vowel file>, and adjusts the length of the original by a signed value <length adjust>. The VowelEnding() instruction may be specified either in the phoneme definition of the vowel, or in the phoneme definition of the sonorant consonant which follows the vowel. The former takes precedence. 当元音后面跟随有发音辅音(例如[l]或[j])时,可用于修改元音的结尾。它被附加到该<vowel file> 在FMT()指令中指定的<vowel file> 中,并通过带符号的值<length adjust>调整原稿的长度。 VowelEnding()指令既可以在元音的音素定义中指定,也可以在元音之后的发音辅音的音素定义中指定。前者优先。
Vowelin <vowel transition data>
(C) Specifies the effects of this consonant on the formants of a following vowel. See "vowel transitions", below. (C)指定此辅音对随后元音的共振峰的影响。请参阅下面的“元音过渡”。
Vowelout <vowel transition data>
(C) Specifies the effects of this consonant on the formants of a preceding vowel. See "vowel transitions", below. (C)指定此辅音对前一个元音的共振峰的影响。请参阅下面的“元音过渡”。
ChangePhoneme(<phoneme>)
Change to the specified phoneme. 更改为指定的音素。
ChangeIfDiminished(<phoneme>)
Change to the specified phoneme (such as schwa, @) if this syllable has "diminished" stress. 如果此音节的重音已“减少”,请更改为指定的音素(例如schwa,@)。
ChangeIfUnstressed(<phoneme>)
Change to the specified phoneme if this syllable has "diminished" or "unstressed" stress. 如果该音节的重音已“减小”或“未加重”,则更改为指定的音素。
ChangeIfNotStressed(<phoneme>)
Change to the specified phoneme if this syllable does not have "primary" stress. 如果此音节没有“主要”重音,请更改为指定的音素。
ChangeIfStressed(<phoneme>)
Change to the specified phoneme if this syllable has "primary" stress. 如果该音节具有“主要”重音,请更改为指定的音素。
IfNextVowelAppend(<phoneme>)
If the following phoneme is a vowel then this additional phoneme will be inserted before it. 如果以下音素是元音,则将在此音素之前插入其他音素。
RETURN 返回
Ends executions of instructions. 结束指令的执行。
CALL <phoneme table>/<phoneme>
Executes the instructions of the specified phoneme. 执行指定音素的指令。

友情链接
版权所有 Copyright(c)2004-2021 锐英源软件
公司注册号:410105000449586 豫ICP备08007559号 最佳分辨率 1024*768
地址:郑州大学北校区院(文化路97号院)内