当前位置：和泉文库 > 教育心理 > 浏览文档

清华大学：Making Full Use of Chinese Speech Corpora（PPT讲稿）

Purpose of speech corpora Factors to be considered in data creation Data creation Data transcription Learning from corpora Chinese Corpus Consortium (CCC)

文件格式：PPT，文件大小：999KB，售价：14.54元

共67页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约67页）

ecur 得意音通技术 Outline Your Partnerin the Century of Speech PUrpose of speech corpora U factors to be considered in data creation Data creation 日 Data transcription ULearning from corpora aChinese Corpus Consortium(CCc)

Your Partner in the Century of Speech 11 Outline ❑Purpose of speech corpora ❑Factors to be considered in data creation ❑Data creation ❑Data transcription ❑Learning from corpora ❑Chinese Corpus Consortium (CCC)

ecur 得意音通技术 12 Data creation Your Partnerin the Century of Speech u Purposes for asr corpus 8 acoustic training Training Set Speech recognition evaluation (testing)-Testing Set a Categories of ASR corpus 心 Read speech i Spontaneous/ conversational speech Design of a speech database before creation f Aspects as mentioned above language, speaking style, recording channel, sampling rate and precision, and corpus size: according to the application /task background; SNR levels, number of speakers and the speaker balance: for diversity consideration Speaking content balance- for content diversity consideration, to provide a good training set, For read speech, the balance could be on a basis of phone, di-phone, tri-phone, and so on IF, di-IF, tri-IF, syllable, di-syllable, tri-Syllable for Chinese For spontaneous speech, topics design

Your Partner in the Century of Speech 12 Data Creation ❑ Purposes for ASR corpus :- ❖ Acoustic training - Training Set ❖ Speech recognition evaluation (testing) - Testing Set ❑ Categories of ASR corpus :- ❖ Read speech; ❖ Spontaneous/conversational speech. ❑ Design of a speech database before creation :- ❖ Aspects as mentioned above: ▪ language, speaking style, recording channel, sampling rate and precision, and corpus size: according to the application/task background; ▪ SNR levels, number of speakers and the speaker balance: for diversity consideration. ❖ Speaking content balance - for content diversity consideration, to provide a good training set, ▪ For read speech, the balance could be on a basis of – phone, di-phone, tri-phone, and so on; – IF, di-IF, tri-IF, syllable, di-syllable, tri-syllable for Chinese ▪ For spontaneous speech, topics design

ecur 得意音通技术 13 Data Creation- Read Speech (1) Your Partnerin the Century of Speech aThough spontaneous asr is becoming one of the research focuses, the read speech database collection is still necessary lA high quality read speech corpus is helpful to train a good initial acoustic model, and then OIn spontaneous ASR, pronunciation modelling techniques as well as pronunciation lexicons are adopted to get a practically good acoustic model finall

Your Partner in the Century of Speech 13 Data Creation – Read Speech (1) ❑Though spontaneous ASR is becoming one of the research focuses, the read speech database collection is still necessary. ❑A high quality read speech corpus is helpful to train a good initial acoustic model, and then ❑In spontaneous ASR, pronunciation modelling techniques as well as pronunciation lexicons are adopted to get a practically good acoustic model finally

ecur 得意音通技术 14 Data Creation- Read Speech(2) Your Partnerin the Century of Speech aGoal of read speech corpus design is often to balance the speech recognition units(modelling units), so as to cover as many co-articulations as possible using a set of sentences as small as possible u Such a minimal sentence set can be used for not only the training of acoustic models but also the speaker adaptation

Your Partner in the Century of Speech 14 Data Creation – Read Speech (2) ❑Goal of read speech corpus design is often to balance the speech recognition units (modelling units), so as to cover as many co-articulations as possible using a set of sentences as small as possible. ❑Such a minimal sentence set can be used for not only the training of acoustic models but also the speaker adaptation

ecur 得意音通技术 15 Data Creation- Read Speech (3) Your Partnerin the Century of Speech u Sentence design example goal is to choose 6,000 sentences (about 0. 75%)from 800,000 sentences taken from the People's daily with a balanced di-if distribution ☆ Several criteria: Natural selection - randomly. Almost natural di-IF distribution Restraining high-frequency di-IFsRHF] To restrain those high-frequency di-IFs from occurring more frequently so that each di-iF occurs almost equally to well train the acoustic mode Encouraging low-frequency di-IFs(ELF). As an alternative to encourage those low-frequency di-IFs as frequently as possible. of occurring times of any di- If should be greater than a doable pre-defined threshold

Your Partner in the Century of Speech 15 Data Creation – Read Speech (3) ❑ Sentence design example. ❖ Goal is to choose 6,000 sentences (about 0.75%) from 800,000 sentences taken from the People's Daily with a balanced di-IF distribution. ❖ Several criteria: ▪ Natural selection - randomly. Almost natural di-IF distribution. ▪ Restraining high-frequency di-IFs (RHF). To restrain those high-frequency di-IFs from occurring more frequently so that each di-IF occurs almost equally to well train the acoustic model. ▪ Encouraging low-frequency di-IFs (ELF). As an alternative, to encourage those low-frequency di-IFs as frequently as possible. # of occurring times of any di-IF should be greater than a doable pre-defined threshold

点击进入文档下载页（PPT格式）

共67页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

兰州大学：数字图书馆技术的渊源进展和反思（PPT讲稿）Digital Libraries and the Future of Library Professions
湖北理工学院：《科研实践基础训练》课程教学资源（PPT讲稿）第二讲科研选题方法
《创新与创业能力培养》课程教材配套电子教案（PPT教学课件，共九章，主编：冯丽霞、王若洪）创新与创业能力培养、职业生涯规划与体验
清华大学出版社：《职业教育与就业指导》课程教材电子教案（PPT课件讲稿，共九章，主编：邵海峡）
北京理工大学：教育技术二级培训（PPT讲稿）教学设计
素质测评标准体系的构建（PPT讲稿）基于方法能力培养的学习
教育部科学技术委员会：高等学校科学技术学术规范指南（宣讲稿）
华南理工大学：科研经费管理的背景与趋势及形势与政策（PPT讲稿）
辽宁农业职业技术学院：2018年毕业生就业质量年度报告
初中物理新课程讲稿（PPT课件）
南京晓庄学院：中学物理实验教学研究（PPT讲稿）绪论
昆明医学院第一附属医院：人的心理（PPT讲稿）普通心理学
《发展心理学》课程教学资源（PPT课件讲稿）第一章绪论
贵州师范学院：《普通心理学》课程教学资源（PPT课件讲稿）第五章知觉
高校教学研究（PPT讲稿）高等职业教育的课程与精品课程建设
中国医科大学：心理应激（PPT讲稿）Psychologicalstress
华北科技学院国家自然科学基金申请辅导报告：基金类项目申请——思路和套路（PPT讲稿）
战略机遇期的中国高等教育（PPT讲稿）高等教育发展的宏观背景和政策走向
新时期大学的理念与管理（PPT讲稿）
《教育技术学》课程教学资源（PPT课件）第4讲教育技术学的理论基础（上）
大连工业大学：优化学科和队伍结构、提升科研整体实力推动高质量发展（PPT讲稿）
北京大学：大学治理（PPT讲稿）比较视角（教育学院：阎凤桥）
喀什大学（喀什师范学院）：班主任工作技能训练教程（PPT讲稿）中小学班主任工作技能培训教程
《班主任工作技能》课程教学大纲（适用专业：数学与应用数学）

点击购买下载（PPT）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录