当前位置：和泉文库 > 计算机 > 浏览文档

山东大学：语音识别技术（PPT课件讲稿）自动语音识别 Automatic Speech Recognition

Introduction Speech recognition based on HMM • Acoustic processing • Acoustic modeling: Hidden Markov Model • Language modeling

文件格式：PPTX，文件大小：2.11MB，售价：10.57元

共44页，可试读15页，点击往前阅读 ↑↑

文档详细内容（约44页）

How might computers do it? Digitization Acoustic analysis of the speech signal Linguistic interpretation Acoustic waveform Acoustic signal 静中解学需 an maris e va neri n a n :i rout u s even Speech recognition HUMAN COMPUTER INTERACTION

How might computers do it? Digitization Acoustic analysis of the speech signal Linguistic interpretation 1/28/2021 HUMAN COMPUTER INTERACTION 6 Acoustic waveform Acoustic signal Speech recognition

Outline Introduction Speech recognition based on HMm Acoustic processing Acoustic modeling: Hidden Markov Model anguage modeling Statistical approach HUMAN COMPUTER INTERACTION

Outline Introduction Speech recognition based on HMM • Acoustic processing • Acoustic modeling: Hidden Markov Model • Language modeling • Statistical approach 1/28/2021 HUMAN COMPUTER INTERACTION 7

Acoustic processing A wave for the words " speech lab"looks like p ee a 10000 1.20□ “to“a transition 0w个 Graphs from Simon Arnfield' s web tutorial on speech, Sheffield http://lethe.leedsac.uk/research/cogn/speech/tutoriall HUMAN COMPUTER INTERACTION

Acoustic processing A wave for the words “speech lab” looks like: 1/28/2021 HUMAN COMPUTER INTERACTION 8 s p ee ch l a b Graphs from Simon Arnfield’s web tutorial on speech, Sheffield: http://lethe.leeds.ac.uk/research/cogn/speech/tutorial/ “l” to “a” transition:

Acoustic sampling 10 ms frame( ms= millisecond =1/1000 second C25 ms window around frame to smooth signal processing 体体和个 I ms 10ms Result Acoustic Feature vectors -986,-792,-692,-614,-429,-286,-134,-57,-41,-169,-456,-450,-541,-761,-1067,-1231,-1847,-952,-645,-489,-448 -212,193,114,-17,-110,128,261,198,390,461,772,948,1451,1974,2624,3793,4968,5939,6057,6581,7302,7649,7223,6119,5461 4353,3611,2740,204,1349,1178,1085,901,301,-262,-499,-488,-707,-1406,-1997,-2377,-2494,-2605,-2675,-2627,-2500,-2148, 1648,-970,-364,13,260,494,788,1011,938,717,507,323,324,325,350,103,-113,64,176,93,-249,-461,-606,-909,-1159,-1397,-1544 HUMAN COMPUTER INTERACTION 9

Acoustic sampling 10 ms frame (ms = millisecond = 1/1000 second) ~25 ms window around frame to smooth signal processing 1/28/2021 HUMAN COMPUTER INTERACTION 9 25 ms 10ms . . . a1 a2 a3 Result: Acoustic Feature Vectors

Spectral analysis Frequency gives pitch; amplitude gives volume sampling at -8 kHz phone, -16 kHz mic(kHz=1000 cycles/sec) p ee ch 10000 10000 Fourier transform of wave yields a spectrogram darkness indicates energy at each frequency hundreds to thousands of frequency samples HUMAN COMPUTER INTERACTION

Spectral analysis Frequency gives pitch; amplitude gives volume • sampling at ~8 kHz phone, ~16 kHz mic (kHz=1000 cycles/sec) Fourier transform of wave yields a spectrogram • darkness indicates energy at each frequency • hundreds to thousands of frequency samples 1/28/2021 HUMAN COMPUTER INTERACTION 10 s p ee ch l a b

点击进入文档下载页（PPTX格式）

共44页，可试读15页，点击继续阅读 ↓↓

您可能感兴趣的文档

数据集成 Data Integration（PPT讲稿）成就与展望 Achievements and Perspectives
北京师范大学：拓扑序及其量子相变（PPT课件讲稿）Topological Order and its Quantum Phase Transition
计算机系教学资源（PPT课件讲稿）信息安全与保密技术
汤姆森 Thomson：利用Web of Knowledge对课题进行检索、分析、跟踪、管理
西安电子科技大学：《微机原理与接口技术》课程教学资源（PPT课件讲稿）第九章定时/计数器8253
同济大学：聚类分析（PPT课件讲稿）Cluster Analysis
《数字图像处理学》课程教学资源（PPT课件讲稿）第2章图像、图像系统与视觉系统
四川大学：《软件测试与维护基础教程》课程教学资源（PPT课件讲稿）软件测试工具 Software Testing Tool
B-树、散列技术、散列表的概念、散列函数的构造方法、处理冲突的方法、散列表上的运算
南京大学：《面向对象技术 OOT》课程教学资源（PPT课件讲稿）对象序列化和持久化 Object Serialization and Persistence
电子工业出版社：《计算机网络》课程教学资源（第五版，PPT课件讲稿）第十章下一代因特网
《网络编程实用教程（第三版）Network Application Programming》课程教学资源（PPT课件讲稿）第1章概述
电子科技大学：《计算机操作系统》课程教学资源（PPT课件讲稿）第五章设备管理
《计算机文化基础》课程教学资源（PPT课件讲稿）第二章 Windows XP操作系统
香港科技大学：《软件开发》教学资源（PPT课件讲稿）Functions
南京大学：复杂系统学习（PPT课件讲稿）佩特里网 Petri Nets
《3ds Max》教学资源（PPT课件）第4章基本三维模型的创建
上海交通大学：《程序设计》课程教学资源（PPT课件讲稿）第6章过程封装——函数
四川大学：《操作系统 Operating System》课程教学资源（PPT课件讲稿）Chapter 5 互斥与同步（Mutual Exclusion and Synchronization）5.4 Monitors 5.5 Message Passing 5.6 Readers/Writers Problem
清华大学：An Efficient Trie-based Method for Approximate Entity Extraction with Edit-Distance Constraints
东南大学：《数据结构》课程教学资源（PPT课件讲稿）第三章栈与队列
《计算机网络与因特网 Computer Networks and Internets》课程教学资源（PPT课件讲稿）Part II 物理层（信号、媒介、数据传输）
合肥工业大学：《网络安全概论》课程教学资源（PPT课件讲稿）第2讲密码学简介（主讲：苏兆品）
长春大学：《计算机应用基础》课程教学资源（PPT课件讲稿）第一章计算机基础知识（崔天明）

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录