当前位置：和泉文库 > 计算机 > 自动语音识别（PPT讲稿）Automatic Speaker Recognition

自动语音识别（PPT讲稿）Automatic Speaker Recognition

• Introduction • The i-vector methodology of speaker recognition • The d-vector methodology of speaker recognition • The end-to-end methodology of speaker recognition • Inter-speaker variability in speaker recognition • Example of variations in speaker recognition • State-of-art approach in SRE

文件格式：PPTX，文件大小：2.5MB，售价：13元

共59页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约59页）

Training procedure We train the jfa matricies in the following order kenny et al 2007a] 1. Train the eigenvoice matrix V, assuming that u and d are zero 2. Train the eigenchannel matrix u given the estimate of V, assuming that D is zero 3. Train the residual matrix d given the estimates of v and u Using these matrices, we compute y for speaker, x for channel and z for residual factors We compute the final score by using these matrices and factors

Training procedure • We train the JFA matricies in the following order [Kenny et al., 2007a] • 1. Train the eigenvoice matrix V, assuming that U and D are zero • 2. Train the eigenchannel matrix U given the estimate of V, assuming that D is zero • 3. Train the residual matrix D given the estimates of V and U • Using these matrices, we compute y for speaker, x for channel, and z for residual factors • We compute the final score by using these matrices and factors

Total variability Subspaces u and v are not completely independent A combined total variability space was used Dehak et al, 201 Speaker factors factors u=m+Vy+Ux+D Speaker Supervector UBM Channel factors u=m+ Tw Speake vector(i-vector) supervene UBM Total variability matrix

Total variability • Subspaces U and V are not completely independent • A combined total variability space was used [Dehak et al., 2011]

-vector An i-vector system uses a set of low-dimensional total variability factors(w) to represent each conversation side Each factor controls an eigen-dimension of the total variability matrix ) and are known as the i-vectors Unlike ] or other Fa methods, the i- vector approach does not make a distinction between speaker and channel define a total variability space, contains speaker and channel variablities simultaneously

i-vector • An i-vector system uses a set of low-dimensional total variability factors (w) to represent each conversation side. Each factor controls an eigen-dimension of the total variability matrix (T), and are known as the i-vectors. • Unlike JFA or other FA methods, the i-vector approach does not make a distinction between speaker and channel • define a total variability space, contains speaker and channel variabilities simultaneously

Training total variability space Rank of T is set prior to training T and w are latent variables EM algorithm is used Random initialization for t Training total variability matrix t is similar to training v except that training T is performed by using all utterances from a given speaker but as produced by different speakers UBM diagonal covariance matrix 2 MDX MD)is introduced to model the residual variability not captured by t

Training total variability space • Rank of T is set prior to training • T and w are latent variables • EM algorithm is used • Random initialization for T • Training total variability matrix T is similar to training V except that training T is performed by using all utterances from a given speaker but as produced by different speakers • UBM diagonal covariance matrix Σ (MD×MD) is introduced to model the residual variability not captured by T

vector extraction oth order statistics Nc(u)=Etc(o,) of an utterance u order statistics Fc())=∑:(o-)t 2nd order statistics Sc(u)=diag(z e(o, )o, o[ )where Tcplotm nc(ot =p(c ot, ubm) C,C ∑H1pm,E) Centralized 1tn and 2nd order statistics =∑(o-m) (u)=diag E nc(o )( or-m )ot-m) where mc is the subvector corresponding to mixture component c

i-vector extraction

点击进入文档下载页（PPTX格式）

共59页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

中国铁道出版社：《局域网技术与组网工程》课程教学资源（PPT课件讲稿）第2章网络工程系统
电子工业出版社：《计算机网络》课程教学资源（第五版，PPT课件讲稿）第九章无线网络
香港浸会大学：MPI - Communicators（PPT讲稿）
《单片机应用系统设计技术》课程教学资源（PPT课件讲稿）第7章单片机外部扩展资源及应用
北京航空航天大学：《数据挖掘——概念和技术（Data Mining - Concepts and Techniques）》课程教学资源（PPT课件讲稿）Chapter 01 Introduction
《单片机原理及应用》课程教学资源（PPT课件讲稿）第14章单片机应用系统抗干扰与可靠性设计
河南中医药大学（河南中医学院）：《计算机文化》课程教学资源（PPT课件讲稿）第七章数据库技术（主讲：王哲）
三维计算机视觉 3D computer vision（基于卡尔曼滤波的运动结构）
《计算机网络与因特网》课程教学资源（PPT课件）Part VII 广域网（简称WAN）, 路由, 和最短路径
The Art of Function Design -Measure and RKHS
大庆职业学院：《计算机网络技术基础》课程教学资源（PPT课件讲稿）第2章数据通信的基础知识
香港浸会大学：C++ as a Better C; Introducing Object Technology
南京大学：《编译原理》课程教学资源（PPT课件讲稿）第三章词法分析
上海交通大学：人工智能的历史和启示——人机对弈作为案例
《计算机网络》课程教学资源（PPT课件讲稿）第三章局域网与校园网设计（网络方案设计）
广西外国语学院：《计算机网络》课程教学资源（PPT课件讲稿）第10章应用层协议
《单片机原理及应用》课程教学资源_本科教学大纲汇编（电子信息工程专业）
上海交通大学：网络安全 Network Security（PPT讲稿，朱浩瑾）
清华大学：Top-k String Similarity Search with Edit-Distance Constraints
北京工商大学：《信息论与编码》课程教学实验指导书
普林斯顿大学：平衡查找树（PPT讲稿）New Balanced Search Trees
《MATLAB程序设计》课程教学资源（教学大纲）Matlab programming
计算机硬件维护（PPT课件讲稿）
南京大学：移动Agent系统支撑（PPT讲稿）Agent Mobility Software Agent

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录