当前位置：和泉文库 > 计算机 > 浏览文档

自动语音识别（PPT讲稿）Automatic Speaker Recognition

• Introduction • The i-vector methodology of speaker recognition • The d-vector methodology of speaker recognition • The end-to-end methodology of speaker recognition • Inter-speaker variability in speaker recognition • Example of variations in speaker recognition • State-of-art approach in SRE

文件格式：PPTX，文件大小：2.5MB，售价：13元

共59页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约59页）

Introduction: Feature extraction Converting the raw speech signal into a sequence of acoustic feature vectors carrying characteristic information about the signal For each frame Frame Signal Window Frame FFT (2 log(p/(2) P(3) p/(3) Mel Filterbank kLPr(26) log(p/(26) Logo DC CTO

Introduction: Feature Extraction • Converting the raw speech signal into a sequence of acoustic feature vectors carrying characteristic information about the signal

Introduction: Pattern matching Main approaches in pattern matching for speaker recognition main Template matching Vector quantization [F. Soong, 1985 Gaussian Mixture Model [A. Reynolds, 2003] Probabilistic Joint factor analysis([P. Kenny, 2006] Main approach model vector[N Dehak, 2011] d-vector/Variani, 2014] Artificial Neura Network End-to-end[G heigold, 2016]

Introduction: Pattern matching • Main approaches in pattern matching for speaker recognition main Main approach Template matching Vector quantization [F. Soong, 1985] Probabilistic model Gaussian Mixture Model [A. Reynolds, 2003] Joint factor analysis [P. Kenny, 2006] i-vector [N. Dehak, 2011 ] Artificial Neural Network d-vector[Variani, 2014 ] End-to-end[G. Heigold, 2016]

The i-vector methodology of speaker recognition Over recent years, ivector has demonstrated state-of the-art performance for speaker recognition Cosine GMM-UBM JFA 1-vector framework Plda

The i-vector methodology of speaker recognition • Over recent years, ivector has demonstrated state-ofthe-art performance for speaker recognition. GMM-UBM framework JFA i-vector Lda Cosine Plda

Joint factor analysis A supervector for a speaker should be decomposable into speaker independent, speaker dependent, channel dependent, and residual components Each component is represented by low-dimensional factors, which operate along the principal dimensions of the corresponding component Speaker dependent component, known as the eigenvoice, and the corresponding factors Eigenvoice matrix Vi V2 Each speaker factor controls an eigendimension of the eigenvoice matrix Low dimensional eigenvoice factors

Joint factor analysis • A supervector for a speaker should be decomposable into speaker independent, speaker dependent, channel dependent, and residual components • Each component is represented by low-dimensional factors, which operate along the principal dimensions of the corresponding component • Speaker dependent component, known as the eigenvoice, and the corresponding factors

GMM supervector u for a speaker can be decomposed as Speaker-dependent component Speaker-dependent residual component u=m+ Vy+Ux+Dz Speaker supervector Speaker-independent Channel-dependent nere component component m is a speaker-independent supervector from UBM V is the eigenvoice matrix y no, )is the speaker factor vector U is the eigenchannel matrix X- N0, ) is the channel factor vector D is the residual matrix and is diagonal zNO, D) is the speaker-specific residual factor vector

• GMM supervector u for a speaker can be decomposed as Where: m is a speaker-independent supervector from UBM V is the eigenvoice matrix y ∼ N(0, I) is the speaker factor vector U is the eigenchannel matrix x ∼ N(0, I) is the channel factor vector D is the residual matrix, and is diagonal z ∼ N(0, I) is the speaker-specific residual factor vector

点击进入文档下载页（PPTX格式）

共59页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

中国铁道出版社：《局域网技术与组网工程》课程教学资源（PPT课件讲稿）第2章网络工程系统
电子工业出版社：《计算机网络》课程教学资源（第五版，PPT课件讲稿）第九章无线网络
香港浸会大学：MPI - Communicators（PPT讲稿）
《单片机应用系统设计技术》课程教学资源（PPT课件讲稿）第7章单片机外部扩展资源及应用
北京航空航天大学：《数据挖掘——概念和技术（Data Mining - Concepts and Techniques）》课程教学资源（PPT课件讲稿）Chapter 01 Introduction
《单片机原理及应用》课程教学资源（PPT课件讲稿）第14章单片机应用系统抗干扰与可靠性设计
河南中医药大学（河南中医学院）：《计算机文化》课程教学资源（PPT课件讲稿）第七章数据库技术（主讲：王哲）
三维计算机视觉 3D computer vision（基于卡尔曼滤波的运动结构）
《计算机网络与因特网》课程教学资源（PPT课件）Part VII 广域网（简称WAN）, 路由, 和最短路径
The Art of Function Design -Measure and RKHS
大庆职业学院：《计算机网络技术基础》课程教学资源（PPT课件讲稿）第2章数据通信的基础知识
香港浸会大学：C++ as a Better C; Introducing Object Technology
南京大学：《编译原理》课程教学资源（PPT课件讲稿）第三章词法分析
上海交通大学：人工智能的历史和启示——人机对弈作为案例
《计算机网络》课程教学资源（PPT课件讲稿）第三章局域网与校园网设计（网络方案设计）
广西外国语学院：《计算机网络》课程教学资源（PPT课件讲稿）第10章应用层协议
《单片机原理及应用》课程教学资源_本科教学大纲汇编（电子信息工程专业）
上海交通大学：网络安全 Network Security（PPT讲稿，朱浩瑾）
清华大学：Top-k String Similarity Search with Edit-Distance Constraints
北京工商大学：《信息论与编码》课程教学实验指导书
普林斯顿大学：平衡查找树（PPT讲稿）New Balanced Search Trees
《MATLAB程序设计》课程教学资源（教学大纲）Matlab programming
计算机硬件维护（PPT课件讲稿）
南京大学：移动Agent系统支撑（PPT讲稿）Agent Mobility Software Agent

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录