BET356体育(中国)官方网站-登录入口

学在BET356官网

/ Study in BUPT

首页 · 学在BET356官网 · 学术活动 · 正文

Deep generative model and speech information factorization

主讲人 :王东副教授，清华大学地点 :BET356体育官方网站登录教学主楼312 开始时间 : 2019-12-04 15:00 结束时间 : 2019-12-04 17:00

主讲人介绍：

Prof. Dong Wang is an associate professor at Tsinghua University, the deputy dean of the Center for Speech and Language Technologies (CSLT) at Tsinghua University. He obtained the Bachelor and Master degrees at Tsinghua University, and the PhD degree at the University of Edinburg in 2010. Prof. Wang worked in Oracle China, IBM China, EURECOM France and Nuance US. He worked on speech processing since 1998, and published more than 140 academic papers. He is the chair of APSIPA SLA track, and serves as a distinguished lecture during 2018-2019.

内容摘要：

State-of-the-art speech processing technologies, including speech recognition, language recognition, speaker recognition, are mostly based on large-scaled deep neural nets trained with large amount of data. This approach, however, cannot fully utilize the information embedded in speech signals, which are assumed to be highly complex and convolved in an unknown way. Recently, we found that a deep generative model is powerful to simulate the speech production process, paving the way of factorizing speech signals into independent information factors. This new approach integrates both generative and discriminative models, and combines the capability of neural nets and Bayesian methods.

上一条 : Cluster Analysis and Data Visualization over Manifolds

下一条 : 欢迎参加2019中国未来网络前沿技术与创新应用论坛

快捷入口

BET356官网校区

分享到