2021年语音合成论文统计（1~2月） - 代码天地

2021年语音合成论文统计（1~2月）

其他 2021-03-25 21:45:20 阅读次数: 0

论文统计每月第一周更新一次，主要跟踪语音合成的发展状况(很多文章都是在会议后才发出，但不影响统计。统计过程难免存在疏漏，因此统计结果仅供参考。读者有什么建议可以直接向我发消息，我将不断修改该统计。历年文章统计可访问 http://yqli.tech/page/tts_paper.html）。如有转载，请注明出处。欢迎关注微信公众号：低调奋进。

语音合成文章情况表（单位：篇）

		1月	2月
前端	多音字，韵律，g2p等等。	1	0
声学模型	语言特征转声学特征，attention工作以及双重学习	1	7
声码器	波形生成	1	3
个性化	少数据，脏数据应用等	1	1
多语言	多语言多说话人模型	0	0
歌唱合成	歌唱和音乐合成	0	1
情感	风格和情感	2	2
多模态	talking head等等	2	1
声音转换	基于GAN方案和特征解耦方案	4	2
其它	基于EEG合成，数据，MOS评测以及语音合成的应用	1	1

文章列表：

1月

		类型
1	Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis	am
2	Polyphone Disambiguition in Mandarin Chinese with Semi-Supervised Learning	frontend
3	Generating coherent spontaneous speech and gesture from text	multimodality
4	Creating Song From Lip and Tongue Videos With a Convolutional Vocoder	multimodality
5	On Interfacing the Brain with Quantum Computers: An Approach to Listen to the Logic of the Mind	other
6	Whispered and Lombard Neural Speech Synthesis	expression
7	Expressive Neural Voice Cloning	expression/ personalization
8	High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion	vc
9	EmoCat: Language-agnostic Emotional Voice Conversion	vc
10	Hierarchical disentangled representation learning for singing voice conversion	vc
11	Adversarially learning disentangled speech representations for robust multi-factor voice conversion	vc
12	Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss	vocoder

2月

1	Triple M: A Practical Neural Text-to-speech System With Multi-guidance Attention And Multi-band Multi-time Lpcnet	am
2	Mixture Density Network for Phone-Level Prosody Modelling in Speech Synthesis	am
3	VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention	am
4	Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input	am
5	Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech	am
6	LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search	am
7	Data-Efficient Training Strategies for Neural TTS Systemsmatch	am
8	Model architectures to extrapolate emotional expressions in DNN-based text-to-speech	expression
9	Model architectures to extrapolate emotional expressions in DNN-based text-to-speech	expression
10	SPEAK WITH YOUR HANDS Using Continuous Hand Gestures to control Articulatory Speech Synthesizer	modal
11	MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network	other
12	Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning	personalization
13	Anyone GAN Sing	sing
14	Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram	vc
15	Investigating Deep Neural Structures and their Interpretability in the Domain of Voice Conversion	vc
16	Universal Neural Vocoding with Parallel WaveNet	vocoder
17	LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation	vocoder
18	High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion	vocoder

猜你喜欢

转载自blog.csdn.net/liyongqiang2420/article/details/114940548

2021年语音合成论文统计（1~2月）

2021年语音合成论文统计（1~3月）

论文翻译-语音合成：Tacotron 2

2021年1月总结2月计划

语音合成论文优选：ICASSP 2021 M2VoC 第2名Investigating on Incorporating Pretrained and Learnable Speaker Repres

语音合成论文优选：ICASSP 2021 M2VoC文章 CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge

论文翻译-语音合成：Char2Wav

2021年1月-11月总结

2021年1月程序员工资统计出炉，你过平均线了吗？

论文翻译-语音合成：Tacotron

论文翻译-语音合成：WaveNet

TTS | 语音合成论文概述

Java2021年1月

IPFS矿工之家：IPFS各矿商挖矿FIL币数据统计表2021年1月15日

2021年2月程序员工资统计（喜大普奔，各位大佬工资都张了吧）

语音合成模型小抄(1)

语音合成论文优选：Anyone GAN Sing

论文阅读_语音合成_VALLE-X

论文阅读_语音合成_VALL-E

论文阅读_语音合成_Spear-TTS

crispr-cas9基因编辑技术最新进展（2021年1月-2月）

【阅读论文】Tacotron2，结合wavenet通过mel频谱实现自然语音合成

小红书APP接口-2021年1月25日

快手APP接口-2021年1月25日

2021年1月25日博客日记

GitHub趋势榜（2021年1月上旬）

2021年1月24日博客日记

2021年1月23日自我流放

2021年1月15日 jenkins 添加windows 节点

2021年1月15日安装 zabbix 和 grafana

今日推荐

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

最强开源大模型 Llama 3 上架 Gitee AI

虽然老乡鸡开源的不是代码，但背后的原因却让人很暖心

周排行

决策树的部分理解

STM32软件IIC的实现

RocketMQ原理解析-HA

vue-动态路由（路由的传参和接参）

利用python对Excel中的特定数据提取并写入新表

【Ubuntu】 Ubuntu16.04搭建NFS服务

Elasticsearch基础操作与对应的curl命令行，python对接实现

JVM数据存储结构 & Java的值传递和址传递

yum命令使用指南

java基础（一）：java语法基础

每日归档

更多

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)

2024-04-16(70)

2024-04-15(42)