Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder - 代码天地

Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder

其他 2021-12-14 18:16:18 阅读次数: 0

会议：2021 interspeech
作者：Manh Luong
单位：Vinai Research, Hanoi, Vietnam

文章目录

- abstract
- method

abstract

基于VAE的说话人特征和内容的解耦，认为同一说话人两句话中的说话人信息是一样的，内容是不一样的。说话人特征+内容=一句话包含的信息。

method

# style_mu, style_logvar是同一个fc输出的维度split结果
# content_mu1, content_logvar1是同一个fc输出的维度split结果
style_mu1, style_logvar1, content_mu1, content_logvar1 = self.encode(x1)
z_content1 = self._reparameterize(content_mu1, content_logvar1, train)

# 同一个人另一句话的编码信息
style_mu2, style_logvar2, content_mu2, content_logvar2 = self.encode(x2)
z_content2 = self._reparameterize(content_mu2, content_logvar2, train)


# 说话人向量的 均值/方差 再次平均
style_mu2 = style_mu2.detach()
style_logvar2 = style_logvar2.detach()
z_style_mu = (style_mu1 + style_mu2)/2
z_style_logvar = (style_logvar1 + style_logvar2)/2
z_style = self._reparameterize(z_style_mu, z_style_logvar)

# 说话人向量和内容拼接
z1 = torch.cat((z_style, z_content1), dim=-1)
z2 = torch.cat((z_style, z_content2), dim=-1)

## parameters of distribution of sample 1
q_z1_mu = torch.cat((z_style_mu, content_mu1), dim=-1)
q_z1_logvar = torch.cat((z_style_logvar, content_logvar1), dim=-1)

## parameters of distribution of sample 2
q_z2_mu = torch.cat((z_style_mu, content_mu2), dim=-1)
q_z2_logvar = torch.cat((z_style_logvar, content_logvar2), dim=-1)

recons_x1 = self.decode(z1)
recons_x2 = self.decode(z2)

其中

# 高斯采样
def _reparameterize(self, mu, logvar, train=True):
   if train:
      epsilon = Variable(torch.empty(logvar.size()).normal_()).cuda()
      std = logvar.mul(0.5).exp_()
   	  return epsilon.mul(std).add_(mu)
   else:
      return mu

猜你喜欢

转载自blog.csdn.net/qq_40168949/article/details/120251006

Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder

F0-CONSISTENT MANY-TO-MANY NON-PARALLEL VOICE CONVERSION VIA CONDITIONAL AUTOENCODER

StarGAN-VC： non-parallel many-to-many voice conversion with StaGAN

Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network

2018icassp-Non-parallel voice conversion using variational autoencoders conditioned by phonetic PPGs

2019ins---Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star-GAN

Non-parallel Voice Conversion using Weighted Generative Adversarial Networks

variational_autoencoder

Variational Autoencoder: Basic Concept

VAE(Variational Autoencoder)的原理

[论文笔记] Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

2016 ICME:Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

李宏毅DLHLP.09.Voice Conversion.1/2. Feature Disentangle

FASTSVC: FAST CROSS-DOMAIN SINGING VOICE CONVERSION WITH FEATURE-WISE LINEAR MODULATION论文理解

2000_narrowband to wideband conversion of speech using GMM based transformation

[work] VAE(Variational Autoencoder)的原理

Autoencorder理解(7):Variational Autoencoder

VAE(Variational Autoencoder)简单记录

Voice conversion with SI-DNN and KL divergence based mapping without parallel training data

[2020 interspeech] DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion

The Voice Conversion Challenge 2018

Parallel-data-free voice conversion using cycle-consistent adversarial networks

MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms

many-to-many

推荐系统——Collaborative Variational Autoencoder for Recommender Systems

机器学习：VAE(Variational Autoencoder) 模型

Variational Autoencoder（变分自编码）

VAE(Variational Autoencoder)简单推导及理解

Pytorch框架实现VAE（Variational Autoencoder）

LSTM Autoencoder using GPU

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)