arXiv每日推荐-3.2:语音/音频每日论文速递

同步公众号(arXiv每日学术速递)
【1】 A.I. based Embedded Speech to Text Using Deepspeech
标题:使用DeepSpeech的基于人工智能的嵌入式语音到文本
作者: Muhammad Hafidh Firmansyah, Gul Malik Urfa
链接:https://arxiv.org/abs/2002.12830

【2】 Deep Residual-Dense Lattice Network for Speech Enhancement
标题:用于语音增强的深层剩余稠密网格网络
作者: Mohammad Nikzad, Fanhua Shang
备注:8 pages, Accepted by AAAI-2020
链接:https://arxiv.org/abs/2002.12794

【3】 Multi-Modal Continuous Valence And Arousal Prediction in the Wild Using Deep 3D Features and Sequence Modeling
标题:使用深度3D特征和序列建模的多模态连续配价和野外唤醒预测
作者: Sowmya Rasipuram, Anutosh Maitra
链接:https://arxiv.org/abs/2002.12766

【4】 Towards Learning a Universal Non-Semantic Representation of Speech
标题:学习语音的普遍非语义表征
作者: Joel Shor, Yinnon Haviv
链接:https://arxiv.org/abs/2002.12764

【5】 DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team
标题:DIHARD II仍然很难:实验结果和来自dku-Lenovo团队的讨论
作者: Qingjian Lin, Ming Li
备注:Submitted to Odyssesy 2020
链接:https://arxiv.org/abs/2002.12761

【6】 A Novel Decision Tree for Depression Recognition in Speech
标题:一种新的用于语音抑郁识别的决策树
作者: Zhenyu Liu, Bin Hu
链接:https://arxiv.org/abs/2002.12759

【7】 Speech Synthesis using EEG
标题:基于EEG的语音合成
作者: Gautam Krishna, Mason Carnahan
备注:Accepted for publication at IEEE ICASSP 2020
链接:https://arxiv.org/abs/2002.12756

【8】 Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis
标题:多说话人文语转换合成中用于自动质量估计的语音表示的比较
作者: Jennifer Williams, Simon King
备注:submitted to Odyssey 2020
链接:https://arxiv.org/abs/2002.1264

原文链接:https://zhuanlan.zhihu.com/p/110269386

发布了48 篇原创文章 · 获赞 29 · 访问量 3040

猜你喜欢

转载自blog.csdn.net/weixin_35894210/article/details/104610710
今日推荐