1.71K subscribers
15.5K photos
10 videos
16 files
8.37K links
ArXiv Papers Related to Computer Science, AI, Deep Learning, Computer Vision, NLP, etc

Admins:
@ffarzaddh
Download Telegram
All papers published in past two days:
Category: Sound
#Sound


🗒 VoxCeleb2: Deep Speaker Recognition
👥 Joon Son Chung, Arsha Nagrani, Andrew Zisserman
📗 PDF



AI Python & arXiv Channel
Latest Published Articles:
Sound
#Sound


🗒 VoxCeleb2: Deep Speaker Recognition
👥 Joon Son Chung, Arsha Nagrani, Andrew Zisserman
📗 PDF


🗒 Multi-View Networks for Denoising of Arbitrary Numbers of Channels
👥 Jonah Casebeer, Brian Luc, Paris Smaragdis
📗 PDF


🗒 A data-driven approach to mid-level perceptual musical feature modeling
👥 Anna Aljanaki, Mohammad Soleymani
📗 PDF


🗒 Model-based Speech Enhancement for Intelligibility Improvement in Binaural Hearing Aids
👥 Mathew Shaji Kavalekalam, Jesper K. Nielsen, Jesper B. Boldt, Mads G. Christensen
📗 PDF


🗒 Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition
👥 Wei-Ning Hsu, Hao Tang, James Glass
📗 PDF


🗒 A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
👥 Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass
📗 PDF


🗒 Capsule Routing for Sound Event Detection
👥 Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang
📗 PDF


🗒 Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
👥 Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, Yonghui Wu
📗 PDF


🗒 Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages
👥 Shiyu Zhou, Shuang Xu, Bo Xu
📗 PDF


🗒 The NES Music Database: A multi-instrumental dataset with expressive performance attributes
👥 Chris Donahue, Huanru Henry Mao, Julian McAuley
📗 PDF


🗒 Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models
👥 Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin
📗 PDF


🗒 Analysis of Length Normalization in End-to-End Speaker Verification System
👥 Weicheng Cai, Jinkun Chen, Ming Li
📗 PDF


🗒 Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
👥 Daniel Stoller, Sebastian Ewert, Simon Dixon
📗 PDF


🗒 On sound-based interpretation of neonatal EEG
👥 Sergi Gomez, Mark O'Sullivan, Emanuel Popovici, Sean Mathieson, Geraldine Boylan, Andriy Temko
📗 PDF


🗒 StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
👥 Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
📗 PDF


#Sound
AI Python & arXiv Channel