arXiv

All papers published in past two days:
Category: Sound
#Sound

🗒 VoxCeleb2: Deep Speaker Recognition
👥 Joon Son Chung, Arsha Nagrani, Andrew Zisserman
📗 PDF

AI Python & arXiv Channel

114 views03:00

arXiv

Latest Published Articles:
Sound
#Sound

🗒 VoxCeleb2: Deep Speaker Recognition
👥 Joon Son Chung, Arsha Nagrani, Andrew Zisserman
📗 PDF

🗒 Multi-View Networks for Denoising of Arbitrary Numbers of Channels
👥 Jonah Casebeer, Brian Luc, Paris Smaragdis
📗 PDF

🗒 A data-driven approach to mid-level perceptual musical feature modeling
👥 Anna Aljanaki, Mohammad Soleymani
📗 PDF

🗒 Model-based Speech Enhancement for Intelligibility Improvement in Binaural Hearing Aids
👥 Mathew Shaji Kavalekalam, Jesper K. Nielsen, Jesper B. Boldt, Mads G. Christensen
📗 PDF

🗒 Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition
👥 Wei-Ning Hsu, Hao Tang, James Glass
📗 PDF

🗒 A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
👥 Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass
📗 PDF

🗒 Capsule Routing for Sound Event Detection
👥 Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang
📗 PDF

🗒 Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
👥 Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, Yonghui Wu
📗 PDF

🗒 Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages
👥 Shiyu Zhou, Shuang Xu, Bo Xu
📗 PDF

🗒 The NES Music Database: A multi-instrumental dataset with expressive performance attributes
👥 Chris Donahue, Huanru Henry Mao, Julian McAuley
📗 PDF

🗒 Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models
👥 Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin
📗 PDF

🗒 Analysis of Length Normalization in End-to-End Speaker Verification System
👥 Weicheng Cai, Jinkun Chen, Ming Li
📗 PDF

🗒 Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
👥 Daniel Stoller, Sebastian Ewert, Simon Dixon
📗 PDF

🗒 On sound-based interpretation of neonatal EEG
👥 Sergi Gomez, Mark O'Sullivan, Emanuel Popovici, Sean Mathieson, Geraldine Boylan, Andriy Temko
📗 PDF

🗒 StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
👥 Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
📗 PDF

#Sound
AI Python & arXiv Channel

132 views15:52

arXiv

Latest Published Articles:
Sound
#Sound

🗒 Learning Transposition-Invariant Interval Features from Symbolic Music and Audio
👥 Stefan Lattner, Maarten Grachten, Gerhard Widmer
📗 PDF

🗒 Towards Automated Single Channel Source Separation using Neural Networks
👥 Arpita Gang, Pravesh Biyani, Akshay Soni
📗 PDF

🗒 Synthesizing Diverse, High-Quality Audio Textures
👥 Joseph Antognini, Matt Hoffman, Ron J. Weiss
📗 PDF

🗒 Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
👥 Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges Linarès, Renato De Mori, Yoshua Bengio
📗 PDF

🗒 A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification
👥 Eduardo Fonseca, Rong Gong, Xavier Serra
📗 PDF

🗒 Speaker Adapted Beamforming for Multi-Channel Automatic Speech Recognition
👥 Tobias Menne, Ralf Schlüter, Hermann Ney
📗 PDF

🗒 End-to-End Speech Recognition From the Raw Waveform
👥 Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert, Emmanuel Dupoux
📗 PDF

🗒 Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices
👥 Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino
📗 PDF

🗒 A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours
👥 Branislav Gerazov, Gérard Bailly, Yi Xu
📗 PDF

🗒 Towards an efficient deep learning model for musical onset detection
👥 Rong Gong, Xavier Serra
📗 PDF

🗒 Towards multi-instrument drum transcription
👥 Richard Vogl, Gerhard Widmer, Peter Knees
📗 PDF

🗒 Cover Song Synthesis by Analogy
👥 Christopher J. Tralie
📗 PDF

🗒 Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
👥 Linhao Dong, Shiyu Zhou, Wei Chen, Bo Xu
📗 PDF

🗒 A 5-Dimensional Tonnetz for Nearly Symmetric Hexachords
👥 Vaibhav Mohanty
📗 PDF

🗒 Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation
👥 Hiroaki Nakajima, Yu Takahashi, Kazunobu Kondo, Yuji Hisaminato
📗 PDF

#Sound
AI Python & arXiv Channel

101 views12:13

arXiv

5 of Latest Published Articles:
Sound
#Sound

🗒 Text-Independent Speaker Verification Based on Deep Neural Networks and Segmental Dynamic Time Warping
👥 Mohamed Adel, Mohamed Afify, Akram Gaballah
📗 PDF

🗒 Conditioning Deep Generative Raw Audio Models for Structured Automatic Music
👥 Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, Brian Kulis
📗 PDF

🗒 Frame-level Instrument Recognition by Timbre and Pitch
👥 Yun-Ning Hung, Yi-Hsuan Yang
📗 PDF

🗒 Sounderfeit: Cloning a Physical Model using a Conditional Adversarial Autoencoder
👥 Stephen Sinclair
📗 PDF

🗒 Convolutional Neural Networks to Enhance Coded Speech
👥 Ziyue Zhao, Huijun Liu, Tim Fingscheidt
📗 PDF

#Sound
AI Python & arXiv Channel

75 views18:56

arXiv

5 of Latest Published Articles:
Sound
#Sound

🗒 Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation
👥 Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata
📗 PDF

🗒 Evaluating the Effects of Material Sonification in Tactile Devices
👥 Rodrigo Martín, Michael Weinmann, Matthias B. Hullin
📗 PDF

🗒 A Computational Study of the Role of Tonal Tension in Expressive Piano Performance
👥 Carlos Cancino-Chacón, Maarten Grachten
📗 PDF

🗒 Exploring End-to-End Techniques for Low-Resource Speech Recognition
👥 Vladimir Bataev, Maxim Korenevsky, Ivan Medennikov, Alexander Zatvornitskiy
📗 PDF

🗒 An energy-based generative sequence model for testing sensory theories of Western harmony
👥 Peter M. C. Harrison, Marcus T. Pearce
📗 PDF

#Sound
AI Python & arXiv Channel

67 views12:56

arXiv

To access articles related to a category, touch the HashTag:

📗 Artificial Intelligence
👉 #ArtificialIntelligence 👈

📗 Hardware Architecture
👉 #HardwareArchitecture 👈

📗 Computational Complexity
👉 #ComputationalComplexity 👈

📗 Computational Engineering, Finance, and Science
👉 #ComputationalEngineeringFinanceandScience 👈

📗 Computational Geometry
👉 #ComputationalGeometry 👈

📗 Computation and Language
👉 #ComputationandLanguage 👈

📗 Cryptography and Security
👉 #CryptographyandSecurity 👈

📗 Computer Vision and Pattern Recognition
👉 #ComputerVisionandPatternRecognition 👈

📗 Computers and Society
👉 #ComputersandSociety 👈

📗 Databases
👉 #Databases 👈

📗 Distributed, Parallel, and Cluster Computing
👉 #DistributedParallelandClusterComputing 👈

📗 Digital Libraries
👉 #DigitalLibraries 👈

📗 Discrete Mathematics
👉 #DiscreteMathematics 👈

📗 Data Structures and Algorithms
👉 #DataStructuresandAlgorithms 👈

📗 Emerging Technologies
👉 #EmergingTechnologies 👈

📗 Formal Languages and Automata Theory
👉 #FormalLanguagesandAutomataTheory 👈

📗 General Literature
👉 #GeneralLiterature 👈

📗 Graphics
👉 #Graphics 👈

📗 Computer Science and Game Theory
👉 #ComputerScienceandGameTheory 👈

📗 Human-Computer Interaction
👉 #Human-ComputerInteraction 👈

📗 Information Retrieval
👉 #InformationRetrieval 👈

📗 Information Theory
👉 #InformationTheory 👈

📗 Learning
👉 #Learning 👈

📗 Logic in Computer Science
👉 #LogicinComputerScience 👈

📗 Multiagent Systems
👉 #MultiagentSystems 👈

📗 Multimedia
👉 #Multimedia 👈

📗 Mathematical Software
👉 #MathematicalSoftware 👈

📗 Numerical Analysis
👉 #NumericalAnalysis 👈

📗 Neural and Evolutionary Computing
👉 #NeuralandEvolutionaryComputing 👈

📗 Networking and Internet Architecture
👉 #NetworkingandInternetArchitecture 👈

📗 Other Computer Science
👉 #OtherComputerScience 👈

📗 Operating Systems
👉 #OperatingSystems 👈

📗 Performance
👉 #Performance 👈

📗 Programming Languages
👉 #ProgrammingLanguages 👈

📗 Robotics
👉 #Robotics 👈

📗 Symbolic Computation
👉 #SymbolicComputation 👈

📗 Sound
👉 #Sound 👈

📗 Software Engineering
👉 #SoftwareEngineering 👈

📗 Social and Information Networks
👉 #SocialandInformationNetworks 👈

📗 Systems and Control
👉 #SystemsandControl 👈

♨️ arXiv Channel

573 viewsedited 13:11

arXiv