All papers published in past two days:
Category: Sound
#Sound
π VoxCeleb2: Deep Speaker Recognition
π₯ Joon Son Chung, Arsha Nagrani, Andrew Zisserman
π PDF
AI Python & arXiv Channel
Category: Sound
#Sound
π VoxCeleb2: Deep Speaker Recognition
π₯ Joon Son Chung, Arsha Nagrani, Andrew Zisserman
π PDF
AI Python & arXiv Channel
Latest Published Articles:
Sound
#Sound
π VoxCeleb2: Deep Speaker Recognition
π₯ Joon Son Chung, Arsha Nagrani, Andrew Zisserman
π PDF
π Multi-View Networks for Denoising of Arbitrary Numbers of Channels
π₯ Jonah Casebeer, Brian Luc, Paris Smaragdis
π PDF
π A data-driven approach to mid-level perceptual musical feature modeling
π₯ Anna Aljanaki, Mohammad Soleymani
π PDF
π Model-based Speech Enhancement for Intelligibility Improvement in Binaural Hearing Aids
π₯ Mathew Shaji Kavalekalam, Jesper K. Nielsen, Jesper B. Boldt, Mads G. Christensen
π PDF
π Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition
π₯ Wei-Ning Hsu, Hao Tang, James Glass
π PDF
π A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
π₯ Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass
π PDF
π Capsule Routing for Sound Event Detection
π₯ Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang
π PDF
π Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
π₯ Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, Yonghui Wu
π PDF
π Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages
π₯ Shiyu Zhou, Shuang Xu, Bo Xu
π PDF
π The NES Music Database: A multi-instrumental dataset with expressive performance attributes
π₯ Chris Donahue, Huanru Henry Mao, Julian McAuley
π PDF
π Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models
π₯ Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin
π PDF
π Analysis of Length Normalization in End-to-End Speaker Verification System
π₯ Weicheng Cai, Jinkun Chen, Ming Li
π PDF
π Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
π₯ Daniel Stoller, Sebastian Ewert, Simon Dixon
π PDF
π On sound-based interpretation of neonatal EEG
π₯ Sergi Gomez, Mark O'Sullivan, Emanuel Popovici, Sean Mathieson, Geraldine Boylan, Andriy Temko
π PDF
π StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
π₯ Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
π PDF
#Sound
AI Python & arXiv Channel
Sound
#Sound
π VoxCeleb2: Deep Speaker Recognition
π₯ Joon Son Chung, Arsha Nagrani, Andrew Zisserman
π PDF
π Multi-View Networks for Denoising of Arbitrary Numbers of Channels
π₯ Jonah Casebeer, Brian Luc, Paris Smaragdis
π PDF
π A data-driven approach to mid-level perceptual musical feature modeling
π₯ Anna Aljanaki, Mohammad Soleymani
π PDF
π Model-based Speech Enhancement for Intelligibility Improvement in Binaural Hearing Aids
π₯ Mathew Shaji Kavalekalam, Jesper K. Nielsen, Jesper B. Boldt, Mads G. Christensen
π PDF
π Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition
π₯ Wei-Ning Hsu, Hao Tang, James Glass
π PDF
π A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
π₯ Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass
π PDF
π Capsule Routing for Sound Event Detection
π₯ Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang
π PDF
π Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
π₯ Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, Yonghui Wu
π PDF
π Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages
π₯ Shiyu Zhou, Shuang Xu, Bo Xu
π PDF
π The NES Music Database: A multi-instrumental dataset with expressive performance attributes
π₯ Chris Donahue, Huanru Henry Mao, Julian McAuley
π PDF
π Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models
π₯ Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin
π PDF
π Analysis of Length Normalization in End-to-End Speaker Verification System
π₯ Weicheng Cai, Jinkun Chen, Ming Li
π PDF
π Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
π₯ Daniel Stoller, Sebastian Ewert, Simon Dixon
π PDF
π On sound-based interpretation of neonatal EEG
π₯ Sergi Gomez, Mark O'Sullivan, Emanuel Popovici, Sean Mathieson, Geraldine Boylan, Andriy Temko
π PDF
π StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
π₯ Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
π PDF
#Sound
AI Python & arXiv Channel
Latest Published Articles:
Sound
#Sound
π Learning Transposition-Invariant Interval Features from Symbolic Music and Audio
π₯ Stefan Lattner, Maarten Grachten, Gerhard Widmer
π PDF
π Towards Automated Single Channel Source Separation using Neural Networks
π₯ Arpita Gang, Pravesh Biyani, Akshay Soni
π PDF
π Synthesizing Diverse, High-Quality Audio Textures
π₯ Joseph Antognini, Matt Hoffman, Ron J. Weiss
π PDF
π Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
π₯ Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges LinarΓ¨s, Renato De Mori, Yoshua Bengio
π PDF
π A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification
π₯ Eduardo Fonseca, Rong Gong, Xavier Serra
π PDF
π Speaker Adapted Beamforming for Multi-Channel Automatic Speech Recognition
π₯ Tobias Menne, Ralf SchlΓΌter, Hermann Ney
π PDF
π End-to-End Speech Recognition From the Raw Waveform
π₯ Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert, Emmanuel Dupoux
π PDF
π Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices
π₯ Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino
π PDF
π A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours
π₯ Branislav Gerazov, GΓ©rard Bailly, Yi Xu
π PDF
π Towards an efficient deep learning model for musical onset detection
π₯ Rong Gong, Xavier Serra
π PDF
π Towards multi-instrument drum transcription
π₯ Richard Vogl, Gerhard Widmer, Peter Knees
π PDF
π Cover Song Synthesis by Analogy
π₯ Christopher J. Tralie
π PDF
π Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
π₯ Linhao Dong, Shiyu Zhou, Wei Chen, Bo Xu
π PDF
π A 5-Dimensional Tonnetz for Nearly Symmetric Hexachords
π₯ Vaibhav Mohanty
π PDF
π Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation
π₯ Hiroaki Nakajima, Yu Takahashi, Kazunobu Kondo, Yuji Hisaminato
π PDF
#Sound
AI Python & arXiv Channel
Sound
#Sound
π Learning Transposition-Invariant Interval Features from Symbolic Music and Audio
π₯ Stefan Lattner, Maarten Grachten, Gerhard Widmer
π PDF
π Towards Automated Single Channel Source Separation using Neural Networks
π₯ Arpita Gang, Pravesh Biyani, Akshay Soni
π PDF
π Synthesizing Diverse, High-Quality Audio Textures
π₯ Joseph Antognini, Matt Hoffman, Ron J. Weiss
π PDF
π Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
π₯ Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges LinarΓ¨s, Renato De Mori, Yoshua Bengio
π PDF
π A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification
π₯ Eduardo Fonseca, Rong Gong, Xavier Serra
π PDF
π Speaker Adapted Beamforming for Multi-Channel Automatic Speech Recognition
π₯ Tobias Menne, Ralf SchlΓΌter, Hermann Ney
π PDF
π End-to-End Speech Recognition From the Raw Waveform
π₯ Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert, Emmanuel Dupoux
π PDF
π Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices
π₯ Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino
π PDF
π A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours
π₯ Branislav Gerazov, GΓ©rard Bailly, Yi Xu
π PDF
π Towards an efficient deep learning model for musical onset detection
π₯ Rong Gong, Xavier Serra
π PDF
π Towards multi-instrument drum transcription
π₯ Richard Vogl, Gerhard Widmer, Peter Knees
π PDF
π Cover Song Synthesis by Analogy
π₯ Christopher J. Tralie
π PDF
π Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
π₯ Linhao Dong, Shiyu Zhou, Wei Chen, Bo Xu
π PDF
π A 5-Dimensional Tonnetz for Nearly Symmetric Hexachords
π₯ Vaibhav Mohanty
π PDF
π Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation
π₯ Hiroaki Nakajima, Yu Takahashi, Kazunobu Kondo, Yuji Hisaminato
π PDF
#Sound
AI Python & arXiv Channel
5 of Latest Published Articles:
Sound
#Sound
π Text-Independent Speaker Verification Based on Deep Neural Networks and Segmental Dynamic Time Warping
π₯ Mohamed Adel, Mohamed Afify, Akram Gaballah
π PDF
π Conditioning Deep Generative Raw Audio Models for Structured Automatic Music
π₯ Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, Brian Kulis
π PDF
π Frame-level Instrument Recognition by Timbre and Pitch
π₯ Yun-Ning Hung, Yi-Hsuan Yang
π PDF
π Sounderfeit: Cloning a Physical Model using a Conditional Adversarial Autoencoder
π₯ Stephen Sinclair
π PDF
π Convolutional Neural Networks to Enhance Coded Speech
π₯ Ziyue Zhao, Huijun Liu, Tim Fingscheidt
π PDF
#Sound
AI Python & arXiv Channel
Sound
#Sound
π Text-Independent Speaker Verification Based on Deep Neural Networks and Segmental Dynamic Time Warping
π₯ Mohamed Adel, Mohamed Afify, Akram Gaballah
π PDF
π Conditioning Deep Generative Raw Audio Models for Structured Automatic Music
π₯ Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, Brian Kulis
π PDF
π Frame-level Instrument Recognition by Timbre and Pitch
π₯ Yun-Ning Hung, Yi-Hsuan Yang
π PDF
π Sounderfeit: Cloning a Physical Model using a Conditional Adversarial Autoencoder
π₯ Stephen Sinclair
π PDF
π Convolutional Neural Networks to Enhance Coded Speech
π₯ Ziyue Zhao, Huijun Liu, Tim Fingscheidt
π PDF
#Sound
AI Python & arXiv Channel
5 of Latest Published Articles:
Sound
#Sound
π Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation
π₯ Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata
π PDF
π Evaluating the Effects of Material Sonification in Tactile Devices
π₯ Rodrigo MartΓn, Michael Weinmann, Matthias B. Hullin
π PDF
π A Computational Study of the Role of Tonal Tension in Expressive Piano Performance
π₯ Carlos Cancino-ChacΓ³n, Maarten Grachten
π PDF
π Exploring End-to-End Techniques for Low-Resource Speech Recognition
π₯ Vladimir Bataev, Maxim Korenevsky, Ivan Medennikov, Alexander Zatvornitskiy
π PDF
π An energy-based generative sequence model for testing sensory theories of Western harmony
π₯ Peter M. C. Harrison, Marcus T. Pearce
π PDF
#Sound
AI Python & arXiv Channel
Sound
#Sound
π Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation
π₯ Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata
π PDF
π Evaluating the Effects of Material Sonification in Tactile Devices
π₯ Rodrigo MartΓn, Michael Weinmann, Matthias B. Hullin
π PDF
π A Computational Study of the Role of Tonal Tension in Expressive Piano Performance
π₯ Carlos Cancino-ChacΓ³n, Maarten Grachten
π PDF
π Exploring End-to-End Techniques for Low-Resource Speech Recognition
π₯ Vladimir Bataev, Maxim Korenevsky, Ivan Medennikov, Alexander Zatvornitskiy
π PDF
π An energy-based generative sequence model for testing sensory theories of Western harmony
π₯ Peter M. C. Harrison, Marcus T. Pearce
π PDF
#Sound
AI Python & arXiv Channel
To access articles related to a category, touch the HashTag:
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel
5 of Latest Published Articles:
Sound
#Sound
π Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network
π₯ Shinnosuke Takamichi, Yuki Saito, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari
π PDF
π Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals
π₯ SΓΆren Becker, Marcel Ackermann, Sebastian Lapuschkin, Klaus-Robert MΓΌller, Wojciech Samek
π PDF
π On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition
π₯ Hao Tang, James Glass
π PDF
π Foreign English Accent Adjustment by Learning Phonetic Patterns
π₯ Fedor Kitashov, Elizaveta Svitanko, Debojyoti Dutta
π PDF
π Approximate k-space models and Deep Learning for fast photoacoustic reconstruction
π₯ Andreas Hauptmann, Ben Cox, Felix Lucka, Nam Huynh, Marta Betcke, Paul Beard, Simon Arridge
π PDF
#Sound
AI Python & arXiv Channel
Sound
#Sound
π Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network
π₯ Shinnosuke Takamichi, Yuki Saito, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari
π PDF
π Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals
π₯ SΓΆren Becker, Marcel Ackermann, Sebastian Lapuschkin, Klaus-Robert MΓΌller, Wojciech Samek
π PDF
π On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition
π₯ Hao Tang, James Glass
π PDF
π Foreign English Accent Adjustment by Learning Phonetic Patterns
π₯ Fedor Kitashov, Elizaveta Svitanko, Debojyoti Dutta
π PDF
π Approximate k-space models and Deep Learning for fast photoacoustic reconstruction
π₯ Andreas Hauptmann, Ben Cox, Felix Lucka, Nam Huynh, Marta Betcke, Paul Beard, Simon Arridge
π PDF
#Sound
AI Python & arXiv Channel
Forwarded from arXiv
To access articles related to a category, touch the HashTag:
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel
5 of Latest Published Articles:
Sound
#Sound
π Auto-adaptive Resonance Equalization using Dilated Residual Networks
π₯ Maarten Grachten, Emmanuel Deruty, Alexandre Tanguy
π PDF
π Unified Hypersphere Embedding for Speaker Recognition
π₯ Mahdi Hajibabaei, Dengxin Dai
π PDF
π Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model
π₯ Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
π PDF
π Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
π₯ Yi-Chen Chen, Sung-Feng Huang, Chia-Hao Shen, Hung-yi Lee, Lin-shan Lee
π PDF
π A Fully Convolutional Neural Network Approach to End-to-End Speech Enhancement
π₯ Frank Longueira, Sam Keene
π PDF
#Sound
AI Python & arXiv Channel
Sound
#Sound
π Auto-adaptive Resonance Equalization using Dilated Residual Networks
π₯ Maarten Grachten, Emmanuel Deruty, Alexandre Tanguy
π PDF
π Unified Hypersphere Embedding for Speaker Recognition
π₯ Mahdi Hajibabaei, Dengxin Dai
π PDF
π Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model
π₯ Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
π PDF
π Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
π₯ Yi-Chen Chen, Sung-Feng Huang, Chia-Hao Shen, Hung-yi Lee, Lin-shan Lee
π PDF
π A Fully Convolutional Neural Network Approach to End-to-End Speech Enhancement
π₯ Frank Longueira, Sam Keene
π PDF
#Sound
AI Python & arXiv Channel
Forwarded from arXiv
To access articles related to a category, touch the HashTag:
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel
5 of Latest Published Articles:
Sound
#Sound
π Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
π₯ Emad M. Grais, Hagen Wierstorf, Dominic Ward, Russell Mason, Mark D. Plumbley
π PDF
π Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models
π₯ Herman Kamper
π PDF
π End-to-end Models with auditory attention in Multi-channel Keyword Spotting
π₯ Haitong Zhang, Junbo Zhang, Yujun Wang
π PDF
π Sequence-to-sequence Models for Small-Footprint Keyword Spotting
π₯ Haitong Zhang, Junbo Zhang, Yujun Wang
π PDF
π Deep Learning for Tube Amplifier Emulation
π₯ Eero-Pekka DamskΓ€gg, Lauri Juvela, Etienne Thuillier, Vesa VΓ€limΓ€ki
π PDF
#Sound
AI Python & arXiv Channel
Sound
#Sound
π Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
π₯ Emad M. Grais, Hagen Wierstorf, Dominic Ward, Russell Mason, Mark D. Plumbley
π PDF
π Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models
π₯ Herman Kamper
π PDF
π End-to-end Models with auditory attention in Multi-channel Keyword Spotting
π₯ Haitong Zhang, Junbo Zhang, Yujun Wang
π PDF
π Sequence-to-sequence Models for Small-Footprint Keyword Spotting
π₯ Haitong Zhang, Junbo Zhang, Yujun Wang
π PDF
π Deep Learning for Tube Amplifier Emulation
π₯ Eero-Pekka DamskΓ€gg, Lauri Juvela, Etienne Thuillier, Vesa VΓ€limΓ€ki
π PDF
#Sound
AI Python & arXiv Channel
Forwarded from arXiv
To access articles related to a category, touch the HashTag:
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel
π Artificial Intelligence
π #ArtificialIntelligence π
π Hardware Architecture
π #HardwareArchitecture π
π Computational Complexity
π #ComputationalComplexity π
π Computational Engineering, Finance, and Science
π #ComputationalEngineeringFinanceandScience π
π Computational Geometry
π #ComputationalGeometry π
π Computation and Language
π #ComputationandLanguage π
π Cryptography and Security
π #CryptographyandSecurity π
π Computer Vision and Pattern Recognition
π #ComputerVisionandPatternRecognition π
π Computers and Society
π #ComputersandSociety π
π Databases
π #Databases π
π Distributed, Parallel, and Cluster Computing
π #DistributedParallelandClusterComputing π
π Digital Libraries
π #DigitalLibraries π
π Discrete Mathematics
π #DiscreteMathematics π
π Data Structures and Algorithms
π #DataStructuresandAlgorithms π
π Emerging Technologies
π #EmergingTechnologies π
π Formal Languages and Automata Theory
π #FormalLanguagesandAutomataTheory π
π General Literature
π #GeneralLiterature π
π Graphics
π #Graphics π
π Computer Science and Game Theory
π #ComputerScienceandGameTheory π
π Human-Computer Interaction
π #Human-ComputerInteraction π
π Information Retrieval
π #InformationRetrieval π
π Information Theory
π #InformationTheory π
π Learning
π #Learning π
π Logic in Computer Science
π #LogicinComputerScience π
π Multiagent Systems
π #MultiagentSystems π
π Multimedia
π #Multimedia π
π Mathematical Software
π #MathematicalSoftware π
π Numerical Analysis
π #NumericalAnalysis π
π Neural and Evolutionary Computing
π #NeuralandEvolutionaryComputing π
π Networking and Internet Architecture
π #NetworkingandInternetArchitecture π
π Other Computer Science
π #OtherComputerScience π
π Operating Systems
π #OperatingSystems π
π Performance
π #Performance π
π Programming Languages
π #ProgrammingLanguages π
π Robotics
π #Robotics π
π Symbolic Computation
π #SymbolicComputation π
π Sound
π #Sound π
π Software Engineering
π #SoftwareEngineering π
π Social and Information Networks
π #SocialandInformationNetworks π
π Systems and Control
π #SystemsandControl π
β¨οΈ arXiv Channel