Deep Learning Publication Navigator - subtopic: speech

Year TitleAuthor
2017   Restricted Boltzmann Machines for Vector Representation of Speech in Speaker Recognition  O Ghahabi, J Hernando
2017   Using deep neural networks to estimate tongue movements from speech face motion  C Kroos, RL Bundgaard
2017   Spectral-Domain Speech Enhancement for Speech Recognition  CH You, MA Bin
2017   An Improved Supervised Speech Separation Method Based on Perceptual Weighted Deep Recurrent Neural Networks  HAN Wei, X ZHANG, SUN Meng, LI Li, SHI Wenhua
2017   Discriminative Acoustic Features for Deployable Speech Recognition  A Faria
2017   On Dnn Posterior Probability Combination In Multi-Stream Speech Recognition For Reverberant Environments  F Xiong, S Goetze, BT Meyer
2017   Automatic Parameter Switching of Noise Reduction for Speech Recognition  T Kawase, K Niwa, Y Hioka, K Kobayashi
2017   Parametric Representation Of The Speaker'S Lips For Multimodal Sign Language And Speech Recognition.  D Ryumin, AA Karpov
2017   Building An Assistant Mobile Application For Teaching Arabic Pronunciation Using A New Approach For Arabic Speech Recognition  B ALKHATIB, M KAWAS, A ALNAHHAS, R BONDOK
2017   Robust automatic speech recognition based on neural network in reverberant environments  L Bai, HL Li, YY He
2017   Accent Recognition System Using Deep Belief Networks for Telugu Speech Signals  K Mannepalli, PN Sastry, M Suman
2017   Speech recognition using deep neural network-recent trends  M Sarma
2017   Monolithic 3D IC designs for low-power deep neural networks targeting speech recognition  K Chang, D Kadetotad, Y Cao, J Seo, SK Lim
2017   Hierarchical Bayesian Combination of Plug-in Maximum A Posteriori Decoders in Deep Neural Networks-based Speech Recognition and Speaker Adaptation  Z Huang, SM Siniscalchi, CH Lee
2017   Random Deep Belief Networks for Recognizing Emotions from Speech Signals  G Wen, H Li, J Huang, D Li, E Xun
2017   Representations of language in a model of visually grounded speech signal  G Chrupała, L Gelderloos, A Alishahi
2017   Robust Speech Recognition Using Generative Adversarial Networks  A Sriram, H Jun, Y Gaur, S Satheesh 
2017   Deep Recurrent Neural Networks in Speech Synthesis Using a Continuous Vocoder  MS Al
2017   Speech Enhancement Using Bayesian Wavenet  K Qian, Y Zhang, S Chang, X Yang, D Florêncio
2017   Deep Bidirectional LSTM based RNN for Casual Speech to Clear Speech conversion  S Singh
2017   Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning  Z Wen, K Li, Z Huang, CH Lee, J Tao
2017   UTD-CRSS submission for MGB-3 Arabic dialect identification: Front-end and back-end advancements on broadcast speech  AE Bulut, Q Zhang, C Zhang, F Bahmaninezhad
2017   Deep recurrent neural networks based binaural speech segregation for the selection of closest target of interest  R Venkatesan, AB Ganesh 
2017   Speech Emotion Recognition Based on PSO-optimized SVM  C LI, F LIU, Y WANG, W Hui, Q ZHANG
2017   Learning spectro-temporal features with 3D CNNs for speech emotion recognition  J Kim, KP Truong, G Englebienne, V Evers
2017   Updating the silent speech challenge benchmark with deep learning  Y Ji, L Liu, H Wang, Z Liu, Z Niu, B Denby
2017   A wearable bone-conducted speech enhancement system for strong background noises  B Huang, Y Gong, J Sun, Y Shen
2017   Reducing speech recognition latency  MME Bisani, HE Secker
2017   A study on target feature activation and normalization and their impacts on the performance of DNN based speech dereverberation systems  B Wu, K Li, M Yang, CH Lee
2017   Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks  S Pillai
2017   Supervised Speech Separation Based on Deep Learning: An Overview  DL Wang, J Chen
2017   Attention-Based End-to-End Speech Recognition in Mandarin  C Shan, J Zhang, Y Wang, L Xie
2017   Multivariate Autoregressive Spectrogram Modeling for Noisy Speech Recognition  S Ganapathy
2017   Recurrent Convolutional Neural Network For Speech Processing  Y Zhao, X Jin, X Hu
2017   Deep Temporal Architecture for Audiovisual Speech Recognition  C Tian, Y Yuan, X Lu 
2017   Feature fusion methods research based on deep belief networks for speech emotion recognition under noise condition  Y Huang, K Tian, A Wu, G Zhang 
2017   Multi-resolution spectral input for convolutional neural network-based speech recognition  L Tóth
2017   Generative modeling of speech using neural networks  S Matsoukas, N Ström, A Rastrow, SVSSR Garimella
2017   Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech  M Neumann, NT Vu
2017   Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic Speech Recognition  A Thanda, SM Venkatesan
2017   An Unsupervised Deep Domain Adaptation Approach for Robust Speech Recognition  S Sun, B Zhang, L Xie, Y Zhang
2017   Speech Intention Classification with Multimodal Deep Learning  Y Gu, X Li, S Chen, J Zhang, I Marsic
2017   A comparison of perceptually motivated loss functions for binary mask estimation in speech separation  B Milner, D Websdale
2017   Nasal Speech Sounds Detection Using Connectionist Temporal Classification  M Cernak, S Tong
2017   Single-Channel Speech Enhancement Based on Sparse Regressive Deep Neural Network  H Sun, S Li
2017   Speech Quality Assessment Over Lossy Transmission Channels Using Deep Belief Networks  ET Affonso, RL Rosa, DZ Rodríguez 
2017   HMM/MLP speech recognition system using a novel data clustering approach  L Lazli, M Boukadoum, OA Mohamed
2017   Learning Modality-Invariant Representations for Speech and Images  K Leidal, D Harwath, J Glass 
2017   Kernel Approximation Methods for Speech Recognition  A May, AB Garakani, Z Lu, D Guo, K Liu, A Bellet, L Fan
2017   Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading  C Tian, W Ji, Y Yuan
2017   Learning Auxiliary Categorical Information For Speech Synthesis Based On Deep And Recurrent Neural Networks  Z Wen, K Li, Z Huang, J Tao, CH Lee
2017   Speech recognition in a dialog system: from conventional to deep processing  A Becerra, JI de la Rosa, E González
2017   Guest Editorial for the special issue on Multi-Microphone Speech Recognition in Everyday Environments  J Barker, R Marxer, E Vincent, S Watanabe
2017   Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework  S Yang, L Xie, X Chen, X Lou, D Huang, H Li
2017   Feature joint-state posterior estimation in factorial speech processing models using deep neural networks  M Khademian, MM Homayounpour
2017   Listening to the World Improves Speech Command Recognition  B McMahan, D Rao 
2017   14.4 A scalable speech recognizer with deep-neural-network acoustic models and voice-activated power gating  M Price, J Glass, AP Chandrakasan
2017   Lip2AudSpec: Speech reconstruction from silent lip movements video  H Akbari, H Arora, L Cao, N Mesgarani 
2017   Stimulated Training For Automatic Speech Recognition And Keyword Search In Limited Resource Conditions  A Ragni, C Wu, MJF Gales, J Vasilakes, KM Knill
2017   Content-Based Image Retrieval and Speech Enhancement System Using Deep Learning Structure  X Zhao
2017   Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments  Z Zhang, J Geiger, J Pohjalainen, AED Mousa
2017   Evaluating deep learning architectures for Speech Emotion Recognition  HM Fayek, M Lech, L Cavedon
2017   Deep LSTM for Large Vocabulary Continuous Speech Recognition  X Tian, J Zhang, Z Ma, Y He, J Wei, P Wu, W Situ, S Li
2017   Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition  C Donahue, B Li, R Prabhavalkar 
2017   Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks  Y Saito, S Takamichi, H Saruwatari
2017   A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement  JM Valin
2017   Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis  B Bollepalli, L Juvela, P Alku
2017   Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN  L Zhu, L Chen, D Zhao, J Zhou, W Zhang
2017   Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks  M Kolbæk, D Yu, ZH Tan, J Jensen
2017   Language and Noise Transfer in Speech Enhancement Generative Adversarial Network  S Pascual, M Park, J Serrà, A Bonafonte, KH Ahn 
2017   On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition  AMC Martinez, SH Mallidi, BT Meyer
2017   A breakthrough in Speech emotion recognition using Deep Retinal Convolution Neural Networks  Y Niu, D Zou, Y Niu, Z He, H Tan
2017   Towards Speech Emotion Recognition “in the wild” using Aggregated Corpora and Deep Multi-Task Learning  J Kim, G Englebienne, KP Truong, V Evers
2017   Paralinguistic Analysis of Children's Speech in Natural Environments  H Rao, MA Clements, Y Li, MR Swanson, J Piven
2017   Advanced Data Exploitation in Speech Analysis: An overview  Z Zhang, N Cummins, B Schuller
2017   Unsupervised Iterative Deep Learning of Speech Features and Acoustic Tokens with Applications to Spoken Term Detection  CT Chung, CY Tsai, CH Liu, LS Lee
2017   A Study on Speech Classification Based on Deep Neural Network under Adverse Environments  K Phapatanaburi
2017   Tamil Speech Emotion Recognition Using Deep Belief Network (DBN)  M Srikanth, D Pravena, D Govind
2017   A Wavenet for Speech Denoising  D Rethage, J Pons, X Serra
2017   Deep Learning for Hate Speech Detection in Tweets  P Badjatiya, S Gupta, M Gupta, V Varma
2017   Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis  Y Zheng, Y Li, Z Wen, B Liu, J Tao
2017   Deep Learning Based Binaural Speech Separation in Reverberant Environments  X Zhang, DL Wang
2017   Prediction-based learning for continuous emotion recognition in speech  J Han, Z Zhang, F Ringeval, B Schuller
2017   Multiple-target deep learning for LSTM-RNN based speech enhancement  L Sun, J Du, LR Dai, CH Lee
2017   Modelling speech emotion recognition using logistic regression and decision trees  A Jacob
2017   Deep Speech Recognition  L Deng
2017   Dynamic pruning in speech recognition  JS Kramer, AD Rosen, KJ Basye
2017   End to End Deep Neural Networks Radio receiver for Speech Signals  D Elbaz, M Zibulevsky
2017   Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification  D Michelsanti, ZH Tan
2017   Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation  WN Hsu, Y Zhang, J Glass
2017   Lying Speech Characteristic Extraction Based on SSAE Deep Learning Model  Y Zhou, H Zhao, L Shang
2017   An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition  B Wu, K Li, F Ge, Z Huang, M Yang, SM Siniscalchi
2017   Improved voicing decision using glottal activity features for statistical parametric speech synthesis  N Adiga, BK Khonglah, SRM Prasanna
2017   Unsupervised modulation filter learning for noise-robust speech recognition  P Agrawal, S Ganapathy
2017   Auxiliary Features from Laser-Doppler Vibrometer Sensor for Deep Neural Network Based Robust Speech Recognition  L Sun, J Du, Z Xie, Y Xu
2017   Raw Waveform-based Speech Enhancement by Fully Convolutional Networks  SW Fu, Y Tsao, X Lu, H Kawai
2017   Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System  M Zimmermann, M Mehdipour Ghazi, HK Ekenel
2017   Automatic speaker identification using speech recognition features  HE Secker
2017   Deep learning and SVM‐based emotion recognition from Chinese speech for smart affective services  W Zhang, D Zhao, Z Chai, LT Yang, X Liu, F Gong
2017   Analysis and modeling for robust whispered speech recognition  S Ghaffarzadegan
2017   A Roadmap for Privacy Preserving Speech Processing  C Glackin, G Chollet, N Dugan, N Cannings, J Wall
2017   I-vector estimation as auxiliary task for multi-task learning based acoustic modeling for automatic speech recognition  G Pironkov, S Dupont, T Dutoit
2017   End-To-End Speech Recognition Applied To Brazilian Portuguese Using Deep Learning  IM Quintanilha
2017   Learning Representations of Emotional Speech with Deep Convolutional Generative Adversarial Networks  J Chang, S Scherer
2017   Deep features-based speech emotion recognition for smart affective services  AM Badshah, N Rahim, N Ullah, J Ahmad 
2017   On the relevance of auditory-based Gabor features for deep learning in robust speech recognition  AM Castro Martinez, SH Mallidi, BT Meyer
2017   An optimization method for speech enhancement based on deep neural network  H Sun, S Li
2017   Improving the Performance of Deep Learning Based Speech Enhancement System Using Fuzzy Restricted Boltzmann Machine  S Samui, I Chakrabarti, SK Ghosh 
2017   Direct Speech Reconstruction From Articulatory Sensor Data by Machine Learning  JA Gonzalez, LA Cheah, AM Gomez, PD Green 
2017   Whispered Speech Recognition Using Deep Denoising Autoencoder and Inverse Filtering  ĐT Grozdić, ST Jovičić 
2017   Methods for Addressing Data Diversity in Automatic Speech Recognition  M Doulaty Bashkand
2017   Language Adaptive Multilingual CTC Speech Recognition  M Müller, S Stüker, A Waibel
2016   Towards real-time Speech Emotion Recognition using deep neural networks  HM Fayek, M Lech, L Cavedon
2016   Large-Margin Structured Prediction Extensions of Neural Networks for Automatic Speech Recognition  S Ravuri
2016   A Spiking Network that Learns to Extract Spike Signatures from Speech Signals  A Tavanaei, AS Maida
2016   Perceptual improvement of deep neural networks for monaural speech enhancement  W Han, X Zhang, M Sun, W Shi, X Chen, Y Hu
2016   Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation  X Xiao, S Zhao, DHH Nguyen, X Zhong, DL Jones
2016   Investigating gated recurrent neural networks for speech synthesis  Z Wu, S King
2016   Speech steganalysis using evolutionary restricted Boltzmann machines  C Paulin, SA Selouani, É Hervet
2016   An Experimental Comparison of Deep Neural Networks for End-to-end Speech Recognition  Z Zhang, Z Sun, J Liu, J Chen, Z Huo, X Zhang
2016   Deep Learning for Statistical Parametric Speech Synthesis  ZH Ling
2016   DBN-based Spectral Feature Representation for Statistical Parametric Speech Synthesis  YJ Hu, ZH Ling
2016   Graph-based Semi-Supervised Learning in Acoustic Modeling for Automatic Speech Recognition  Y Liu
2016   Feature Transfer Learning for Speech Emotion Recognition  J Deng
2016   Improvements in the Detection of Vowel Onset and Offset Points in a Speech Sequence  A Kumar, S Shahnawazuddin, G Pradhan
2016   Emotion Recognition in Speech with Deep Learning Architectures  M Erdal, M Kächele, F Schwenker
2016   Wide matching??? An approach to improving noise robustness for speech enhancement  J Ming, D Crookes
2016   Joint acoustic factor learning for robust deep neural network based automatic speech recognition  S Kundu, G Mantena, Y Qian, T Tan, M Delcroix
2016   Synaptic depression in deep neural networks for speech processing  W Zhang, H Li, M Yang, N Mesgarani
2016   Exploiting LSTM structure in deep neural networks for speech recognition  T He, J Droppo
2016   Multi-output RNN-LSTM for multiple speaker speech synthesis and adaptation  S Pascual, A Bonafonte
2016   3WRBM-Based Speech Factor Modeling for Arbitrary-Source and Non-Parallel Voice Conversion  T Nakashika, Y Minami
2016   Towards Improving Low-Resource Speech Recognition Using Articulatory and Language Features  M Müller, S Stüker, A Waibel
2016   Enhanced Factored Three-Way Restricted Boltzmann Machines for Speech Detection  P Sun, J Qin
2016   A Review on Deep Learning Algorithms for Speech and Facial Emotion Recognition  CP Latha, M Priya
2016   Isolated Word Speech Recognition System Using Deep Neural Networks  D Dhanashri, SB Dhonde
2016   Neural networks for proper name retrieval in the framework of automatic speech recognition  D Fohr, I Illina
2016   A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition  A Zeyer, P Doetsch, P Voigtlaender, R Schlüter, H Ney
2016   Toward growing modular deep neural networks for continuous speech recognition  Z Ansari, SA Seyyedsalehi
2016   Speech Emotion Recognition Based on Deep Belief Networks and Wavelet Packet Cepstral Coefficients.  Y Huang, A Wu, G Zhang, Y Li
2016   Transfer Learning of Deep Neural Network for Speech Emotion Recognition  Y Huang, M Hu, X Yu, T Wang, C Yang
2016   Efficient memory compression in deep neural networks using coarse-grain sparsification for speech applications  D Kadetotad, S Arunachalam, C Chakrabarti, J Seo
2016   Feature Learning via Deep Belief Network for Chinese Speech Emotion Recognition  S Zhang, X Zhao, Y Chuang, W Guo, Y Chen
2016   ILMSAF Based Speech Enhancement with DNN and Noise Classification  R Li, Y Liu, Y Shi, L Dong, CUI Weili
2016   Personalized Speech recognition on mobile devices  I McGraw, R Prabhavalkar, R Alvarez, MG Arenas
2016   Audio and Speech Classification Applied to Child Sexual Abuse Investigation  O Montin, G Mörtberg
2016   Prosody Utilization in Continuous Speech Recognition  J Bartošek
2016   Deep Complementary Bottleneck Features For Visual Speech Recognition  S Petridis, M Pantic
2016   Visual Speech Feature Representations: Recent Advances  C Sui, M Bennamoun, R Togneri
2016   An efficient noise-robust automatic speech recognition system using artificial neural networks  S Gupta, KM Bhurchandi, AG Keskar
2016   Multi-Task Learning for Speech Recognition: An Overview  G Pironkov, S Dupont, T Dutoit
2016   Domain Adaptation Of Deep Neural Networks For Automatic Speech Recognition Via Wireless Sensors  G Gosztolya, T Grósz
2016   Emotion Recognition from Speech Signals Using Deep Learning Methods  S Pathak, MV Kolhe
2016   Selection of Optimal Array Noise Reduction Parameter Set for Accurate Speech Recognition in Various Noisy Environments  T Kawase, K Niwa, Y Hioka, K Kobayashi
2016   The Appropriate Hidden Layers of Deep Belief Networks for Speech Recognition  Q Wei, H Li, X Zhou
2016   An Analysis of Shallow and Deep Representations of Speech Based on Unsupervised Classification of Isolated Words  G Salvi
2016   Intonation classification for L2 English speech using multi-distribution deep neural networks  K Li, X Wu, H Meng
2016   Non-Invasive Detection of Alzheimerâ s Disease-Multifractality of Emotional Speech  S Bhaduri, R Das, D Ghosh
2016   Speech Recognition of Continuous Tamil phoneme using DBN  AC Banumathi
2016   Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Generation Error Training  Z Wu, S King
2016   Speech Synthesis Based on Hidden Markov Models and Deep Learning  M Coto
2016   Feature mapping using deep belief networks for robust speech recognition  M Gholmipour, B Nasersharif
2016   Multi-task Deep Learning for User Intention Understanding in Speech Interaction Systems  Y Ning, J Jia, Z Wu, R Li, Y An, Y Wang, H Meng
2016   Deep neural network acoustic models for multi-dialect Arabic speech recognition  NF Hmad
2016   At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech  M Schmitt, F Ringeval, B Schuller
2016   Deep Neural Network Based Phonological Feature Extraction for Bengali Continuous Speech  T Bhowmik, SKD Mandal
2016   Deep Neural Network Approach for Single Channel Speech Enhancement Processing  D Li
2016   Deep learning based parts of speech tagger for Bengali  MF Kabir, K Abdullah
2016   Improving Electroencephalography-Based Imagined Speech Recognition with a Simultaneous Video Data Stream  SJ Stolze
2016   Paralinguistic Event Detection in Children's Speech  H Rao
2016   Investigation of DNN-based audio-visual speech recognition  S TAMURA, H NINOMIYA, N KITAOKA, S OSUGA
2016   Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers  T OCHIAI, S MATSUDA, H WATANABE, LU Xugang
2016   Multi-channel speech enhancement and amplitude modulation analysis for noise robust automatic speech recognition  N Moritz, K Adiloğlu, J Anemüller, S Goetze
2016   Graph based manifold regularized deep neural networks for automatic speech recognition  VS Tomar, RC Rose
2016   Hybrid DNN-Latent structured SVM acoustic models for continuous speech recognition  S Ravuri
2016   Connectionist multivariate density-estimation and its application to speech synthesis  B Uria
2016   A Pairwise Algorithm Using Deep Stacking Network for Speech Separation and Pitch Estimation  X Zhang, H Zhang, S Nie, W Liu
2016   A Comparison between Deep Neural Nets and Kernel Acoustic Models for Speech Recognition  Z Lu, D Guo, AB Garakani, K Liu, A May, A Bellet, L Fan
2016   A Phoneme-Based Pre-Training Approach For Deep Neural Network With Application To Speech Enhancement  SE Chazan, S Gannot, J Goldberger
2016   Automatic Speech Recognition Based on Neural Networks  R Schlüter, P Doetsch, P Golik, M Kitza, T Menne, K Irie
2016   DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi  I Kipyatkova, A Karpov
2016   Ensemble Deep Neural Network Based Waveform-Driven Stress Model for Speech Synthesis  BP Tóth, KI Kis, G Szaszák, G Németh
2016   Towards End-to-End Speech Recognition  D PALAZ
2016   An experimental study on joint modeling of mixed-bandwidth data via deep neural networks for robust speech recognition  J Gao, J Du, C Kong, H Lu, E Chen, CH Lee
2016   Unsupervised domain adaptation for speech emotion recognition using PCANet  Z Huang, W Xue, Q Mao, Y Zhan
2016   Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training  Z Wu, S King
2016   Segment phoneme classification from speech under noisy conditions: Using amplitude-frequency modulation based two-dimensional auto-regressive features with …  R Rangslang
2016   A Review on Automatic Speech Recognition Architecture and Approaches  S Karpagavalli, E Chandra
2016   Deep Neural Networks For Automatic Detection Of Screams And Shouted Speech In Subway Trains  P Laffitte, D Sodoyer, C Tatkeu, L Girin
2016   Deep Learning Architectures, algorithms for speech recognition: An Overview  B Manickaraj
2016   Wav2Letter: an End-to-End ConvNet-based Speech Recognition System  R Collobert, C Puhrsch, G Synnaeve
2016   Deep Learning Methods For Improving The Perceptual Quality Of Noisy And Reverberant Speech  DS Williamson
2016   Speech recognition capability generation and control  Y Watanabe, A Rajasekaram, R Ramachandran
2016   Improving Speech Recognition Using Limited Accent Diverse British English Training Data With Deep Neural Networks  M Najafian, S Safavi, JHL Hansen, M Russell
2016   A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks  J Du, Y Tu, LR Dai, CH Lee
2016   Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology  A Asaei, M Cernak, H Bourlard
2016   Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition  T Gao, J Du, Y Xu, C Liu, LR Dai, CH Lee
2016   Towards Realizing Sign Language-to-Speech Conversion by Combining Deep Learning and Statistical Parametric Speech Synthesis  X An, H Yang, Z Gan
2016   Paralinguistic Speech Recognition: Classifying Emotion in Speech with Deep Learning Neural Networks  ER Segal
2016   A review on Speech and image classification applications using deep neural networks  EMK Dhaliwal, EA Singla
2016   A Fully Convolutional Neural Network for Speech Enhancement  SR Park, J Lee
2016   Training Maxout Neural Networks for Speech Recognition Tasks  A Prudnikov, M Korenevsky
2016   Investigating Gated Recurrent Networks For Speech Synthesis  Z Wu, S King
2016   A Perceptually Motivated Approach for Speech Enhancement Based on Deep Neural Network  HAN Wei, X ZHANG, MIN Gang, SUN Meng
2016   Multi-task Recurrent Model for Speech and Speaker Recognition  Z Tang, L Li, D Wang
2016   Signal Processing Algorithm-Integrated Deep Neural Network-Based Speech Recognition Apparatus And Learning Method …  H Chung, JG Park, SJ Lee, YK Lee
2016   Noise Robust Speech Recognition Using Deep Belief Networks  M Farahat, R Halavati
2016   Deep Learning Based Emotion Recognition from Chinese Speech  W Zhang, D Zhao, X Chen, Y Zhang
2016   Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces  F Bocquelet, T Hueber, L Girin, C Savariaux, B Yvert
2016   Speech Phoneme Classification by Intelligent Decision-Level Fusion  FAA Laleye, EC Ezin, C Motamed
2016   Investigating Very Deep Highway Networks for Parametric Speech Synthesis  X Wang, S Takaki, J Yamagishi
2016   Speech Emotion Recognition Using Voiced Segment Selection Algorithm  Y Gu, E Postma, HX Lin, J van den Herik
2016   Merlin: An Open Source Neural Network Speech Synthesis System  Z Wu, O Watts, S King
2015   Automatic dysfluency detection in dysarthric speech using deep belief networks  S Oue, R Marxer, F Rudzicz
2015   End-to-End Attention-based Large Vocabulary Speech Recognition  D Bahdanau, J Chorowski, D Serdyuk, P Brakel
2015   Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments  T Gao, J Du, Y Xu, C Liu, LR Dai, CH Lee
2015   New Trends In Machine Learning For Speech Recognition  I GAVAT, D MILITARU
2015   Using Deep Learning for Detecting Spoofing Attacks on Speech Signals  A Godoy, F Simões, JA Stuchi, MA Angeloni, M Uliani
2015   Tabby Talks: an automated tool for the assessment of childhood apraxia of speech  M Shahin, B Ahmed, A Parnandi, V Karappa
2015   Speech Music Discrimination Using an Ensemble of Biased Classifiers  K Kim, A Baijal, BS Ko, S Lee, I Hwang, Y Kim
2015   Deep Belief Neural Networks and Bidirectional Long-Short Term Memory Hybrid for Speech Recognition  Ł BROCKI, K MARASEK
2015   A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis  B Fan, SW Lee, X Tian, L Xie, M Dong
2015   Robust Excitation-Based Features For Automatic Speech Recognition  T Drugman, Y Stylianou, L Chen, X Chen, MJF Gales
2015   Machine Learning in Automatic Speech Recognition: A Survey  J Padmanabhan, MJ Johnson Premkumar
2015   Deep Neural Network Applications on Automatic Speech Recognition  P Wang
2015   Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends  ZH Ling, SY Kang, H Zen, A Senior, M Schuster
2015   Deep learning approaches to problems in speech recognition, computational chemistry, and natural language text processing  GE Dahl
2015   Bayesian Speech and Language Processing  S Watanabe, JT Chien
2015   Prediction, Bayesian inference and feedback in speech recognition  D Norris, JM McQueen, A Cutler
2015   Acoustic Modeling In Statistical Parametric Speech Synthesis–From Hmm To Lstm-Rnn  H Zen
2015   Multi-task Learning of Deep Neural Networks for Low-resource Speech Recognition  D Chen, B Mak
2015   Discriminative Template Learning in Group-Convolutional Networks for Invariant Speech Representations  C Zhang, S Voinea, G Evangelopoulos, L Rosasco
2015   Audio augmentation for speech recognition  T Ko, V Peddinti, D Povey, S Khudanpur
2015   Transfer Learning for Speech and Language Processing  D Wang, TF Zheng
2015   Linguistic Features Identify Alzheimer's Disease in Narrative Speech  KC Fraser, JA Meltzer, F Rudzicz
2015   Highway Long Short-Term Memory RNNs for Distant Speech Recognition  Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass
2015   Using Language Adaptive Deep Neural Networks for Improved Multilingual Speech Recognition  M Müller, A Waibel
2015   Supervised Speech Separation Using Deep Neural Networks  Y Wang
2015   Speech vocoding for laboratory phonology  M Cernak, Š Beňuš, A Lazaridis
2015   Classifying Phonological Categories In Imagined And Articulated Speech  S Zhao, F Rudzicz
2015   Speaker verification based on the fusion of speech acoustics and inverted articulatory signals  M Li, J Kim, A Lammert, PK Ghosh, V Ramanarayanan
2015   Feature Extraction Using Pre-Trained Convolutive Bottleneck Nets For Dysarthric Speech Recognition  Y Takashima, T Nakashika, T Takiguchi, Y Ariki
2015   Speech processing in Deep Learning and Computational Neuroscience SE367A: Cognitive cience  S Garg, S Gupta
2015   State of the Art in Statistical Methods for Language and Speech Processing  JR Bellegarda, C Monz
2015   Listening With Your Eyes: Towards a Practical Visual Speech Recognition System Using Deep Boltzmann Machines  C Sui, M Bennamoun, R Togneri
2015   Weighted Training For Speech Under Lombard Effect For Speaker Recognition  MM Saleem, G Liu, JHL Hansen
2015   On Using Heterogeneous Data For Vehicle-Based Speech Recognition: A Dnn-Based Approach  X Feng, B Richardson, S Amman, J Glass
2015   Training Phoneme Models For Singing With “Songified” Speech Data  AM Kruspe, I Fraunhofer
2015   Frame Based Postprocessor for Speech Recognition Based on Augmented Conditional Random Fields  Y Hifny
2015   Building DNN Acoustic Models for Large Vocabulary Speech Recognition  AL Maas, P Qi, Z Xie, AY Hannun, CT Lengerich
2015   Speech Enhancement Based on Analysis–Synthesis Framework with Improved Parameter Domain Enhancement  B Liu, J Tao, Z Wen, F Mo
2015   Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature  M Mimura, S Sakai, T Kawahara
2015   Acoustic Model Structuring for Improving Automatic Speech Recognition Performance  A Gorin
2015   F0 Modeling In Hmm-Based Speech Synthesis System Using Deep Belief Network  S Mukherjee, SKD Mandal
2015   Speech Enhancement Based on Analysis–Synthesis Framework With Improved Pitch Estimation and Spectral Envelope Enhancement  B Liu, F Mo, J Tao
2015   Denoising Convolutional Autoencoders for Noisy Speech Recognition  M Kayser, V Zhong
2015   Model-based Approaches to Robust Speech Recognition in Diverse Environments  Y Wang
2015   Learning Speech Emotion Features by Joint Disentangling-Discrimination  W Xue, Z Huang, X Luo, Q Mao
2015   Duration Refinement for Hybrid Speech Synthesis System using Random Forest  R Zhang, X Lou, Q Wu
2015   Exploiting Low-Dimensional Structures To Enhance Dnn Based Acoustic Modeling In Speech Recognition  P Dighe, G Luyet, A Asaei, H Bourlard
2015   Meta-Data Inputs To Front End Processing For Automatic Speech Recognition  D Willett, KJ Lööf, Y Pan, J Pinto, C Gollan
2015   Optimized multi-channel deep neural network with 2D graphical representation of acoustic speech features for emotion recognition  MN Stolar, M Lech, IS Burnett
2015   Deep Multimodal Learning for Audio-Visual Speech Recognition  Y Mroueh, E Marcheret, V Goel
2015   Noisy Training for Deep Neural Networks in Speech Recognition  S Yin, C Liu, Z Zhang, Y Lin, D Wang, J Tejedor, T Fang
2015   Unidirectional Long Short-Term Memory Recurrent Neural Network With Recurrent Output Layer For Low-Latency Speech …  H Zen, H Sak
2015   Improved speech inversion using general regression neural network  S Najnin, B Banerjee
2015   Localization based stereo speech separation using deep networks  Y Yu, W Wang, J Luo, P Feng
2015   Deep learning for speech classification and speaker recognition  MM Saleem
2015   Speech Separation based on Deep Belief Network  W Haijia, Z Xiongwei, Z Liangliang, Z Xia
2015   EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding  Y Miao, M Gowayyed, F Metze
2015   An Iterative Deep Learning Framework For Unsupervised Discovery Of Speech Features And Linguistic Units With Applications On …  CT Chung, CY Tsai, HH Lu, CH Liu, H Lee, L Lee
2015   A study of speaker adaptation for DNN-based speech synthesis  Z Wu, P Swietojanski, C Veaux, S Renals, S King
2015   Deep Neural Network Based Spectral Feature Mapping for Robust Speech Recognition  K Han, Y He, D Bagchi, E Fosler
2015   Speech Attribute Detection Using Deep Learning  I Kukanov
2015   Deep Speech 2: End-to-End Speech Recognition in English and Mandarin  D Amodei, R Anubhai, E Battenberg, C Case, J Casper
2015   A Fast Learning Method for the Multi-layer Perceptron in Automatic Speech Recognition Systems  C Cai, Y Xu, D Ke, K Su
2015   Lie Detection from Speech Analysis Based on K–SVD Deep Belief Network Model  Y Zhou, H Zhao, X Pan
2015   Speech reconstruction from human auditory cortex with deep neural networks  M Yang, SA Sheth, CA Schevon, GMMK II
2015   The NAIST English Speech Recognition System for IWSLT 2015  M Heck, QT Do, S Sakti, G Neubig, S Nakamura
2015   Maxout neurons for deep convolutional and LSTM neural networks in speech recognition  M Cai, J Liu
2015   Voice Activity Detection (VAD) for a Coded Speech Bitstream without Decoding  DA Barreda, JEG Lainez, D Sharma, P Naylor
2015   Emotion recognition from embedded bodily expressions and speech during dyadic interactions  PM Müller, S Amin, P Verma, M Andriluka, A Bulling
2015   Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization …  JH Hansen
2015   Integrating Acoustic and State-Transition Models for Free Phone Recognition in L2 English Speech Using Multi-Distribution Deep Neural Networks  K Li, X Qian, S Kang, P Liu, H Meng
2015   An investigation on DNN-derived bottleneck features for GMM-HMM based robust speech recognition  Y You, Y Qian, T He, K Yu
2015   LIP movement generation using restricted Boltzmann machines for visual speech synthesis  ZC Liu, ZH Ling, LR Dai
2015   Local trajectory based speech enhancement for robust speech recognition with deep neural network  Y You, Y Qian, K Yu
2015   Analysis of CNN-based Speech Recognition System using Raw Speech as Input  D Palaz, MM Doss, R Collobert
2015   Be at Odds? Deep and Hierarchical Neural Networks for Classification and Regression of Conflict in Speech  R Brueckner, B Schuller
2015   CURRENT OPINION Teaching laryngeal endoscopy skills to speech and language therapists: applying learning theory to optimize practical skills mastery  HF Robinson, R Dennick
2015   Deep Nmf For Speech Separation  J Le Roux, JR Hershey, F Weninger
2015   Phase-Sensitive And Recognition-Boosted Speech Separation Using Deep Recurrent Neural Networks  H Erdogan, JR Hershey, S Watanabe, J Le Roux
2015   Speech emotion recognition with unsupervised feature learning  Z HUANG, W XUE, Q MAO
2015   Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder  H Wang, Y Zhao, XF Liu, XN Xu, L Wang, N Zhou
2015   Merging of Native and Non-native Speech for Low-resource Accented ASR  SS Juan, L Besacier, B Lecouteux, TP Tan
2015   Soft context clustering for F0 modeling in HMM-based speech synthesis  S Khorram, H Sameti, S King
2015   Attention-Based Models for Speech Recognition  J Chorowski, D Bahdanau, D Serdyuk, K Cho
2015   Development and Assessment of Visual Articulatory Biofeedback Systems for Speech Therapy  D Fabre, T Hueber, M Canault, N Bedoin, P Badin
2015   Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition  L Liang, R Steve
2015   The LIUM English-to-French Spoken Language Translation System and the Vecsys/LIUM Automatic Speech Recognition System for Italian Language for IWSLT 2014  A Rousseau, L Barrault, P Deléglise, Y Esteve
2015   A discriminative reliability-aware classification model with applications to intelligibility classification in pathological speech  N Kumar, SS Narayanan
2015   Strategies for distant speech recognitionin reverberant environments  M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto
2015   Robust ASR using neural network based speech enhancement and feature simulation  S Sivasankaran, AA Nugraha, E Vincent
2015   Towards Structured Deep Neural Network for Automatic Speech Recognition  YH Liao, HY Lee, L Lee
2015   Deep learning in acoustic modeling for Automatic Speech Recognition and Understanding-an overview  I Gavat, D Militaru
2015   Fast adaptation of deep neural network based on discriminant codes for speech recognition  S Xue, O Abdel
2015   Deep Neural Networks Employing Multi-Task Learning And Stacked Bottleneck Features For Speech Synthesis  Z Wu, C Valentini
2014   Learning linearly separable features for speech recognition using convolutional neural networks  D Palaz, MM Doss, R Collobert
2014   End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results  J Chorowski, D Bahdanau, K Cho, Y Bengio
2014   Speech Emotion Recognition Using CNN  Z Huang, M Dong, Q Mao, Y Zhan
2014   Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers  Y Tu, J Du, Y Xu, L Dai, CH Lee
2014   Decision tree based state tying for speech recognition using DNN derived embeddings  X Li, X Wu
2014   Speaker adaptation of hybrid NN/HMM model for speech recognition based on singular value decomposition  S Xue, H Jiang, L Dai
2014   Labeling unsegmented sequence data with DNN-HMM and its application for speech recognition  X Li, X Wu
2014   Investigation of stochastic Hessian-Free optimization in Deep neural networks for speech recognition  Z You, B Xu
2014   Cross-language transfer learning for deep neural network based speech enhancement  Y Xu, J Du, LR Dai, CH Lee
2014   Improving generation performance of speech emotion recognition by denoising autoencoders  L Chao, J Tao, M Yang, Y Li
2014   Cross-language speech attribute detection and phone recognition for Tibetan using deep learning  H Wang, Y Zhao, Y Xu, X Xu, X Suo, Q Ji
2014   A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors  Y Wang, J Du, L Dai, CH Lee
2014   Mapping between ultrasound and vowel speech using DNN framework  X Zheng, J Wei, W Lu, Q Fang, J Dang
2014   Parametric Speech Synthesis Using Local and Global Sparse Gaussian  T Koriyama, T Nose, T Kobayashi
2014   A critical examination of deep learningapproaches to automated speech recognition  NGA LAYOUSS
2014   Improving Acoustic Model for Vietnamese Large Vocabulary Continuous Speech Recognition System Using Deep Bottleneck Features  QB Nguyen, TT Vu, CM Luong
2014   A Deep Learning Approach to Data-driven Parameterizations for Statistical Parametric Speech Synthesis  PK Muthukumar, AW Black
2014   Computational modeling and validation of the motor contribution to speech perception  L Badino, AD Ausilio, L Fadiga, G Metta
2014   Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks  Y Jiang, D Wang, R Liu, Z Feng
2014   Deep convolutional neural networks for large-scale speech tasks  TN Sainath, B Kingsbury, G Saon, H Soltau
2014   DeepSpeech: Scaling up end-to-end speech recognition  A Hannun, C Case, J Casper, B Catanzaro, G Diamos
2014   Audio-visual speech recognition using deep learning  K Noda, Y Yamaguchi, K Nakadai, HG Okuno, T Ogata
2014   Deep neural network adaptation for children's and adults' speech recognition  R Serizel, D Giuliani, FBK FBK
2014   The relation of eye gaze and face pose: Potential impact on speech recognition  M Slaney, A Stolcke, D Hakkani
2014   Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array  W Li, L Wang, Y Zhou, J Dines, M Magimai
2014   Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks  D Palaz, R Collobert
2014   Dysarthric Speech Recognition Using a Convolutive Bottleneck Network  T Nakashika, T Yoshioka, T Takiguchi, Y Ariki
2014   Ensemble Learning Approaches in Speech Recognition  Y Zhao, J Xue, X Chen
2014   Neural Network Based Pitch Tracking In Very Noisy Speech  K Han, D Wang
2014   An Investigation of Implementation and Performance Analysis of DNN Based Speech Synthesis System  Z Chen, K Yu
2014   Speech Separation of A Target Speaker Based on Deep Neural Networks  TY Du Jun, X Yong, LCH Dai Lirong
2014   Deep Neural Network Based Speech Separation for Robust Speech Recognition  T Yanhui, D Jun, X Yong, D Lirong, L Chin
2014   Supervised Speech Separation And Processing  K Han
2014   Extracting Deep Bottleneck Features For Visual Speech Recognition  C Sui, R Togneri, M Bennamoun
2014   A Study of Deep Belief Network Based Chinese Speech Emotion Recognition  B Chen, Q Yin, P Guo
2014   Automatic Speech Recognition: A Deep Learning Approach  D Yu, L Deng
2014   Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments  A Schwarz, C Huemmer, R Maas, W Kellermann