BetterScholar BetterScholar
18
Title Level Year L/Y
Recurrent neural network based language model
Tomas Mikolov, M. Karafiát, L. Burget, J. Černocký, S. Khudanpur
12 2010 12
2010
SRILM - an extensible language modeling toolkit
A. Stolcke
12 2002 12
2002
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
7 auth. Daniel S. Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, E. D. Cubuk, ... Quoc V. Le
11 2019 11
2019
Conformer: Convolution-augmented Transformer for Speech Recognition
11 auth. Anmol Gulati, James Qin, Chung-Cheng Chiu, Niki Parmar, Yu Zhang, Jiahui Yu, ... Wei Han, Shibo Wang, Zhengdong Zhang, Yonghui Wu, Ruoming Pang
11 2020 11
2020
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
Hasim Sak, A. Senior, F. Beaufays
11 2014 11
2014
VoxCeleb: A Large-Scale Speaker Identification Dataset
Arsha Nagrani, Joon Son Chung, Andrew Zisserman
11 2017 11
2017
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung, Arsha Nagrani, Andrew Zisserman
11 2018 11
2018
A database of German emotional speech
F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, Benjamin Weiss
11 2005 11
2005
The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
D. Pearce, H. Hirsch
11 2000 11
2000
LSTM Neural Networks for Language Modeling
M. Sundermeyer, Ralf Schlüter, H. Ney
10 2012 10
2012
Tacotron: Towards End-to-End Speech Synthesis
14 auth. Yuxuan Wang, R. Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, N. Jaitly, Zongheng Yang, Y. Xiao, Z. Chen, Samy Bengio, ... Quoc V. Le, Yannis Agiomyrgiannakis, R. Clark, R. Saurous
10 2017 10
2017
ESPnet: End-to-End Speech Processing Toolkit
12 auth. Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Y. Unno, ... Nelson Yalta, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai
10 2018 10
2018
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
Brecht Desplanques, Jenthe Thienpondt, Kris Demuynck
10 2020 10
2020
SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual, A. Bonafonte, J. Serrà
10 2017 10
2017
Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi
Michael McAuliffe, Michaela Socolof, Sarah Mihuc, M. Wagner, Morgan Sonderegger
10 2017 10
2017
Audio augmentation for speech recognition
Tom Ko, Vijayaditya Peddinti, Daniel Povey, S. Khudanpur
10 2015 10
2015
One billion word benchmark for measuring progress in statistical language modeling
7 auth. Ciprian Chelba, Tomas Mikolov, M. Schuster, Qi Ge, T. Brants, P. Koehn, ... T. Robinson
10 2013 10
2013
Analysis of i-vector Length Normalization in Speaker Recognition Systems
D. Garcia-Romero, C. Espy-Wilson
10 2011 10
2011