Recurrent neural network based language model
Tomas Mikolov,
M. Karafiát,
L. Burget,
J. Černocký,
S. Khudanpur
|
12 |
2010 |
12
2010
|
SRILM - an extensible language modeling toolkit
A. Stolcke
|
12 |
2002 |
12
2002
|
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
7 auth.
Daniel S. Park,
William Chan,
Yu Zhang,
Chung-Cheng Chiu,
Barret Zoph,
E. D. Cubuk,
...
Quoc V. Le
|
11 |
2019 |
11
2019
|
Conformer: Convolution-augmented Transformer for Speech Recognition
11 auth.
Anmol Gulati,
James Qin,
Chung-Cheng Chiu,
Niki Parmar,
Yu Zhang,
Jiahui Yu,
...
Wei Han,
Shibo Wang,
Zhengdong Zhang,
Yonghui Wu,
Ruoming Pang
|
11 |
2020 |
11
2020
|
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
Hasim Sak,
A. Senior,
F. Beaufays
|
11 |
2014 |
11
2014
|
VoxCeleb: A Large-Scale Speaker Identification Dataset
Arsha Nagrani,
Joon Son Chung,
Andrew Zisserman
|
11 |
2017 |
11
2017
|
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung,
Arsha Nagrani,
Andrew Zisserman
|
11 |
2018 |
11
2018
|
A database of German emotional speech
F. Burkhardt,
A. Paeschke,
M. Rolfes,
W. Sendlmeier,
Benjamin Weiss
|
11 |
2005 |
11
2005
|
The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
D. Pearce,
H. Hirsch
|
11 |
2000 |
11
2000
|
LSTM Neural Networks for Language Modeling
M. Sundermeyer,
Ralf Schlüter,
H. Ney
|
10 |
2012 |
10
2012
|
Tacotron: Towards End-to-End Speech Synthesis
14 auth.
Yuxuan Wang,
R. Skerry-Ryan,
Daisy Stanton,
Yonghui Wu,
Ron J. Weiss,
N. Jaitly,
Zongheng Yang,
Y. Xiao,
Z. Chen,
Samy Bengio,
...
Quoc V. Le,
Yannis Agiomyrgiannakis,
R. Clark,
R. Saurous
|
10 |
2017 |
10
2017
|
ESPnet: End-to-End Speech Processing Toolkit
12 auth.
Shinji Watanabe,
Takaaki Hori,
Shigeki Karita,
Tomoki Hayashi,
Jiro Nishitoba,
Y. Unno,
...
Nelson Yalta,
Jahn Heymann,
Matthew Wiesner,
Nanxin Chen,
Adithya Renduchintala,
Tsubasa Ochiai
|
10 |
2018 |
10
2018
|
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
Brecht Desplanques,
Jenthe Thienpondt,
Kris Demuynck
|
10 |
2020 |
10
2020
|
SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual,
A. Bonafonte,
J. Serrà
|
10 |
2017 |
10
2017
|
Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi
Michael McAuliffe,
Michaela Socolof,
Sarah Mihuc,
M. Wagner,
Morgan Sonderegger
|
10 |
2017 |
10
2017
|
Audio augmentation for speech recognition
Tom Ko,
Vijayaditya Peddinti,
Daniel Povey,
S. Khudanpur
|
10 |
2015 |
10
2015
|
One billion word benchmark for measuring progress in statistical language modeling
7 auth.
Ciprian Chelba,
Tomas Mikolov,
M. Schuster,
Qi Ge,
T. Brants,
P. Koehn,
...
T. Robinson
|
10 |
2013 |
10
2013
|
Analysis of i-vector Length Normalization in Speaker Recognition Systems
D. Garcia-Romero,
C. Espy-Wilson
|
10 |
2011 |
10
2011
|