Anomaly-based annotation error detection in speech-synthesis corpora (English)
- New search for: Matoušek, Jindřich
- New search for: Tihelka, Daniel
- New search for: Matoušek, Jindřich
- New search for: Tihelka, Daniel
In:
Computer Speech and Language
;
46
;
1-35
;
2017
-
ISSN:
- Article (Journal) / Electronic Resource
-
Title:Anomaly-based annotation error detection in speech-synthesis corpora
-
Contributors:Matoušek, Jindřich ( author ) / Tihelka, Daniel ( author )
-
Published in:Computer Speech and Language ; 46 ; 1-35
-
Publisher:
- New search for: Elsevier Ltd
-
Publication date:2017-04-11
-
Size:35 pages
-
ISSN:
-
DOI:
-
Type of media:Article (Journal)
-
Type of material:Electronic Resource
-
Language:English
-
Keywords:
-
Source:
Table of contents – Volume 46
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Anomaly-based annotation error detection in speech-synthesis corporaMatoušek, Jindřich / Tihelka, Daniel et al. | 2017
- 36
-
Reversible speaker de-identification using pre-trained transformation functionsMagariños, Carmen / Lopez-Otero, Paula / Docio-Fernandez, Laura / Rodriguez-Banga, Eduardo / Erro, Daniel / Garcia-Mateo, Carmen et al. | 2017
- 53
-
Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov ModelsZeinali, Hossein / Sameti, Hossein / Burget, Lukáš / Černocký, Jan “Honza” et al. | 2017
- 72
-
Adaptive speaker diarization of broadcast news based on factor analysisDesplanques, Brecht / Demuynck, Kris / Martens, Jean-Pierre et al. | 2017
- 94
-
Constructing a Natural Language Inference dataset using generative neural networksStarc, Janez / Mladenić, Dunja et al. | 2017
- 113
-
A time-sensitive historical thesaurus-based semantic tagger for deep semantic annotationPiao, Scott / Dallachy, Fraser / Baron, Alistair / Demmen, Jane / Wattam, Steve / Durkin, Philip / McCracken, James / Rayson, Paul / Alexander, Marc et al. | 2017
- 136
-
Estimation of glottal closure instants from degraded speech using a phase-difference-based algorithmAnushiya Rachel, G. / Vijayalakshmi, P. / Nagarajan, T. et al. | 2017
- 154
-
A segmental framework for fully-unsupervised large-vocabulary speech recognitionKamper, Herman / Jansen, Aren / Goldwater, Sharon et al. | 2017
- 175
-
Towards the next generation of speech tools and corporaDraxler, Christoph / Harrington, Jonathan / Schiel, Florian et al. | 2017
- 179
-
Influence of speaker familiarity on blind and visually impaired children’s and young adults’ perception of synthetic voicesPucher, Michael / Zillinger, Bettina / Toman, Markus / Schabus, Dietmar / Valentini-Botinhao, Cassia / Yamagishi, Junichi / Schmid, Erich / Woltron, Thomas et al. | 2017
- 196
-
Characterisation of voice quality of Parkinson’s disease using differential phonological posterior featuresCernak, Milos / Orozco-Arroyave, Juan Rafael / Rudzicz, Frank / Christensen, Heidi / Vásquez-Correa, Juan Camilo / Nöth, Elmar et al. | 2017
- 209
-
Lexicon-free fingerspelling recognition from video: Data, models, and signer adaptationKim, Taehwan / Keane, Jonathan / Wang, Weiran / Tang, Hao / Riggle, Jason / Shakhnarovich, Gregory / Brentari, Diane / Livescu, Karen et al. | 2017
- 233
-
Scalable algorithms for unsupervised clustering of acoustic data for speech recognitionRath, Shakti P. et al. | 2017
- 249
-
Spoken language understanding and interaction: machine learning for human-like conversational systemsGašić, Milica / Hakkani-Tür, Dilek / Celikyilmaz, Asli et al. | 2017
- 252
-
Multilingually trained bottleneck features in spoken language recognitionFér, Radek / Matějka, Pavel / Grézl, František / Plchot, Oldřich / Veselý, Karel / Černocký, Jan Honza et al. | 2017
- 268
-
Emotion, age, and gender classification in children’s speech by humans and machinesKaya, Heysem / Salah, Albert Ali / Karpov, Alexey / Frolova, Olga / Grigorev, Aleksey / Lyakso, Elena et al. | 2017
- 284
-
Improving the understanding of spoken referring expressions through syntactic-semantic and contextual-phonetic error-correctionZukerman, Ingrid / Partovi, Andisheh et al. | 2017
- 311
-
A Framework for pre-training hidden-unit conditional random fields and its extension to long short term memory networksKim, Young-Bum / Stratos, Karl / Sarikaya, Ruhi et al. | 2017
- 327
-
Unsupervised crosslingual adaptation of tokenisers for spoken language recognitionNg, Raymond W.M. / Nicolao, Mauro / Hain, Thomas et al. | 2017
- 343
-
Using speech technology for quantifying behavioral characteristics in peer-led team learning sessionsDubey, Harishchandra / Sangwan, Abhijeet / Hansen, John H.L. et al. | 2017
- 367
-
Introduction to the special issue on deep learning approaches for machine translationCosta-jussà, Marta R. / Allauzen, Alexandre / Barrault, Loïc / Cho, Kyunghun / Schwenk, Holger et al. | 2017
- 374
-
A generic neural acoustic beamforming architecture for robust multi-channel speech processingHeymann, Jahn / Drude, Lukas / Haeb-Umbach, Reinhold et al. | 2016
- 386
-
Multi-microphone speech recognition in everyday environmentsBarker, Jon / Marxer, Ricard / Vincent, Emmanuel / Watanabe, Shinji et al. | 2017
- 388
-
Robust coherence-based spectral enhancement for speech recognition in adverse real-world environmentsBarfuss, Hendrik / Huemmer, Christian / Schwarz, Andreas / Kellermann, Walter et al. | 2017
- 401
-
Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backendHori, Takaaki / Chen, Zhuo / Erdogan, Hakan / Hershey, John R. / Le Roux, Jonathan / Mitra, Vikramjit / Watanabe, Shinji et al. | 2017
- 419
-
Room-localized spoken command recognition in multi-room, multi-microphone environmentsRodomagoulakis, Isidoros / Katsamanis, Athanasios / Potamianos, Gerasimos / Giannoulis, Panagiotis / Tsiami, Antigoni / Maragos, Petros et al. | 2017
- 444
-
A combined evaluation of established and new approaches for speech recognition in varied reverberation conditionsSivasankaran, Sunit / Vincent, Emmanuel / Illina, Irina et al. | 2017
- 461
-
Acoustic model training based on node-wise weight boundary model for fast and small-footprint deep neural networksTakeda, Ryu / Nakadai, Kazuhiro / Komatani, Kazunori et al. | 2017
- 481
-
Multi-style learning with denoising autoencoders for acoustic modeling in the internet of things (IoT)Lin, Payton / Lyu, Dau-Cheng / Chen, Fei / Wang, Syu-Siang / Tsao, Yu et al. | 2017
- 496
-
Bayesian feature enhancement using independent vector analysis and reverberation parameter re-estimation for noisy reverberant speech recognitionCho, Ji-Won / Park, Jong-Hyeon / Chang, Joon-Hyuk / Park, Hyung-Min et al. | 2017
- 517
-
An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speechTu, Yan-Hui / Du, Jun / Wang, Qing / Bao, Xiao / Dai, Li-Rong / Lee, Chin-Hui et al. | 2016
- 535
-
An analysis of environment, microphone and data simulation mismatches in robust speech recognitionVincent, Emmanuel / Watanabe, Shinji / Nugraha, Aditya Arie / Barker, Jon / Marxer, Ricard et al. | 2016
- 558
-
Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech RecognitionMoritz, Niko / Adiloğlu, Kamil / Anemüller, Jörn / Goetze, Stefan / Kollmeier, Birger et al. | 2016
- 574
-
Speech enhancement for robust automatic speech recognition: Evaluation using a baseline system and instrumental measuresMoore, A.H. / Peso Parada, P. / Naylor, P.A. et al. | 2016
- 585
-
DNN adaptation by automatic quality estimation of ASR hypothesesFalavigna, Daniele / Matassoni, Marco / Jalalvand, Shahab / Negri, Matteo / Turchi, Marco et al. | 2016
- 605
-
The third ‘CHiME’ speech separation and recognition challenge: Analysis and outcomesBarker, Jon / Marxer, Ricard / Vincent, Emmanuel / Watanabe, Shinji et al. | 2016