TONGUE AND LIP MOTION PATTERNS IN ALARYNGEAL SPEECH (English)
- New search for: Teplansky, Kristin J.
- New search for: Wisler, Alan
- New search for: Cao, Beiming
- New search for: Liang, Wendy
- New search for: Whited, Chad W.
- New search for: Mau, Ted
- New search for: Wang, Jun
- New search for: Teplansky, Kristin J.
- New search for: Wisler, Alan
- New search for: Cao, Beiming
- New search for: Liang, Wendy
- New search for: Whited, Chad W.
- New search for: Mau, Ted
- New search for: Wang, Jun
In:
Cognitive intelligence for speech processing ; Volume 7 of 7
; 4576-4580
;
2020
- Conference paper / Print
-
Title:TONGUE AND LIP MOTION PATTERNS IN ALARYNGEAL SPEECH
-
Contributors:Teplansky, Kristin J. ( author ) / Wisler, Alan ( author ) / Cao, Beiming ( author ) / Liang, Wendy ( author ) / Whited, Chad W. ( author ) / Mau, Ted ( author ) / Wang, Jun ( author )
-
Conference:INTERSPEECH ; 21. ; 2020 ; Online
-
Published in:
-
Publisher:
- New search for: Curran Associates, Inc.
-
Place of publication:Red Hook, NY
-
Publication date:2020
-
Type of media:Conference paper
-
Type of material:Print
-
Language:English
-
Source:
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 4362
-
SPEAKER CODE BASED SPEAKER ADAPTIVE TRAINING USING MODEL AGNOSTIC META-LEARNINGWu, Huaxin / Wan, Genshun / Pan, Jia et al. | 2020
- 4367
-
DOMAIN ADAPTATION USING CLASS SIMILARITY FOR ROBUST SPEECH RECOGNITIONZhu, Han / Zhao, Jiangjiang / Ren, Yuling / Wang, Li / Zhang, Pengyuan et al. | 2020
- 4372
-
INCREMENTAL MACHINE SPEECH CHAIN TOWARDS ENABLING LISTENING WHILE SPEAKING IN REAL TIMENovitasari, Sashi / Tjandra, Andros / Yanagita, Tomoya / Sakti, Sakriani / Nakamura, Satoshi et al. | 2020
- 4377
-
CONTEXT-DEPENDENT ACOUSTIC MODELING WITHOUT EXPLICIT PHONE CLUSTERINGRaissi, Tina / Beck, Eugen / Schluter, Ralf / Ney, Hermann et al. | 2020
- 4382
-
VOICE CONVERSION BASED DATA AUGMENTATION TO IMPROVE CHILDREN'S SPEECH RECOGNITION IN LIMITED DATA SCENARIOShahnawazuddin, S. / Adiga, Nagaraj / Kumar, Kunal / Poddar, Aayushi / Ahmad, Waquar et al. | 2020
- 4387
-
COPYCAT: MANY-TO-MANY FINE-GRAINED PROSODY TRANSFER FOR NEURAL TEXT-TO-SPEECHKarlapati, Sri / Moinet, Alexis / Joly, Arnaud / Klimkov, Viacheslav / Saez-Trigueros, Daniel / Drugman, Thomas et al. | 2020
- 4392
-
JOINT DETECTION OF SENTENCE STRESS AND PHRASE BOUNDARY FOR PROSODYLin, Binghuai / Wang, Liyuan / Feng, Xiaoli / Zhang, Jinsong et al. | 2020
- 4397
-
TRANSFER LEARNING OF THE EXPRESSIVITY USING FLOW METRIC LEARNING IN MULTISPEAKER TEXT-TO-SPEECH SYNTHESISKulkarni, Ajinkya / Colotte, Vincent / Jouvet, Denis et al. | 2020
- 4402
-
SPEAKING SPEED CONTROL OF END-TO-END SPEECH SYNTHESIS USING SENTENCE- LEVEL CONDITIONINGBae, Jae-Sung / Bae, Hanbin / Joo, Young-Sun / Lee, Junmo / Lee, Gyeong-Hoon / Cho, Hoon-Young et al. | 2020
- 4407
-
DYNAMIC PROSODY GENERATION FOR SPEECH SYNTHESIS USING LINGUISTICS- DRIVEN ACOUSTIC EMBEDDING SELECTIONTyagi, Shubhi / Nicolis, Marco / Rohnke, Jonas / Drugman, Thomas / Lorenzo-Trueba, Jaime et al. | 2020
- 4412
-
IMPROVING THE PROSODY OF RNN-BASED ENGLISH TEXT-TO-SPEECH SYNTHESIS BY INCORPORATING A BERT MODELKenter, Tom / Sharma, Manish / Clark, Rob et al. | 2020
- 4417
-
IMPROVED PROSODY FROM LEARNED F0 CODEBOOK REPRESENTATIONS FOR VQ-VAE SPEECH WAVEFORM RECONSTRUCTIONZhao, Yi / Li, Haoyu / Lai, Cheng-I / Williams, Jennifer / Cooper, Erica / Yamagishi, Junichi et al. | 2020
- 4422
-
PROSODY LEARNING MECHANISM FOR SPEECH SYNTHESIS SYSTEM WITHOUT TEXT LENGTH LIMITZeng, Zhen / Wang, Jianzong / Cheng, Ning / Xiao, Jing et al. | 2020
- 4427
-
DISCRIMINATIVE METHOD TO EXTRACT COARSE PROSODIC STRUCTURE AND ITS APPLICATION FOR STATISTICAL PHRASE/ACCENT COMMAND ESTIMATIONShirahata, Yuma / Saito, Daisuke / Minematsu, Nobuaki et al. | 2020
- 4432
-
CONTROLLABLE NEURAL TEXT-TO-SPEECH SYNTHESIS USING INTUITIVE PROSODIC FEATURESRaitio, Tuomo / Rasipuram, Ramya / Castellani, Dan et al. | 2020
- 4437
-
CONTROLLABLE NEURAL PROSODY SYNTHESISMorrison, Max / Jin, Zeyu / Salamon, Justin / Bryan, Nicholas J. / Mysore, Gautham J. et al. | 2020
- 4442
-
MULTI-REFERENCE NEURAL TTS STYLIZATION WITH ADVERSARIAL CYCLE CONSISTENCYWhitehill, Matt / Ma, Shuang / McDuff, Daniel / Song, Yale et al. | 2020
- 4447
-
INTERACTIVE TEXT-TO-SPEECH SYSTEM VIA JOINT STYLE ANALYSISGao, Yang / Zheng, Weiyi / Yang, Zhaojun / Kohler, Thilo / Fuegen, Christian / He, Qing et al. | 2020
- 4452
-
MOBILE-ASSISTED PROSODY TRAINING FOR LIMITED ENGLISH PROFICIENCY: LEARNER BACKGROUND AND SPEECH LEARNING PATTERNHirschi, Kevin / Kang, Okim / Cucchiarini, Catia / Hansen, John H. L. / Evanini, Keelan / Strik, Helmer et al. | 2020
- 4457
-
FINDING INTELLIGIBLE CONSONANT-VOWEL SOUNDS USING HIGH-QUALITY ARTICULATORY SYNTHESISNiekerk, Daniel R. Van / Xu, Anqi / Gerazov, Branislav / Krug, Paul K. / Birkholz, Peter / Xu, Yi et al. | 2020
- 4462
-
AUDIOVISUAL CORRESPONDENCE LEARNING IN HUMANS AND MACHINESKrishnamohan, Venkat / Soman, Akshara / Gupta, Anshul / Ganapathy, Sriram et al. | 2020
- 4467
-
PERCEPTION OF ENGLISH FRICATIVES AND AFFRICATES BY ADVANCED CHINESE LEARNERS OF ENGLISHLan, Yizhou et al. | 2020
- 4471
-
PERCEPTION OF JAPANESE CONSONANT LENGTH BY NATIVE SPEAKERS OF KOREAN DIFFERING IN JAPANESE LEARNING EXPERIENCETsukada, Kimiko / Kim, Joo-Yeon / Han, Jeong-Im et al. | 2020
- 4476
-
AUTOMATIC DETECTION OF PHONOLOGICAL ERRORS IN CHILD SPEECH USING SIAMESE RECURRENT AUTOENCODERLee, Si-Ioi Ng. Tan et al. | 2020
- 4481
-
A COMPARISON OF ENGLISH RHYTHM PRODUCED BY NATIVE AMERICAN SPEAKERS AND MANDARIN ESL PRIMARY SCHOOL LEARNERSDing, Hongwei / Lin, Binghuai / Wang, Liyuan / Wang, Hui / Fang, Ruomei et al. | 2020
- 4486
-
CROSS-LINGUISTIC INTERACTION BETWEEN PHONOLOGICAL CATEGORIZATION AND ORTHOGRAPHY PREDICTS PROSODIC EFFECTS IN THE ACQUISITION OF PORTUGUESE LIQUIDS BY LI-MANDARIN LEARNERSZhou, Chao / Hamann, Silke et al. | 2020
- 4491
-
CROSS-LINGUISTIC PERCEPTION OF UTTERANCES WITH WILLINGNESS AND RELUCTANCE IN MANDARIN BY KOREAN L2 LEARNERSLi, Wengian / Tu, Jung-Yueh et al. | 2020
- 4496
-
SPEECH ENHANCEMENT BASED ON BEAMFORMING AND POST-FILTERING BY COMBINING PHASE INFORMATIONCheng, Rui / Bao, Changchun et al. | 2020
- 4501
-
A NOISE-AWARE MEMORY-ATTENTION NETWORK ARCHITECTURE FOR REGRESSION-BASED SPEECH ENHANCEMENTWang, Yu-Xuan / Du, Jun / Chai, Li / Lee, Chin-Hui / Pan, Jia et al. | 2020
- 4506
-
HIFI-GAN: HIGH-FIDELITY DENOISING AND DEREVERBERATION BASED ON SPEECH DEEP FEATURES IN ADVERSARIAL NETWORKSSu, Jiagi / Jin, Zevu / Finkelstein, Adam et al. | 2020
- 4511
-
LEARNING COMPLEX SPECTRAL MAPPING FOR SPEECH ENHANCEMENT WITH IMPROVED CROSS-CORPUS GENERALIZATIONPandey, Ashutosh / Wang, Deliang et al. | 2020
- 4516
-
SPEECH ENHANCEMENT WITH STOCHASTIC TEMPORAL CONVOLUTIONAL NETWORKSRichter, Julius / Carbajal, Guillaume / Gerkmann, Timo et al. | 2020
- 4521
-
VISUAL SPEECH IN REAL NOISY ENVIRONMENTS (VISION): A NOVEL BENCHMARK DATASET AND DEEP LEARNING-BASED BASELINE SYSTEMGogate, Mandar / Dashtipour, Kia / Hussain, Amir et al. | 2020
- 4526
-
SPARSE MIXTURE OF LOCAL EXPERTS FOR EFFICIENT SPEECH ENHANCEMENTSivaraman, Aswin / Kim, Minje et al. | 2020
- 4531
-
IMPROVED SPEECH ENHANCEMENT USING TCN WITH MULTIPLE ENCODER-DECODER LAYERSKishore, Vinith / Tiwari, Nitya / Paramasivam, Periyasamy et al. | 2020
- 4536
-
JOINT TRAINING FOR SIMULTANEOUS SPEECH DENOISING AND DEREVERBERATION WITH DEEP EMBEDDING REPRESENTATIONSFan, Cunhang / Tao, Jianhua / Liu, Bin / Yi, Jiangyan / Wen, Zhengqi et al. | 2020
- 4541
-
UNSUPERVISED ROBUST SPEECH ENHANCEMENT BASED ON ALPHA-STABLE FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATIONFontaine, Mathieu / Sekiguchi, Kouhei / Nugraha, Aditya Arie / Yoshii, Kazuyoshi et al. | 2020
- 4546
-
SQUEEZE FOR SNEEZE: COMPACT NEURAL NETWORKS FOR COLD AND FLU RECOGNITIONAlbes, Merlin / Ren, Zhao / Schuller, Bjorn W. / Cummins, Nicholas et al. | 2020
- 4551
-
EXTENDED STUDY ON THE USE OF VOCAL TRACT VARIABLES TO QUANTIFY NEUROMOTOR COORDINATION IN DEPRESSIONSeneviratne, Nadee / Williamson, James R. / Lammert, Adam C. / Quatieri, Thomas F. / Espy-Wilson, Carol et al. | 2020
- 4556
-
AFFECTIVE CONDITIONING ON HIERARCHICAL ATTENTION NETWORKS APPLIED TO DEPRESSION DETECTION FROM TRANSCRIBED CLINICAL INTERVIEWSXezonaki, Danai / Paraskevopoulos, Georgios / Potamianos, Alexandros / Narayanan, Shrikanth et al. | 2020
- 4561
-
DOMAIN ADAPTATION FOR ENHANCING SPEECH-BASED DEPRESSION DETECTION IN NATURAL ENVIRONMENTAL CONDITIONS USING DILATED CNNSHuang, Zhaocheng / Epps, Julien / Joachim, Dale / Stasak, Brian / Williamson, James R. / Quatieri, Thomas F. et al. | 2020
- 4566
-
MAKING A DISTINCTION BETWEEN SCHIZOPHRENIA AND BIPOLAR DISORDER BASED ON TEMPORAL PARAMETERS IN SPONTANEOUS SPEECHGosztolya, Gabor / Bagi, Anita / Szaloki, Szilvia / Szendi, Istvan / Hoffmann, Ildiko et al. | 2020
- 4571
-
PREDICTION OF SLEEPINESS RATINGS FROM VOICE BY MAN AND MACHINEHuckvale, Mark / Beke, Andras / Ikushima, Mirei et al. | 2020
- 4576
-
TONGUE AND LIP MOTION PATTERNS IN ALARYNGEAL SPEECHTeplansky, Kristin J. / Wisler, Alan / Cao, Beiming / Liang, Wendy / Whited, Chad W. / Mau, Ted / Wang, Jun et al. | 2020
- 4581
-
AUTOENCODER BOTTLENECK FEATURES WITH MULTI-TASK OPTIMISATION FOR IMPROVED CONTINUOUS DYSARTHRIC SPEECH RECOGNITIONYue, Zhengjun / Christensen, Heidi / Barker, Jon et al. | 2020
- 4586
-
RAW SPEECH WAVEFORM BASED CLASSIFICATION OF PATIENTS WITH ALS, PARKINSON'S DISEASE AND HEALTHY CONTROLS USING CNN-BLSTMMallela, Jhansi / Illa, Aravind / Belur, Yamini / Atchayaram, Nalini / Yadav, Ravi / Reddy, Pradeep / Gope, Dipanjan / Ghosh, Prasanta Kumar et al. | 2020
- 4591
-
ASSESSMENT OF PARKINSON'S DISEASE MEDICATION STATE THROUGH AUTOMATIC SPEECH ANALYSISPompili, Anna / Solera-Urena, Ruben / Abad, Alberto / Cardoso, Rita / Guimaraes, Isabel / Fabbri, Margherita / Martins, Isabel P. / Ferreira, Joaquim et al. | 2020
- 4596
-
IMPROVING REPLAY DETECTION SYSTEM WITH CHANNEL CONSISTENCY DENSENEXT FOR THE ASVSPOOF 2019 CHALLENGEZhang, Chao / Cheng, Junjie / Gu, Yanmei / Wang, Huacan / Ma, Jun / Wang, Shaojun / Xiao, Jing et al. | 2020
- 4601
-
SUBJECTIVE QUALITY EVALUATION OF SPEECH SIGNALS TRANSMITTED VIA BPL-PLC WIRED SYSTEMFalkowski-Gilski, Przemyslaw / Debita, Grzegorz / Habrych, Marcin / Miedzinski, Bogdan / Jedlikowski, Przemyslaw / Polnik, Bartosz / Wandzio, Jan / Wang, Xin et al. | 2020
- 4606
-
INVESTIGATING THE VISUAL LOMBARD EFFECT WITH GABOR BASED FEATURESChiu, Waito / Xu, Yan / Abel, Andrew / Lin, Chun / Tu, Zhengzheng et al. | 2020
- 4611
-
EXPLORATION OF AUDIO QUALITY ASSESSMENT AND ANOMALY LOCALISATION USING ATTENTION MODELSHuang, Qiang / Hain, Thomas et al. | 2020
- 4616
-
DEVELOPMENT OF A SPEECH QUALITY DATABASE UNDER UNCONTROLLED CONDITIONSRagano, Alessandro / Benetos, Emmanouil / Hines, Andrew et al. | 2020
- 4621
-
EVALUATING THE RELIABILITY OF ACOUSTIC SPEECH EMBEDDINGSAlgayres, Robin / Zaiem, Mohamed Salah / Sagot, Benoit / Dupoux, Emmanuel et al. | 2020
- 4626
-
FRAME-LEVEL SIGNAL-TO-NOISE RATIO ESTIMATION USING DEEP LEARNINGLi, Hao / Wang, Deliang / Zhang, Xueliang / Gao, Guanglai et al. | 2020
- 4631
-
A PYRAMID RECURRENT NETWORK FOR PREDICTING CROWDSOURCED SPEECH- QUALITY RATINGS OF REAL-WORLD SIGNALSDong, Xuan / Williamson, Donald S. et al. | 2020
- 4636
-
EFFECT OF SPECTRAL COMPLEXITY REDUCTION AND NUMBER OF INSTRUMENTS | ON MUSICAL ENJOYMENT WITH COCHLEAR IMPLANTSBrueggeman, Avamarie / Hansen, John H. L. et al. | 2020
- 4641
-
SPECTRUM CORRECTION: ACOUSTIC SCENE CLASSIFICATION WITH MISMATCHED RECORDING DEVICESKosmider, Michal et al. | 2020
- 4646
-
DISTRIBUTED SUMMATION PRIVACY FOR SPEECH ENHANCEMENTO'Connor, Matt / Kleijn, W. Bastiaan et al. | 2020
- 4651
-
PERCEPTION OF PRIVACY MEASURED IN THE CROWD --- PAIRED COMPARISON ON THE EFFECT OF BACKGROUND NOISESLeschanowsky, Anna / Das, Sneha / Backstrom, Tom / Zarazaga, Pablo Perez et al. | 2020
- 4656
-
HIDE AND SPEAK: TOWARDS DEEP NEURAL NETWORKS FOR SPEECH STEGANOGRAPHYKreuk, Felix / Adi, Yossi / Raj, Bhiksha / Singh, Rita / Keshet, Joseph et al. | 2020
- 4661
-
DETECTING ADVERSARIAL EXAMPLES FOR SPEECH RECOGNITION VIA UNCERTAINTY QUANTIFICATIONDaubener, Sina / Schonherr, Lea / Fischer, Asja / Kolossa, Dorothea et al. | 2020
- 4666
-
PRIVACY GUARANTEES FOR DE-IDENTIFYING TEXT TRANSFORMATIONSAdelani, David Ifeoluwa / Davody, Ali / Kleinbauer, Thomas / Klakow, Dietrich et al. | 2020
- 4671
-
DETECTING AUDIO ATTACKS ON ASR SYSTEMS WITH DROPOUT UNCERTAINTYJayashankar, Tejas / Roux, Jonathan Le / Moulin, Pierre et al. | 2020
- 4676
-
VOICE TRANSFORMER NETWORK: SEQUENCE-TO-SEQUENCE VOICE CONVERSION USING TRANSFORMER WITH TEXT-TO-SPEECH PRETRAININGHuang, Wen-Chin / Hayashi, Tomoki / Wu, Yi-Chiao / Kameoka, Hirokazu / Toda, Tomoki et al. | 2020
- 4681
-
NONPARALLEL TRAINING OF EXEMPLAR-BASED VOICE CONVERSION SYSTEM USING INCA-BASED ALIGNMENT TECHNIQUESuda, Hitoshi / Kotani, Gaku / Saito, Daisuke et al. | 2020
- 4686
-
ENHANCING INTELLIGIBILITY OF DYSARTHRIC SPEECH USING GATED CONVOLUTIONAL-BASED VOICE CONVERSION SYSTEMChen, Chen-Yu / Zheng, Wei-Zhong / Wang, Syu-Siang / Tsao, Yu / Li, Pei-Chun / Lai, Ying-Hui et al. | 2020
- 4691
-
VQVC+: ONE-SHOT VOICE CONVERSION BY VECTOR QUANTIZATION AND U-NET ARCHITECTUREWu, Da-Yi / Chen, Yen-Hao / Lee, Hung-Yi et al. | 2020
- 4696
-
COTATRON: TRANSCRIPTION-GUIDED SPEECH ENCODER FOR ANY-TO-MANY VOICE CONVERSION WITHOUT PARALLEL DATAPark, Seung-Won / Kim, Doo-Young / Joe, Myun-Chul et al. | 2020
- 4701
-
DYNAMIC SPEAKER REPRESENTATIONS ADJUSTMENT AND DECODER FACTORIZATION FOR SPEAKER ADAPTATION IN END-TO-END SPEECH SYNTHESISFu, Ruibo / Tao, Jianhua / Wen, Zhenggi / Yi, Jiangyan / Wang, Tao / Qiang, Chunyu et al. | 2020
- 4706
-
ARVC: AN AUTO-REGRESSIVE VOICE CONVERSION SYSTEM WITHOUT PARALLEL TRAINING DATALian, Zheng / Wen, Zhengqi / Zhou, Xinyong / Pu, Songbai / Zhang, Shengkai / Tao, Jianhua et al. | 2020
- 4711
-
IMPROVED ZERO-SHOT VOICE CONVERSION USING EXPLICIT CONDITIONING SIGNALSNercessian, Shahan et al. | 2020
- 4716
-
NON-PARALLEL VOICE CONVERSION WITH FEWER LABELED DATA BY CONDITIONAL GENERATIVE ADVERSARIAL NETWORKSChen, Minchuan / Hou, Weijian / Ma, Jun / Wang, Shaojun / Xiao, Jing et al. | 2020
- 4721
-
TRANSFERRING SOURCE STYLE IN NON-PARALLEL VOICE CONVERSIONLiu, Songxiang / Cao, Yuewen / Kang, Shiyin / Hu, Na / Liu, Xunying / Su, Dan / Yu, Dong / Meng, Helen et al. | 2020
- 4726
-
VOICE CONVERSION USING SPEECH-TO-SPEECH NEURO-STYLE TRANSFERAlbadawy, Ehab A. / Lyu, Siwei et al. | 2020
- 4731
-
IMPROVING CROSS-LINGUAL TRANSFER LEARNING FOR END-TO-END SPEECH RECOGNITION WITH SPEECH TRANSLATIONWang, Changhan / Pino, Juan / Gu, Jiatao et al. | 2020
- 4736
-
TRANSLITERATION BASED DATA AUGMENTATION FOR TRAINING MULTILINGUAL ASR ACOUSTIC MODELS IN LOW RESOURCE SETTINGSThomas, Samuel / Audhkhasi, Kartik / Kingsbury, Brian et al. | 2020
- 4741
-
MULTILINGUAL SPEECH RECOGNITION WITH SELF-ATTENTION STRUCTURED PARAMETERIZATIONZhu, Yun / Haghani, Parisa / Tripathi, Anshuman / Ramabhadran, Bhuvana / Farris, Brian / Xu, Hainan / Lu, Han / Sak, Hasim / Leal, Isabel / Gaur, Neeraj et al. | 2020
- 4746
-
LATTICE-FREE MAXIMUM MUTUAL INFORMATION TRAINING OF MULTILINGUAL SPEECH RECOGNITION SYSTEMSMadikeri, Srikanth / Khonglah, Banriskhem K. / Tong, Sibo / Motlicek, Petr / Bourlard, Herve / Povev, Daniel et al. | 2020
- 4751
-
MASSIVELY MULTILINGUAL ASR: 50 LANGUAGES, 1 MODEL, 1 BILLION PARAMETERSPratap, Vineel / Sriram, Anuroop / Tomasello, Paden / Hannun, Awni / Liptchinsky, Vitaliy / Synnaeve, Gabriel / Collobert, Ronan et al. | 2020
- 4756
-
MULTILINGUAL SPEECH RECOGNITION USING LANGUAGE-SPECIFIC PHONEME RECOGNITION AS AUXILIARY TASK FOR INDIAN LANGUAGESSailor, Hardik B. / Hain, Thomas et al. | 2020
- 4761
-
STYLE VARIATION AS A VANTAGE POINT FOR CODE-SWITCHINGChandu, Khyathi Raghavi / Black, Alan W. et al. | 2020
- 4766
-
BI-ENCODER TRANSFORMER NETWORK FOR MANDARIN-ENGLISH CODE-SWITCHING SPEECH RECOGNITION USING MIXTURE OF EXPERTSLu, Yizhou / Huang, Mingkun / Li, Hao / Guo, Jiaqi / Qian, Yanmin et al. | 2020
- 4776
-
TOWARDS CONTEXT-AWARE END-TO-END CODE-SWITCHING SPEECH RECOGNITIONQiu, Zimeng / Li, Yiyuan / Li, Xinjian / Metze, Florian / Campbell, William M. et al. | 2020
- 4781
-
INCREASING THE INTELLIGIBILITY AND NATURALNESS OF ALARYNGEAL SPEECH USING VOICE CONVERSION AND SYNTHETIC FUNDAMENTAL FREQUENCYDinh, Tuan / Kain, Alexander / Samlan, Robin / Cao, Beiming / Wang, Jun et al. | 2020
- 4786
-
AUTOMATIC ASSESSMENT OF DYSARTHRIC SEVERITY LEVEL USING AUDIO-VIDEO CROSS-MODAL APPROACH IN DEEP LEARNINGTong, Han / Sharifzadeh, Hamid / McLoughlin, lan et al. | 2020
- 4791
-
STAGED KNOWLEDGE DISTILLATION FOR END-TO-END DYSARTHRIC SPEECH RECOGNITION AND SPEECH ATTRIBUTE TRANSCRIPTIONLin, Yugin / Wang, Longbiao / Li, Sheng / Dang, Jianwu / Ding, Chenchen et al. | 2020
- 4796
-
DYSARTHRIC SPEECH RECOGNITION BASED ON DEEP METRIC LEARNINGTakashima, Yuki / Takashima, Ryoichi / Takiguchi, Tetsuya / Ariki, Yasuo et al. | 2020
- 4801
-
AUTOMATIC GLOTTIS DETECTION AND SEGMENTATION IN STROBOSCOPIC VIDEOS USING CONVOLUTIONAL NETWORKSDegala, Divya / Rao, Achuth M. V. / Krishnamurthy, Rahul / Gopikishore, Pebbili / Priyadharshini, Veeramani / Prakash, T. K. / Ghosh, Prasanta Kumar et al. | 2020
- 4806
-
ACOUSTIC FEATURE EXTRACTION WITH INTERPRETABLE DEEP NEURAL NETWORK FOR NEURODEGENERATIVE RELATED DISORDER CLASSIFICATIONPan, Yilin / Mirheidari, Bahman / Tu, Zehai / O'Malley, Ronan / Walker, Traci / Venneri, Annalena / Reuber, Markus / Blackburn, Daniel / Christensen, Heidi et al. | 2020
- 4811
-
COSWARA --- A DATABASE OF BREATHING, COUGH, AND VOICE SOUNDS FOR COVID-19 DIAGNOSISSharma, Neeraj / Krishnan, Prashant / Kumar, Rohit / Ramoji, Shreyas / Chetupalli, Srikanth Raj / Nirmala, R. / Ghosh, Prasanta Kumar / Ganapathy, Sriram et al. | 2020
- 4816
-
ACOUSTIC-BASED ARTICULATORY PHENOTYPES OF AMYOTROPHIC LATERAL SCLEROSIS AND PARKINSON'S DISEASE: TOWARDS AN INTERPRETABLE, HYPOTHESIS-DRIVEN FRAMEWORK OF MOTOR CONTROLRowe, Hannah P. / Gutz, Sarah E. / Maffei, Marc F. / Green, Jordan R. et al. | 2020
- 4821
-
RECOGNISING EMOTIONS IN DYSARTHRIC SPEECH USING TYPICAL SPEECH DATAAlhinti, Lubna / Cunningham, Stuart / Christensen, Heidi et al. | 2020
- 4826
-
DETECTING AND ANALYSING SPONTANEOUS ORAL CANCER SPEECH IN THE WILDHalpern, Bence Mark / Son, Rob Van / Brekel, Michiel Van Den / Scharenborg, Odette et al. | 2020
- 4831
-
THE ZERO RESOURCE SPEECH CHALLENGE 2020: DISCOVERING DISCRETE SUBWORD AND WORD UNITSDunbar, Ewan / Karadayi, Julien / Bernard, Mathieu / Cao, Xuan-Nga / Algayres, Robin / Ondel, Lucas / Besacier, Laurent / Sakti, Sakriani / Dupoux, Emmanuel et al. | 2020
- 4836
-
VECTOR-QUANTIZED NEURAL NETWORKS FOR ACOUSTIC UNIT DISCOVERY IN THE ZEROSPEECH 2020 CHALLENGENiekerk, Benjamin Van / Nortje, Leanne / Kamper, Herman et al. | 2020
- 4841
-
EXPLORATION OF END-TO-END SYNTHESISERS FOR ZERO RESOURCE SPEECH CHALLENGE 2020Pandia, Karthik D. S. / Prakash, Anusha / Kumar, Mano Ranjith M. / Murthy, Hema A. et al. | 2020
- 4846
-
VECTOR QUANTIZED TEMPORALLY-AWARE CORRESPONDENCE SPARSE AUTOENCODERS FOR ZERO-RESOURCE ACOUSTIC UNIT DISCOVERYGundogdu, Batuhan / Yusuf, Bolaji / Yesilbursa, Mansur / Saraclar, Murat et al. | 2020
- 4851
-
TRANSFORMER VQ-VAE FOR UNSUPERVISED UNIT DISCOVERY AND SPEECH SYNTHESIS: ZEROSPEECH 2020 CHALLENGETjandra, Andros / Sakti, Sakriani / Nakamura, Satoshi et al. | 2020
- 4856
-
EXPLORING TTS WITHOUT T USING BIOLOGICALLY/PSYCHOLOGICALLY MOTIVATED NEURAL NETWORK MODULES (ZEROSPEECH 2020)Morita, Takashi / Koda, Hiroki et al. | 2020
- 4861
-
CYCLIC SPECTRAL MODELING FOR UNSUPERVISED UNIT DISCOVERY INTO VOICE CONVERSION WITH EXCITATION AND WAVEFORM MODELINGTobing, Patrick Lumban / Hayashi, Tomoki / Wu, Yi-Chiao / Kobayashi, Kazuhiro / Toda, Tomoki et al. | 2020
- 4866
-
UNSUPERVISED ACOUSTIC UNIT REPRESENTATION LEARNING FOR VOICE CONVERSION USING WAVENET AUTO-ENCODERSChen, Mingjie / Hain, Thomas et al. | 2020
- 4871
-
UNSUPERVISED DISCOVERY OF RECURRING SPEECH PATTERNS USING PROBABILISTIC ADAPTIVE METRICSRasanen, Okko / Blandon, Maria Andrea Cruz et al. | 2020
- 4876
-
SELF-EXPRESSING AUTOENCODERS FOR UNSUPERVISED SPOKEN TERM DISCOVERYBhati, Saurabhchand / Villalba, Jesus / Zelasko, Piotr / Dehak, Najim et al. | 2020
- 4881
-
PERCEPTIMATIC: A HUMAN SPEECH PERCEPTION BENCHMARK FOR UNSUPERVISED SUBWORD MODELLINGMillet, Juliette / Dunbar, Ewan et al. | 2020
- 4886
-
DECODING IMAGINED, HEARD, AND SPOKEN SPEECH: CLASSIFICATION AND REGRESSION OF EEG USING A 14-CHANNEL DRY-CONTACT MOBILE HEADSETClayton, Jonathan / Wellington, Scott / Valentini-Botinhao, Cassia / Watts, Oliver et al. | 2020
- 4891
-
GLOTTAL CLOSURE INSTANTS DETECTION FROM EGG SIGNAL BY CLASSIFICATION APPROACHReddy, Gurunath M. / Rao, K. Sreenivasa / Das, Partha Pratim et al. | 2020
- 4896
-
CLASSIFY IMAGINARY MANDARIN TONES WITH CORTICAL EEG SIGNALSLi, Hua / Chen, Fei et al. | 2020
- 4901
-
AUGMENTING IMAGES FOR ASR AND TTS THROUGH SINGLE-LOOP AND DUAL- LOOP MULTIMODAL CHAIN FRAMEWORKEffendi, Johanes / Tjandra, Andros / Sakti, Sakriani / Nakamura, Satoshi et al. | 2020
- 4906
-
PUNCTUATION PREDICTION IN SPONTANEOUS CONVERSATIONS: CAN WE MITIGATE ASR ERRORS WITH RETROFITTED WORD EMBEDDINGS?Augustyniak, Lukasz / Szymanski, Piotr / Morzy, Mikolaj / Zelasko, Piotr / Szymczak, Adrian / Mizgajski, Jan / Carmiel, Yishay / Dehak, Najim et al. | 2020
- 4911
-
MULTIMODAL SEMI-SUPERVISED LEARNING FRAMEWORK FOR PUNCTUATION PREDICTION IN CONVERSATIONAL SPEECHSunkara, Monica / Ronanki, Srikanth / Bekal, Dhanush / Bodapati, Sravan / Kirchhoff, Katrin et al. | 2020
- 4916
-
EFFICIENT MDI ADAPTATION FOR N-GRAM LANGUAGE MODELSHuang, Ruizhe / Li, Ke / Arora, Ashish / Povey, Daniel / Khudanpur, Sanjeev et al. | 2020
- 4921
-
IMPROVING TAIL PERFORMANCE OF A DELIBERATION E2E ASR MODEL USING A LARGE TEXT CORPUSPeyser, Cal / Mavandadi, Sepand / Sainath, Tara N. / Apfel, James / Pang, Ruoming / Kumar, Shankar et al. | 2020
- 4926
-
LANGUAGE MODEL DATA AUGMENTATION BASED ON TEXT DOMAIN TRANSFEROgawa, Atsunori / Tawara, Naohiro / Delcroix, Marc et al. | 2020
- 4931
-
CONTEMPORARY POLISH LANGUAGE MODEL (VERSION 2) USING BIG DATA AND SUB-WORD APPROACHWolk, Krzysztof et al. | 2020
- 4936
-
IMPROVING SPEECH RECOGNITION OF COMPOUND-RICH LANGUAGESPandey, Prabhat / Leutnant, Volker / Wiesler, Simon / Heymann, Jahn / Willett, Daniel et al. | 2020
- 4941
-
LANGUAGE MODELING FOR SPEECH ANALYTICS IN UNDER-RESOURCED LANGUAGESWills, Simone / Uys, Pieter / Heerden, Charl Van / Barnard, Etienne et al. | 2020
- 4946
-
AN EARLY STUDY ON INTELLIGENT ANALYSIS OF SPEECH UNDER COVID-19 SEVERITY, SLEEP QUALITY, FATIGUE, AND ANXIETYHan, Jing / Qian, Kun / Song, Meishu / Yang, Zijiang / Ren, Zhao / Liu, Shuo / Liu, Juan / Zheng, Huaiyuan / Ji, Wei / Koike, Tomoya et al. | 2020
- 4951
-
AN EVALUATION OF THE EFFECT OF ANXIETY ON SPEECH --- COMPUTATIONAL PREDICTION OF ANXIETY FROM SUSTAINED VOWELSBaird, Alice / Cummins, Nicholas / Schnieder, Sebastian / Krajewski, Jarek / Schuller, Bjorn W. et al. | 2020
- 4956
-
HYBRID NETWORK FEATURE EXTRACTION FOR DEPRESSION ASSESSMENT FROM SPEECHZhao, Ziping / Li, Qifei / Cummins, Nicholas / Liu, Bin / Wang, Haishuai / Tao, Jianhua / Schuller, Bjorn W. et al. | 2020
- 4961
-
IMPROVING DETECTION OF ALZHEIMER'S DISEASE USING AUTOMATIC SPEECH RECOGNITION TO IDENTIFY HIGH-QUALITY SEGMENTS FOR MORE ROBUST FEATURE EXTRACTIONPan, Yilin / Mirheidari, Bahman / Reuber, Markus / Venneri, Annalena / Blackburn, Daniel / Christensen, Heidi et al. | 2020
- 4966
-
CLASSIFICATION OF MANIFEST HUNTINGTON DISEASE USING VOWEL DISTORTION MEASURESRomana, Amrit / Bandon, John / Carlozzi, Noelle / Roberts, Angela / Provost, Emily Mower et al. | 2020
- 4971
-
PARKINSON'S DISEASE DETECTION FROM SPEECH USING SINGLE FREQUENCY FILTERING CEPSTRAL COEFFICIENTSKadiri, Sudarsana Reddy / Kethireddy, Rashmi / Alku, Paavo et al. | 2020
- 4976
-
AUTOMATIC PREDICTION OF SPEECH INTELLIGIBILITY BASED ON X-VECTORS IN THE CONTEXT OF HEAD AND NECK CANCERQuintas, Sebastiao / Mauclair, Julie / Woisard, Virginie / Pinquier, Julien et al. | 2020
- 4981
-
SPECTRAL MOMENT AND DURATION OF BURST OF PLOSIVES IN SPEECH OF CHILDREN WITH HEARING IMPAIRMENT AND TYPICALLY DEVELOPING CHILDREN - A COMPARATIVE STUDYAbraham, Ajish K. / Pushpavathi, M. / Sreedevi, N. / Navya, A. / Vikram, C. M. / Prasanna, S. R. Mahadeva et al. | 2020
- 4986
-
APHASIC SPEECH RECOGNITION USING A MIXTURE OF SPEECH INTELLIGIBILITY EXPERTSPerez, Matthew / Aldeneh, Zakaria / Provost, Emily Mower et al. | 2020
- 4991
-
AUTOMATIC DISCRIMINATION OF APRAXIA OF SPEECH AND DYSARTHRIA USING A MINIMALISTIC SET OF HANDCRAFTED FEATURESKodrasi, Ina / Pernon, Michaela / Laganaro, Marina / Bourlard, Herve et al. | 2020
- 4996
-
WEAK-ATTENTION SUPPRESSION FOR TRANSFORMER BASED SPEECH RECOGNITIONShi, Yangyang / Wang, Yongqiang / Wu, Chunyang / Fuegen, Christian / Zhang, Frank / Le, Duc / Yeh, Ching-Feng / Seltzer, Michael L. et al. | 2020
- 5001
-
CONV-TRANSFORMER TRANSDUCER: LOW LATENCY, LOW FRAME RATE. STREAMABLE END-TO-END SPEECH RECOGNITIONHuang, Wenvong / Hu, Wenchao / Yeung, Yu Ting / Chen, Xiao et al. | 2020
- 5006
-
IMPROVING TRANSFORMER-BASED SPEECH RECOGNITION WITH UNSUPERVISED PRE-TRAINING AND MULTI-TASK SEMANTIC KNOWLEDGE LEARNINGLi, Song / Li, Lin / Hong, Qingyang / Liu, Lingling et al. | 2020
- 5011
-
TRANSFORMER-BASED LONG-CONTEXT END-TO-END SPEECH RECOGNITIONHori, Takaaki / Moritz, Niko / Hori, Chiori / Roux, Jonathan Le et al. | 2020
- 5016
-
SELF-AND-MIXED ATTENTION DECODER WITH DEEP ACOUSTIC STRUCTURE FOR TRANSFORMER-BASED LVCSRZhou, Xinyuan / Lee, Grandee / Yilmaz, Emre / Long, Yanhua / Liang, Jiaen / Li, Haizhou et al. | 2020
- 5021
-
UNIVERSAL SPEECH TRANSFORMERZhao, Yingzhu / Ni, Chongjia / Leung, Cheung-Chi / Joty, Shafiq / Chng, Eng Siong / Ma, Bin et al. | 2020
- 5026
-
SPIKE-TRIGGERED NON-AUTOREGRESSIVE TRANSFORMER FOR END-TO-END SPEECH RECOGNITIONTian, Zhengkun / Yi, Jiangyan / Tao, Jianhua / Bai, Ye / Zhang, Shuai / Wen, Zhengqi et al. | 2020
- 5031
-
CROSS ATTENTION WITH MONOTONIC ALIGNMENT FOR SPEECH TRANSFORMERZhao, Yingzhu / Ni, Chongjia / Leung, Cheung-Chi / Joty, Shafiq / Chng, Eng Siong / Ma, Bin et al. | 2020
- 5036
-
CONFORMER: CONVOLUTION-AUGMENTED TRANSFORMER FOR SPEECH RECOGNITIONGulati, Anmol / Qin, James / Chiu, Chung-Cheng / Parmar, Niki / Zhang, Yu / Yu, Jiahui / Han, Wei / Wang, Shibo / Zhang, Zhengdong / Wu, Yonghui et al. | 2020
- 5041
-
EXPLORING TRANSFORMERS FOR LARGE-SCALE SPEECH RECOGNITIONLu, Liang / Liu, Changliang / Li, Jinyu / Gong, Yifan et al. | 2020
- 5046
-
SPARSENESS-AWARE DOA ESTIMATION WITH MAJORIZATION MINIMIZATIONTogami, Masahito / Scheibler, Robin et al. | 2020
- 5051
-
SPATIAL RESOLUTION OF EARLY REFLECTION FOR SPEECH AND WHITE NOISEZhong, Xiaoli / Song, Hao / Liu, Xuejie et al. | 2020
- 5056
-
EFFECT OF MICROPHONE POSITION MEASUREMENT ERROR ON RIR AND ITS IMPACT ON SPEECH INTELLIGIBILITY AND QUALITYRaikar, Aditya / Nathwani, Karan / Panda, Ashish / Kopparapu, Sunil Kumar et al. | 2020
- 5061
-
ONLINE BLIND REVERBERATION TIME ESTIMATION USING CRNNSDeng, Shuwen / Mack, Wolfgang / Habets, Emanuel A. P. et al. | 2020
- 5066
-
SINGLE-CHANNEL BLIND DIRECT-TO-REVERBERATION RATIO ESTIMATION USING MASKINGMack, Wolfgang / Deng, Shuwen / Habets, Emanuel A. P. et al. | 2020
- 5071
-
THE IMPORTANCE OF TIME-FREQUENCY AVERAGING FOR BINAURAL SPEAKER LOCALIZATION IN REVERBERANT ENVIRONMENTSBeit-On, Hanan / Tourbabin, Vladimir / Rafaely, Boaz et al. | 2020
- 5076
-
ACOUSTIC SIGNAL ENHANCEMENT USING RELATIVE HARMONIC COEFFICIENTS: SPHERICAL HARMONICS DOMAIN APPROACHHu, Yonggang / Samarasinghe, Prasanga N. / Abhayapala, Thushara D. et al. | 2020
- 5081
-
INSTANTANEOUS TIME DELAY ESTIMATION OF BROADBAND SIGNALSMurthy, B. H. V. S. Narayana / Satyanarayana, J. V. / Chennupati, Nivedita / Yegnanarayana, B. et al. | 2020
- 5086
-
U-NET BASED DIRECT-PATH DOMINANCE TEST FOR ROBUST DIRECTION-OF-ARRIVAL ESTIMATIONWang, Hao / Chen, Kai / Lu, Jing et al. | 2020
- 5091
-
SOUND EVENT LOCALIZATION AND DETECTION BASED ON MULTIPLE DOA BEAMFORMING AND MULTI-TASK LEARNINGXue, Wei / Tong, Ying / Zhang, Chao / Ding, Guohong / He, Xiaodong / Zhou, Bowen et al. | 2020
-
IMPROVING LOW RESOURCE CODE-SWITCHED ASR USING AUGMENTED CODE-SWITCHED TTSSharma, Yash / Abraham, Basil / Taneja, Karan / Jyothi, Preethi et al. | 2020