AN AUTOMATIC, SIMPLE ULTRASOUND BIOFEEDBACK PARAMETER FOR DISTINGUISHING ACCURATE AND MISARTICULATED RHOTIC SYLLABLES (English)
- New search for: Li, Sarah R.
- New search for: Annand, Colin T.
- New search for: Dugan, Sarah
- New search for: Schwab, Sarah M.
- New search for: Eary, Kathryn J.
- New search for: Swearengen, Michael
- New search for: Stack, Sarah
- New search for: Boyce, Suzanne
- New search for: Riley, Michael A.
- New search for: Mast, T. Douglas
- New search for: Li, Sarah R.
- New search for: Annand, Colin T.
- New search for: Dugan, Sarah
- New search for: Schwab, Sarah M.
- New search for: Eary, Kathryn J.
- New search for: Swearengen, Michael
- New search for: Stack, Sarah
- New search for: Boyce, Suzanne
- New search for: Riley, Michael A.
- New search for: Mast, T. Douglas
In:
22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) ; Volume 1 of 6
; 471-475
;
2021
- Conference paper / Print
-
Title:AN AUTOMATIC, SIMPLE ULTRASOUND BIOFEEDBACK PARAMETER FOR DISTINGUISHING ACCURATE AND MISARTICULATED RHOTIC SYLLABLES
-
Contributors:Li, Sarah R. ( author ) / Annand, Colin T. ( author ) / Dugan, Sarah ( author ) / Schwab, Sarah M. ( author ) / Eary, Kathryn J. ( author ) / Swearengen, Michael ( author ) / Stack, Sarah ( author ) / Boyce, Suzanne ( author ) / Riley, Michael A. ( author ) / Mast, T. Douglas ( author )
-
Conference:INTERSPEECH ; 22. ; 2021 ; Brünn; Online
-
Published in:
-
Publisher:
- New search for: Curran Associates, Inc.
-
Place of publication:Red Hook, NY
-
Publication date:2021
-
Type of media:Conference paper
-
Type of material:Print
-
Language:English
-
Source:
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
OPTIMIZING LATENCY FOR ONLINE VIDEO CAPTIONING USING AUDIO-VISUAL TRANSFORMERSHori, Chiori / Hori, Takaaki / Roux, Jonathan Le et al. | 2021
- 6
-
SPECMIX : A MIXED SAMPLE DATA AUGMENTATION METHOD FOR TRAINING WITH TIME-FREQUENCY DOMAIN FEATURESKim, Gwantae / Han, David K. / Ko, Hanseok et al. | 2021
- 11
-
EVENT SPECIFIC ATTENTION FOR POLYPHONIC SOUND EVENT DETECTIONSundar, Harshavardhan / Sun, Ming / Wang, Chao et al. | 2021
- 16
-
AN EVALUATION OF DATA AUGMENTATION METHODS FOR SOUND SCENE GEOTAGGINGBear, Helen L. / Morfi, Veronica / Benetos, Emmanouil et al. | 2021
- 21
-
SPECAUGMENT++: A HIDDEN SPACE DATA AUGMENTATION METHOD FOR ACOUSTIC SCENE CLASSIFICATIONWang, Helin / Zou, Yuexian / Wang, Wenwu et al. | 2021
- 26
-
ACOUSTIC SCENE CLASSIFICATION USING KERVOLUTION-BASED SUBSPECTRALNETNandi, Ritika / Shekhar, Shashank / Mulimani, Manjunath et al. | 2021
- 31
-
VARIATIONAL INFORMATION BOTTLENECK FOR EFFECTIVE LOW-RESOURCE AUDIO CLASSIFICATIONSi, Shijing / Wang, Jianzong / Sun, Huiming / Wu, Jianhan / Zhang, Chuanyao / Qu, Xiaoyang / Cheng, Ning / Chen, Lei / Xiao, Jing et al. | 2021
- 36
-
IMPROVING WEAKLY SUPERVISED SOUND EVENT DETECTION WITH SELF-SUPERVISED AUXILIARY TASKSDeshmukh, Soham / Raj, Bhiksha / Singh, Rita et al. | 2021
- 41
-
SHALLOW CONVOLUTION-AUGMENTED TRANSFORMER WITH DIFFERENTIABLE NEURAL COMPUTER FOR LOW-COMPLEXITY CLASSIFICATION OF VARIABLE-LENGTH ACOUSTIC SCENESeo, Soonshin / Lee, Donghyun / Kim, Ji-Hwan et al. | 2021
- 46
-
ACOUSTIC EVENT DETECTION WITH CLASSIFIER CHAINSKomatsu, Tatsuya / Watanabe, Shinji / Miyazaki, Koichi / Hayashi, Tomoki et al. | 2021
- 51
-
AN EFFECTIVE MUTUAL MEAN TEACHING BASED DOMAIN ADAPTATION METHOD FOR SOUND EVENT DETECTIONZheng, Xu / Song, Yan / Dai, Li-Rong / McLoughlin, Ian / Liu, Lin et al. | 2021
- 56
-
AST: AUDIO SPECTROGRAM TRANSFORMERGong, Yuan / Chung, Yu-An / Glass, James et al. | 2021
- 61
-
DEEP FEATURE TRANSFER LEARNING FOR AUTOMATIC PRONUNCIATION ASSESSMENTLin, Binghuai / Wang, Liyuan et al. | 2021
- 66
-
WEAKLY-SUPERVISED WORD-LEVEL PRONUNCIATION ERROR DETECTION IN NON-NATIVE ENGLISH SPEECHKorzekwa, Daniel / Lorenzo-Trueba, Jaime / Drugman, Thomas / Calamaro, Shira / Kostek, Bozena et al. | 2021
- 71
-
UNDERSTANDING MEDICAL CONVERSATIONS: RICH TRANSCRIPTION, CONFIDENCE SCORES & INFORMATION EXTRACTIONSoltau, Hagen / Wang, Minggiu / Shafran, Izhak / Shafey, Laurent El et al. | 2021
- 76
-
MULTILINGUAL SPEECH EVALUATION: CASE STUDIES ON ENGLISH, MALAY AND TAMILZhang, Huayun / Shi, Ke / Chen, Nancy F. et al. | 2021
- 81
-
PHONE-LEVEL PRONUNCIATION SCORING FOR SPANISH SPEAKERS LEARNING ENGLISH USING A GOP-DNN SYSTEMVidal, Jazmin / Bonomi, Cyntia / Sancinetti, Marcelo / Ferrer, Luciana et al. | 2021
- 86
-
A STUDY ON FINE-TUNING WAV2VEC2.0 MODEL FOR THE TASK OF MISPRONUNCIATION DETECTION AND DIAGNOSISPeng, Linkai / Fu, Kaiqi / Lin, Binghuai / Ke, Dengfeng / Zhan, Jinsong et al. | 2021
- 91
-
END-TO-END SPEAKER-ATTRIBUTED ASR WITH TRANSFORMERKanda, Naoyuki / Ye, Guoli / Gaur, Yashesh / Wang, Xiaofei / Meng, Zhong / Chen, Zhuo / Yoshioka, Takuya et al. | 2021
- 96
-
“YOU DON'T UNDERSTAND ME!": COMPARING ASR RESULTS FOR L1 AND L2 SPEAKERS OF SWEDISHCumbal, Ronald / Moell, Birger / Lopes, Jose / Engwall, Olov et al. | 2021
- 101
-
IMPROVEMENT OF AUTOMATIC ENGLISH PRONUNCIATION ASSESSMENT WITH SMALL NUMBER OF UTTERANCES USING SENTENCE SPEAKABILITYNaijo, Satsuki / Ito, Akinori / Nose, Takashi et al. | 2021
- 106
-
LEXICAL DENSITY ANALYSIS OF WORD PRODUCTIONS IN JAPANESE ENGLISH USING ACOUSTIC WORD EMBEDDINGSAndo, Shintaro / Minematsu, Nobuaki / Saito, Daisuke et al. | 2021
- 111
-
END-TO-END RICH TRANSCRIPTION-STYLE AUTOMATIC SPEECH RECOGNITION WITH SEMI-SUPERVISED LEARNINGTanaka, Tomohiro / Masumura, Ryo / Ihori, Mana / Takashima, Akihiko / Orihashi, Shota / Makishima, Naoki et al. | 2021
- 116
-
EXPLORE WAV2VEC 2.0 FOR MISPRONUNCIATION DETECTIONXu, Xiaoshuo / Kang, Yueteng / Cao, Songjun / Lin, Binghuai / Ma, Long et al. | 2021
- 121
-
NEMO INVERSE TEXT NORMALIZATION: FROM DEVELOPMENT TO PRODUCTIONZhang, Yang / Bakhturina, Evelina / Gorman, Kyle / Ginsburg, Boris et al. | 2021
- 126
-
THE IMPACT OF ASR ON THE AUTOMATIC ANALYSIS OF LINGUISTIC COMPLEXITY AND SOPHISTICATION IN SPONTANEOUS L2 SPEECHQiao, Yu / Zhou, Wei / Kerz, Elma / Schluter, Ralf et al. | 2021
- 131
-
FEARLESS STEPS CHALLENGE PHASE-3 (FSC P3): ADVANCING SLT FOR UNSEEN CHANNEL AND MISSION DATA ACROSS NASA APOLLO AUDIOJoglekar, Aditya / Sadjadi, Seyed Omid / Chandra-Shekar, Meena / Cieri, Christopher / Hansen, John H. L. et al. | 2021
- 136
-
NEURAL TEXT DENORMALIZATION FOR SPEECH TRANSCRIPTSSuter, Benjamin / Novak, Josef et al. | 2021
- 141
-
ALIGNED CONTRASTIVE PREDICTIVE CODINGChorowski, Jan / Ciesielski, Grzegorz / Dzikowski, Jaroslaw / Lancucki, Adrian / Marxer, Ricard / Opala, Mateusz / Pusz, Piotr / Rychlikowski, Pawel / Stypulkowski, Michal et al. | 2021
- 146
-
INFORMATION RETRIEVAL FOR ZEROSPEECH 2021: THE SUBMISSION BY UNIVERSITY OF WROCLAWChorowski, Jan / Ciesielski, Grzegorz / Dzikowski, Jaroslaw / Lancucki, Adrian / Marxer, Ricard / Opala, Mateusz / Pusz, Piotr / Rychlikowski, Pawel / Stypulkowski, Michal et al. | 2021
- 151
-
NEURAL SPEAKER EMBEDDINGS FOR ULTRASOUND-BASED SILENT SPEECH INTERFACESShandiz, Amin Honarmandi / Toth, Laszlo / Gosztolya, Gabor / Marko, Alexandra / Csapo, Tamas Gabor et al. | 2021
- 156
-
AUTOMATICALLY DETECTING ERRORS AND DISFLUENCIES IN READ SPEECH TO PREDICT COGNITIVE IMPAIRMENT IN PEOPLE WITH PARKINSON'S DISEASERomana, Amrit / Bandon, John / Perez, Matthew / Gutierrez, Stephanie / Richter, Richard / Roberts, Angela / Provost, Emily Mower et al. | 2021
- 161
-
LATE FUSION OF THE AVAILABLE LEXICON AND RAW WAVEFORM-BASED ACOUSTIC MODELING FOR DEPRESSION AND DEMENTIA RECOGNITIONVillatoro-Tello, Esau / Dubagunta, S. Pavankumar / Fritsch, Julian / Ramirez-De-La-Rosa, Gabriela / Motlicek, Petr / Magimai-Doss, Mathew et al. | 2021
- 166
-
SPEECH DISORDER CLASSIFICA TION USING EXTENDED FACTORIZED HIERARCHICAL VARIATIONAL AUTO-ENCODERSOi, Jinzi / Hamme, Hugo Van et al. | 2021
- 171
-
AUTOMATIC EXTRACTION OF SPEECH RHYTHM DESCRIPTORS FOR SPEECH INTELLIGIBILITY ASSESSMENT IN THE CONTEXT OF HEAD AND NECK CANCERSVaysse, Robin / Farinas, Jerome / Astesano, Corine / Andre-Obrecht, Regine et al. | 2021
- 176
-
THE IMPACT OF FORCED-ALIGNMENT ERRORS ON AUTOMATIC PRONUNCIATION EVALUATIONMathad, Vikram C. / Mahr, Tristan J. / Scherer, Nancy / Chapman, Kathy / Hustad, Katherine C. / Liss, Julie / Berisha, Visar et al. | 2021
- 181
-
ASSESSING POSTERIOR-BASED MISPRONUNCIATION DETECTION ON FIELD- COLLECTED RECORDINGS FROM CHILD SPEECH THERAPY SESSIONSHair, Adam / Zhao, Guanlong / Ahmed, Beena / Ballard, Kirrie J. / Gutierrez-Osuna, Ricardo et al. | 2021
- 186
-
IDENTIFYING COGNITIVE IMPAIRMENT USING SENTENCE VECTORSMirheidari, Bahman / Pan, Yilin / Blackburn, Daniel / O'Malley, Ronan / Christensen, Heidi et al. | 2021
- 191
-
PHONETIC COMPLEXITY, SPEECH ACCURACY AND INTELLIGIBILITY ASSESSMENT OF ITALIAN DYSARTHRIC SPEECHFivela, Barbara Gili / Sallustio, Vincenzo / Pede, Silvia / Patrocinio, Danilo et al. | 2021
- 196
-
UNSUPERVISED DOMAIN ADAPTATION FOR DYSARTHRIC SPEECH DETECTION VIA DOMAIN ADVERSARIAL TRAINING AND MUTUAL INFORMATION MINIMIZATIONWang, Disong / Deng, Liqun / Yeung, Yu Ting / Chen, Xiao / Liu, Xunying / Meng, Helen et al. | 2021
- 201
-
CLAC: A SPEECH CORPUS OF HEALTHY ENGLISH SPEAKERSHaulcy, R'Mani / Glass, James et al. | 2021
- 206
-
DETECTION OF CONSONANT ERRORS IN DISORDERED SPEECH BASED ON CONSONANT-VOWEL SEGMENT EMBEDDINGNg, Si-Ioi / Ng, Cymie Wing-Yee / Li, Jingyu / Lee, Tan et al. | 2021
- 211
-
SOURCE AND VOCAL TRACT CVUES FOR SPEECH-BASED CLASSIFICATION OF PATIENTS WITH PARKINSON'S DISEASE AND HEALTHY SUBJECTSBhattacharjee, Tanuka / Mallela, Jhansi / Belur, Yamini / Atchayaram, Nalini / Yadav, Ravi / Reddy, Pradeep / Gope, Dipanjan / Ghosh, Prasanta Kumar et al. | 2021
- 216
-
UNCERTAINTY-AWARE COVID-19 DETECTION FROM IMBALANCED SOUND DATAXia, Tong / Han, Jing / Oendro, Lorena / Dang, Ting / Mascolo, Cecilia et al. | 2021
- 221
-
VOCALIZATION RECOGNITION OF PEOPLE WITH PROFOUND INTELLECTUAL AND MULTIPLE DISABILITIES (PIMD) USING MACHINE LEARNING ALGORITHMSJesko, Waldemar et al. | 2021
- 226
-
SPEECH INTELLIGIBILITY OF DYSARTHRIC SPEECH: HUMAN SCORES AND ACOUSTIC-PHONETIC FEATURESXue, Wei / Hout, Roeland Van / Boogmans, Fleur / Ganzeboom, Mario / Cucchiarini, Catia / Strik, Helmer et al. | 2021
- 231
-
ANALYZING SHORT TERM DYNAMIC SPEECH FEATURES FOR UNDERSTANDING BEHAVIORAL TRAITS OF CHILDREN WITH AUTISM SPECTRUM DISORDERKim, Young-Kyung / Lahiri, Rimita / Nasir, Md. / Kim, So Hyun / Bishop, Somer / Lord, Catherine / Narayanan, Shrikanth S. et al. | 2021
- 236
-
PARENTAL SPOKEN SCAFFOLDING AND NARRATIVE SKILLS IN CROWD-SOURCED STORYTELLING SAMPLES OF YOUNG CHILDRENYue, Zhengjun / Barker, Jon / Christensen, Heidi / McKean, Cristina / Ashton, Elaine / Wren, Yvonne / Gadgil, Swapnil / Bright, Rebecca et al. | 2021
- 241
-
MODELING THE EFFECT OF MILITARY OXYGEN MASKS ON SPEECH CHARACTERISTICSElie, Benjamin / Gauvain, Jodie / Gauvain, Jean-Luc / Lamel, Lori et al. | 2021
- 246
-
DETECTING ENGLISH SPEECH IN THE AIR TRAFFIC CONTROL VOICE COMMUNICATIONSzoke, Igor / Kesiraju, Santosh / Novotny, Ondrej / Kocour, Martin / Vesely, Karel / Cernocky, Jan et al. | 2021
- 251
-
CONTEXTUAL SEMI-SUPERVISED LEARNING: AN APPROACH TO LEVERAGE AIR- SURVEILLANCE AND UNTRANSCRIBED ATC DATA IN ASR SYSTEMSZuluaga-Gomez, Juan / Nigmatulina, Iuliia / Prasad, Amrutha / Motlicek, Petr / Vesely, Karel / Kocour, Martin / Szoke, Igor et al. | 2021
- 256
-
BOOSTING OF CONTEXTUAL INFORMATION IN ASR FOR AIR-TRAFFIC CALL-SIGN RECOGNITIONKocour, Martin / Vesely, Karel / Blatt, Alexander / Gomez, Juan Zuluaga / Szoke, Igor / Cernocky, Jan / Klakow, Dietrich / Motlicek, Petr et al. | 2021
- 261
-
TOWARDS AN ACCENT-ROBUST APPROACH FOR ATC COMMUNICATIONS TRANSCRIPTIONJahchan, Nataly / Barbier, Florentin / Gita, Ariyanidevi Dharma / Khelif, Khaled / Delpech, Estelle et al. | 2021
- 266
-
ROBUST COMMAND RECOGNITION FOR LITHUANIAN AIR TRAFFIC CONTROL TOWER UTTERANCESOhneiser, Oliver / Sarfioo, Seyyed Saeed / Helmke, Hartmut / Shetty, Shruthi / Motlicek, Petr / Kleinert, Matthias / Ehr, Heiko / Murauskas, Sarunas et al. | 2021
- 271
-
EFFECTS OF VOICE TYPE AND TASK ON L2 LEARNERS' AWARENESS OF PRONUNCIATION ERRORSSilpachai, Alif / Rehman, Ivana / Barriuso, Taylor Anne / Levis, John / Chukharev-Hudilainen, Evgeny / Zhao, Guanlong / Gutierrez-Osuna, Ricardo et al. | 2021
- 276
-
LEXICAL ENTRAINMENT AND INTRA-SPEAKER VARIABILITY IN COOPERATIVE DIALOGUESMenshikova, Alla / Kocharov, Daniil / Kachkovskaia, Tatiana et al. | 2021
- 281
-
ANALYSIS OF EYE GAZE REASONS AND GAZE AVERSIONS DURING THREE-PARTY CONVERSATIONSIshi, Carlos Toshinori / Shintani, Taiken et al. | 2021
- 286
-
A PSYCHOLOGY-DRIVEN COMPUTATIONAL ANALYSIS OF POLITICAL INTERVIEWSCook, Darren / Zilka, Miri / Maskell, Simon / Alison, Laurence et al. | 2021
- 291
-
INVESTIGATING THE INTERPLAY BETWEEN AFFECTIVE, PHONATORY AND MOTORIC SUBSYSTEMS IN AUTISM SPECTRUM DISORDER USING A MULTIMODAL DIALOGUE AGENTKothare, Hardik / Ramanarayanan, Vikram / Roesler, Oliver / Neumann, Michael / Liscombe, Jackson / Burke, William / Cornish, Andrew / Habberstad, Doug / Sakallah, Alaa / Markuson, Sara et al. | 2021
- 296
-
CROSS-MODAL LEARNING FOR AUDIO-VISUAL VIDEO PARSINGLamba, Jatin / Abhishek / Akula, Jayaprakash / Dabral, Rishabh / Jyothi, Preethi / Ramakrishnan, Ganesh et al. | 2021
- 301
-
SPEECH EMOTION RECOGNITION BASED ON ATTENTION WEIGHT CORRECTION USING WORD-LEVEL CONFIDENCE MEASURESantoso, Jennifer / Yamada, Takeshi / Makino, Shoji / Ishizuka, Kenkichi / Hiramura, Takekatsu et al. | 2021
- 306
-
DETECTING ALZHEIMER'S DISEASE USING INTERACTIONAL AND ACOUSTIC FEATURES FROM SPONTANEOUS SPEECHNasreen, Shamila / Hough, Julian / Purver, Matthew et al. | 2021
- 311
-
REAL-TIME MULTI-CHANNEL SPEECH ENHANCEMENT BASED ON NEURAL NETWORK MASKING WITH ATTENTION MODELXue, Cheng / Huang, Weilong / Chen, Weiguang / Feng, Jinwei et al. | 2021
- 316
-
IMPROVING CHANNEL DECORRELATION FOR MULTI-CHANNEL TARGET SPEECH EXTRACTIONHan, Jiangyu / Rao, Wei / Wang, Yannan / Long, Yanhua et al. | 2021
- 321
-
INPLACE GATED CONVOLUTIONAL RECURRENT NEURAL NETWORK FOR DUAL-CHANNEL SPEECH ENHANCEMENTLiu, Jinjiang / Zhang, Xueliang et al. | 2021
- 326
-
SRIB-LEAP SUBMISSION TO FAR-FIELD MULTI-CHANNEL SPEECH ENHANCEMENT CHALLENGE FOR VIDEO CONFERENCINGRaj, R. G. Prithvi / Kumar, Rohit / Jayesh, M. K. / Purushothaman, Anurenjan / Ganapathy, Sriram / Shaik, M. A. Basha et al. | 2021
- 331
-
A PARTITIONED-BLOCK FREQUENCY-DOMAIN ADAPTIVE KALMAN FILTER FOR STEREOPHONIC ACOUSTIC ECHO CANCELLATIONZhu, Rui / Yang, Feiran / Li, Yuepeng / Shang, Shidong et al. | 2021
- 336
-
REAL-TIME INDEPENDENT VECTOR ANALYSIS USING SEMI-SUPERVISED NONNEGATIVE MATRIX FACTORIZATION AS A SOURCE MODELWang, Taihui / Yang, Feiran / Zhu, Rui / Yang, Jun et al. | 2021
- 341
-
A CAUSAL U-NET BASED NEURAL BEAMFORMING NETWORK FOR REAL-TIME MULTI-CHANNEL SPEECH ENHANCEMENTRen, Xinlei / Zhang, Xu / Chen, Lianwu / Zheng, Xiguang / Zhang, Chen / Guo, Liang / Yu, Bing et al. | 2021
- 346
-
UNSUPERVISED CROSS-LINGUAL REPRESENTATION LEARNING FOR SPEECH RECOGNITIONConneau, Alexis / Baevski, Alexei / Collobert, Ronan / Mohamed, Abdelrahman / Auli, Michael et al. | 2021
- 351
-
MUCS 2021: MULTILINGUAL AND CODE-SWITCHING ASR CHALLENGES FOR LOW RESOURCE INDIAN LANGUAGESDiwan, Anuj / Vaideeswaran, Rakesh / Shah, Sanket / Singh, Ankita / Raghavan, Srinivasa / Khare, Shreya / Unni, Vinit / Vyas, Saurabh / Rajpuria, Akash / Yarra, Chiranjeevi et al. | 2021
- 356
-
DIFFERENTIABLE ALLOPHONE GRAPHS FOR LANGUAGE-UNIVERSAL SPEECH RECOGNITIONYan, Brian / Dalmia, Siddharth / Mortensen, David R. / Metze, Florian / Watanabe, Shinji et al. | 2021
- 361
-
ADAPT-AND-ADJUST: OVERCOMING THE LONG-TAIL PROBLEM OF MULTILINGUAL SPEECH RECOGNITIONWinata, Genta Indra / Wang, Guangsen / Xiong, Caiming / Hoi, Steven et al. | 2021
- 366
-
SRI-B END-TO-END SYSTEM FOR MULTILINGUAL AND CODE-SWITCHING ASR CHALLENGES FOR LOW RESOURCE INDIAN LANGUAGESSailor, Hardik / Praveen, Kiran T. / Agrawal, Vikas / Jain, Abhinav / Pandey, Abhishek et al. | 2021
- 371
-
USING LARGE SELF-SUPERVISED MODELS FOR LOW-RESOURCE SPEECH RECOGNITIONKrishna, D. N. / Wang, Pinyi / Bozza, Bruno et al. | 2021
- 376
-
BOOTSTRAP AN END-TO-END ASR SYSTEM BY MULTILINGUAL TRAINING, TRANSFER LEARNING, TEXT-TO-TEXT MAPPING AND SYNTHETIC AUDIOGiollo, Manuel / Gunceler, Deniz / Liu, Yulan / Willett, Daniel et al. | 2021
- 381
-
DUAL SCRIPT E2E FRAMEWORK FOR MULTILINGUAL AND CODE-SWITCHING ASRKumar, Mari Ganesh / Kuriakose, Jom / Thyagachandran, Anand / Kumar, Arun A. / Seth, Ashish / Prasad, Lodagala V. S. V. Durga / Jaiswal, Saish / Prakash, Anusha / Murthy, Hema A. et al. | 2021
- 386
-
EFFICIENT WEIGHT FACTORIZATION FOR MULTILINGUAL SPEECH RECOCGNITIONPham, Ngoc-Quan / Nguyen, Tuan-Nam / Stuker, Sebastian / Waibel, Alex et al. | 2021
- 391
-
TOWARDS ONE MODEL TO RULE ALL: MULTILINGUAL STRATEGY FOR DIALECTAL CODE-SWITCHING ARABIC ASRChowdhury, Shammur Absar / Hussein, Amir / Abdelali, Ahmed / Ali, Ahmed et al. | 2021
- 396
-
LANGUAGE AND SPEAKER-INDEPENDENT FEATURE TRANSFORMATION FOR END- TO-END MULTILINGUAL SPEECH RECOGNITIONHayakawa, Tomoaki / Leow, Chee Siang / Kobayashi, Akio / Utsuro, Takehito / Nishizaki, Hiromitsu et al. | 2021
- 401
-
HIERARCHICAL PHONE RECOGNITION WITH COMPOSITIONAL PHONETICSLi, Xinjian / Li, Juncheng / Metze, Florian / Black, Alan W. et al. | 2021
- 406
-
ON MODELING GLOTTAL SOURCE INFORMATION FOR PHONATION ASSESSMENT IN PARKINSON'S DISEASEVasquez-Correa, J. C. / Fritsch, Julian / Orozco-Arroyave, J. R. / Noth, Elmar / Magimai-Doss, Mathew et al. | 2021
- 411
-
DISTORTION OF VOICED OBSTRUENTS FOR DIFFERENTIAL DIAGNOSIS BETWEEN PARKINSON'S DISEASE AND MULTIPLE SYSTEM A TROPHYDaoudi, Khalid / Das, Biswajit / Victor, Solange Milhe De Saint / Foubert-Samier, Alexandra / Traon, Anne Pavy-Le / Rascol, Olivier / Meissner, Wassilios G. / Woisard, Virginie et al. | 2021
- 416
-
A STUDY INTO PRE-TRAINING STRATEGIES FOR SPOKEN LANGUAGE UNDERSTANDING ON DYSARTHRIC SPEECHWang, Pu / Babaali, Bagher / Hamme, Hugo Van et al. | 2021
- 421
-
EASYCALL CORPUS: A DYSARTHRIC SPEECH DATASETTurrisi, Rosanna / Braccia, Arianna / Emanuele, Marco / Giulietti, Simone / Pugliatti, Maura / Sensi, Mariachiara / Fadiga, Luciano / Badino, Leonardo et al. | 2021
- 426
-
ACOUSTIC INDICATORS OF SPEECH MOTOR COORDINATION IN ADULTS WITH AND WITHOUT TRAUMATIC BRAIN INJURYTalkar, Tanya / Solomon, Nancy Pearl / Brungart, Douglas S. / Kuchinsky, Stefanie E. / Eitel, Megan M. / Lippa, Sara M. / Brickell, Tracey A. / French, Louis M. / Lange, Rael T. / Quatieri, Thomas F. et al. | 2021
- 431
-
IMAGE-BASED ASSESSMENT OF JAW PARAMETERS AND JAW KINEMATICS FOR ARTICULATORY SIMULATION: PRELIMINARY RESULTSAbraham, Ajish K. / Sivaramakrishnan, V. / Swapna, N. / Manohar, N. et al. | 2021
- 436
-
INVESTIGATING SPEECH RECONSTRUCTION FOR LARYNGECTOMEES FOR SILENT SPEECH INTERFACESCao, Beiming / Sebkhi, Nordine / Bhavsar, Arpan / Inan, Omer T. / Samlan, Robin / Mau, Ted / Wang, Jun et al. | 2021
- 441
-
RASSPER: RADAR-BASED SILENT SPEECH RECOGNITIONFerreira, David / Silva, Samuel / Curado, Francisco / Teixeira, Antonio et al. | 2021
- 446
-
EFFECT OF CARRIER BANDWIDTH ON UNDERSTANDING MANDARIN SENTENCES IN SIMULATED ELECTRIC-ACOUSTIC HEARINGWang, Feng / Chen, Jing / Chen, Fei et al. | 2021
- 451
-
AN ATTENTION SELF-SUPERVISED CONTRASTIVE LEARNING BASED THREE-STAGE MODEL FOR HAND SHAPE FEATURE REPRESENTATION IN CUED SPEECHWang, Jianrong / Gu, Nan / Yu, Mei / Li, Xuewei / Fang, Qiang / Liu, Li et al. | 2021
- 456
-
REMOTE SMARTPHONE-BASED SPEECH COLLECTION: ACCEPTANCE AND BARRIERS IN INDIVIDUALS WITH MAJOR DEPRESSIVE DISORDERDineley, Judith / Lavelle, Grace / Leightley, Daniel / Matcham, Faith / Siddi, Sara / Penarrubia-Maria, Maria Teresa / White, Katie M. / Ivan, Alina / Oetzmann, Carolin / Simblett, Sara et al. | 2021
- 461
-
A COMPARATIVE STUDY OF DIFFERENT EMG FEATURES FOR ACOUSTICS-TO-EMG MAPPINGSharma, Manthan / Gaddam, Navaneetha / Umesh, Tejas / Murthy, Aditya / Ghosh, Prasanta Kumar et al. | 2021
- 466
-
SILENT VERSUS MODAL MULTI-SPEAKER SPEECH RECOGNITION FROM ULTRASOUND AND VIDEORibeiro, Manuel Sam / Eshky, Aciel / Richmond, Korin / Renals, Steve et al. | 2021
- 471
-
AN AUTOMATIC, SIMPLE ULTRASOUND BIOFEEDBACK PARAMETER FOR DISTINGUISHING ACCURATE AND MISARTICULATED RHOTIC SYLLABLESLi, Sarah R. / Annand, Colin T. / Dugan, Sarah / Schwab, Sarah M. / Eary, Kathryn J. / Swearengen, Michael / Stack, Sarah / Boyce, Suzanne / Riley, Michael A. / Mast, T. Douglas et al. | 2021
- 476
-
SEGMENT AND TONE PRODUCTION IN CONTINUOUS SPEECH OF HEARING AND HEARING-IMPAIRED CHILDRENTseng, Shu-Chuan / Liu, Yi-Fen et al. | 2021
- 481
-
LEVERAGING SPEAKER ATTRIBUTE INFORMATION USING MULTI TASK LEARNING FOR SPEAKER VERIFICATION AND DIARIZATIONLuu, Chau / Bell, Peter / Renals, Steve et al. | 2021
- 486
-
ICSPK: INTERPRETABLE COMPLEX SPEAKER EMBEDDING EXTRACTOR FROM RAW WAVEFORMPeng, Junyi / Qu, Xiaoyang / Wang, Jianzong / Gu, Rongzhi / Xiao, Jing / Burget, Lukas / Cernocky, Jan et al. | 2021
- 491
-
SPINE2NET: SPINENET WITH RES2NET AND TIME-SQUEEZE-AND-EXCITATION BLOCKS FOR SPEAKER RECOGNITIONRybicka, Magdalena / Villalba, Jesus / Zelasko, Piotr / Dehak, Najim / Kowalczyk, Konrad et al. | 2021
- 496
-
SPEAKER EMBEDDINGS BY MODELING CHANNEL-WISE CORRELATIONSStafylakis, Themos / Rohdin, Johan / Burget, Lukas et al. | 2021
- 501
-
MULTI-TASK NEURAL NETWORK FOR ROBUST MULTIPLE SPEAKER EMBEDDING EXTRACTIONHe, Weipeng / Motlicek, Petr / Odobez, Jean-Marc et al. | 2021
- 506
-
SPEAKER ATTENTIVE SPEECH EMOTION RECOGNITIONMoine, Clement Le / Obin, Nicolas / Roebel, Axel et al. | 2021
- 511
-
M3: MULTIMODAL MASKING APPLIED TO SENTIMENT ANALYSISGeorgiou, Efthymios / Paraskevopoulos, Georgios / Potamianos, Alexandros et al. | 2021
- 516
-
SEPARATION OF EMOTIONAL AND RECONSTRUCTION EMBEDDINGS ON LADDER NETWORK TO IMPROVE SPEECH EMOTION RECOGNITION ROBUSTNESS IN NOISY CONDITIONSLeem, Seong-Gyun / Fulford, Daniel / Onnela, Jukka-Pekka / Gard, David / Busso, Carlos et al. | 2021
- 521
-
ACOUSTIC FEATURES AND NEURAL REPRESENTATIONS FOR CATEGORICAL EMOTION RECOGNITION FROM SPEECHKeesing, Aaron / Koh, Yun Sing / Witbrock, Michael et al. | 2021
- 526
-
AUTOMATIC ANALYSIS OF THE EMOTIONAL CONTENT OF SPEECH IN DAYLONG CHILD-CENTERED RECORDINGS FROM A NEONATAL INTENSIVE CARE UNITVaaras, Einari / Ahlqvist-Bjorkroth, Sari / Drossos, Konstantinos / Rasanen, Okko et al. | 2021
- 531
-
MULTIMODAL SENTIMENT ANALYSIS WITH TEMPORAL MODALITY ATTENTIONOian, Fan / Han, Jiging et al. | 2021
- 536
-
LEARNING FINE-GRAINED CROSS MODALITY EXCITEMENT FOR SPEECH EMOTION RECOGNITIONLi, Hang / Ding, Wenbiao / Wu, Zhongqin / Liu, Zitao et al. | 2021
- 541
-
ACTED VS. IMPROVISED: DOMAIN ADAPTATION FOR ELICITATION APPROACHES IN AUDIO-VISUAL EMOTION RECOGNITIONLi, Haoqi / Kim, Yelin / Kuo, Cheng-Hao / Narayanan, Shrikanth S. et al. | 2021
- 546
-
GRAPH ISOMORPHISM NETWORK FOR SPEECH EMOTION RECOGNITIONLiu, Jiawang / Wang, Haoxiang et al. | 2021
- 551
-
EMOTION RECOGNITION FROM SPEECH USING WAV2VEC 2.0 EMBEDDINGSPepino, Leonardo / Riera, Pablo / Ferrer, Luciana et al. | 2021
- 556
-
STOCHASTIC PROCESS REGRESSION FOR CROSS-CULTURAL SPEECH EMOTION RECOGNITIONKumar, Mani T. / Sanchez, Enrique / Tzimiropoulos, Georgios / Giesbrecht, Timo / Valstar, Michel et al. | 2021
- 561
-
APPLYING TDNN ARCHITECTURES FOR ANALYZING DURATION DEPENDENCIES ON SPEECH EMOTION RECOGNITIONKumawat, Pooja / Routray, Aurobinda et al. | 2021
- 566
-
LEVERAGING PRE-TRAINED LANGUAGE MODEL FOR SPEECH SENTIMENT ANALYSISShon, Suwon / Brusco, Pablo / Pan, Jing / Han, Kyu J. / Watanabe, Shinji et al. | 2021
- 571
-
TEMPORAL CONTEXT IN SPEECH EMOTION RECOGNITIONXia, Yangyang / Chen, Li-Wei / Rudnicky, Alexander / Stern, Richard M. et al. | 2021
- 576
-
PARAMETRIC DISTRIBUTIONS TO MODEL NUMERICAL EMOTION LABELSBose, Deboshree / Sethu, Vidhyasaharan / Ambikairajah, Eliathamby et al. | 2021
- 581
-
AFFECT RECOGNITION THROUGH SCALOGRAM AND MULTI-RESOLUTION COCHLEAGRAM FEATURESHaider, Fasih / Luz, Saturnino et al. | 2021
- 586
-
A SPEECH EMOTION RECOGNITION FRAMEWORK FOR BETTER DISCRIMINATION OF CONFUSIONSLiu, Jiawang / Wang, Haoxiang et al. | 2021
- 591
-
TIME-FREQUENCY REPRESENTATION LEARNING WITH GRAPH CONVOLUTIONAL NETWORK FOR DIALOGUE-LEVEL SPEECH EMOTION RECOGNITIONLiu, Jiaxing / Song, Yaodong / Wang, Longbiao / Dang, Jianwu / Yu, Ruiguo et al. | 2021
- 596
-
AUDIO-VISUAL SPEECH EMOTION RECOCGNITION BY DISENTANGLING EMOTION AND IDENTITY ATTRIBUTESIto, Koichiro / Fujioka, Takuya / Sun, Qinghua / Nagamatsu, Kenji et al. | 2021
- 601
-
GENERALIZED DILATED CNN MODELS FOR DEPRESSION DETECTION USING INVERTED VOCAL TRACT VARIABLESSeneviratne, Nadee / Espy-Wilson, Carol et al. | 2021
- 606
-
SPEECH EMOTION RECOCGNITION VIA MULTI-LEVEL CROSS-MODAL DISTILLATIONLi, Ruichen / Zhao, Jinming / Jin, Oin et al. | 2021
- 611
-
SPEECH EMOTION RECOGNITION WITH MULTI-TASK LEARNINGCai, Xingyu / Yuan, Jiahong / Zheng, Renjie / Huang, Liang / Church, Kenneth et al. | 2021
- 616
-
METRIC LEARNING BASED FEATURE REPRESENTATION WITH GATED FUSION MODEL FOR SPEECH EMOTION RECOGNITIONGao, Yuan / Liu, Jiaxing / Wang, Longbiao / Dang, Jianwu et al. | 2021
- 621
-
LEARNING MUTUAL CORRELATION IN MULTIMODAL TRANSFORMER FOR SPEECH EMOTION RECOGNITIONWang, Yuhua / Shen, Guang / Xu, Yuezhu / Li, Jiahang / Zhao, Zhengdao et al. | 2021
- 626
-
Y-VECTOR: MULTISCALE WAVEFORM ENCODER FOR SPEAKER EMBEDDINGZhu, Ge / Jiang, Fei / Duan, Zhiyao et al. | 2021
- 631
-
SERIALIZED MULTI-LA YER MULTI-HEAD ATTENTION FOR NEURAL SPEAKER EMBEDDINGZhu, Hongning / Lee, Kong Aik / Li, Haizhou et al. | 2021
- 636
-
BIDIRECTIONAL MULTISCALE FEATURE AGGREGATION FOR SPEAKER VERIFICATIONOi, Jiajun / Guo, Wu / Gu, Bin et al. | 2021
- 641
-
ADAPTIVE CONVOLUTIONAL NEURAL NETWORK FOR TEXT-INDEPENDENT SPEAKER RECOGNITIONKim, Seong-Hu / Park, Yong-Hwa et al. | 2021
- 646
-
BINARY NEURAL NETWORK FOR SPEAKER VERIFICATIONZhu, Tinglong / Oin, Xiaoyi / Li, Ming et al. | 2021
- 651
-
PHONEME-AWARE AND CHANNEL-WISE ATTENTIVE LEARNING FOR TEXT DEPENDENT SPEAKER VERIFICATIONLiu, Yan / Li, Zheng / Li, Lin / Hong, Qingyang et al. | 2021
- 656
-
IMPROVING DEEP CNN ARCHITECTURES WITH VARIABLE-LENGTH TRAINING SAMPLES FOR TEXT-INDEPENDENT SPEAKER VERIFICATIONWu, Yanfeng / Zhao, Junan / Guo, Chenkai / Xu, Jing et al. | 2021
- 661
-
MUTUAL INFORMATION ENHANCED TRAINING FOR SPEAKER EMBEDDINGTu, Youzhi / Mak, Man-Wai et al. | 2021
- 666
-
IMPROVING TIME DELAY NEURAL NETWORK BASED SPEAKER RECOGNITION WITH CONVOLUTIONAL BLOCK AND FEATURE AGGREGATION METHODSZhang, Yu-Jia / Wang, Yih-Wen / Chen, Chia-Ping / Lu, Chung-Li / Chan, Bo-Cheng et al. | 2021
- 671
-
REFORMULATING DOVER-LAP LABEL MAPPING AS A GRAPH PARTITIONING PROBLEMRaj, Desh / Khudanpur, Sanjeev et al. | 2021
- 676
-
GRAPH ATTENTION NETWORKS FOR ANTI-SPOOFINGTak, Hemlata / Jung, Jee-Weon / Patino, Jose / Todisco, Massimiliano / Evans, Nicholas et al. | 2021
- 681
-
EFFECTIVE PHASE ENCODING FOR END-TO-END SPEAKER VERIFICATIONPeng, Junyi / Qu, Xiaoyang / Gu, Rongzhi / Wang, Jianzong / Xiao, Jing / Burget, Lukas / Cernocky, Jan et al. | 2021
- 686
-
LOG-LIKELIHOOD-RATIO COST FUNCTION AS OBJECTIVE LOSS FOR SPEAKER VERIFICATION SYSTEMSMingote, Victoria / Miguel, Antonio / Ortega, Alfonso / Lleida, Eduardo et al. | 2021
- 691
-
ACOUSTIC-PROSODIC, LEXICAL AND DEMOGRAPHIC CUES TO PERSUASIVENESS IN COMPETITIVE DEBATE SPEECHESNguyen, Huyen / Vente, Ralph / Lupea, David / Levitan, Sarah Ita / Hirschberg, Julia et al. | 2021
- 696
-
AUDIO-VISUAL RECOGNITION OF EMOTIONAL ENGAGEMENT OF PEOPLE WITH DEMENTIASteinert, Lars / Putze, Felix / Kuster, Dennis / Schultz, Tanja et al. | 2021
- 701
-
SPEAKING CORONA? HUMAN AND MACHINE RECOGNITION OF COVID-19 FROM VOICEHecker, Pascal / Pokorny, Florian B. / Bartl-Pokorny, Katrin D. / Reichel, Uwe / Ren, Zhao / Hantke, Simone / Eyben, Florian / Schuller, Dagmar M. / Arnrich, Bert / Schuller, Bjorn W. et al. | 2021
- 706
-
MEASURING VOICE QUALITY PARAMETERS AFTER SPEAKER PSEUDONYMIZATIONSon, Rob J. J. H. Van et al. | 2021
- 711
-
EMOTION CARRIER RECOGNITION FROM PERSONAL NARRATIVESTammewar, Aniruddha / Cervone, Alessandra / Riccardi, Giuseppe et al. | 2021
- 716
-
VISUAL SPEECH FOR OBSTRUCTIVE SLEEP APNEA DETECTIONBotelho, Catarina / Abad, Alberto / Schultz, Tanja / Trancoso, Isabel et al. | 2021
- 721
-
TDCA-NET: TIME-DOMAIN CHANNEL ATTENTION NETWORK FOR DEPRESSION DETECTIONCai, Cong / Niu, Mingyue / Liu, Bin / Tao, Jianhua / Liu, Xuefei et al. | 2021
- 726
-
ANALYSIS OF CONTEXTUAL VOICE CHANGES IN REMOTE MEETINGSMaruri, Hector A. Cordourier / Aslan, Sinem / Stemmer, Georg / Alyuz, Nese / Nachman, Lama et al. | 2021
- 731
-
STACKED RECURRENT NEURAL NETWORKS FOR SPEECH-BASED INFERENCE OF ATTACHMENT CONDITION IN SCHOOL AGE CHILDRENAlsofyani, Huda / Vinciarelli, Alessandro et al. | 2021
- 736
-
ROBUST LAUGHTER DETECTION IN NOISY ENVIRONMENTSGillick, Jon / Deng, Wesley / Ryokai, Kimiko / Bamman, David et al. | 2021