Combining selection tree with observation reordering pruning for efficient speaker identification using GMM-UBM (English)
- New search for: Zhenyu Xiong,
- New search for: Zheng, T.F.
- New search for: Zhanjiang Song,
- New search for: Wenhu Wu,
- New search for: Zhenyu Xiong,
- New search for: Zheng, T.F.
- New search for: Zhanjiang Song,
- New search for: Wenhu Wu,
In:
Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
;
1
;
I/625-I/628 Vol. 1
;
2005
-
ISBN:
-
ISSN:
- Conference paper / Electronic Resource
-
Title:Combining selection tree with observation reordering pruning for efficient speaker identification using GMM-UBM
-
Contributors:Zhenyu Xiong, ( author ) / Zheng, T.F. ( author ) / Zhanjiang Song, ( author ) / Wenhu Wu, ( author )
-
Published in:
-
Publisher:
- New search for: IEEE
-
Publication date:2005-01-01
-
Size:216149 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
-
Source:
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 0_1
-
2005 IEEE International Conference on Acoustics, Speech, and Signal Processing| 2005
- 1117
-
Author index| 2005
- cxviii
-
Breaker page| 2005
- cxviii
-
ICASSP 2005 Proceedings| 2005
- I
-
SP-L4.2: SPEAKER ADAPTIVE CONFIDENCE SCORING USING BAYESIAN COMBININGKim, T.-Y. / Ko, H. / IEEE et al. | 2005
- I
-
SP-L4.3: IMPROVING UTTERANCE VERIFICATION USING ADDITIONAL CONFIDENCE MEASURES IN ISOLATED SPEECH RECOGNITION INTERFACESGreenland, G. / Wong, W. / Kunov, H. / IEEE et al. | 2005
- I
-
SP-L5.1: ADAPTATION OF PRECISION MATRIX MODELS ON LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITIONSim, K. C. / Gales, M. J. F. / IEEE et al. | 2005
- I
-
SP-L4.6: COMBINATION OF MULTIPLE PREDICTORS TO IMPROVE CONFIDENCE MEASURE BASED ON LOCAL POSTERIOR PROBABILITIESFu, Y. / Du, L. / IEEE et al. | 2005
- I
-
SP-L6.2: NON-INTRUSIVE GMM-BASED SPEECH QUALITY MEASUREMENTFalk, T. / Xu, Q. / Chan, W.-Y. / IEEE et al. | 2005
- I
-
SP-P1.6: SLIDING WINDOW SMOOTHING FOR MAXIMUM ENTROPY BASED INTONATIONAL PHRASE PREDICTION IN CHINESELi, J.-F. / Hu, G.-P. / Wang, R.-H. / Dai, L.-R. / IEEE et al. | 2005
- I
-
SP-P2.2: "OF ALL THINGS THE MEASURE IS MAN": AUTOMATIC CLASSIFICATION OF EMOTIONS AND INTER-LABELER CONSISTENCYSteidl, S. / Levit, M. / Batliner, A. / Noth, E. / Niemann, H. / IEEE et al. | 2005
- I
-
SP-P2.9: SOFT DECODING OF TEMPORAL DERIVATIVES FOR ROBUST DISTRIBUTED SPEECH RECOGNITION IN PACKET LOSSJames, A. / Milner, B. / IEEE et al. | 2005
- I
-
SP-P3.7: BAYESIAN MODEL BASED NON-INTRUSIVE SPEECH QUALITY EVALUATIONChen, G. / Parsa, V. / IEEE et al. | 2005
- I
-
SP-P4.8: ANALYSIS OF A LARGE IN-CAR SPEECH CORPUS AND ITS APPLICATION TO THE MULTIMODEL ASRFujimura, H. / Miyajima, C. / Itou, K. / Takeda, K. / Itakura, F. / IEEE et al. | 2005
- I
-
SP-P5.9: AUTOMATIC PROCESSING OF AUDIO LECTURES FOR INFORMATION RETRIEVAL: VOCABULARY SELECTION AND LANGUAGE MODELINGPark, A. / Hazen, T. / Glass, J. / IEEE et al. | 2005
- I
-
SP-P6.7: MASK ESTIMATION BASED ON SOUND LOCALISATION FOR MISSING DATA SPEECH RECOGNITIONHarding, S. / Barker, J. / Brown, G. J. / IEEE et al. | 2005
- I
-
SP-P6.10: SUBSPACE-BASED SPEAKER-INDEPENDENT VOWEL RECOGNITIONMuralishankar, R. / O Shaughnessy, D. / IEEE et al. | 2005
- I
-
SP-P8.3: EXTRACTING ADDITIONAL INFORMATION FROM GAUSSIAN MIXTURE MODEL PROBABILITIES FOR IMPROVED TEXT-INDEPENDENT SPEAKER IDENTIFICATIONNarayanaswamy, B. / Gangadharaiah, R. / IEEE et al. | 2005
- I
-
SP-P8.9: A NEW COMMON COMPONENT GMM-BASED SPEAKER RECOGNITION METHODWang, Y.-R. / Chiang, C.-Y. / IEEE et al. | 2005
- I
-
SP-P10.8: ALIZE, A FREE TOOLKIT FOR SPEAKER RECOGNITIONBonastre, J.-F. / Wils, F. / Meignier, S. / IEEE et al. | 2005
- I
-
SP-P12.3: CROSS DOMAIN AUTOMATIC TRANSCRIPTION ON THE TC-STAR EPPS CORPUSGollan, C. / Bisani, M. / Kanthak, S. / Schluter, R. / Ney, H. / IEEE et al. | 2005
- I
-
SP-P13.3: VOICED/UNVOICED DETERMINATION OF SPEECH SIGNAL IN NOISY ENVIRONMENT USING HARMONICITY MEASURE BASED ON INSTANTANEOUS FREQUENCYArifianto, D. / Kobayashi, T. / IEEE et al. | 2005
- I
-
SP-P13.8: OBJECTIVE QUALITY MEASURES FOR GLOTTAL INVERSE FILTERING OF SPEECH PRESSURE SIGNALSBackstrom, T. / Airas, M. / Lehto, L. / Alku, P. / IEEE et al. | 2005
- I
-
SP-P16.9: INCORPORATING DIALOGUE CONTEXT AND TOPIC CLUSTERING IN OUT-OF-DOMAIN DETECTIONLane, I. / Kawahara, T. / IEEE et al. | 2005
- I
-
SP-P17.10: SPEECH ENHANCEMENT BASED ON FILTERING THE SPECTROTEMPORAL MODULATIONSMesgarani, N. / Shamma, S. / IEEE et al. | 2005
- I
-
SP-L1.1: POLYGLOT SYNTHESIS USING A MIXTURE OF MONOLINGUAL CORPORALatorre, J. / Iwano, K. / Furui, S. / IEEE et al. | 2005
- I
-
SP-L3.6: RELATIVE ENERGY AND INTELLIGIBILITY OF TRANSIENT SPEECH INFORMATIONYoo, S. / Boston, J. R. / Durrant, J. / Kovacyk, K. / Karn, S. / Shaiman, S. / El-Jaroudi, A. / Li, C.-C. / IEEE et al. | 2005
- I
-
SP-L5.5: MINIMUM CLASSIFICATION ERROR FOR LARGE SCALE SPEECH RECOGNITION TASKS USING WEIGHTED FINITE STATE TRANSDUCERSMcDermott, E. / Katagiri, S. / IEEE et al. | 2005
- I
-
SP-L6.3: A MULTIPLE-DESCRIPTION PCM SPEECH CODER USING STRUCTURED DUAL VECTOR QUANTIZERSVoran, S. / IEEE et al. | 2005
- I
-
SP-L7.3: ADAPTIVE TIME SEGMENTATION OF NOISY SPEECH FOR IMPROVED SPEECH ENHANCEMENTHendriks, R. C. / Heusdens, R. / Jensen, J. / IEEE et al. | 2005
- I
-
SP-L11.2: LOG-ENERGY DYNAMIC RANGE NORMALIZATION FOR ROBUST SPEECH RECOGNITIONZhu, W. / O Shaughnessy, D. / IEEE et al. | 2005
- I
-
SP-P2.7: A HIDDEN TRAJECTORY MODEL WITH BI-DIRECTIONAL TARGET-FILTERING: CASCADED VS. INTEGRATED IMPLEMENTATION FOR PHONETIC RECOGNITIONDeng, L. / Li, X. / Yu, D. / Acero, A. / IEEE et al. | 2005
- I
-
SP-P3.2: ADAPTIVE TRAINING FOR HIDDEN SEMI-MARKOV MODELYamagishi, J. / Kobayashi, T. / IEEE et al. | 2005
- I
-
SP-P6.9: INFLUENCE OF AUTOCORRELATION LAG RANGES ON ROBUST SPEECH RECOGNITIONShannon, B. J. / Paliwal, K. K. / IEEE et al. | 2005
- I
-
SP-P9.6: FUZZY PARAMETER CLUSTERING METHOD IN SPEECH RECOGNITIONXu, X. / Zhu, J. / IEEE et al. | 2005
- I
-
SP-P14.7: TONOTOPIC MULTI-LAYERED PERCEPTRON: A NEURAL NETWORK FOR LEARNING LONG-TERM TEMPORAL FEATURES FOR SPEECH RECOGNITIONChen, B. / Zhu, Q. / Morgan, N. / IEEE et al. | 2005
- I
-
SP-P14.9: QUASI-CONTINUOUS LOCAL CODEBOOK FEATURES FOR MULTILINGUAL ACOUSTIC PHONETIC MODELLINGDiehl, F. / Moreno, A. / IEEE et al. | 2005
- I
-
SP-P15.2: AUTOMATIC DISFLUENCY REMOVAL ON RECOGNIZED SPONTANEOUS SPEECH - RAPID ADAPTATION TO SPEAKER DEPENDENT DISFLUENCIESHonal, M. / Schultz, T. / IEEE et al. | 2005
- I
-
SP-P16.13: AUTOMATIC DIALOG ACT SEGMENTATION AND CLASSIFICATION IN MULTIPARTY MEETINGSAng, J. / Liu, Y. / Shriberg, E. / IEEE et al. | 2005
- I
-
SP-L2.6: UNSUPERVISED SEMANTIC INTENT DISCOVERY FROM CALL LOG ACOUSTICSLi, X. / Gunawardana, A. / Acero, A. / IEEE et al. | 2005
- I
-
SP-L6.6: CODING WITH SIDE INFORMATION TECHNIQUES FOR LSF RECONSTRUCTION IN VOICE OVER IPAgiomyrgiannakis, Y. / Stylianou, Y. / IEEE et al. | 2005
- I
-
SP-L7.4: SPEECH ENHANCEMENT USING HARMONIC REGENERATIONPlapous, C. / Marro, C. / Scalart, P. / IEEE et al. | 2005
- I
-
SP-L7.5: INSTANT NOISE ESTIMATION USING FOURIER TRANSFORM OF AMDF AND VARIABLE START MINIMA SEARCHLin, Z. / Goubran, R. A. / IEEE et al. | 2005
- I
-
SP-L8.5: PROSODY MODELING AND EIGEN-PROSODY ANALYSIS FOR ROBUST SPEAKER RECOGNITIONChen, Z.-H. / Liao, Y.-F. / Juang, Y.-T. / IEEE et al. | 2005
- I
-
SP-L9.6: LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOPHasegawa-Johnson, M. / Baker, J. / Borys, S. / Chen, K. / Coogan, E. / Greenberg, S. / Juneja, A. / Kirchhoff, K. / Livescu, K. / Mohan, S. et al. | 2005
- I
-
SP-L11.5: PARTICLE FILTER BASED NON-STATIONARY NOISE TRACKING FOR ROBUST SPEECH RECOGNITIONFujimoto, M. / Nakamura, S. / IEEE et al. | 2005
- I
-
SP-P1.5: PROSODY ANALYSIS AND MODELING FOR EMOTIONAL SPEECH SYNTHESISJiang, D.-n. / Zhang, W. / Shen, L.-q. / Cai, L.-h. / IEEE et al. | 2005
- I
-
SP-P1.7: IDENTIFICATION AND SYNTHESIS OF CANTONESE TONES BASED ON THE COMMAND-RESPONSE MODEL FOR F0 CONTOUR GENERATIONGu, W. / Hirose, K. / Fujisaki, H. / IEEE et al. | 2005
- I
-
SP-P1.8: COMPRESSION OF EXCEPTION LEXICONS FOR SMALL FOOTPRINT GRAPHEME-TO-PHONEME CONVERSIONMeron, J. / Veprek, P. / IEEE et al. | 2005
- I
-
SP-P3.4: SPEECH RECOGNITION IN THE BLIND CONDITION BASED ON MULTIPLE DIRECTIVITY PATTERNS USING A MICROPHONE ARRAYSekiya, T. / Kobayashi, T. / IEEE et al. | 2005
- I
-
SP-P3.5: AN UNSUPERVISED QUANTITATIVE MEASURE FOR WORD PROMINENCE IN SPONTANEOUS SPEECHWang, D. / Narayanan, S. / IEEE et al. | 2005
- I
-
SP-P3.13: VOICING-STATE CLASSIFICATION OF CO-CHANNEL SPEECH USING NONLINEAR STATE-SPACE RECONSTRUCTIONMahgoub, Y. / Dansereau, R. / IEEE et al. | 2005
- I
-
SP-P4.2: CONTEXT-DEPENDENT DURATION MODELINGWillett, D. / IEEE et al. | 2005
- I
-
SP-P4.3: RECOGNISING SPEECH IN THE PRESENCE OF A COMPETING SPEAKER USING A `SPEECH FRAGMENT DECODER'Coy, A. / Barker, J. / IEEE et al. | 2005
- I
-
SP-P3.14: SPEECH RATE ESTIMATION VIA TEMPORAL CORRELATION AND SELECTED SUB-BAND CORRELATIONNarayanan, S. / Wang, D. / IEEE et al. | 2005
- I
-
SP-P4.11: ACOUSTIC FEATURE COMBINATION FOR ROBUST SPEECH RECOGNITIONZolnay, A. / Schlueter, R. / Ney, H. / IEEE et al. | 2005
- I
-
SP-P5.5: FAST TWO-STAGE VOCABULARY-INDEPENDENT SEARCH IN SPONTANEOUS SPEECHYu, P. / Seide, F. / IEEE et al. | 2005
- I
-
SP-P6.2: PITCH-SYNCHRONOUS ZCPA (PS-ZCPA)-BASED FEATURE EXTRACTION WITH AUDITORY MASKINGGhulam, M. / Fukuda, T. / Horikawa, J. / Nitta, T. / IEEE et al. | 2005
- I
-
SP-P8.6: IMPROVED SPEAKER MODEL MIGRATION VIA STOCHASTIC SYNTHESISNavratil, J. / Ramaswamy, G. / IEEE et al. | 2005
- I
-
SP-P9.7: AUTOMATIC TRAINING SET SEGMENTATION FOR MULTI-PASS SPEECH RECOGNITIONMao, M. / Vanhoucke, V. / Strope, B. / IEEE et al. | 2005
- I
-
SP-P9.8: GENERALIZED STATISTICAL MODELING OF PRONUNCIATION VARIATIONS USING VARIABLE-LENGTH PHONE CONTEXTAkita, Y. / Kawahara, T. / IEEE et al. | 2005
- I
-
SP-P10.1: A PROBABILISTIC MEASURE OF MODALITY RELIABILITY IN SPEAKER VERIFICATIONRichiardi, J. / Prodanov, P. / Drygajlo, A. / IEEE et al. | 2005
- I
-
SP-P11.6: VOICE ACTIVITY DETECTION BASED ON GENERALIZED GAMMA DISTRIBUTIONShin, J. W. / Chang, J.-H. / Yun, H. S. / Kim, N. S. / IEEE et al. | 2005
- I
-
SP-P12.9: INVESTIGATION OF ACOUSTIC MODELING TECHNIQUES FOR LVCSR SYSTEMSLiu, X. / Gales, M. J. F. / Sim, K. C. / Yu, K. / IEEE et al. | 2005
- I
-
SP-P12.8: BAYESIAN MODEL COMBINATION (BAYCOM) FOR IMPROVED RECOGNITIONSankar, A. / IEEE et al. | 2005
- I
-
SP-P15.4: TWO-STAGE SPEAKER ADAPTATION OF HYBRID TIED-POSTERIOR ACOUSTIC MODELSStadermann, J. / Rigoll, G. / IEEE et al. | 2005
- I
-
SP-P15.11: ALTERNATE PHONE MODELS FOR CONVERSATIONAL SPEECHLamel, L. / Gauvain, J.-L. / IEEE et al. | 2005
- I
-
SP-P16.11: A NEW ASR EVALUATION MEASURE AND MINIMUM BAYES-RISK DECODING FOR OPEN-DOMAIN SPEECH UNDERSTANDINGNanjo, H. / Kawahara, T. / IEEE et al. | 2005
- I
-
SP-P17.4: OVERCOMING THE STATISTICAL INDEPENDENCE ASSUMPTION W.R.T. FREQUENCY IN SPEECH ENHANCEMENTFingscheidt, T. / Beaugeant, C. / Suhadi, S. / IEEE et al. | 2005
- I
-
SP-P17.5: A TWO-STAGE ALGORITHM FOR ENHANCEMENT OF REVERBERANT SPEECHWu, M. / Wang, D. / IEEE et al. | 2005
- I
-
SP-P17.13: AN IMPROVED ESTIMATION OF A PRIORI SPEECH ABSENCE PROBABILITY FOR SPEECH ENHANCEMENT: IN PERSPECTIVE OF SPEECH PERCEPTIONChoi, M. S. / Kang, H.-G. / IEEE et al. | 2005
- I
-
SP-L1.3: SPECTRAL CONVERSION BASED ON MAXIMUM LIKELIHOOD ESTIMATION CONSIDERING GLOBAL VARIANCE OF CONVERTED PARAMETERToda, T. / Black, A. W. / Tokuda, K. / IEEE et al. | 2005
- I
-
SP-L8.4: SPEAKER VERIFICATION USING ADAPTED ARTICULATORY FEATURE-BASED CONDITIONAL PRONUNCIATION MODELINGLeung, K.-Y. / Mak, M.-W. / Siu, M. / Kung, S.-Y. / IEEE et al. | 2005
- I
-
SP-L8.6: PROSODIC MODELING FOR SPEAKER RECOGNITION BASED ON SUB-BAND ENERGY TEMPORAL TRAJECTORIESAdami, A. / IEEE et al. | 2005
- I
-
SP-L9.4: THE IBM 2004 CONVERSATIONAL TELEPHONY SYSTEM FOR RICH TRANSCRIPTIONSoltau, H. / Kingsbury, B. / Mangu, L. / Povey, D. / Saon, G. / Zweig, G. / IEEE et al. | 2005
- I
-
SP-L10.3: SPEECH SIGNAL ANALYSIS WITH EXPONENTIAL AUTOREGRESSIVE MODELIshizuka, K. / Kato, H. / Nakatani, T. / IEEE et al. | 2005
- I
-
SP-L10.6: AN AUTO-REGRESSIVE, NON-STATIONARY EXCITED SIGNAL PARAMETER ESTIMATION METHOD AND AN EVALUATION OF A SINGING-VOICE RECOGNITIONSasou, A. / Goto, M. / Hayamizu, S. / Tanaka, K. / IEEE et al. | 2005
- I
-
SP-P2.10: DBN-BASED MULTI-STREAM MODELS FOR MANDARIN TONEME RECOGNITIONLei, X. / Ji, G. / Ng, T. / Bilmes, J. / Ostendorf, M. / IEEE et al. | 2005
- I
-
SP-P3.1: SCALABLE CONCATENATIVE SPEECH SYNTHESIS BASED ON THE PLURAL UNIT SELECTION AND FUSION METHODTamura, M. / Mizutani, T. / Kagoshima, T. / IEEE et al. | 2005
- I
-
SP-P4.5: EFFECT OF PHASE-SENSITIVE ENVIRONMENT MODEL AND HIGHER ORDER VTS ON NOISY SPEECH FEATURE ENHANCEMENTStouten, V. / Van hamme, H. / Wambacq, P. / IEEE et al. | 2005
- I
-
SP-P6.6: TWO-STAGE NOISE SPECTRA ESTIMATION AND REGRESSION BASED IN-CAR SPEECH RECOGNITION USING SINGLE DISTANT MICROPHONELi, W. / Itou, K. / Takeda, K. / Itakura, F. / IEEE et al. | 2005
- I
-
SP-P6.8: SPEECH PROCESSING USING JOINT FEATURES DERIVED FROM THE MODIFIED GROUP DELAY FUNCTIONHegde, R. / Murthy, H. / Rao, G. V. R. / IEEE et al. | 2005
- I
-
SP-P7.2: LANGUAGE MODEL ESTIMATION FOR OPTIMIZING END-TO-END PERFORMANCE OF A NATURAL LANGUAGE CALL ROUTING SYSTEMGoel, V. / Kuo, H.-K. / Deligne, S. / Wu, C. / IEEE et al. | 2005
- I
-
SP-P9.3: OPTIMAL CLUSTERING AND NON-UNIFORM ALLOCATION OF GAUSSIAN KERNELS IN SCALAR DIMENSION FOR HMM COMPRESSIONLi, X.-B. / Soong, F. K. / Myrvoll, T. A. / Wang, R.-H. / IEEE et al. | 2005
- I
-
SP-P10.6: T-NORM FOR TEXT-DEPENDENT COMMERCIAL SPEAKER VERIFICATION APPLICATIONS: EFFECT OF LEXICAL MISMATCHHebert, M. / Boies, D. / IEEE et al. | 2005
- I
-
SP-P10.9: SPEAKER ADAPTIVE COHORT SELECTION FOR TNORM IN TEXT-INDEPENDENT SPEAKER VERIFICATIONSturim, D. / Reynolds, D. / IEEE et al. | 2005
- I
-
SP-P10.10: HYBRID SPEAKER-BASED SEGMENTATION SYSTEM USING MODEL-LEVEL CLUSTERINGKim, H.-G. / Ertelt, D. / Sikora, T. / IEEE et al. | 2005
- I
-
SP-P11.1: IMPROVING THE 2.4 KB/S MILITARY STANDARD MELP (MS-MELP) CODER USING PITCH-SYNCHRONOUS ANALYSIS AND SYNTHESIS TECHNIQUESErtan, A. E. / Barnwell, T. P. / IEEE et al. | 2005
- I
-
SP-P11.3: TOWARDS ILBC SPEECH CODING AT LOWER RATES THROUGH A NEW FORMULATION OF THE START STATE SEARCHGarrido, C. M. / Murthi, M. N. / Andersen, S. V. / IEEE et al. | 2005
- I
-
SP-L7.2: A WAVELET KALMAN FILTER WITH PERCEPTUAL MASKING FOR SPEECH ENHANCEMENT IN COLORED NOISEMa, N. / Bouchard, M. / Goubran, R. A. / IEEE et al. | 2005
- I
-
SP-L9.2: CONTRUCTING ENSEMBLES OF ASR SYSTEMS USING RANDOMIZED DECISION TREESSiohan, O. / Ramabhadran, B. / Kingsbury, B. / IEEE et al. | 2005
- I
-
SP-L9.5: TRAINING LVCSR SYSTEMS ON THOUSANDS OF HOURS OF DATAEvermann, G. / Chan, H. Y. / Gales, M. J. F. / Jia, B. / Mrva, D. / Woodland, P. / Yu, K. / IEEE et al. | 2005
- I
-
SP-L10.4: COMPARISON OF AUTOREGRESSIVE PARAMETER ESTIMATION ALGORITHMS FOR SPEECH PROCESSING AND RECOGNITIONMorris, R. / Arrowood, J. / Clements, M. / IEEE et al. | 2005
- I
-
SP-P3.9: FUNDAMENTAL FREQUENCY ESTIMATION AND VOCAL TREMOR ANALYSIS BY MEANS OF MORLET WAVELET TRANSFORMSCnockaert, L. / Grenez, F. / Schoentgen, J. / IEEE et al. | 2005
- I
-
SP-P6.5: ON DESENSITIZING THE MEL-CEPSTRUM TO SPURIOUS SPECTRAL COMPONENTS FOR ROBUST SPEECH RECOGNITIONTyagi, V. / Wellekens, C. / IEEE et al. | 2005
- I
-
SP-P7.12: INTEGRATING MULTIPLE LAYERS OF CONCEPT INFORMATION INTO N-GRAM MODELING FOR SPOKEN LANGUAGE UNDERSTANDINGWang, N. J.-C. / IEEE et al. | 2005
- I
-
SP-P8.4: COMBINING SELECTION TREE WITH OBSERVATION REORDERING PRUNING FOR EFFICIENT SPEAKER IDENTIFICATION USING GMM-UBMXiong, Z. / Zheng, T. / Song, Z. / Wu, W. / IEEE et al. | 2005
- I
-
SP-P8.12: NOISE ROBUST SPEAKER VERIFICATION USING MEL-FREQUENCY DISCRETE WAVELET COEFFICIENTS AND PARALLEL MODEL COMPENSATIONTufekci, Z. / Gurbuz, S. / IEEE et al. | 2005
- I
-
SP-P10.11: ROBUSTNESS OF BIT-STREAM BASED FEATURES FOR SPEAKER VERIFICATIONMoreno-Daniel, A. / Juang, B.-H. / Nolazco-Flores, J. A. / IEEE et al. | 2005
- I
-
SP-P12.10: IMPROVED CONFUSION NETWORK ALGORITHM AND SHORTEST PATH SEARCH FROM WORD LATTICEXue, J. / Zhao, Y. / IEEE et al. | 2005
- I
-
SP-P13.1: ANALYSIS OF SPECTRAL MEASURES FOR VOICED SPEECH WITH VARYING NOISE AND PERTUBATION LEVELSO Leidhin, E. / Murphy, P. / IEEE et al. | 2005
- I
-
SP-P16.6: THE AT&T WATSON SPEECH RECOGNIZERGoffin, V. / Allauzen, C. / Bocchieri, E. / Hakkani-Tur, D. / Ljolje, A. / Parthasarathy, S. / Rahim, M. / Riccardi, G. / Saraclar, M. / IEEE et al. | 2005
- I
-
SP-P16.10: STRUCTURING BASEBALL LIVE GAMES BASED ON SPEECH RECOGNITION USING TASK DEPENDENT KNOWLEDGE AND EMOTION STATE RECOGNITIONSako, A. / Ariki, Y. / IEEE et al. | 2005
- I
-
SP-P16.12: SPEECH RECOGNITION OF A NAMED ENTITYTomita, T. / Okimoto, Y. / Yamamoto, H. / Sagisaka, Y. / IEEE et al. | 2005
- I
-
SP-P16.14: SENTENCE EXTRACTION-BASED PRESENTATION SUMMARIZATION TECHNIQUES AND EVALUATION METRICSHirohata, M. / Shinnaka, Y. / Iwano, K. / Furui, S. / IEEE et al. | 2005
- I
-
SP-P17.11: IMPROVED KALMAN FILTERING FOR SPEECH ENHANCEMENTGrancharov, V. / Samuelsson, J. / Kleijn, B. / IEEE et al. | 2005
- I
-
SP-P17.14: SPEECH ENHANCEMENT USING A SWITCHING KALMAN FILTER WITH A PERCEPTUAL POST-FILTERDeng, J. / Bouchard, M. / Yeap, T. H. / IEEE et al. | 2005
- I
-
SP-L4.5: ROBUST SPEECH RECOGNITION BY INTEGRATING SPEECH SEPARATION AND HYPOTHESIS TESTINGSrinivasan, S. / Wang, D. / IEEE et al. | 2005
- I
-
SP-L5.2: DISCRIMINATIVE TRAINING OF CDHMMS FOR MAXIMUM RELATIVE SEPARATION MARGINLiu, C. / Jiang, H. / Li, X. / IEEE et al. | 2005
- I
-
SP-L7.1: SIGNAL SUBSPACE SPEECH ENHANCEMENT FOR AUDIBLE NOISE REDUCTIONYou, C. / Koh, S. N. / Rahardja, S. / IEEE et al. | 2005
- I
-
SP-P2.4: META-CLASSIFIERS IN ACOUSTIC AND LINGUISTIC FEATURE FUSION-BASED AFFECT RECOGNITIONSchuller, B. / Villar, R. J. / Rigoll, G. / Lang, M. / IEEE et al. | 2005
- I
-
SP-P2.11: SPARSE KPCA FOR FEATURE EXTRACTION IN SPEECH RECOGNITIONLima, A. / Zen, H. / Nankaku, Y. / Tokuda, K. / Kitamura, T. / Resende, F. G. / IEEE et al. | 2005
- I
-
SP-P4.7: NOISY SPEECH RECOGNITION BASED ON ROBUST END-POINT DETECTION AND MODEL ADAPTATIONZhang, Z. / Furui, S. / IEEE et al. | 2005
- I
-
SP-P5.4: NOVEL TECHNIQUES FOR TIME-COMPRESSING SPEECH: AN EXPLORATORY STUDYTucker, S. / Whittaker, S. / IEEE et al. | 2005
- I
-
SP-P5.6: AN HMM-BASED TEXT SEGMENTATION METHOD USING VARIATIONAL BAYES APPROACH AND ITS APPLICATION TO LVCSR FOR BROADCAST NEWSKoshinaka, T. / Iso, K.-i. / Okumura, A. / IEEE et al. | 2005
- I
-
SP-P6.11: ROBUST SPEECH RECOGNITION BASED ON SPECTRAL ADJUSTING AND WARPINGZhao, R. / Wang, Z. / IEEE et al. | 2005
- I
-
SP-P7.4: RAPID LANGUAGE MODEL DEVELOPMENT USING EXTERNAL RESOURCES FOR NEW SPOKEN DIALOG DOMAINSSarikaya, R. / Gravano, A. / Gao, Y. / IEEE et al. | 2005
- I
-
SP-P7.9: AN EFFICIENT ALGORITHM FOR CLUSTERING SHORT SPOKEN UTTERANCESLiu, Z. / IEEE et al. | 2005
- I
-
SP-P9.11: MODELING SUCCESSIVE FRAME DEPENDENCIES WITH HYBRID HMM/BN ACOUSTIC MODELMarkov, K. / Nakamura, S. / IEEE et al. | 2005
- I
-
SP-P11.14: A SOFT DECISION BASED NOISE CROSS POWER SPECTRAL DENSITY ESTIMATION FOR TWO-MICROPHONE SPEECH ENHANCEMENT SYSTEMSZhang, X. / Jia, Y. / IEEE et al. | 2005
- I
-
SP-P12.13: CROSS-LANGUAGE ACOUSTIC MODEL REFINEMENT FOR THE INDONESIAN LANGUAGEMartin, T. / Sridharan, S. / IEEE et al. | 2005
- I
-
SP-P13.5: DETECTION OF SYMBOLIC GESTURAL EVENTS IN ARTICULATORY DATA FOR USE IN STRUCTURAL REPRESENTATIONS OF CONTINUOUS SPEECHGutkin, A. / King, S. / IEEE et al. | 2005
- I
-
SP-P13.9: EFFECTS OF GLOTTAL AND LIP BOUNDARY CONDITIONS ON VOCAL-TRACT AREA FUNCTION ESTIMATES FROM SPEECH SIGNALSDeng, H. / Ward, R. K. / Beddoes, M. / Hodgson, M. / IEEE et al. | 2005
- I
-
SP-P14.2: MINIMUM PHONEME ERROR BASED HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS FOR SPEECH RECOGNITIONZhang, B. / Matsoukas, S. / IEEE et al. | 2005
- I
-
SP-P15.10: LEARNING PRONUNCIATION AND FORMULATION VARIANTS IN CONTINUOUS SPEECH APPLICATIONSColibro, D. / Fissore, L. / Popovici, C. / Vair, C. / Laface, P. / IEEE et al. | 2005
- I
-
SP-P16.3: UNSUPERVISED VOCABULARY EXPANSION FOR AUTOMATIC TRANSCRIPTION OF BROADCAST NEWSOhtsuki, K. / Hiroshima, N. / Oku, M. / Imamura, A. / IEEE et al. | 2005
- I
-
SP-P17.7: LEAKAGE MODEL AND TEETH CLACK REMOVAL FOR AIR- AND BONE-CONDUCTIVE INTEGRATED MICROPHONESLiu, Z. / Subramanya, A. / Zhang, Z. / Droppo, J. / Acero, A. / IEEE et al. | 2005
- I
-
SP-P17.6: MATRIX QUANTIZATION BASED TIME-VARYING FILTER SPEECH ENHANCEMENTRao K, S. / Thippur, S. / IEEE et al. | 2005
- I
-
SP-L3.1: PROPOSAL ON OBJECTIVE SPEECH QUALITY ASSESSMENT FOR WIDEBAND IP TELEPHONYMorioka, C. / Kurashima, A. / Takahashi, A. / IEEE et al. | 2005
- I
-
SP-L4.1: REJECTION USING RANK STATISTICS BASED ON HMM STATE SHORTLISTSBocchieri, E. / Parthasarathy, S. / IEEE et al. | 2005
- I
-
SP-L6.1: MULTI-FRAME GMM-BASED BLOCK QUANTISATION OF LINE SPECTRAL FREQUENCIES FOR WIDEBAND SPEECH CODINGSo, S. / Paliwal, K. K. / IEEE et al. | 2005
- I
-
SP-L11.1: STATIC AND DYNAMIC SPECTRAL FEATURES: THEIR NOISE ROBUSTNESS AND OPTIMAL WEIGHTS FOR ASRYang, C. / Soong, F. K. / Lee, T. / IEEE et al. | 2005
- I
-
SP-L11.3: A COMPANDING FRONT END FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITIONGuinness, J. / Raj, B. / Schmidt-Nielsen, B. / Turicchia, L. / Sarpeshkar, R. / IEEE et al. | 2005
- I
-
SP-P2.3: DISORDERED SPEECH EVALUATION USING OBJECTIVE QUALITY MEASURESGu, L. / Harris, J. / Shrivastav, R. / Sapienza, C. / IEEE et al. | 2005
- I
-
SP-P3.3: PERCEPTUALLY WEIGHTED LONG TERM MODELING OF SINUSOIDAL SPEECH AMPLITUDE TRAJECTORIESFirouzmand, M. / Girin, L. / IEEE et al. | 2005
- I
-
SP-P4.1: CLOSELY COUPLED ARRAY PROCESSING AND MODEL-BASED COMPENSATION FOR MICROPHONE ARRAY SPEECH RECOGNITIONZhao, X. / Ou, Z. / Chen, M. / Wang, Z. / IEEE et al. | 2005
- I
-
SP-P4.9: BUILDING AN EFFECTIVE CORPUS BY USING ACOUSTIC SPACE VISUALIZATION (COSMOS) METHODNagino, G. / Shozakai, M. / IEEE et al. | 2005
- I
-
SP-P5.1: DYNAMIC MATCH PHONE-LATTICE SEARCHES FOR VERY FAST AND ACCURATE UNRESTRICTED VOCABULARY KEYWORD SPOTTINGThambiratnam, K. / Sridharan, S. / IEEE et al. | 2005
- I
-
SP-P5.7: DETECTING GROUP INTEREST-LEVEL IN MEETINGSGatica-Perez, D. / McCowan, I. / Zhang, D. / Bengio, S. / IEEE et al. | 2005
- I
-
SP-P6.4: SPEECH FEATURE SMOOTHING FOR ROBUST ASRChen, C.-P. / Bilmes, J. / Ellis, D. / IEEE et al. | 2005
- I
-
SP-P6.12: ROBUST SPEECH ACTIVITY DETECTION USING LDA APPLIED TO FF PARAMETERSPadrell, J. / Macho, D. / Nadeu, C. / IEEE et al. | 2005
- I
-
SP-P7.1: JOINT DISCRIMINATIVE LANGUAGE MODELING AND UTTERANCE CLASSIFICATIONSaraclar, M. / Roark, B. / IEEE et al. | 2005
- I
-
SP-P7.6: RANDOM CLUSTERINGS FOR LANGUAGE MODELINGEmami, A. / Jelinek, F. / IEEE et al. | 2005
- I
-
SP-P7.5: USING LOCAL & GLOBAL PHONOTACTIC FEATURES IN CHINESE DIALECT IDENTIFICATIONLim, B. P. / Li, H. / Ma, B. / IEEE et al. | 2005
- I
-
SP-P8.7: FACTOR ANALYSIS SIMPLIFIEDKenny, P. / Boulianne, G. / Ouellet, P. / Dumouchel, P. / IEEE et al. | 2005
- I
-
SP-P11.4: A MISSING-DATA APPROACH TO NOISE-ROBUST LPC EXTRACTION FOR VOICED SPEECH USING AUXILIARY SENSORSDemiroglu, C. / Barnwell, T. P. / IEEE et al. | 2005
- I
-
SP-P11.7: INCREASING THE ROBUSTNESS OF CELP-BASED CODERS BY CONSTRAINED OPTIMIZATIONChibani, M. / Gournay, P. / Lefebvre, R. / IEEE et al. | 2005
- I
-
SP-P11.12: A ROBUST NARROWBAND TO WIDEBAND EXTENSION SYSTEM FEATURING ENHANCED CODEBOOK MAPPINGUnno, T. / McCree, A. / IEEE et al. | 2005
- I
-
SP-P12.6: A STUDY ON KNOWLEDGE SOURCE INTEGRATION FOR CANDIDATE RESCORING IN AUTOMATIC SPEECH RECOGNITIONLi, J. / Tsao, Y. / Lee, C.-H. / IEEE et al. | 2005
- I
-
SP-P12.12: DEVELOPMENT OF THE CU-HTK 2004 BROADCAST NEWS TRANSCRIPTION SYSTEMSKim, D. Y. / Chan, H. Y. / Evermann, G. / Gales, M. J. F. / Mrva, D. / Sim, K. C. / Woodland, P. / IEEE et al. | 2005
- I
-
SP-P13.2: AUTOMATIC DYSPHONIA RECOGNITION USING BIOLOGICALLY-INSPIRED AMPLITUDE-MODULATION FEATURESMalyska, N. / Quatieri, T. / Sturim, D. / IEEE et al. | 2005
- I
-
SP-P13.10: ADAPTIVE FILTERBANKS INSPIRED BY THE AUDITORY SYSTEM FOR SPEECH FEATURE EXTRACTIONKumaresan, R. / Allu, G. K. / Cariani, P. / IEEE et al. | 2005
- I
-
SP-P13.12: A GRAPHICAL MODEL FOR FORMANT TRACKINGMalkin, J. / Li, X. / Bilmes, J. / IEEE et al. | 2005
- I
-
SP-P14.3: A STUDY OF AUDITORY MODELING AND PROCESSING FOR SPEECH SIGNALSJeon, W. / Juang, B.-H. / IEEE et al. | 2005
- I
-
SP-P15.6: KERNEL EIGENSPACE-BASED MLLR ADAPTATION USING MULTIPLE REGRESSION CLASSESHsiao, R. / Mak, B. / IEEE et al. | 2005
- I
-
SP-P15.7: AUTOMATICALLY TRANSCRIBING MEETINGS USING DISTANT MICROPHONESMetze, F. / Fugen, C. / Pan, Y. / Alexander, W. / IEEE et al. | 2005
- I
-
SP-P16.5: MAXIMUM ENTROPY SEGMENTATION OF BROADCAST NEWSChristensen, H. / Kolluru, B. / Gotoh, Y. / Renals, S. / IEEE et al. | 2005
- I
-
SP-P16.4: CLASSIFICATION OF STRUCTURED DESCRIPTIONSBangalore, S. / Rambow, O. / IEEE et al. | 2005
- I
-
SP-P16.8: ERROR PREDICTION IN SPOKEN DIALOG: FROM SIGNAL-TO-NOISE RATIO TO SEMANTIC CONFIDENCE SCORESHakkani-Tur, D. / Tur, G. / Riccardi, G. / Kim, H. K. / IEEE et al. | 2005
- I
-
SP-P17.8: SPEECH ENHANCEMENT USING A MMSE SHORT TIME SPECTRAL AMPLITUDE ESTIMATOR WITH LAPLACIAN SPEECH MODELINGChen, B. / Loizou, P. / IEEE et al. | 2005
- I
-
SP-L2.1: INCORPORATING DISCOURSE FEATURES INTO CONFIDENCE SCORING OF INTENTION RECOGNITION RESULTS IN SPOKEN DIALOGUE SYSTEMSHigashinaka, R. / Sudoh, K. / Nakano, M. / IEEE et al. | 2005
- I
-
SP-L3.2: NEURAL CELL TYPE RECOGNITION BETWEEN GLOBUS PALLIDUS EXTERNUS AND GLOBUS PALLIDUS INTERNUS BY GAUSSIAN MIXTURE MODELINGFu, Q. / Clements, M. / Mewes, K. / IEEE et al. | 2005
- I
-
SP-L6.5: PREDICTIVE VQ FOR BANDWIDTH SCALABLE LSP QUANTIZATIONEhara, H. / Morii, T. / Oshikiri, M. / Yoshida, K. / IEEE et al. | 2005
- I
-
SP-L7.6: SPEECH ENHANCEMENT BASED ON SPEECH SPECTRAL COMPLEX GAUSSIAN MIXTURE MODELDing, G.-H. / Wang, X. / Cao, Y. / Ding, F. / Tang, Y. / IEEE et al. | 2005
- I
-
SP-L8.3: THE 2004 MIT LINCOLN LABORATORY SPEAKER RECOGNITION SYSTEMReynolds, D. / Campbell, W. / Gleason, T. / Quillen, C. / Sturim, D. / Torres-Carrasquillo, P. / Adami, A. / IEEE et al. | 2005
- I
-
SP-L10.2: COHERENT ENVELOPE DETECTION FOR MODULATION FILTERING OF SPEECHSchimmel, S. / Atlas, L. / IEEE et al. | 2005
- I
-
SP-L11.6: ONLINE CEPSTRAL FILTERING USING A SEQUENTIAL EM APPROACH WITH POLYAK AVERAGING AND FEEDBACKMyrvoll, T. A. / Nakamura, S. / IEEE et al. | 2005
- I
-
SP-P1.2: AN AUTOMATIC PROSODY RECOGNIZER USING A COUPLED MULTI-STREAM ACOUSTIC MODEL AND A SYNTACTIC-PROSODIC LANGUAGE MODELAnanthakrishnan, S. / Narayanan, S. / IEEE et al. | 2005
- I
-
SP-P1.3: F0 CONTROL CHARACTERIZATION BY PERCEPTUAL IMPRESSIONS ON SPEAKING ATTITUDES USING MULTIPLE DIMENSIONAL SCALING ANALYSISKokenawa, Y. / Tsuzaki, M. / Kato, H. / Sagisaka, Y. / IEEE et al. | 2005
- I
-
SP-P3.6: SPEECH MODELLING BASED ON GENERALIZED GAUSSIAN PROBABILITY DENSITY FUNCTIONSKokkinakis, K. / Nandi, A. K. / IEEE et al. | 2005
- I
-
SP-P5.11: COMBINING MULTIPLE SUBWORD REPRESENTATIONS FOR OPEN-VOCABULARY SPOKEN DOCUMENT RETRIEVALLee, S.-w. / Tanaka, K. / Itoh, Y. / IEEE et al. | 2005
- I
-
SP-P7.3: LANGUAGE IDENTIFICATION USING PHONETIC AND PROSODIC HMMS WITH FEATURE NORMALIZATIONObuchi, Y. / Sato, N. / IEEE et al. | 2005
- I
-
SP-P7.7: DIALECT/ACCENT CLASSIFICATION VIA BOOSTED WORD MODELINGHuang, R. / Hansen, J. H. L. / IEEE et al. | 2005
- I
-
SP-P8.5: ADVANCES IN CHANNEL COMPENSATION FOR SVM SPEAKER RECOGNITIONSolomonoff, A. / Campbell, W. / Boardman, I. / IEEE et al. | 2005
- I
-
SP-P8.8: MINIMUM CLASSIFICATION ERROR INTERACTIVE TRAINING FOR SPEAKER IDENTIFICATIONKida, Y. / Yamamoto, H. / Miyajima, C. / Tokuda, K. / Kitamura, T. / IEEE et al. | 2005
- I
-
SP-P10.7: A SESSION-GMM GENERATIVE MODEL USING TEST UTTERANCE GAUSSIAN MIXTURE MODELING FOR SPEAKER VERIFICATIONAronowitz, H. / Burshtein, D. / Amir, A. / IEEE et al. | 2005
- I
-
SP-P11.11: STOCHASTIC INTEGRATION AND LONG TERM PREDICTOR ESTIMATION UNDER NOISY CONDITIONS FOR SPEECH ENHANCEMENTKuropatwinski, M. / Kleijn, B. / IEEE et al. | 2005
- I
-
SP-P11.13: ARTIFICIAL BANDWIDTH EXPANSION METHOD TO IMPROVE INTELLIGIBILITY AND QUALITY OF AMR-CODED NARROWBAND SPEECHLaaksonen, L. / Kontio, J. / Alku, P. / IEEE et al. | 2005
- I
-
SP-P12.4: USING RULE-BASED KNOWLEDGE TO IMPROVE LVCSRBeutler, R. / Kaufmann, T. / Pfister, B. / IEEE et al. | 2005
- I
-
SP-P14.10: GARCH COEFFICIENTS AS FEATURE FOR SPEECH RECOGNITION IN PERSIAN ISOLATED DIGITAbdolahi, M. / Amindavar, H. / IEEE et al. | 2005
- I
-
SP-P15.1: VARIATIONAL BAYESIAN ADAPTATION FOR SPEAKER CLUSTERINGValente, F. / Wellekens, C. / IEEE et al. | 2005
- I
-
SP-L1.2: INTRODUCING ROUGHNESS IN INDIVIDUALITY TRANSFORMATION THROUGH JITTER MODELING AND MODIFICATIONVerma, A. / Kumar, A. / IEEE et al. | 2005
- I
-
SP-L2.5: MODEL ADAPTATION FOR SPOKEN LANGUAGE UNDERSTANDINGTur, G. / IEEE et al. | 2005
- I
-
SP-L3.4: CAN YOU UNDERSTAND HIM? LET'S LOOK AT HIS WORD ACCURACY - AUTOMATIC EVALUATION OF TRACHEOESOPHAGEAL SPEECHSchuster, M. / Noeth, E. / Haderl, T. / Steidl, S. / Batliner, A. / Rosanowski, F. / IEEE et al. | 2005
- I
-
SP-L5.3: STATISTICAL PERFORMANCE ANALYSIS OF MCE/GPD LEARNING IN GAUSSIAN CLASSIFIERS AND HIDDEN MARKOV MODELSAfify, M. / Li, X.-W. / Jiang, H. / IEEE et al. | 2005
- I
-
SP-L6.4: A NEW SEGMENT QUANTIZER FOR LINE SPECTRAL FREQUENCIES USING LEMPEL-ZIV ALGORITHMKohata, M. / Suzuki, M. / Makino, S. / IEEE et al. | 2005
- I
-
SP-L8.1: IMPROVED PHONETIC SPEAKER RECOGNITION USING LATTICE DECODINGHatch, A. / Peskin, B. / Stolcke, A. / IEEE et al. | 2005
- I
-
SP-L9.1: SUB-PHONETIC POLYNOMIAL SEGMENT MODEL FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITIONYeung, S.-K. A. / Li, C.-F. / Siu, M.-H. / IEEE et al. | 2005
- I
-
SP-L10.1: SPEECH ANALYSIS BY ESTIMATING PERCEPTUALLY RELEVANT POLE LOCATIONSAtti, V. / Spanias, A. / IEEE et al. | 2005
- I
-
SP-P5.8: SEMANTIC DATA MINING OF SHORT UTTERANCESBegeja, L. / Drucker, H. / Gibbon, D. / Haffner, P. / Liu, Z. / Renger, B. / Shahraray, B. / IEEE et al. | 2005
- I
-
SP-P7.8: WEB-DATA AUGMENTED LANGUAGE MODELS FOR MANDARIN CONVERSATIONAL SPEECH RECOGNITIONNg, T. / Ostendorf, M. / Hwang, M.-Y. / Siu, M. / Bulyko, I. / Lei, X. / IEEE et al. | 2005
- I
-
SP-P7.10: MAXIMUM ENTROPY BASED GENERIC FILTER FOR LANGUAGE MODEL ADAPTATIONYu, D. / Mahajan, M. / Mau, P. / Acero, A. / IEEE et al. | 2005
- I
-
SP-P8.10: GMM-BASED BHATTACHARYYA KERNEL FISHER DISCRIMINANT ANALYSIS FOR SPEAKER RECOGNITIONChao, Y.-H. / Wang, H.-M. / Chang, R.-C. / IEEE et al. | 2005
- I
-
SP-P9.10: ACOUSTIC MODEL TRAINING USING GREEDY EMHu, R. / Li, X. / Zhao, Y. / IEEE et al. | 2005
- I
-
SP-P10.5: CLUSTERING SPEECH UTTERANCES BY SPEAKER USING EIGENVOICE-MOTIVATED VECTOR SPACE MODELSTsai, W.-H. / Cheng, S.-S. / Chao, Y.-H. / Wang, H.-M. / IEEE et al. | 2005
- I
-
SP-P11.8: JOINT OPTIMIZATION OF EXCITATION PARAMETERS IN ANALYSIS-BY-SYNTHESIS SPEECH CODERS HAVING MULTI-TAP LONG TERM PREDICTORMittal, U. / Ashley, J. / Cruz-Zeno, E. / Jasiuk, M. / IEEE et al. | 2005
- I
-
SP-P11.9: BLOCK-BASED BANDWIDTH EXTENSION OF NARROWBAND SPEECH SIGNAL BY USING CDHMMYao, S. / Chan, C.-F. / IEEE et al. | 2005
- I
-
SP-P14.5: AUTOMATIC SYLLABLE STRESS DETECTION USING PROSODIC FEATURES FOR PRONUNCIATION EVALUATION OF LANGUAGE LEARNERSTepperman, J. / Narayanan, S. / IEEE et al. | 2005
- I
-
SP-P14.11: FMPE: DISCRIMINATIVELY TRAINED FEATURES FOR SPEECH RECOGNITIONPovey, D. / Kingsbury, B. / Mangu, L. / Saon, G. / Soltau, H. / Zweig, G. / IEEE et al. | 2005
- I
-
SP-P15.3: AGGREGATE A POSTERIORI LINEAR REGRESSION FOR SPEAKER ADAPTATIONHuang, C.-H. / Chien, J.-T. / IEEE et al. | 2005
- I
-
SP-P15.12: WHISPERY SPEECH RECOGNITION USING ADAPTED ARTICULATORY FEATURESJou, S.-C. / Schultz, T. / Waibel, A. / IEEE et al. | 2005
- I
-
SP-L1.5: VOICE FORGERY USING ALISP: INDEXATION IN A CLIENT MEMORYPerrot, P. / Aversano, G. / Blouet, R. / Charbit, M. / Chollet, G. / IEEE et al. | 2005
- I
-
SP-L2.3: DIALOG ACT TAGGING USING GRAPHICAL MODELSJi, G. / Bilmes, J. / IEEE et al. | 2005
- I
-
SP-L3.3: ANALYSIS OF RELATIONSHIP BETWEEEN OVERALL QUALITY AND PSYCHOLOGICAL FACTORS AFFECTING HIGH-QUALITY SPEECH COMMUNICATION SERVICESAoki, H. / Takahashi, A. / IEEE et al. | 2005
- I
-
SP-L3.5: A WARPED BANDWIDTH EXPANSION FILTERBoillot, M. / Harris, J. / IEEE et al. | 2005
- I
-
SP-L5.4: DISCRIMINATIVE TRAINING OF ACOUSTIC MODELS APPLIED TO DOMAINS WITH UNRELIABLE TRANSCRIPTSMathias, L. / Yegnanarayanan, G. / Fritsch, J. / IEEE et al. | 2005
- I
-
SP-L5.6: DISCRIMINATIVE TRAINING BASED ON THE CRITERION OF LEAST PHONE COMPETING TOKENS FOR LARGE VOCABULARY SPEECH RECOGNITIONLiu, B. / Jiang, H. / Zhou, J.-L. / Wang, R.-H. / IEEE et al. | 2005
- I
-
SP-L9.3: EFFICIENT GENERATION OF HIGH-ORDER CONTEXT-DEPENDENT WEIGHTED FINITE STATE TRANSDUCERS FOR SPEECH RECOGNITIONSchuster, M. / Hori, T. / IEEE et al. | 2005
- I
-
SP-L11.4: MULTI-RESOLUTION SPECTRAL ENTROPY FEATURE FOR ROBUST ASRMisra, H. / Ikbal, S. / Sivadas, S. / Bourlard, H. / IEEE et al. | 2005
- I
-
SP-P1.1: IMPROVING THE UNDERSTANDABILITY OF SPEECH SYNTHESIS BY MODELING SPEECH IN NOISELangner, B. / Black, A. W. / IEEE et al. | 2005
- I
-
SP-P1.9: PREDICTION OF PRONUNCIATION VARIATIONS FOR SPEECH SYNTHESIS: A DATA-DRIVEN APPROACHBennett, C. / Black, A. W. / IEEE et al. | 2005
- I
-
SP-P2.1: INCREASED ROBUSTNESS AGAINST BIT ERRORS FOR DISTRIBUTED SPEECH RECOGNITION IN WIRELESS ENVIRONMENTSDelaney, B. / IEEE et al. | 2005
- I
-
SP-P2.5: PACKET LOSS CONCEALMENT BASED ON VQ REPLICAS AND MMSE ESTIMATION APPLIED TO DISTRIBUTED SPEECH RECOGNITIONPeinado, A. M. / Gomez, A. M. / Sanchez, V. E. / Perez-Cordoba, J. L. / Rubio, A. J. / IEEE et al. | 2005
- I
-
SP-P2.6: A COMPARISON OF SOFT-FEATURE DISTRIBUTED SPEECH RECOGNITION WITH CANDIDATE CODECS FOR SPEECH ENABLED MOBILE SERVICESIon, V. / Haeb-Umbach, R. / IEEE et al. | 2005
- I
-
SP-P2.8: A COMPARISON OF CLASSIFIERS FOR DETECTING EMOTION FROM SPEECHShafran, I. / Mohri, M. / IEEE et al. | 2005
- I
-
SP-P3.10: AUTOMATIC SPEECH SEGMENTATION USING AVERAGE LEVEL CROSSING RATE INFORMATIONSarkar, A. / Sreenivas, T. V. / IEEE et al. | 2005
- I
-
SP-P3.11: DWT-BASED PHONETIC GROUPS CLASSIFICATION USING NEURAL NETWORKSPham, V. T. / Kubin, G. / IEEE et al. | 2005
- I
-
SP-P3.12: A NOVEL KLT ALGORITHM OPTIMIZED FOR SMALL SIGNAL SETSGianfelici, F. / Biagetti, G. / Crippa, P. / Turchetti, C. / IEEE et al. | 2005
- I
-
SP-P4.4: AN ENVIRONMENT COMPENSATED MAXIMUM LIKELIHOOD TRAINING APPROACH BASED ON STOCHASTIC VECTOR MAPPINGWu, J. / Huo, Q. / Zhu, D. / IEEE et al. | 2005
- I
-
SP-P8.1: DISCRIMINATIVE POWER OF TRANSIENT FRAMES IN SPEAKER RECOGNITIONLouradour, J. / Daoudi, K. / Andre-Obrecht, R. / IEEE et al. | 2005
- I
-
SP-P9.5: CLUSTER-DEPENDENT ACOUSTIC MODELINGXiang, B. / Nguyen, L. / Matsoukas, S. / Schwartz, R. / IEEE et al. | 2005
- I
-
SP-P9.9: ON INITIALIZATION OF GAUSSIAN MIXTURES: A HYBRID GENETIC EM ALGORITHMPernkopf, F. / IEEE et al. | 2005
- I
-
SP-P9.12: IMPROVED COVARIANCE MODELING FOR MAXIMUM LIKELIHOOD MULTIPLE SUBSPACE TRANSFORMATIONSZhou, X. / Tian, Y. / Zhou, J. / Dai, B. / IEEE et al. | 2005
- I
-
SP-P10.2: A CORRELATION METRIC FOR SPEAKER TRACKING USING ANCHOR MODELSCollet, M. / Charlet, D. / Bimbot, F. / IEEE et al. | 2005
- I
-
SP-P10.4: F-RATIO CLIENT-DEPENDENT NORMALISATION FOR BIOMETRIC AUTHENTICATION TASKSPoh, N. / Bengio, S. / IEEE et al. | 2005
- I
-
SP-P10.12: TWO-WAY CLUSTER VOTING TO IMPROVE SPEAKER DIARISATION PERFORMANCETranter, S. / IEEE et al. | 2005
- I
-
SP-P10.13: SPEAKER DETECTION WITHOUT MODELSGillick, D. / Stafford, S. / Peskin, B. / IEEE et al. | 2005
- I
-
SP-P11.5: A TECHNIQUE OF MULTI-TAP LONG TERM PREDICTOR (LTP) FILTER USING SUB-SAMPLE RESOLUTION DELAYJasiuk, M. / Ramabadran, T. / Mittal, U. / Ashley, J. / McLaughlin, M. / IEEE et al. | 2005
- I
-
SP-P12.2: FIRST STEPS IN FAST ACOUSTIC MODELING FOR A NEW TARGET LANGUAGE: APPLICATION TO VIETNAMESELe, V.-B. / Besacier, L. / IEEE et al. | 2005
- I
-
SP-P12.11: THAI AUTOMATIC SPEECH RECOGNITIONSuebvisai, S. / Charoenpornsawat, P. / Black, A. W. / Woszczyna, M. / Schultz, T. / IEEE et al. | 2005
- I
-
SP-P13.7: MODELING OF THE FRONT CAVITY AND SUBLINGUAL SPACE IN AMERICAN ENGLISH RHOTIC SOUNDSZhang, Z. / Espy-Wilson, C. / Boyce, S. / Tiede, M. / IEEE et al. | 2005
- I
-
SP-P14.4: A WAVELET AND FILTER BANK FRAMEWORK FOR PHONETIC CLASSIFICATIONChoueiter, G. / Glass, J. / IEEE et al. | 2005
- I
-
SP-P15.5: VARIOUS REFERENCE SPEAKERS DETERMINATION METHODS FOR EMBEDDED KERNEL EIGENVOICE SPEAKER ADAPTATIONMak, B. / Ho, S. / IEEE et al. | 2005
- I
-
SP-P15.9: ADAPTIVE TRAINING USING SIMPLE TARGET MODELSStemmer, G. / Brugnara, F. / Giuliani, D. / IEEE et al. | 2005
- I
-
SP-P16.7: OPEN VOCABULARY CHINESE NAME RECOGNITION WITH THE HELP OF CHARACTER DESCRIPTION AND SYLLABLE SPELLING RECOGNITIONTsai, C.-H. / Wang, N. J.-C. / Huang, P. / Shen, J.-L. / IEEE et al. | 2005
- I
-
SP-L1.6: AN IMPROVED SPECTRAL AND PROSODIC TRANSFORMATION METHOD IN STRAIGHT-BASED VOICE CONVERSIONQin, L. / Chen, G. / Ling, Z. / Dai, L. / IEEE et al. | 2005
- I
-
SP-L2.2: SEMANTIC INTERPRETATION WITH ERROR CORRECTIONRaymond, C. / Bechet, F. / Camelin, N. / De Mori, R. / Damnati, G. / IEEE et al. | 2005
- I
-
SP-L2.4: A CLARIFICATION ALGORITHM FOR SPOKEN DIALOGUE SYSTEMSLewis, C. / Di Fabbrizio, G. / IEEE et al. | 2005
- I
-
SP-L8.2: SRI'S 2004 NIST SPEAKER RECOGNITION EVALUATION SYSTEMKajarekar, S. / Ferrer, L. / Shriberg, E. / Sonmez, K. / Stolcke, A. / Venkataraman, A. / Zheng, J. / IEEE et al. | 2005
- I
-
SP-P1.4: ADDITIVE MODELING OF ENGLISH F0 CONTOUR FOR SPEECH SYNTHESISSakai, S. / IEEE et al. | 2005
- I
-
SP-P1.10: RECORDING SCRIPT DESIGN FOR CORPUS-BASED TTS SYSTEM BASED ON COVERAGE OF VARIOUS PHONETIC ELEMENTSIsogai, M. / Mizuno, H. / Mano, K. / IEEE et al. | 2005
- I
-
SP-P1.12: COMPARATIVE STUDY OF AUTOMATIC PHONE SEGMENTATION METHODS FOR TTSAdell, J. / Bonafonte, A. / Gomez, J. A. / Castro, M. J. / IEEE et al. | 2005
- I
-
SP-P2.12: EFFECTS OF PHONEME CHARACTERISTICS ON TEO FEATURE-BASED AUTOMATIC STRESS DETECTION IN SPEECHRuzanski, E. / Hansen, J. H. L. / Meyerhoff, J. L. / Saviolakis, G. / Koenig, M. / IEEE et al. | 2005
- I
-
SP-P4.10: HMM/ANN BASED SPECTRAL PEAK LOCATION ESTIMATION FOR NOISE ROBUST SPEECH RECOGNITIONIkbal, S. / Bourlard, H. / Magimai-Doss, M. / IEEE et al. | 2005
- I
-
SP-P5.2: A STREAM-WEIGHT OPTIMIZATION METHOD FOR MULTI-STREAM HMMS BASED ON LIKELIHOOD VALUE NORMALIZATIONTamura, S. / Iwano, K. / Furui, S. / IEEE et al. | 2005
- I
-
SP-P5.10: BLIND CHANGE DETECTION FOR AUDIO SEGMENTATIONOmar, M. / Chaudhari, U. / Ramaswamy, G. / IEEE et al. | 2005
- I
-
SP-P6.1: VARIATIONAL BAYESIAN FEATURE SALIENCY FOR AUDIO TYPE CLASSIFICATIONValente, F. / Wellekens, C. / IEEE et al. | 2005
- I
-
SP-P7.13: AUTOMATIC LANGUAGE IDENTIFICATION USING ERGODIC HMMSantoshKumar, S. A. / Ramasubramanian, V. / IEEE et al. | 2005
- I
-
SP-P9.4: HIERARCHICAL CORRELATION COMPENSATION FOR HIDDEN MARKOV MODELSLin, H. / Tian, Y. / Zhou, J. / Jiang, H. / IEEE et al. | 2005
- I
-
SP-P11.2: ULTRA LOW BIT RATE SPEECH CODING USING AN ERGODIC HIDDEN MARKOV MODELLee, M. / Durey, A. / Moore, E. / Clements, M. / IEEE et al. | 2005
- I
-
SP-P11.10: SEGMENTATION-BASED SPEECH ENHANCEMENT FOR INTELLIGIBILITY IMPROVEMENT IN MELP CODERS USING AUXILIARY SENSORSDemiroglu, C. / Kamath, S. / Anderson, D. / IEEE et al. | 2005
- I
-
SP-P12.1: LATTICE SEGMENTATION AND SUPPORT VECTOR MACHINES FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITIONVenkataramani, V. / Byrne, W. / IEEE et al. | 2005
- I
-
SP-P12.7: DEVELOPMENT OF THE CUHTK 2004 MANDARIN CONVERSATIONAL TELEPHONE SPEECH TRANSCRIPTION SYSTEMGales, M. J. F. / Jia, B. / Liu, A. / Sim, K. C. / Woodland, P. / Yu, K. / IEEE et al. | 2005
- I
-
SP-P13.6: MATHEMATICAL EVIDENCE OF THE ACOUSTIC UNIVERSAL STRUCTURE IN SPEECHMinematsu, N. / IEEE et al. | 2005
- I
-
SP-P15.8: A NOVEL METHOD FOR RAPID SPEAKER ADAPTATION BASED ON SUPPORT SPEAKER WEIGHTINGCai, T. / Zhu, J. / IEEE et al. | 2005
- I
-
SP-P16.2: CONSTRAINED PHRASE-BASED TRANSLATION USING WEIGHTED FINITE STATE TRANSDUCERZhou, B. / Chen, S. / Gao, Y. / IEEE et al. | 2005
- I
-
SP-P17.3: CODEBOOK-BASED BAYESIAN SPEECH ENHANCEMENTSrinivasan, S. / Samuelsson, J. / Kleijn, B. / IEEE et al. | 2005
- I
-
SP-P17.9: SEPARATION OF FRICATIVES AND AFFRICATESHu, G. / Wang, D. / IEEE et al. | 2005
- I
-
SP-L1.4: A STUDY ON RESIDUAL PREDICTION TECHNIQUES FOR VOICE CONVERSIONSuendermann, D. / Bonafonte, A. / Ney, H. / Hoege, H. / IEEE et al. | 2005
- I
-
SP-L4.4: GENERALIZED POSTERIOR PROBABILITY FOR MINIMUM ERROR VERIFICATION OF RECOGNIZED SENTENCESLo, W. K. / Soong, F. K. / IEEE et al. | 2005
- I
-
SP-L10.5: AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY MARKERS IN SPEECH SIGNALSDikshit, P. / Zahorian, S. / Nagulapati, S. / IEEE et al. | 2005
- I
-
SP-P1.11: OPTIMAL SUBSET SELECTION FROM TEXT DATABASESTian, J. / Nurminen, J. / Kiss, I. / IEEE et al. | 2005
- I
-
SP-P3.8: ROBUST PITCH ESTIMATION AT VERY LOW SNR EXPLOITING TIME AND FREQUENCY DOMAIN CUESShahnaz, C. / Zhu, W.-P. / Ahmad, M. O. / IEEE et al. | 2005
- I
-
SP-P4.6: TOWARDS SPEECH RECOGNITION ORIENTED DEREVERBERATIONJinachitra, P. / Prieto, R. / IEEE et al. | 2005
- I
-
SP-P4.12: ACOUSTIC TRAINING FROM HETEROGENEOUS DATA SOURCES: EXPERIMENTS IN MANDARIN CONVERSATIONAL TELEPHONE SPEECH TRANSCRIPTIONTsakalidis, S. / Byrne, W. / IEEE et al. | 2005
- I
-
SP-P5.3: LIP READING FOR ROBUST SPEECH RECOGNITION ON EMBEDDED DEVICESPerez, J. F. G. / Frangi, A. F. / Solano, E. L. / Lukas, K. / IEEE et al. | 2005
- I
-
SP-P5.12: ROBUST LIP-MOTION FEATURES FOR SPEAKER IDENTIFICATIONCetingul, H. E. / Yemez, Y. / Erzin, E. / Tekalp, A. M. / IEEE et al. | 2005
- I
-
SP-P6.3: MFCC COMPENSATION FOR IMPROVED RECOGNITION OF FILTERED AND BAND-LIMITED SPEECHMorales, N. / Hansen, J. H. L. / Toledano, D. T. / IEEE et al. | 2005
- I
-
SP-P7.11: LANGUAGE IDENTIFICATION USING PITCH CONTOUR INFORMATIONLin, C.-Y. / Wang, H.-C. / IEEE et al. | 2005
- I
-
SP-P8.2: SPEAKER IDENTIFICATION IN UNKNOWN NOISY CONDITIONS - A UNIVERSAL COMPENSATION APPROACHMing, J. / Stewart, D. / Vaseghi, S. / IEEE et al. | 2005
- I
-
SP-P8.11: A STUDY OF THE RELATIVE IMPORTANCE OF TEMPORAL CHARACTERISTICS IN TEXT-DEPENDENT AND TEXT-CONSTRAINED SPEAKER VERIFICATIONNealand, J. / Pelecanos, J. / Zilca, R. / Ramaswamy, G. / IEEE et al. | 2005
- I
-
SP-P9.1: INITIALIZING SUBSPACE CONSTRAINED GAUSSIAN MIXTURE MODELSOlsen, P. / Visweswariah, K. / Gopinath, R. / IEEE et al. | 2005
- I
-
SP-P9.2: MULTI-RATE AND VARIABLE-RATE MODELING OF SPEECH AT PHONE AND SYLLABLE TIME SCALESCetin, O. / Ostendorf, M. / IEEE et al. | 2005
- I
-
SP-P10.3: ESTIMATING AND EVALUATING CONFIDENCE FOR FORENSIC SPEAKER RECOGNITIONCampbell, W. / Reynolds, D. / Campbell, J. / Brady, K. / IEEE et al. | 2005
- I
-
SP-P12.5: ADAPTATION STRATEGIES FOR THE ACOUSTIC AND LANGUAGE MODELS IN BILINGUAL SPEECH TRANSCRIPTIONDieguez-Tirado, J. / Garcia-Mateo, C. / Docio-Fernandez, L. / Cardenal-Lopez, A. / IEEE et al. | 2005
- I
-
SP-P13.4: SNR AND LOCAL NOISE POWER ESTIMATIONS BASED ON GAUSSIAN MIXTURE MODELING ON THE LOG-POWER DOMAINTakeda, K. / Dat, T. H. / Fujimura, H. / Itakura, F. / IEEE et al. | 2005
- I
-
SP-P13.11: MULTI-SPEAKER ARTICULATORY RECONSTRUCTION BASED ON AN EIGEN ARTICULATORY HMMHiroya, S. / Mochida, T. / IEEE et al. | 2005
- I
-
SP-P13.13: DYSPHONIC SPEECH ANALYSIS USING GENERALIZED VARIOGRAMKacha, A. / Grenez, F. / Schoentgen, J. / Benmahammed, K. / IEEE et al. | 2005
- I
-
SP-P14.1: TRAINING WIDEBAND ACOUSTIC MODELS USING MIXED-BANDWIDTH TRAINING DATA VIA FEATURE BANDWIDTH EXTENSIONSeltzer, M. / Acero, A. / IEEE et al. | 2005
- I
-
SP-P14.6: PREDICTING FORMANT FREQUENCIES FROM MFCC VECTORSDarch, J. / Milner, B. / Shao, X. / Vaseghi, S. / Yan, Q. / IEEE et al. | 2005
- I
-
SP-P14.8: TOWARDS AN INTELLIGENT ACOUSTIC FRONT-END FOR AUTOMATIC SPEECH RECOGNITION: BUILT-IN SPEAKER NORMALIZATION (BISN)Yapanel, U. / Hansen, J. H. L. / IEEE et al. | 2005
- I
-
SP-P16.1: OPEN VOCABULARY ASR FOR AUDIOVISUAL DOCUMENT INDEXATIONAllauzen, A. / Gauvain, J.-L. / IEEE et al. | 2005
- I
-
SP-P17.1: BLIND DEREVERBERATION BASED ON ESTIMATES OF SIGNAL TRANSMISSION CHANNELS WITHOUT PRECISE INFORMATION OF CHANNEL ORDERHikichi, T. / Delcroix, M. / Miyoshi, M. / IEEE et al. | 2005
- I
-
SP-P17.2: FAST ESTIMATION OF A PRECISE DEREVERBERATION FILTER BASED ON SPEECH HARMONICITYKinoshita, K. / Nakatani, T. / Miyoshi, M. / IEEE et al. | 2005
- I
-
SP-P17.12: ADAPTIVE DECORRELATION FILTERING ALGORITHM FOR SPEECH SOURCE SEPARATION IN UNCORRELATED NOISESHu, R. / Zhao, Y. / IEEE et al. | 2005
- I/1
-
Polyglot synthesis using a mixture of monolingual corporaLatorre, J. / Iwano, K. / Furui, S. et al. | 2005
- I/5
-
Introducing roughness in individuality transformation through jitter modeling and modificationVerma, A. / Kumar, A. et al. | 2005
- I/9
-
Spectral conversion based on maximum likelihood estimation considering global variance of converted parameterToda, T. / Black, A.W. / Tokuda, K. et al. | 2005
- I/13
-
A study on residual prediction techniques for voice conversionSundermann, D. / Bonafonte, A. / Ney, H. et al. | 2005
- I/17
-
Voice forgery using ALISP: indexation in a client memoryPatrick, P.Z. / Aversano, G. / Blouet, R. / Charbit, M. / Chollet, G. et al. | 2005
- I/21
-
An improved spectral and prosodic transformation method in STRAIGHT-based voice conversionLong Qin, / Gao-Peng Chen, / Zhen-Hua Ling, / Li-Rong Dai, et al. | 2005
- I/25
-
Incorporating discourse features into confidence scoring of intention recognition results in spoken dialogue systemsHigashinaka, R. / Sudoh, K. / Nakano, M. et al. | 2005
- I/29
-
Semantic interpretation with error correctionRaymond, C. / Bechet, F. / Camelin, N. / De Mori, R. / Damnati, G. et al. | 2005
- I/33
-
Dialog act tagging using graphical modelsGang Ji, / Bilmes, J. et al. | 2005
- I/37
-
A clarification algorithm for spoken dialogue systemsLewis, C. / Di Fabbrizio, G. et al. | 2005
- I/41
-
Model adaptation for spoken language understandingTur, G. et al. | 2005
- I/45
-
Unsupervised semantic intent discovery from call log acousticsXiao Li, / Gunawardana, A. / Acero, A. et al. | 2005
- I/49
-
Proposal on objective speech quality assessment for wideband IP telephonyMorioka, C. / Kurashima, A. / Takahashi, A. et al. | 2005
- I/53
-
Neural cell type recognition between globus pallidus externus and globus pallidus internus by Gaussian mixture modelingQiang Fu, / Clements, M. / Mewes, K. et al. | 2005
- I/57
-
Analysis of relationship between overall quality and psychological factors affecting high-quality speech communication servicesAoki, H. / Takahashi, A. et al. | 2005
- I/61
-
Can you understand him? Let's look at his word accuracy-automatic evaluation of tracheoesophageal speechSchuster, M. / Noth, E. / Haderlein, T. / Steidl, S. / Batliner, A. / Rosanowski, F. et al. | 2005
- I/65
-
A warped bandwidth expansion filterBoillot, M.A. / Harris, J.G. et al. | 2005
- I/69
-
Relative energy and intelligibility of transient speech informationSungyub Yoo, / Boston, J.R. / Durrant, J.D. / Kovacyk, K. / Karn, S. / Shaiman, S. / El-Jaroudi, A. / Ching-Chung Li, et al. | 2005
- I/73
-
Rejection using rank statistics based on HMM state shortlistsBocchieri, E. / Parthasarathy, S. et al. | 2005
- I/77
-
Speaker adaptive confidence scoring using Bayesian combiningTae-Yoon Kim, / Hanseok Ko, et al. | 2005
- I/81
-
Improving utterance verification using additional confidence measures in isolated speech recognition interfacesGreenland, G. / Wong, W. / Kunov, H. et al. | 2005
- I/85
-
Generalized posterior probability for minimum error verification of recognized sentencesWai Kit Lo, / Soong, F.K. et al. | 2005
- I/89
-
Robust speech recognition by integrating speech separation and hypothesis testingSrinivasan, S. / DeLiang Wang, et al. | 2005
- I/93
-
Combination of multiple predictors to improve confidence measure based on local posterior probabilitiesYuewen Fu, / Limin Du, et al. | 2005
- I/97
-
Adaptation of precision matrix models on large vocabulary continuous speech recognitionSim, K.C. / Gales, M. et al. | 2005
- I/101
-
Discriminative training of CDHMMs for maximum relative separation marginChaojun Liu, / Hui Jiang, / Xinwei Li, et al. | 2005
- I/105
-
Statistical performance analysis of MCE/GPD learning in Gaussian classifiers and hidden Markov models [speech recognition example]Afify, M. / Xin-Wei Lai, / Hui Jiang, et al. | 2005
- I/109
-
Discriminative training of acoustic models applied to domains with unreliable transcripts [speech recognition applications]Mathias, L. / Yegnanarayanan, G. / Fritsch, J. et al. | 2005
- I/113
-
Minimum classification error for large scale speech recognition tasks using weighted finite state transducersMcDermott, E. / Katagiri, S. et al. | 2005
- I/117
-
Discriminative training based on the criterion of least phone competing tokens for large vocabulary speech recognitionBo Liu, / Hui Jiang, / Jian-Lai Zhou, / Ren-Hua Wang, et al. | 2005
- I/121
-
Multi-frame GMM-based block quantisation of line spectral frequencies for wideband speech codingSo, S. / Paliwal, K.K. et al. | 2005
- I/125
-
Non-intrusive GMM-based speech quality measurementFalk, T.H. / Qingfeng Xu, / Wai-Yip Chan, et al. | 2005
- I/129
-
A multiple-description PCM speech coder using structured dual vector quantizersVoran, S.D. et al. | 2005
- I/133
-
A new segment quantizer for line spectral frequencies using Lempel-Ziv algorithm [speech coding applications]Kohata, M. / Suzuki, M. / Makino, S. et al. | 2005
- I/137
-
Predictive VQ for bandwidth scalable LSP quantization [speech coding applications]Ehara, H. / Morii, T. / Oshikiri, M. / Yoshida, K. et al. | 2005
- I/141
-
Coding with side information techniques for LSF reconstruction in voice over IPAgiomyrgiannakis, Y. / Stylianou, Y. et al. | 2005
- I/145
-
Signal subspace speech enhancement for audible noise reductionChang Huai You, / Soo Ngee Koh, / Rahardja, S. et al. | 2005
- I/149
-
A wavelet Kalman filter with perceptual masking for speech enhancement in colored noiseNing Ma, / Bouchard, M. / Goubran, R.A. et al. | 2005
- I/153
-
Adaptive time segmentation of noisy speech for improved speech enhancementHendriks, R.C. / Heusdens, R. / Jensen, J. et al. | 2005
- I/157
-
Speech enhancement using harmonic regenerationPlapous, C. / Marro, C. / Scalart, P. et al. | 2005
- I/161
-
Instant noise estimation using Fourier transform of AMDF and variable start minima searchZhong Lin, / Goubran, R. et al. | 2005
- I/165
-
Speech enhancement based on speech spectral complex Gaussian mixture modelGuo-Hong Ding, / Xia Wang, / Yang Cao, / Feng Ding, / Yuezhong Tang, et al. | 2005
- I/169
-
Improved phonetic speaker recognition using lattice decodingHatch, A.O. / Peskin, B. / Stolcke, A. et al. | 2005
- I/173
-
SRI's 2004 NIST speaker recognition evaluation systemKajarekar, S.S. / Ferrer, L. / Shriberg, E. / Sonmez, K. / Stolcke', A. / Venkataraman, A. / Jing Zheng, et al. | 2005
- I/177
-
The 2004 MIT Lincoln Laboratory speaker recognition systemReynolds, D.A. / Campbell, W. / Gleason, T. / Quillen, C. / Sturim, D. / Torres-Carrasquillo, P. / Adami, A. et al. | 2005
- I/181
-
Speaker verification using adapted articulatory feature-based conditional pronunciation modelingKa-Yee Leung, / Man-Wai Mak, / Manhung Siu, / Sun-Yuan Kung, et al. | 2005
- I/185
-
Prosody modeling and eigen-prosody analysis for robust speaker recognitionZi-He Chen, / Yuan-Fu Liao, / Yau-Tarng Juang, et al. | 2005
- I/189
-
Prosodic modeling for speaker recognition based on sub-band energy temporal trajectoriesAdami, A.G. et al. | 2005
- I/193
-
Sub-phonetic polynomial segment model for large vocabulary continuous speech recognitionSiu-Kei Au Yeung, / Chak-Fai Li, / Man-Hung Siu, et al. | 2005
- I/197
-
Constructing ensembles of ASR systems using randomized decision treesSiohan, O. / Ramabhadran, B. / Kingsbury, B. et al. | 2005
- I/201
-
Efficient generation of high-order context dependent weighted finite state transducers for speech recognitionSchuster, M. / Hori, T. et al. | 2005
- I/205
-
The IBM 2004 conversational telephony system for rich transcriptionSoltau, H. / Kingsbury, B. / Mangu, L. / Povey, D. / Saon, G. / Zweig, G. et al. | 2005
- I/209
-
Training LVCSR systems on thousands of hours of dataEvermann, G. / Chan, H.Y. / Gales, M.J.F. / Jia, B. / Mrva, D. / Woodland, P.C. / Yu, K. et al. | 2005
- I/213
-
Landmark-based speech recognition: report of the 2004 Johns Hopkins summer workshopHasegawa-Johnson, M. / Baker, J. / Borys, S. / Chen, K. / Coogan, E. / Greenberg, S. / Juneja, A. / Kirchhoff, K. / Livescu, K. / Mohan, S. et al. | 2005
- I/217
-
Speech analysis by estimating perceptually relevant pole locationsAtti, V. / Spanias, A. et al. | 2005
- I/221
-
Coherent envelope detection for modulation filtering of speechSchimmel, S. / Atlas, L. et al. | 2005
- I/225
-
Speech signal analysis with exponential autoregressive modelIshizuka, K. / Kato, H. / Nakatani, T. et al. | 2005
- I/229
-
Comparison of autoregressive parameter estimation algorithms for speech processing and recognitionMorris, R.W. / Arrowood, J.A. / Clements, M.A. et al. | 2005
- I/233
-
An algorithm for locating fundamental frequency markers in speech signalsDikshit, P. / Zahorian, S.A. / Nagulapati, S. et al. | 2005
- I/237
-
An auto-regressive, non-stationary excited signal parameter estimation method and an evaluation of a singing-voice recognitionSasou, A. / Goto, M. / Hayamizu, S. / Tanaka, K. et al. | 2005
- I/241
-
Static and dynamic spectral features: their noise robustness and optimal weights for ASRChen Yang, / Soong, F.K. / Tan Lee, et al. | 2005
- I/245
-
Log-energy dynamic range normalization for robust speech recognitionWeizhong Zhu, / O'Shaughnessy, D. et al. | 2005
- I/249
-
A companding front end for noise-robust automatic speech recognitionGuinness, J. / Raj, B. / Schmidt-Nielsen, B. / Turicchia, L. / Sarpeshkars, R. et al. | 2005
- I/253
-
Multi-resolution spectral entropy feature for robust ASRMisra, H. / Ikbal, S. / Sivadas, S. / Bourlard, H. et al. | 2005
- I/257
-
Particle filter based non-stationary noise tracking for robust speech recognitionFujimoto, M. / Nakamura, S. et al. | 2005
- I/261
-
Online cepstral filtering using a sequential EM approach with Polyak averaging and feedback [speech recognition applications]Myrvoll, T.A. / Nakamura, S. et al. | 2005
- I/265
-
Improving the understandability of speech synthesis by modeling speech in noiseLangner, B. / Black, A.W. et al. | 2005
- I/269
-
An automatic prosody recognizer using a coupled multi-stream acoustic model and a syntactic-prosodic language modelAnanthakrishnan, S. / Narayanan, S.S. et al. | 2005
- I/273
-
F0 control characterization by perceptual impressions on speaking attitudes using multiple dimensional scaling analysisKokenawa, Y. / Tsuzaki, M. / Kato, H. / Sagisaka, Y. et al. | 2005
- I/277
-
Additive modeling of English F0 contour for speech synthesisSakai, S. et al. | 2005
- I/281
-
Prosody analysis and modeling for emotional speech synthesisDan-ning Jiang, / Wei Zhang, / Li-qin Shen, / Lian-hong Cai, et al. | 2005
- I/285
-
Sliding window smoothing for maximum entropy based intonational phrase prediction in ChineseJian-Feng Li, / Guo-Ping Hu, / Ren-Hua Wang, / Li-Rong Dai, et al. | 2005
- I/289
-
Identification and synthesis of Cantonese tones based on the command-response model for F/sub 0/ contour generationWentao Gu, / Hirose, K. / Fujisaki, H. et al. | 2005
- I/293
-
Compression of exception lexicons for small footprint grapheme-to-phoneme conversionMeron, J. / Veprek, P. et al. | 2005
- I/297
-
Prediction of pronunciation variations for speech synthesis: a data-driven approachBennett, C.L. / Black, A.W. et al. | 2005
- I/301
-
Recording script design for corpus-based TTS system based on coverage of various phonetic elementsIsogai, M. / Mizuno, H. / Mano, K. et al. | 2005
- I/305
-
Optimal subset selection from text databasesJilei Tian, / Nurminen, J. / Kiss, I. et al. | 2005
- I/309
-
Comparative study of automatic phone segmentation methods for TTSAdell, J. / Bonafonte, A. / Gomez, J.A. / Castro, M.J. et al. | 2005
- I/313
-
Increased robustness against bit errors for distributed speech recognition in wireless environmentsDelaney, B. et al. | 2005
- I/317
-
"Of all things the measure is man" automatic classification of emotions and inter-labeler consistency [speech-based emotion recognition]Steidl, S. / Levit, M. / Batliner, A. / Noth, E. / Niemann, H. et al. | 2005
- I/321
-
Disordered speech evaluation using objective quality measuresLingyun Gu, / Harris, J.G. / Shrivastav, R. / Sapienza, C. et al. | 2005
- I/325
-
Meta-classifiers in acoustic and linguistic feature fusion-based affect recognitionSchuller, B. / Villar, R.J. / Rigoll, G. / Lang, M. et al. | 2005
- I/329
-
Packet loss concealment based on VQ replicas and MMSE estimation applied to distributed speech recognitionPeinado, A.M. / Gomez, A.M. / Sanchez, V. / Perez-Cordoba, J.L. / Rubio, A.J. et al. | 2005
- I/333
-
A comparison of soft-feature distributed speech recognition with candidate codecs for speech enabled mobile servicesIon, V. / Haeb-Umbach, R. et al. | 2005
- I/337
-
A hidden trajectory model with bi-directional target filtering: cascaded vs. integrated implementation for phonetic recognitionLi Deng, / Xiang Li, / Dong Yu, / Acero, A. et al. | 2005
- I/341
-
A comparison of classifiers for detecting emotion from speechShafran, I. / Mohri, M. et al. | 2005
- I/345
-
Soft decoding of temporal derivatives for robust distributed speech recognition in packet lossJames, A. / Milner, B. et al. | 2005
- I/349
-
DBN-based multi-stream models for Mandarin toneme recognitionXin Lei, / Gang Ji, / Ng, T. / Bilmes, J. / Ostendorf, M. et al. | 2005
- I/353
-
Sparse KPCA for feature extraction in speech recognitionLima, A. / Zen, H. / Nankaku, Y. / Tokuda, K. / Kitamura, T. / Resende, F.G. et al. | 2005
- I/357
-
Effects of phoneme characteristics on TEO feature-based automatic stress detection in speechRuzanski, E. / Hansen, J.H.L. / Meyerhoff, J. / Saviolakis, G. / Koenig, M. et al. | 2005
- I/361
-
Scalable concatenative speech synthesis based on the plural unit selection and fusion methodTamura, M. / Mizutani, T. / Kagoshima, T. et al. | 2005
- I/365
-
Adaptive training for hidden semi-Markov model [speech synthesis applications]Yamagishi, J. / Kobayashi, T. et al. | 2005
- I/369
-
Perceptually weighted long term modeling of sinusoidal speech amplitude trajectoriesFirouzmand, M.Z. / Girin, L. et al. | 2005
- I/373
-
Speech recognition in the blind condition based on multiple directivity patterns using a microphone arraySekiya, T. / Kobayashi, T. et al. | 2005
- I/377
-
An unsupervised quantitative measure for word prominence in spontaneous speechDagen Wang, / Narayanan, S. et al. | 2005
- I/381
-
Speech modelling based on generalized Gaussian probability density functionsKokkinakis, K. / Nandi, A.K. et al. | 2005
- I/385
-
Bayesian model based non-intrusive speech quality evaluationGuo Chen, / Parsa, V. et al. | 2005
- I/389
-
Robust pitch estimation at very low SNR exploiting time and frequency domain cuesShahnaz, C. / Zhu, W.-P. / Ahmad, M.O. et al. | 2005
- I/393
-
Fundamental frequency estimation and vocal tremor analysis by means of Morlet wavelet transformsCnockaert, L. / Grenez, F. / Schoentgen, J. et al. | 2005
- I/397
-
Automatic speech segmentation using average level crossing rate informationSarkar, A. / Sreenivas, T.V. et al. | 2005
- I/401
-
DWT-based phonetic groups classification using neural networksPham, T.V. / Kubin, G. et al. | 2005
- I/405
-
A novel KLT algorithm optimized for small signal sets [speech processing applications]Gianfelici, F. / Biagetti, G. / Crippa, P. / Turchetti, C. et al. | 2005
- I/409
-
Voicing-state classification of co-channel speech using nonlinear state-space reconstructionMahgoub, Y.A. / Dansereau, R.M. et al. | 2005
- I/413
-
Speech rate estimation via temporal correlation and selected sub-band correlationNarayanan, S. / Dagen Wang, et al. | 2005
- I/417
-
Closely coupled array processing and model-based compensation for microphone array speech recognitionXianyu Zhao, / Zhijian Ou, / Minhua Chen, / Zuoying Wang, et al. | 2005
- I/421
-
Context dependent duration modeling [speech recognition applications]Willett, D. et al. | 2005
- I/425
-
Recognising speech in the presence of a competing speaker using a 'speech fragment decoder'Coy, A. / Barker, J. et al. | 2005
- I/429
-
An environment compensated maximum likelihood training approach based on stochastic vector mapping [speech recognition applications]Jian Wu, / Qiang Huo, / Donglai Zhu, et al. | 2005
- I/433
-
Effect of phase-sensitive environment model and higher order VTS on noisy speech feature enhancement [speech recognition applications]Stouten, V. / Van Hamme, H. / Wambacq, P. et al. | 2005
- I/437
-
Towards speech recognition oriented dereverberationJinachitra, P. / Prieto, R.E. et al. | 2005
- I/441
-
Noisy speech recognition based on robust end-point detection and model adaptationZhipeng Zhang, / Furui, S. et al. | 2005
- I/445
-
Analysis of a large in-car speech corpus and its application to the multimodel ASRFujimua, H. / Miyajima, C. / Itou, K. / Takeda, K. / Itakura, F. et al. | 2005
- I/449
-
Building an effective corpus by using acoustic space visualization (COSMOS) method [speech recognition applications]Nagino, G. / Shozakai, M. et al. | 2005
- I/453
-
HMM/ANN based spectral peak location estimation for noise robust speech recognitionIkbal, S. / Bourlard, H. / Magimai-Doss, M. et al. | 2005
- I/457
-
Acoustic feature combination for robust speech recognitionZolnay, A. / Schluter, R. / Ney, H. et al. | 2005
- I/461
-
Acoustic training from heterogeneous data sources: experiments in Mandarin conversational telephone speech transcriptionTsakalidis, S. / Byrne, W. et al. | 2005
- I/465
-
Dynamic match phone-lattice searches for very fast and accurate unrestricted vocabulary keyword spottingThambiratnam, K. / Sridharan, S. et al. | 2005
- I/469
-
A stream-weight optimization method for multi-stream HMMs based on likelihood value normalizationTamura, S. / Iwano, K. / Furui, S. et al. | 2005
- I/473
-
Lip reading for robust speech recognition on embedded devicesPerez, J.F.G. / Frangi, A.F. / Solano, E.L. / Lukas, K. et al. | 2005
- I/477
-
Novel techniques for time-compressing speech: an exploratory studyTucker, S. / Whittaker, S. et al. | 2005
- I/481
-
Fast two-stage vocabulary independent search in spontaneous speechPeng Yu, / Seide, F. et al. | 2005
- I/485
-
An HMM-based text segmentation method using variational Bayes approach and its application to LVCSR for broadcast newsKoshinaka, T. / Iso, K. / Okumura, A. et al. | 2005
- I/489
-
Detecting group interest-level in meetingsGatica-Perez, D. / McCowan, L. / Dong Zhang, / Bengio, S. et al. | 2005
- I/493
-
Semantic data mining of short utterancesBegeja, L. / Drucker, H. / Gibbon, D. / Haffner, P. / Zhu Liu, / Renger, B. / Shahraray, B. et al. | 2005
- I/497
-
Automatic processing of audio lectures for information retrieval: vocabulary selection and language modelingPark, A. / Hazen, T.J. / Glass, J.R. et al. | 2005
- I/501
-
Blind change detection for audio segmentationOmar, M.K. / Chaudhari, U. / Ramaswamy, G. et al. | 2005
- I/505
-
Combining multiple subword representations for open-vocabulary spoken document retrievalShi-Wook Lee, / Tanaka, K. / Itoh, Y. et al. | 2005
- I/509
-
Robust lip-motion features for speaker identificationCetingul, H.E. / Yemez, Y. / Erzin, E. / Tekalp, A.M. et al. | 2005
- I/513
-
Variational Bayesian feature saliency for audio type classificationValente, F. / Wellekens, C. et al. | 2005
- I/517
-
Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory maskingGhulam, M. / Fukuda, T. / Horikawa, J. / Nitta, T. et al. | 2005
- I/521
-
MFCC compensation for improved recognition of filtered and bandlimited speechMorales, N. / Hansen, J.H.L. / Toledano, D.T. et al. | 2005
- I/525
-
Speech feature smoothing for robust ASRChia-Ping Chen, / Bilmes, J. / Ellis, D.P.W. et al. | 2005
- I/529
-
On desensitizing the Mel-cepstrum to spurious spectral components for robust speech recognitionTyagi, V. / Wellekens, C. et al. | 2005
- I/533
-
Two-stage noise spectra estimation and regression based in-car speech recognition using single distant microphoneWeifeng Li, / Itou, K. / Takeda, K. / Itakura, F. et al. | 2005
- I/537
-
Mask estimation based on sound localisation for missing data speech recognitionHarding, S. / Barker, J. / Brown, G.J. et al. | 2005
- I/541
-
Speech processing using joint features derived from the modified group delay functionHegde, R.M. / Murthy, H.A. / Rao, G.V.R. et al. | 2005
- I/545
-
Influence of autocorrelation lag ranges on robust speech recognitionShannon, B.J. / Paliwal, K.K. et al. | 2005
- I/549
-
Subspace-based speaker-independent vowel recognitionMuralishankar, R. / O'Shaughnessy, D. et al. | 2005
- I/553
-
Robust speech recognition based on spectral adjusting and warpingRui Zhao, / Zuoying Wang, et al. | 2005
- I/557
-
Robust speech activity detection using LDA applied to FF parametersPadrell, J. / Macho, D. / Nadeu, C. et al. | 2005
- I/561
-
Joint discriminative language modeling and utterance classificationSaraclar, M. / Roark, B. et al. | 2005
- I/565
-
Language model estimation for optimizing end-to-end performance of a natural language call routing systemGoel, V. / Kuo, H.-K.J. / Deligne, S. / Cheng Wu, et al. | 2005
- I/569
-
Language identification using phonetic and prosodic HMMs with feature normalizationObuchi, Y. / Sato, N. et al. | 2005
- I/573
-
Rapid language model development using external resources for new spoken dialog domainsSarikaya, R. / Gravano, A. / Yuqing Gao, et al. | 2005
- I/577
-
Using local & global phonotactic features in Chinese dialect identificationBoon Pang Lim, / Haizhou Li, / Bin Ma, et al. | 2005
- I/581
-
Random clusterings for language modelingEmami, A. / Jelinek, F. et al. | 2005
- I/585
-
Dialect/accent classification via boosted word modelingRongqing Huang, / Hansen, J.H.L. et al. | 2005
- I/589
-
Web-data augmented language models for Mandarin conversational speech recognitionNg, T. / Ostendorf, M. / Mei-Yuh Hwang, / Manhung Siu, / Bulyko, I. / Xin Lei, et al. | 2005
- I/593
-
An efficient algorithm for clustering short spoken utterancesZhu Liu, et al. | 2005
- I/597
-
Maximum entropy based generic filter for language model adaptationDong Yu, / Mahajan, M. / Mau, P. / Acero, A. et al. | 2005
- I/601
-
Language identification using pitch contour informationChi-Yueh Lin, / Hsiao-Chuan Wang, et al. | 2005
- I/605
-
Integrating multiple layers of concept information into n-gram modeling for spoken language understandingWang, N.J.C. et al. | 2005
- I/609
-
Automatic language identification using ergodic-HMMSantosh Kumar, S.A. / Ramasubramanian, V. et al. | 2005
- I/613
-
Discriminative power of transient frames in speaker recognitionLouradour, J. / Daoudi, K. / Andre-Obrecht, R. et al. | 2005
- I/617
-
Speaker identification in unknown noisy conditions - a universal compensation approachJi Ming, / Stewart, D. / Vaseghi, S. et al. | 2005
- I/621
-
Extracting additional information from Gaussian mixture model probabilities for improved text independent speaker identificationNarayanaswamy, B. / Gangadharaiah, R. et al. | 2005
- I/625
-
Combining selection tree with observation reordering pruning for efficient speaker identification using GMM-UBMZhenyu Xiong, / Zheng, T.F. / Zhanjiang Song, / Wenhu Wu, et al. | 2005
- I/629
-
Advances in channel compensation for SVM speaker recognitionSolomonoff, A. / Campbell, W.M. / Boardman, I. et al. | 2005
- I/633
-
Improved speaker model migration via stochastic synthesis [speaker recognition applications]Navratil, J. / Ramaswamy, G.N. et al. | 2005
- I/637
-
Factor analysis simplified [speaker verification applications]Kenny, P. / Boulianne, G. / Ouellet, P. / Dumouchel, P. et al. | 2005
- I/641
-
Minimum classification error interactive training for speaker identification [interactive robot applications]Kida, Y. / Yamamoto, H. / Miyajima, C. / Tokuda, K. / Kitamura, T. et al. | 2005
- I/645
-
A new common component GMM-based speaker recognition methodYih-Ru Wang, / Chen-Yu Chiang, et al. | 2005
- I/649
-
GMM-based Bhattacharyya kernel Fisher discriminant analysis for speaker recognitionYi-Hsiang Chao, / Hsin-Min Wang, / Ruei-Chuan Chang, et al. | 2005
- I/653
-
A study of the relative importance of temporal characteristics in text dependent and text constrained speaker verificationNealand, J.H. / Pelecanos, J.W. / Zilca, R.D. / Ramaswamy, G.N. et al. | 2005
- I/657
-
Noise robust speaker verification using mel-frequency discrete wavelet coefficients and parallel model compensationTufekci, Z. / Gurbuz, S. et al. | 2005
- I/661
-
Initializing subspace constrained Gaussian mixture modelsOlsen, P.A. / Visweswariah, K. / Gopinath, R. et al. | 2005
- I/665
-
Multi-rate and variable-rate modeling of speech at phone and syllable time scales [speech recognition applications]Cetin, O. / Ostendorf, M. et al. | 2005
- I/669
-
Optimal clustering and non-uniform allocation of Gaussian kernels in scalar dimension for HMM compression [speech recognition applications]Xiao-Bing Li, / Soong, F.K. / Myrvoll, T.A. / Ren-Hua Wang, et al. | 2005
- I/673
-
Hierarchical correlation compensation for hidden Markov models [speech recognition applications]Hui Lin, / Ye Tian, / Jian-Lai Zhou, / Hui Jiang, et al. | 2005
- I/677
-
Cluster-dependent acoustic modeling [speech recognition applications]Bing Xiang, / Long Nguyen, / Matsoukas, S. / Schwartz, R. et al. | 2005
- I/681
-
Fuzzy parameter clustering method in speech recognitionXianghua Xu, / Jie Zhu, et al. | 2005
- I/685
-
Automatic training set segmentation for multi-pass speech recognitionMao, M.Z. / Vanhoucke, V. / Strope, B. et al. | 2005
- I/689
-
Generalized statistical modeling of pronunciation variations using variable-length phone contextAkita, Y. / Kawahara, T. et al. | 2005
- I/693
-
On initialization of Gaussian mixtures: a hybrid genetic EM algorithmPernkopf, F. et al. | 2005
- I/697
-
Acoustic model training using greedy EMRusheng Hu, / Xiaolong Li, / Yunxin Zhao, et al. | 2005
- I/701
-
Modeling successive frame dependencies with hybrid HMM/BN acoustic modelMarkov, K. / Nakamura, S. et al. | 2005
- I/705
-
Improved covariance modeling for maximum likelihood multiple subspace transformations [speech recognition applications]Xi Zhou, / Ye Tian, / Jian-lai Zhou, / Bei-qian Dai, et al. | 2005
- I/709
-
A probabilistic measure of modality reliability in speaker verificationRichiardi, J. / Prodanov, P. / Drygajlo, A. et al. | 2005
- I/713
-
A correlation metric for speaker tracking using anchor modelsCollet, M. / Charlet, D. / Bimbot, F. et al. | 2005
- I/717
-
Estimating and evaluating confidence for forensic speaker recognitionCampbell, W.M. / Reynolds, D.A. / Campbell, J.P. / Brady, K.J. et al. | 2005
- I/721
-
F-ratio client dependent normalisation for biometric authentication tasksPoh, N. / Bengio, S. et al. | 2005
- I/725
-
Clustering speech utterances by speaker using Eigenvoice-motivated vector space modelsWei-Ho Tsai, / Shih-Sian Cheng, / Yi-Hsiang Chao, / Hsin-Min Wang, et al. | 2005
- I/729
-
T-Norm for text-dependent commercial speaker verification applications: effect of lexical mismatchHebert, M. / Boies, D. et al. | 2005
- I/733
-
A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verificationAronowitz, H. / Burshtein, D. / Amir, A. et al. | 2005
- I/737
-
ALIZE, a free toolkit for speaker recognitionBonastre, J.-F. / Wils, F. / Meignier, S. et al. | 2005
- I/741
-
Speaker adaptive cohort selection for Tnorm in text-independent speaker verificationSturim, D.E. / Reynolds, D.A. et al. | 2005
- I/745
-
Hybrid speaker-based segmentation system using model-level clusteringHyoung-Gook Kim, / Ertelt, D. / Sikora, T. et al. | 2005
- I/749
-
Robustness of bit-stream based features for speaker verificationMoreno-Daniel, A. / Juang, B.H. / Nolazco-Flores, J.A. et al. | 2005
- I/753
-
Two-way cluster voting to improve speaker diarisation performanceTranter, S.E. et al. | 2005
- I/757
-
Speaker detection without modelsGillick, D. / Stafford, S. / Peskin, B. et al. | 2005
- I/761
-
Improving the 2.4 kb/s military standard-MELP (MS-MELP) coder using pitch-synchronous analysis and synthesis techniques [speech coding]Ertan, A.E. / Barnwell, T.P. et al. | 2005
- I/765
-
Ultra low bit rate speech coding using an ergodic hidden Markov modelLee, M.E. / Durey, A.S. / Moore, E. / Clements, M. et al. | 2005
- I/769
-
Towards iLBC speech coding at lower rates through a new formulation of the start state searchGarrido, C.M. / Murthi, M.N. / Andersen, S.Y. et al. | 2005
- I/773
-
A missing-data approach to noise-robust LPC extraction for voiced speech using auxiliary sensorsDemiroglu, C. / Barnwell, T. et al. | 2005
- I/777
-
A technique of multi-tap long term predictor (LTP) filter using sub-sample resolution delay [speech coding applications]Jasiuk, M.A. / Ramabadran, T. / Mittal, U. / Ashley, J.P. / McLaughlin, M.J. et al. | 2005
- I/781
-
Voice activity detection based on generalized gamma distributionJong Won Shin, / Joon-Hyuk Chang, / Hwan Sik Yun, / Nam Soo Kim, et al. | 2005
- I/785
-
Increasing the robustness of CELP-based coders by constrained optimizationChibani, M. / Gournay, P. / Lefebvre, R. et al. | 2005
- I/789
-
Joint optimization of excitation parameters in analysis-by-synthesis speech coders having multi-tap long term predictorMittal, U. / Ashley, J.P. / Cruz-Zeno, E.M. / Jasiuk, M.A. et al. | 2005
- I/793
-
Block-based bandwidth extension of narrowband speech signal by using CDHMMSheng Yao, / Cheung-Fat Chan, et al. | 2005
- I/797
-
Segmentation-based speech enhancement for intelligibility improvement in MELP coders using auxiliary sensorsDemiroglu, C. / Kamath, S.D. / Anderson, D.V. et al. | 2005