MSP-P1.10: PROBABILISTIC FACE RECOGNITION FROM COMPRESSED IMAGERY (English)
- New search for: Li, J.
- New search for: Zhou, S.
- New search for: IEEE Signal Processing Society
- New search for: Li, J.
- New search for: Zhou, S.
- New search for: IEEE Signal Processing Society
In:
ICASSP; 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing: proceedings : May 17-21, 2004, Fairmont Queen Elizabeth Hotel, Montreal, Quebec, Canada /
;
V - 909-912
;
2004
-
ISBN:
-
ISSN:
- Conference paper / Print
-
Title:MSP-P1.10: PROBABILISTIC FACE RECOGNITION FROM COMPRESSED IMAGERY
-
Contributors:
-
Conference:29th, ICASSP; 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing: proceedings : May 17-21, 2004, Fairmont Queen Elizabeth Hotel, Montreal, Quebec, Canada / ; 2004 ; Montreal, Quebec
-
Published in:IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING ; 5 ; V - 909-912
-
Publisher:
- New search for: IEEE
-
Place of publication:Piscataway, N.J.
-
Publication date:2004-01-01
-
Size:V - 909-912
-
Remarks:Conference number extrapolated. "IEEE Catalog Number: 04CH37568"--T.p. verso. Includes bibliographical references and author index. Acoustics, speech, and signal processing ICASSP 2004; Vol 5 of 5
-
ISBN:
-
ISSN:
-
Type of media:Conference paper
-
Type of material:Print
-
Language:English
-
Keywords:
-
Source:
© Metadata Copyright the British Library Board and other contributors. All rights reserved.
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 0_1
-
2004 IEEE International Conference on Acoustics, Speech and Signal Processing| 2004
- 0_1
-
2004 IEEE International Conferrence on Acoustics, Speech and Signal Processing| 2004
- I
-
Speaking style adaptation using context clustering decision tree for HMM-based speech synthesisYamagishi, J. / Tachibana, M. / Masuko, T. / Kobayashi, T. et al. | 2004
- I
-
Soft decoding strategies for distributed speech recognition over IP networksCardenal-Lopez, A. / Docio-Fernandez, L. / Garcia-Mateo, C. et al. | 2004
- I
-
A subvector-based error concealment algorithm for speech recognition over mobile networksZheng-Hua Tan, / Daisgaard, P. / Lindberg, B. et al. | 2004
- 61
-
A complexity reduction of ETSI advanced front-end for DSRLi, Jin-Yu / Liu, Bo / Wang, Ren-Hua / Dai, Li-Rong et al. | 2004
- I
-
Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleavingWei-hao Hsu, / Lin-shan Lee, et al. | 2004
- I
-
A novel method for computation of periodicity, aperiodicity and pitch of speech signalsDeshmukh, O. / Singh, J. / Espy-Wilson, C. et al. | 2004
- I
-
Non-uniform speaker normalization using affine-transformationKumar, S.V.B. / Umesh, S. / Sinha, R. et al. | 2004
- I
-
Speech feature extraction method representing periodicity and aperiodicity in sub bands for robust speech recognitionIshizuka, K. / Miyazaki, N. et al. | 2004
- 169
-
Performance analysis for a class of robust adaptive beamformersBesson, O. / Vincent, F. et al. | 2004
- 189
-
Spatial filtering of RF interference in radio astronomy using a reference antennaVeen, A.J. van der / Boonstra, A.J. et al. | 2004
- I
-
Higher order cepstral moment normalization (HOCMN) for robust speech recognitionChang-wen Hsu, / Lin-shan Lee, et al. | 2004
- I
-
Speech enhancement based on a combined multi-channel array with constrained iterative and auditory masked processingXianxian Zhang, / Hansen, J.H.L. / Rehar, K.A. et al. | 2004
- 237
-
An improved array interpolation approach to DOA estimation in correlated signal environmentsLau, B.K. / Cook, G.J. / Leung, Y.H. et al. | 2004
- I
-
Meta-data conditional language modelingBacchiani, M. / Roark, B. et al. | 2004
- 249
-
Direct position determination of narrowband radio transmittersWeiss, A.J. et al. | 2004
- I
-
Cross-lingual latent semantic analysis for language modelingWoosung Kim, / Khudanpur, S. et al. | 2004
- 293
-
A Kalman filter based registration approach for asynchronous sensors in multiple sensor fusion applicationsZhou, Yifeng et al. | 2004
- 329
-
A single-carrier/OFDM comparison for broadband wireless communicationVan der Perre, L. / Tubbax, J. / Horlin, F. / De Man, H. et al. | 2004
- I
-
Speaker indexing and adaptation using speaker clustering based on statistical model selectionNishida, M. / Kawahara, T. et al. | 2004
- 361
-
Geolocation by time difference of arrival using hyperbolic asymptotesDrake, S.R. / Dogancay, K. et al. | 2004
- 393
-
Design of complex allpass filtersFernandez-Vazquez, A. / Jovanovic-Dolecek, G. et al. | 2004
- 397
-
Multiplier-free band-selectable digital filtersSantraine, A. / Leprince, S. / Taylor, F. et al. | 2004
- 433
-
Public speech-oriented guidance system with adult and child discrimination capabilityNisimura, R. / Lee, A. / Saruwatari, H. / Shikano, K. et al. | 2004
- I
-
Improving phoneme recognition of telephone quality speechQiang Huang, / Cox, S. et al. | 2004
- 449
-
A stochastic model for the affine projection algorithm operating in a nonstationary environmentAlmeida, S.J.M. de / Bermudez, J.C.M. / Bershad, N.J. et al. | 2004
- 457
-
A statistical analysis of the multi-split LMS algorithmResende, L.S. / Rocha, C.A.F. / Bermudez, J.C.M. / Bellanger, M.G. et al. | 2004
- 461
-
Sufficient condition for tap-length gradient adaption of LMS algorithmGu, Yuantao / Tang, Kun / Cui, Huijuan et al. | 2004
- 469
-
A modified constant-Q transform for audio signalsSantos, C.N. dos / Netto, S.L. / Biscainho, L.W.R. / Graziosi, D.B. et al. | 2004
- 501
-
Weighted low rank approximation and reduced rank linear regressionWerner, K. / Jansson, M. et al. | 2004
- 505
-
Wavelet packets-based direction-of-arrival estimationXue, Yanbo / Wang, Jinkuan / Liu, Zhigang et al. | 2004
- I
-
An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic modelKen Chen, / Hasegawa-Johnson, M. / Cohen, A. et al. | 2004
- 521
-
A new signal model and identification algorithm for hidden semi-Markov signalsAzimi, M. / Nasiopoulos, P. / Ward, R.K. et al. | 2004
- 561
-
Polyphase analysis of aliasing effects in enlargementsSeidner, D. et al. | 2004
- 569
-
Can timing jitter improve random process reconstruction in presence of aliasing?Lacaze, B. / Mailhes, C. et al. | 2004
- I
-
Importance of window shape for phase-only reconstruction of speechAlsteris, L.D. / Paliwal, K.K. et al. | 2004
- 581
-
Frequency analysis using non-uniform sampling with application to active queue managementGunnarsson, F. / Gustafsson, F. et al. | 2004
- I
-
Automatic emotional speech classificationVerveridis, D. / Kotropoulos, C. / Pitas, I. et al. | 2004
- 597
-
Parametric smoothing of spline interpolationIbanez, J. / Santamaria, I. / Pantaleon, C. / Vielva, L. et al. | 2004
- 629
-
Diffusion equations for adaptive affine distributionsGosme, J. / Richard, C. / Goncalves, P. et al. | 2004
- 633
-
Comparative study of three time-frequency representations with applications to a novel correlation methodSejdic, E. / Jiang, J. et al. | 2004
- 637
-
A bootstrap scheme for time-frequency auto-term selection in antenna arraysCirillo, L.A. / Zoubir, A.M. et al. | 2004
- 649
-
Pole optimisation in adaptive Laguerre filteringden Brinker, A.C. / Sarroukh, B.E. et al. | 2004
- 665
-
On adaptive interpolated FIR filtersBilcu, R.C. / Kuosmanen, P. / Egiazarian, K. et al. | 2004
- 673
-
A recursive least squares algorithm robust to low-power excitationLudovico, C.S. / Bermudez, J.C.M. et al. | 2004
- 709
-
Kalman filtering in stochastic gradient algorithms: construction of a stopping ruleBittner, B. / Pronzato, L. et al. | 2004
- I
-
Combining equalization and estimation for bandwidth extension of narrowband speechYasheng Qian, / Kabal, P. et al. | 2004
- 753
-
Novel approach to AM-FM decomposition with applications to speech and music analysisSekhar, S.C. / Sreenivas, T.V. et al. | 2004
- 757
-
Time-frequency-moving-average processes: principles and cepstral methods for parameter estimationJachan, M. / Matz, G. / Hlawatsch, F. et al. | 2004
- I
-
Studies in massively speaker-specific speech recognitionYu Shi, / Eric Chang, et al. | 2004
- I
-
Codebook design for ASR systems using custom arithmetic unitsXiao Li, / Malkin, J. / Bilmes, J. et al. | 2004
- I
-
Parameter sharing in subband likelihood-maximizing beamforming for speech recognition using microphone arraysSeltzer, M.L. / Stern, R.M. et al. | 2004
- I
-
Extended cluster information vector quantization (ECI-VQ) for robust classificationArrowood, J.A. / Clements, M.A. et al. | 2004
- 933
-
Notions of strong ergodicity for stochastic analysis of multirate systemsMarelli, D. / Fu, Minyue et al. | 2004
- 945
-
An extended sure approach for multicomponent image denoisingBenazza-Benyahia, A. / Pesquet, J.C. et al. | 2004
- 1009
-
Blind deconvolution using Bayesian methods with application to the dereverberation of speechDaly, M.J. / Reilly, J.R. et al. | 2004
- I
-
Automatic recognition of Bluetooth speech in 802.11 interference and the effectiveness of insertion-based compensation techniquesNour-Eldin, A.H. / Tolba, H. / O'Shaughnessy, D. et al. | 2004
- 1037
-
SP-P16.12: SENSITIVITY ANALYSIS OF NOISE ROBUSTNESS METHODSBrayda, L. / Rigazio, L. / Boman, R. / Junqua, J.-C. / IEEE Signal Processing Society et al. | 2004
- 1041
-
Fast MCMC computations for the estimation of sparse processes from noisy observationsDavy, M. / Idier, J. et al. | 2004
- 1041
-
Author index| 2004
- 1053
-
An approach based on influence function to evaluate robustness and detection performance of CFAR detectorsMeng, Huadong / Wang, Xiqin / Zhang, Hao / Peng, Yingning et al. | 2004
- 1069
-
Detection performance for discrete test statistics. Application to low-flux imageryFerrari, A. / Tourneret, J.Y. et al. | 2004
- 1097
-
Signal detection and estimation using atomic decomposition and information-theoretic criteriaLopez-Risueno, G. / Grajal, J. / Yeste-Ojeda, O.A. et al. | 2004
- I
-
Combination of hidden Markov models with dynamic time warping for speech recognitionAxelrod, S. / Maison, B. et al. | 2004
- I
-
Exact training of a neural syntactic language modelEmami, A. / Jelinek, F. et al. | 2004
- I
-
A two-step noise reduction techniquePlapous, C. / Marro, C. / Mauuary, L. / Scalart, P. et al. | 2004
- I
-
Enrollment in low-resource speech recognition systemsDeligne, S. / Dharanipragada, S. et al. | 2004
- I
-
A multimedia approach for audio segmentation in TV broadcast newsPerez-Freire, L. / Garcia-Mateo, C. et al. | 2004
- I
-
The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluationMoraru, D. / Meignier, S. / Fredouille, C. / Besacier, L. / Bonastre, J.F. et al. | 2004
- I
-
Fusing language identification systems using performance confidence indexesGutierrez, J. / Rouas, J.L. / Andre-Obrecht, R. et al. | 2004
- I
-
Enhancement of mismatched conditions in speaker recognition for multimedia applicationsFakhr, W. / Abdelsalam, A. / Hamdy, N. et al. | 2004
- I
-
A detection based approach to robust speech understandingKuansan Wang, et al. | 2004
- I
-
Automatic learning of interpretation strategies for spoken dialogue systemsRaymond, C. / Bechet, F. / De Mori, R. / Damnati, G. / Esteve, Y. et al. | 2004
- I
-
Automatically derived units for segment vocodersRamasubramanian, V. / Sreenivas, T.V. et al. | 2004
- I
-
Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architectureSchuller, B. / Rigoll, G. / Lang, M. et al. | 2004
- I
-
Clustering and segmenting speakers and their locations in meetingsAjmera, J. / Lathoud, G. / McCowan, L. et al. | 2004
- I
-
Analysis by synthesis of acoustic correlates of British, Australian and American accentsQin Yan, / Vaseghi, S. / Rentzos, D. / Ching-Hsiang Ho, et al. | 2004
- I
-
A low-band spectrum envelope modeling for high quality pitch modificationMochizuki, R. / Kobayashi, A. et al. | 2004
- I
-
Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesisToda, T. / Kawai, H. / Tsuzaki, M. et al. | 2004
- I
-
Corrective language modeling for large vocabulary ASR with the perceptron algorithmRoark, B. / Saraclar, M. / Collins, M. et al. | 2004
- I
-
Improved name recognition with meta-data dependent name networksMaskey, S.R. / Bacchiani, M. / Roark, B. / Sproat, R. et al. | 2004
- I
-
A new voice activity detector using subband order-statistics filters for robust speech recognitionRamirez, J. / Segura, J.C. / Benirez, C. / de la Torre, A. / Rubio, A. et al. | 2004
- I
-
Fusion based speech segmentation in DARPA SPINE2 taskChengyi Zheng, / Yonghong Yan, et al. | 2004
- I
-
Discriminative feature transformation by guided discriminative trainingHsiao, R. / Mak, B. et al. | 2004
- I
-
Decision tree based tone modeling for Chinese speech recognitionPui-Fung WONG, / Man-Hung SIU, et al. | 2004
- I
-
Joint removal of additive and convolutional noise with model-based feature enhancementStouten, V. / Van Hamme, H. / Wambacq, P. et al. | 2004
- I
-
Minimum classification error training of landmark models for real-time continuous speech recognitionMcDermott, E. / Hazen, T.J. et al. | 2004
- I
-
Universal compensation -- an approach to noisy speech recognition assuming no knowledge of noiseJi Ming, et al. | 2004
- I
-
SP-L2.1: DISCRIMINATIVE TRAINING FOR SPEAKER IDENTIFICATION BASED ON MAXIMUM MODEL DISTANCE ALGORITHMHong, Q. Y. / Kwong, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L1.6: VOICE CONVERSION THROUGH TRANSFORMATION OF SPECTRAL AND INTONATION FEATURESRentzos, D. / Vaseghi, S. / Yan, Q. / Ho, C.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.3: A SUBVECTOR-BASED ERROR CONCEALMENT ALGORITHM FOR SPEECH RECOGNITION OVER MOBILE NETWORKSTan, Z.-H. / Dalsgaard, P. / Lindberg, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.6: EFFICIENT AND ROBUST DISTRIBUTED SPEECH RECOGNITION (DSR) OVER WIRELESS FADING CHANNELS: 2D-DCT COMPRESSION, ITERATIVE BIT ALLOCATION, SHORT BCH CODE AND INTERLEAVINGHsu, W.-h. / Lee, L.-s. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L4.3: TEXT-INDEPENDENT SPEAKER RECOGNITION BY COMBINING SPEAKER-SPECIFIC GMM WITH SPEAKER ADAPTED SYLLABLE-BASED HMMNakagawa, S. / Zhang, W. / Takahashi, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L4.1: HIGH-LEVEL SPEAKER VERIFICATION USING SUPPORT VECTOR MACHINESCampbell, W. / Campbell, J. / Reynolds, D. / Jones, D. / Leek, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.5: WEIGHTED AUTOCORRELATION-BASED F0 ESTIMATION FOR DISTANT-TALKING INTERACTION WITH A DISTRIBUTED MICROPHONE NETWORKArmani, L. / Omologo, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.6: SPEECH FEATURE EXTRACTION METHOD REPRESENTING PERIODICITY AND APERIODICITY IN SUB BANDS FOR ROBUST SPEECH RECOGNITIONIshizuka, K. / Miyazaki, N. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.5: IMPROVED QUANTIZATION STRUCTURES USING GENERALIZED HMM MODELLING WITH APPLICATION TO WIDEBAND SPEECH CODINGDuni, E. / Subramaniam, A. / Rao, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.3: OVERDETERMINED BLIND SEPARATION FOR CONVOLUTIVE MIXTURES OF SPEECH BASED ON MULTISTAGE ICA USING SUBARRAY PROCESSINGNishikawa, T. / Abe, H. / Saruwatari, H. / Shikano, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.5: MULTIPLE-MICROPHONE TIME-VARYING FILTERS FOR ROBUST SPEECH RECOGNITIONLai, C. / Aarabi, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.5: CROSS-LINGUAL LATENT SEMANTIC ANALYSIS FOR LANGUAGE MODELINGKim, W. / Khudanpur, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.11: SPEAKER INDEXING AND ADAPTATION USING SPEAKER CLUSTERING BASED ON STATISTICAL MODEL SELECTIONNishida, M. / Kawahara, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.6: LANGUAGE BOUNDARY DETECTION AND INDENTIFICATION OF MIXED-LANGUAGE SPEECH BASED ON MAP ESTIMATIONShia, C.-J. / Chiu, Y.-H. / Hsieh, J.-H. / Wu, C.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.6: MULTISENSOR MELPE USING PARAMETER SUBSTITUTIONBrady, K. / Quatieri, T. / Campbell, J. / Campbell, W. / Brandstein, M. / Weinstein, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.11: AN IMPROVED CORRECTION FORMULA FOR THE ESTIMATION OF HARMONIC MAGNITUDES AND ITS APPLICATION TO OPEN QUOTIENT ESTIMATIONIseli, M. / Alwan, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.6: HMM-BASED FREQUENCY BANDWIDTH EXTENSION FOR SPEECH ENHANCEMENT USING LINE SPECTRAL FREQUENCIESChen, G. / Parsa, V. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.8: PERCEPTUAL KALMAN FILTERING FOR SPEECH ENHANCEMENT IN COLORED NOISEMa, N. / Bouchard, M. / Goubran, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.10: AN MMSE SPEECH ENHANCEMENT APPROACH INCORPORATING MASKING PROPERTIESYou, C. h. / Koh, S. n. / Rahardja, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.1: IMPROVING BROADCAST NEWS TRANSCRIPTION BY LIGHTLY SUPERVISED DISCRIMINATIVE TRAININGChan, H. Y. / Woodland, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.10: SEQUENTIAL CLUSTERING ALGORITHM FOR GAUSSIAN MIXTURE INITIALIZATIONMessina, R. / Jouvet, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.2: A NEW VOICE ACTIVITY DETECTOR USING SUBBAND ORDER-STATISTICS FILTERS FOR ROBUST SPEECH RECOGNITIONRamirez, J. / Segura, J. C. / Benitez, C. / de la Torre, A. / Rubio, A. J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.4: A STREAM-WEIGHT OPTIMIZATION METHOD FOR AUDIO-VISUAL SPEECH RECOGNITION USING MULTI-STREAM HMMSTamura, S. / Iwano, K. / Furui, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.5: A FACTORIAL HMM APPROACH TO SIMULTANEOUS RECOGNITION OF ISOLATED DIGITS SPOKEN BY MULTIPLE TALKERS ON ONE AUDIO CHANNELDeoras, A. / Hasegawa-Johnson, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.10: PARAMETER SHARING IN SUBBAND LIKELIHOOD-MAXIMIZING BEAMFORMING FOR SPEECH RECOGNITION USING MICROPHONE ARRAYSSeltzer, M. / Stern, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.7: CHINESE-ENGLISH BILINGUAL PHONE MODELING FOR CROSS-LANGUAGE SPEECH RECOGNITIONYu, S. / Zhang, S. / Xu, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.10: MINIMUM KULLBACK-LEIBLER DISTANCE BASED MULTIVARIATE GAUSSIAN FEATURE ADAPTATION FOR DISTANT-TALKING SPEECH RECOGNITIONPan, Y. / Waibel, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.1: PITCH PREDICTION FROM MFCC VECTORS FOR SPEECH RECONSTRUCTIONShao, X. / Milner, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.4: EXTRACTION OF PITCH IN ADVERSE CONDITIONSMahadeva Prasanna, S. R. / Yegnanarayana, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.2: HIGHER ORDER CEPSTRAL MOMENT NORMALIZATION (HOCMN) FOR ROBUST SPEECH RECOGNITIONHsu, C.-w. / Lee, L.-s. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.9: EMPLOYING LAPLACIAN-GAUSSIAN DENSITIES FOR SPEECH ENHANCEMENTGazor, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.10: ROBUST ADAPTIVE KALMAN FILTERING-BASED SPEECH ENHANCEMENT ALGORITHMGabrea, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.10: EIGEN-MLLRS APPLIED TO UNSUPERVISED SPEAKER ENROLLMENT FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITIONAubert, X. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.7: EFFICIENT SPECTRUM CODING FOR SUPER-WIDEBAND SPEECH AND ITS APPLICATION TO 7/10/15 KHZ BANDWIDTH SCALABLE CODERSOshikiri, M. / Ehara, H. / Yoshida, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.7: FRACTIONAL FOURIER TRANSFORM FEATURES FOR SPEECH RECOGNITIONSarikaya, R. / Gao, Y. / Saon, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.1: MINIMUM SEGMENTATION ERROR BASED DISCRIMINATIVE TRAINING FOR SPEECH SYNTHESIS APPLICATIONWu, Y.-J. / Kawai, H. / Ni, J. / Wang, R.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.9: EVALUATION OF THE EFFECT OF STRESS ON FORMANTS IN FARSI VOWELSGharavian, D. / Ahadi, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.14: SCALING OF WAVEFORM SEGMENTS ALONG THE TIME AXIS FOR CONCATENATIVE SPEECH SYNTHESISNishizawa, N. / Kawai, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.8: CROSS-DIALECTAL ACOUSTIC DATA SHARING FOR ARABIC SPEECH RECOGNITIONKirchhoff, K. / Vergyri, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.3: SEGMENTAL TONAL MODELING FOR PHONE SET DESIGN IN MANDARIN LVCSRHuang, C. / Shi, Y. / Zhou, J.-L. / Chu, M. / Wang, T. / Chang, E. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.4: DECISION TREE BASED TONE MODELING FOR CHINESE SPEECH RECOGNITIONWong, P.-F. / Siu, M.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
Non-parallel training for voice conversion by maximum likelihood constrained adaptationMouchtaris, A. / Van der Spiegel, J. / Mueller, P. et al. | 2004
- I
-
Parameter sharing and minimum classification error training of mixtures of factor analyzers for speaker identificationYamamoto, H. / Nankaku, Y. / Miyajima, C. / Tokuda, K. / Kitamura, T. et al. | 2004
- I
-
Generalized locally recurrent probabilistic neural networks for text-independent speaker verificationGanchev, T. / Fakotakis, N. / Tasoulis, D.K. / Vrahatis, M.N. et al. | 2004
- I
-
Robust speech recognition techniques evaluation for telephony server based in-car applicationsDelphin-Poulat, L. et al. | 2004
- I
-
High-level speaker verification with support vector machinesCampbell, W.M. / Campbell, J.R. / Reynolds, D.A. / Jones, D.A. / Leek, T.R. et al. | 2004
- I
-
Weighted autocorrelation-based F0 estimation for distant-talking interaction with a distributed microphone networkArmani, L. / Omologo, M. et al. | 2004
- I
-
Product of power spectrum and group delay function for speech recognitionDonglai Zhu, / Paliwal, K.K. et al. | 2004
- I
-
Low-complexity predictive trellis coded quantization of wideband speech LSF parametersYongwon Shin, / Sangwon Kang, / Fischer, T.R. / Changyong Son, / Yongbeom Lee, et al. | 2004
- I
-
Variable-dimension quantization of sinusoidal amplitudes using Gaussian mixture modelsLindblom, J. / Hedelin, P. et al. | 2004
- I
-
Robust speech recognition using cepstral domain missing data techniques and noisy masksVan Hamme, H. et al. | 2004
- I
-
Noise suppression for automotive applications based on directional informationFuchs, M. / Haulick, T. / Schmidt, G. et al. | 2004
- I
-
Vocabulary-independent search in spontaneous speechSeide, F. / Peng Yu, / Chengyuan Ma, / Chang, E. et al. | 2004
- I
-
Robust adaptive Kalman filtering-based speech enhancement algorithmGabrea, M. et al. | 2004
- I
-
Language boundary detection and identification of mixed-language speech based on MAP estimationChi-Jiun Shia, / Yu-Hsien Chiu, / Jia-Hsin Hsieh, / Chung-Hsien Wu, et al. | 2004
- I
-
Language identification using parallel syllable-like unit recognitionNagarajan, T. / Murthy, H.A. et al. | 2004
- I
-
A multi-pass linear fold algorithm for sentence boundary detection using prosodic cuesDagen Wang, / Narayanan, S.S. et al. | 2004
- I
-
An evaluation of automatic phone segmentation for concatenative speech synthesisKawai, H. / Toda, T. et al. | 2004
- I
-
Estimation of short-term predictor parameters for coding and enhancement of noisy speechSrinivasan, S. / Samuelsson, J. / Kleijn, W.B. et al. | 2004
- I
-
HMM-based frequency bandwidth extension for speech enhancement using line spectral frequenciesChen, G. / Parsa, V. et al. | 2004
- I
-
Optimizing acoustic models for commercial speech recognition using foreground scores and data weightingBoies, D. / Strope, B. / Weintraub, M. / Su-Lin Wu, et al. | 2004
- I
-
Hidden spectral peak trajectory model for phone classificationYiu-Pong LAI, / Man-Hung SIU, et al. | 2004
- I
-
Microphone array post-filter for separation of simultaneous non-stationary sourcesValin, J.M. / Rouat, J. / Michaud, F. et al. | 2004
- I
-
A pitch synchronous feature extraction method for speaker recognitionKim, S. / Eriksson, T. / Hong-Goo Kang, / Dae Hee Youn, et al. | 2004
- I
-
Automatic indexing of key sentences for lecture archives using statistics of presumed discourse markersNanjo, H. / Kitade, T. / Kawahara, T. et al. | 2004
- I
-
Predicting foreground SH, SL and BNH DAM scores for multidimensional objective measure of speech qualitySen, D. et al. | 2004
- I
-
Application of the modified group delay function to speaker identification and discriminationHegde, R.M. / Murthy, H.A. / Rao, G.V.R. et al. | 2004
- I
-
Trapping conversational speech: extending TRAP/tandem approaches to conversational telephone speech recognitionMorgan, N. / Chen, B.Y. / Zhu, Q. / Stolcke, A. et al. | 2004
- I
-
On use of task independent training data in tandem feature extractionSivadas, S. / Hermansk, H. et al. | 2004
- I
-
Hybrid language models for out of vocabulary word detection in large vocabulary conversational speech recognitionYazgan, A. / Saraclar, M. et al. | 2004
- I
-
Generating and evaluating segmentations for automatic speech recognition of conversational telephone speechTranter, S.E. / Yu, K. / Everinann, G. / Woodland, P.C. et al. | 2004
- I
-
Advances in the automatic transcription of lecturesCettolo, M. / Brugnara, F. / Federico, M. et al. | 2004
- I
-
An evaluation of a nonlinear feature transformation for conversational speech recognitionOmar, M.K. / Kingsbury, B. et al. | 2004
- I
-
Extended Baum transformations for general functionsKanevsky, D. et al. | 2004
- I
-
SP-L1.3: HIGH QUALITY VOICE MORPHINGYe, H. / Young, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L1.2: SPEAKING STYLE ADAPTATION USING CONTEXT CLUSTERING DECISION TREE FOR HMM-BASED SPEECH SYNTHESISYamagishi, J. / Tachibana, M. / Masuko, T. / Kobayashi, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L4.2: USING HAAR TRANSFORMED VOCAL SOURCE INFORMATION FOR AUTOMATIC SPEAKER RECOGNITIONZheng, N. / Ching, P. C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.5: ROBUST SPEECH RECOGNITION TECHNIQUES EVALUATION FOR TELEPHONY SERVER BASED IN-CAR APPLICATIONSDelphin-Poulat, L. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.3: THE ETSI EXTENDED DISTRIBUTED SPEECH RECOGNITION (DSR) STANDARDS: CLIENT SIDE PROCESSING AND TONAL LANGUAGE RECOGNITION EVALUATIONSorin, A. / Ramabadran, T. / Chazan, D. / Hoory, R. / McLaughlin, M. / Pearce, D. / Wang, F. / Zhang, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.4: PHASE AUTOCORRELATION (PAC) FEATURES IN ENTROPY BASED MULTI-STREAM FOR ROBUST SPEECH RECOGNITIONIkbal, S. / Misra, H. / Boulard, H. / Hermansky, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.4: VOCABULARY-INDEPENDENT SEARCH IN SPONTANEOUS SPEECHSeide, F. / Yu, P. / Ma, C. / Chang, E. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.12: LOW DISTORTION SPEECH DENOISING USING AN ADAPTIVE PARAMETRIC WIENER FILTERFan, N. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.2: ADAPTIVE TRAINING USING STRUCTURED TRANSFORMSYu, K. / Gales, M. J. F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.7: PRIOR KNOWLEDGE GUIDED MEL BASED MODEL SELECTION AND ADAPTATION FOR NONNATIVE SPEECH RECOGNITIONHe, X. / Zhao, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.9: AN INVESTIGATION INTO FRONT-END SIGNAL PROCESSING FOR SPEAKER NORMALIZATIONUmesh, S. / Sinha, R. / Bharath Kumar, S. V. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.10: IMPROVING PHONEME RECOGNITION OF TELEPHONE QUALITY SPEECHHuang, Q. / Cox, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.1: NOISE-DEPENDENT POSTFILTERINGGrancharov, V. / Samuelsson, J. / Kleijn, W. B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.3: ADAPTIVE TIME-SEGMENTATION FOR SPEECH CODING WITH LIMITED DELAYRodbro, C. A. / Jensen, J. / Heusdens, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.5: TOWARDS MULTILINGUAL SPEECH RECOGNITION USING DATA DRIVEN SOURCE/TARGET ACOUSTICAL UNITS ASSOCIATIONBayeh, R. / Lin, S.-S. / Chollet, G. / Mokbel, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.2: A STRUCTURED SPEECH MODEL WITH CONTINUOUS HIDDEN DYNAMICS AND PREDICTION-RESIDUAL TRAINING FOR TRACKING VOCAL TRACT RESONANCESDeng, L. / Lee, L. / Attias, H. / Acero, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.7: SPEECH EMOTION RECOGNITION COMBINING ACOUSTIC FEATURES AND LINGUISTIC INFORMATION IN A HYBRID SUPPORT VECTOR MACHINE - BELIEF NETWORK ARCHITECTURESchuller, B. / Rigoll, G. / Lang, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.3: ANALYSIS BY SYNTHESIS OF ACOUSTIC CORRELATES OF BRITISH, AUSTRALIAN AND AMERICAN ACCENTSYan, Q. / Vaseghi, S. / Rentzos, D. / Ho, C.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.3: FEATURE SELECTION FOR IMPROVED BANDWIDTH EXTENSION OF SPEECH SIGNALSJax, P. / Vary, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.10: THE 2003 ISL RICH TRANSCRIPTION SYSTEM FOR CONVERSATIONAL TELEPHONY SPEECHSoltau, H. / Yu, H. / Metze, F. / Fugen, C. / Jin, Q. / Jou, S.-C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.4: RAO-BLACKWELLISED GIBBS SAMPLING FOR SWITCHING LINEAR DYNAMICAL SYSTEMSRosti, A.-V. / Gales, M. J. F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.7: EXTENDED BAUM TRANSFORMATIONS FOR GENERAL FUNCTIONSKanevsky, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.12: EXTENDED CLUSTER INFORMATION VECTOR QUANTIZATION (ECI-VQ) FOR ROBUST CLASSIFICATIONArrowood, J. / Clements, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.9: SPEECH ENHANCEMENT BASED ON MULTIPLE DIRECTIVITY PATTERNS USING A MICROPHONE ARRAYSekiya, T. / Kobayashi, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.12: NONLINEAR NOISE COMPENSATION IN FEATURE DOMAIN FOR SPEECH RECOGNITION WITH NUMERICAL METHODSJiang, H. / Wang, Q. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.5: ASYNCHRONOUS HMM WITH APPLICATIONS TO SPEECH RECOGNITIONGarg, A. / Balakrishnan, S. / Vaithyanathan, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.11: AUTOMATIC RECOGNITION OF BLUETOOTH SPEECH IN 802.11 INTERFERENCE AND THE EFFECTIVENESS OF INSERTION-BASED COMPENSATION TECHNIQUESNour-Eldin, A. / Tolba, H. / O Shaughnessy, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.2: MULTIPLE FRAME BLOCK QUANTISATION OF LINE SPECTRAL FREQUENCIES USING GAUSSIAN MIXTURE MODELSPaliwal, K. K. / So, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.1: EFFECTS OF TRANSCRIPTION ERRORS ON SUPERVISED LEARNING IN SPEECH RECOGNITIONSundaram, R. / Picone, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.3: MPE-BASED DISCRIMINATIVE LINEAR TRANSFORM FOR SPEAKER ADAPTATIONWang, L. / Woodland, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.4: A STUDY OF VARIOUS COMPOSITE KERNELS FOR KERNEL EIGENVOICE SPEAKER ADAPTATIONMak, B. / Kwok, J. / Ho, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.2: A DETECTION BASED APPROACH TO ROBUST SPEECH UNDERSTANDINGWang, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.8: JOINT FREQUENCY DOMAIN AND RECONSTRUCTED PHASE SPACE FEATURES FOR SPEECH RECOGNITIONLindgren, A. / Johnson, M. / Povinelli, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.8: FORMANT FREQUENCY ESTIMATION IN NOISEChen, B. / Loizou, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.2: SPEECH DISCRIMINATION BASED ON MULTISCALE SPECTRO-TEMPORAL MODULATIONSMesgarani, N. / Shamma, S. / Slaney, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.4: VOICE ACTIVITY DETECTION USING VISUAL INFORMATIONLiu, P. / Wang, Z. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.5: SPEECH MODELING AND VOICED/UNVOICED/MIXED/SILENCE SPEECH SEGMENTATION WITH FRACTIONALLY GAUSSIAN NOISE BASED MODELSOveisgharan, S. / Shamsollahi, M. B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.6: SOUND FEATURE DETECTION USING LEAKY INTEGRATE-AND-FIRE NEURONSSmith, L. / Fraser, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.7: A REAL-TIME CANTONESE TEXT-TO-AUDIOVISUAL SPEECH SYNTHESIZERWang, J.-Q. / Wong, K.-H. / Heng, P.-A. / Meng, H. M.-L. / Wong, T.-T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.15: SPEECH SYNTHESIS FROM REAL TIME ULTRASOUND IMAGES OF THE TONGUEDenby, B. / Stone, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.12: FILLER MODEL BASED CONFIDENCE MEASURES FOR SPOKEN DIALOGUE SYSTEMS: A CASE STUDY FOR TURKISHAkyol, A. / Erdogan, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.2: BASIS SUPERPOSITION PRECISION MATRIX MODELLING FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITIONSim, K. C. / Gales, M. J. F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.3: JOINT REMOVAL OF ADDITIVE AND CONVOLUTIONAL NOISE WITH MODEL-BASED FEATURE ENHANCEMENTStouten, V. / Van hamme, H. / Wambacq, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.7: ON TRACKING NOISE WITH LINEAR DYNAMICAL SYSTEM MODELSRaj, B. / Singh, R. / Stern, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.8: MITIGATION OF CHANNEL ERRORS IN EFR-BASED SPEECH RECOGNITIONGomez, A. M. / Peinado, A. M. / Sanchez, V. E. / Perez-Cordoba, J. L. / Rubio, A. J. / IEEE Signal Processing Society et al. | 2004
- I
-
Improvement of speaker recognition by combining residual and prosodic features with acoustic featuresShi-Han Chen, / Hsiao-Chuan Wang, et al. | 2004
- I
-
Dimensionality reduction using MCE-optimized LDA transformationXiao-Bing Li, / Jin-Yu Li, / Ren-Hua Wang, et al. | 2004
- I
-
Light supervision in acoustic model trainingLong Nguyen, / Bing Xiang, et al. | 2004
- I
-
Overdetermined blind separation for convolutive mixtures of speech based on multistage ICA using subarray processingNishikawa, T. / Abe, H. / Saruwatari, H. / Shikano, K. et al. | 2004
- I
-
A study of design compromises for speech coders in packet networksLefebvre, R. / Philippe, G.T. / Salami, R. et al. | 2004
- I
-
Improvement issues on transcoding algorithms: for the flexible usage to the various pairs of speech codecJin-Kyu Choi, / Chang-Heon Lee, / Hong-Goo, K. / Young-Cheol Park, / Dae Hee Youn, et al. | 2004
- I
-
Parameterization of the score threshold for a text-dependent adaptive speaker verification systemMirghafori, N. / Hebert, M. et al. | 2004
- I
-
Wideband audio over narrowband low-resolution mediaHeping Ding, et al. | 2004
- I
-
A differential spectral voice activity detectorGarner, P.N. / Fukada, T. / Komori, Y. et al. | 2004
- I
-
Scaling of waveform segments along the time axis for concatenative speech synthesisNishizawa, N. / Kawai, H. et al. | 2004
- I
-
Sequential clustering algorithm for Gaussian mixture initializationMessina, R. / Jouvet, D. et al. | 2004
- I
-
An analysis of interleavers for robust speech recognition in burst-like packet lossJames, A.B. / Milner, B.P. et al. | 2004
- I
-
A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channelDeoras, A.N. / Hasegawa-Johnson, A. et al. | 2004
- I
-
PCMM-based feature compensation schemes using model interpolation and mixture sharingWooil Kim, / Ohil Kwon, / Hanseok Ko, et al. | 2004
- I
-
Asynchronous HMM with applications to speech recognitionGarg, A. / Balakrishnan, S. / Vaithyanathan, S. et al. | 2004
- I
-
SP-L4.4: APPLYING ARTICULATORY FEATURES TO TELEPHONE-BASED SPEAKER VERIFICATIONLeung, K.-Y. / Mak, M.-W. / Kung, S.-Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.4: ON SPLIT QUANTIZATION OF LSF PARAMETERSNorden, F. / Eriksson, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.2: COMBINATION OF HIDDEN MARKOV MODELS WITH DYNAMIC TIME WARPING FOR SPEECH RECOGNITIONAxelrod, S. / Maison, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.3: JOINT DECODING FOR PHONEME-GRAPHEME CONTINUOUS SPEECH RECOGNITIONMagimai-Doss, M. / Bengio, S. / Bourlard, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.5: LIGHT SUPERVISION IN ACOUSTIC MODEL TRAININGNguyen, L. / Xiang, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.2: EXACT TRAINING OF A NEURAL SYNTACTIC LANGUAGE MODELEmami, A. / Jelinek, F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.8: ON THE DECISION-DIRECTED ESTIMATION APPROACH OF EPHRAIM AND MALAHCohen, I. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.5: FEATURE SPACE GAUSSIANIZATIONSaon, G. / Dharanipragada, S. / Povey, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.12: EIGENSPACE-BASED MLLR WITH SPEAKER ADAPTIVE TRAINING IN LARGE VOCABULARY CONVERSATIONAL SPEECH RECOGNITIONDoumpiotis, V. / Deng, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.4: COMBINED ESTIMATION/CODING OF HIGHBAND SPECTRAL ENVELOPES FOR SPEECH SPECTRUM EXPANSIONAgiomyrgiannakis, Y. / Stylianou, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.5: AUTOMATICALLY DERIVED UNITS FOR SEGMENT VOCODERSRamasubramanian, V. / Sreenivas, T. V. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.1: BAYESIAN MODELLING OF THE SPEECH SPECTRUM USING MIXTURE OF GAUSSIANSZolfaghari, P. / Watanabe, S. / Nakamura, A. / Katagiri, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.4: FORMANT TRACKING BY MIXTURE STATE PARTICLE FILTERZheng, Y. / Hasegawa-Johnson, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.4: REFINING SEGMENTAL BOUNDARIES FOR TTS DATABASE USING FINE CONTEXTUAL-DEPENDENT BOUNDARY MODELSWang, L. / Zhao, Y. / Chu, M. / Zhou, J.-L. / Cao, Z. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.13: AN EVALUATION OF AUTOMATIC PHONE SEGMENTATION FOR CONCATENATIVE SPEECH SYNTHESISKawai, H. / Toda, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.1: SPHERICAL HARMONIC ANALYSIS OF EQUALIZATION IN A REVERBERANT ROOMBetlehem, T. / Abhayapala, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.4: AUTOMATED LIP-READING FOR IMPROVED SPEECH INTELLIGIBILITYMcClain, M. / Brady, K. / Brandstein, M. / Quatieri, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.14: IMPROVED NAME RECOGNITION WITH META-DATA DEPENDENT NAME NETWORKSMaskey, S. / Bacchiani, M. / Roark, B. / Sproat, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.11: LIGHTLY SUPERVISED AND DATA-DRIVEN APPROACHES TO MANDARIN BROADCAST NEWS TRANSCRIPTIONChen, B. / Kuo, J.-W. / Tsai, W.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.8: VOICING FEATURE INTEGRATION IN SRI'S DECIPHER LVCSR SYSTEMGraciarena, M. / Franco, H. / Zheng, J. / Vergyri, D. / Stolcke, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.11: TONE VARIATION MODELING FOR FLUENT MANDARIN TONE RECOGNITION BASED ON CLUSTERINGLin, W.-Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.9: PARSING SPEECH INTO ARTICULATORY EVENTSHacioglu, K. / Pellom, B. / Ward, W. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.2: ASSESSMENT OF SIGNAL SUBSPACE BASED SPEECH ENHANCEMENT FOR NOISE ROBUST SPEECH RECOGNITIONHermus, K. / Wambacq, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.7: MODELING SUB-BAND CORRELATION FOR NOISE-ROBUST SPEECH RECOGNITIONMcAuley, J. / Ming, J. / Hanna, P. / Stewart, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.4: BAYESIAN DURATION MODELING AND LEARNING FOR SPEECH RECOGNITIONChien, J.-T. / Huang, C.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
High quality voice morphingHui Ye, / Young, S. et al. | 2004
- I
-
Voice characteristics conversion for TTS using reverse VTLNEichner, M. / Wolff, M. / Hoffmann, R. et al. | 2004
- I
-
Discovering relations among discriminative training objectives [speak recognition applications]Qi Li, et al. | 2004
- I
-
Disentangling speaker and channel effects in speaker verificationKenny, P. / Dumouchel, P. et al. | 2004
- I
-
The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstructionRamabadran, T. / Sorin, A. / McLaughlin, M. / Chazan, D. / Pearce, D. / Hoory, R. et al. | 2004
- I
-
Using Haar transformed vocal source information for automatic speaker recognitionNengheng Zheng, / Ching, P.C. et al. | 2004
- I
-
Multiple frame block quantisation of line spectral frequencies using Gaussian mixture modelsPaliwal, K.K. / So, S. et al. | 2004
- I
-
On split quantization of LSF parametersNordin, F. / Eriksson, T. et al. | 2004
- I
-
On the decision-directed estimation approach of Ephraim and MalahCohen, I. et al. | 2004
- I
-
Adaptive time-segmentation for speech coding with limited delayRodbro, C.A. / Jensen, J. / Heusdens, R. et al. | 2004
- I
-
Closed-form estimation of the amplitude commands in the automatic extraction of the Fujisaki's modelSilva, S.D.S. / Netto, S.L. et al. | 2004
- I
-
A real-time Cantonese text-to-audiovisual speech synthesizerJian-Qing Wang, / Ka-Ho Wong, / Pheng-Ann Pheng, / Meng, H.M. / Tien-Tsin Wong, et al. | 2004
- I
-
Modeling pronunciation variation for spontaneous speech synthesisWerner, S. / Wolff, M. / Eichner, M. / Hoffinann, R. et al. | 2004
- I
-
Basis superposition precision matrix modelling for large vocabulary continuous speech recognitionSim, K.C. / Gales, M.J.F. et al. | 2004
- I
-
Voicing feature integration in SRI's decipher LVCSR systemGraciarena, M. / Franco, H. / Jing Zheng, / Vergyri, D. / Stolcke, A. et al. | 2004
- I
-
Chinese-English bilingual phone modeling for cross-language speech recognitionShengmin Yu, / Shitwu Zhang, / Bo Xu, et al. | 2004
- I
-
Prosody-based recognition of spoken German varietiesDizdarevic, V. / Hagmuller, M. / Kubin, G. / Pernkopf, E. / Baum, M. et al. | 2004
- I
-
Assessment of signal subspace based speech enhancement for noise robust speech recognitionHermus, K. / Wambacq, P. et al. | 2004
- I
-
DBN based multi-stream models for audio-visual speech recognitionGowdy, J.N. / Subramanya, A. / Bartels, C. / Bilmes, J. et al. | 2004
- I
-
Modeling sub-band correlation for noise-robust speech recognitionMcauley, J. / Ji Ming, / Hanna, P. / Stewart, D. et al. | 2004
- I
-
SP-L1.5: VOICE CHARACTERISTICS CONVERSION FOR TTS USING REVERSE VTLNEichner, M. / Wolff, M. / Hoffmann, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.4: A COMPLEXITY REDUCTION OF ETSI ADVANCED FRONT-END FOR DSRLi, J.-Y. / Liu, B. / Wang, R.-H. / Dai, L.-R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L4.5: SPEAKER IDENTIFICATION USING SUPRA-SEGMENTAL PITCH PATTERN DYNAMICSFarahani, F. / Georgiou, P. / Narayanan, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.2: PRODUCT OF POWER SPECTRUM AND GROUP DELAY FUNCTION FOR SPEECH RECOGNITIONZhu, D. / Paliwal, K. K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.4: A LOCALLY, WEIGHTED DISTANCE MEASURE FOR EXAMPLE BASED SPEECH RECOGNITIONDe Wachter, M. / Demuynck, K. / Wambacq, P. / Van Compernolle, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.6: ROBUST SPEECH RECOGNITION USING CEPSTRAL DOMAIN MISSING DATA TECHNIQUES AND NOISY MASKSVan hamme, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.5: CEPSTRAL GAIN NORMALIZATION FOR NOISE ROBUST SPEECH RECOGNITIONYoshizawa, S. / Hayasaka, N. / Wada, N. / Miyanaga, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.5: AUTOMATIC LEARNING OF INTERPRETATION STRATEGIES FOR SPOKEN DIALOGUE SYSTEMSRaymond, C. / Bechet, F. / De Mori, R. / Damnati, G. / Esteve, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.6: UNSUPERVISED AND ACTIVE LEARNING IN AUTOMATIC SPEECH RECOGNITION FOR CALL CLASSIFICATIONHakkani-Tur, D. / Tur, G. / Rahim, M. / Riccardi, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.12: SPEECH-ACTIVATED TEXT RETRIEVAL SYSTEM FOR MULTIMODAL CELLULAR PHONESIshikawa, S.-y. / Ikeda, T. / Miki, K. / Adachi, F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.8: ENHANCED STANDARD COMPLIANT DISTRIBUTED SPEECH RECOGNITION (AURORA ENCODER) USING RATE ALLOCATIONSrinivasamurthy, N. / Ortega, A. / Narayanan, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.9: TRAPPING CONVERSATIONAL SPEECH: EXTENDING TRAP/TANDEM APPROACHES TO CONVERSATIONAL TELEPHONE SPEECH RECOGNITIONMorgan, N. / Chen, B. / Zhu, Q. / Stolcke, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.3: AN ESTIMATE OF PHYSICAL SCALE FROM SPEECHSmith, L. / Nelson, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.2: WATERMARKING OF SPEECH SIGNALS USING THE SINUSOIDAL MODEL AND FREQUENCY MODULATION OF THE PARTIALSGirin, L. / Marchand, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.5: A LOW-BAND SPECTRUM ENVELOPE MODELING FOR HIGH QUALITY PITCH MODIFICATIONMochizuki, R. / Kobayashi, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.8: OPTIMIZING SUB-COST FUNCTIONS FOR SEGMENT SELECTION BASED ON PERCEPTUAL EVALUATIONS IN CONCATENATIVE SPEECH SYNTHESISToda, T. / Kawai, H. / Tsuzaki, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.9: SPEECH ENHANCEMENT USING ROBUST WEIGHTING FACTORS FOR CRITICAL-BAND-WAVELET-PACKET TRANSFORMLu, C.-T. / Wang, H.-C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.4: CORRECTIVE LANGUAGE MODELING FOR LARGE VOCABULARY ASR WITH THE PERCEPTRON ALGORITHMRoark, B. / Saraclar, M. / Collins, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.7: A GENERALIZED CONSTRUCTION OF INTEGRATED SPEECH RECOGNITION TRANSDUCERSAllauzen, C. / Mohri, M. / Riley, M. / Roark, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.3: AUTOMATIC GENERATION OF NON-UNIFORM HMM STRUCTURES BASED ON VARIATIONAL BAYESIAN APPROACHJitsuhiro, T. / Nakamura, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.6: INVESTIGATIONS INTO THE RELATIONSHIP BETWEEN MEASURABLE SPEECH QUALITY AND SPEECH RECOGNITION RATE FOR TELEPHONY SPEECHSun, H. / Shue, L. / Chen, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.11: FUSION BASED SPEECH SEGMENTATION IN DARPA SPINE2 TASKZheng, C. / Yan, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.6: A STUDY ON ROBUST SEGMENTATION AND LOCATION OF TONE NUCLEI IN CHINESE CONTINUOUS SPEECHZhang, J. / Hirose, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.1: ROBUST SPEECH RECOGNITION IN ADDITIVE AND CHANNEL NOISE ENVIRONMENTS USING GMM AND EM ALGORITHMFujimoto, M. / Ariki, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.8: COMBINING FEATURE COMPENSATION AND WEIGHTED VITERBI DECODING FOR NOISE ROBUST SPEECH RECOGNITION WITH LIMITED ADAPTATION DATACui, X. / Alwan, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.11: A TREE-STRUCTURED CLUSTERING METHOD INTEGRATING NOISE AND SNR FOR PIECEWISE LINEAR-TRANSFORMATION-BASED NOISE ADAPTATIONZhang, Z. / Sugimura, T. / Furui, S. / IEEE Signal Processing Society et al. | 2004
- I
-
Discrimination power weighted subword-based speaker verificationSiu-Man Chan, / Man-Hung Siu, et al. | 2004
- I
-
Applying articulatory features to telephone-based speaker verificationKa-Yee Leung, / Man-Wai Mak, / Sun-Yuan Kung, et al. | 2004
- I
-
Pitch prediction from MFCC vectors for speech reconstructionXu Shao, / Milner, B. et al. | 2004
- I
-
Joint decoding for phoneme-grapheme continuous speech recognitionMagimai-Doss, M. / Bengio, S. / Bourlard, H. et al. | 2004
- I
-
Spectral entropy based feature for robust ASRMisra, H. / Ikbal, S. / Bourlard, H. / Hermansky, H. et al. | 2004
- I
-
A multiple description speech coder based on AMR-WB for mobile ad hoc networksDong, H. / Gersho, A. / Gibson, J.D. / Cuperman, V. et al. | 2004
- I
-
A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standardSung-Kyo Jung, / Kyung-Tae Kini, / Hong-Goo Kang, et al. | 2004
- I
-
Employing Laplacian-Gaussian densities for speech enhancementGazor, S. et al. | 2004
- I
-
Online speaker clusteringLilt, D. / Kubala, F. et al. | 2004
- I
-
Robust multimodal understandingBangalore, S. / Johnston, M. et al. | 2004
- I
-
A distributed framework for enterprise level speech recognition servicesArizmendi, I. / Rose, R.C. et al. | 2004
- I
-
Speech-activated text retrieval system for multimodal cellular phonesIshikawa, S.Y. / Ikeda, T. / Miki, K. / Adachi, F. / Isotani, R. / Iso, K.I. / Okumura, A. et al. | 2004
- I
-
Enhanced standard compliant distributed speech recognition (Aurora encoder) using rate allocationSrinivasamurthy, N. / Ortega, A. / Narayanan, S. et al. | 2004
- I
-
Variational Bayesian feature selection for Gaussian mixture modelsValente, F. / Wellekens, C. et al. | 2004
- I
-
Joint frequency domain and reconstructed phase space features for speech recognitionLindgren, A.C. / T Johnson, M. / Povinelli, R.J. et al. | 2004
- I
-
Refining segmental boundaries for TTS database using fine contextual-dependent boundary modelsLuuan Wang, / Yong Zhao, / Min Chu, / Jianlai Zhou, / Zhigang Cao, et al. | 2004
- I
-
Evaluation of the effect of stress on formants in Farsi vowelsGharavian, D. / Ahadi, S.M. et al. | 2004
- I
-
Improving broadcast news transcription by lightly supervised discriminative trainingChan, H.Y. / Woodland, P. et al. | 2004
- I
-
The 2003 ISL rich transcription system for conversational telephony speechSoltau, H. / Hua Yu, / Metze, F. / Fugen, C. / Qin Jin, / Szu-Chen Jou, et al. | 2004
- I
-
A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation-based noise adaptationZhang, Z. / Sugimura, T. / Furui, S. et al. | 2004
- I
-
Spatio-temporal processing for distant speech recognitionSiow Yong Low, / Togneri, R. / Nordholm, S. et al. | 2004
- I
-
Sensitivity analysis of noise robustness methodsBrayda, L. / Rigazio, L. / Boman, R. / Junqua, J.C. et al. | 2004
- I
-
Can back-ends be more robust than front-ends? Investigation over the Aurora-2 databaseBernard, A. / Yifan Gong, / Xiaodong Cui, et al. | 2004
- I
-
SP-L9.3: ROBUSTNESS OF SPEECH RECOGNITION USING GENETIC ALGORITHMS AND A MEL-CEPSTRAL SUBSPACE APPROACHSelouani, S.-A. / O Shaughnessy, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.1: OPTIMAL BLIND SEPARATION OF CONVOLUTIVE AUDIO MIXTURES WITHOUT TEMPORAL CONSTRAINTSKokkinakis, K. / Nandi, A. K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.5: ON THE ARCHITECTURE OF THE CDMA2000® VARIABLE-RATE MULTIMODE WIDEBAND (VMR-WB) SPEECH CODING STANDARDJelinek, M. / Salami, R. / Ahmadi, S. / Bessette, B. / Gournay, P. / Laflamme, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.7: A TWO-STEP NOISE REDUCTION TECHNIQUEPlapous, C. / Marro, C. / Mauuary, L. / Scalart, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.11: A NOISE ESTIMATION ALGORITHM WITH RAPID ADAPTATION FOR HIGHLY NON-STATIONARY ENVIRONMENTSRangachari, S. / Loizou, P. / Hu, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.1: PERFORMANCE COMPARISONS OF ALL-PASS TRANSFORM ADAPTATION WITH MAXIMUM LIKELIHOOD LINEAR REGRESSIONMcDonough, J. / Waibel, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.9: IDENTIFYING IN-SET AND OUT-OF-SET SPEAKERS USING NEIGHBORHOOD INFORMATIONAngkititrakul, P. / Hansen, J. H. L. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.8: EXTENDING BOOSTING FOR CALL CLASSIFICATION USING WORD CONFUSION NETWORKSTur, G. / Hakkani-Tur, D. / Riccardi, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.2: A DATA MINING APPROACH TO OBJECTIVE SPEECH QUALITY MEASUREMENTZha, W. / Chan, W.-Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.1: A MODEL-BASED TONE LABELING METHOD FOR MIN-NAN/TAIWANESE SPEECHKuo, W.-C. / Wang, Y.-R. / Chen, S.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.2: AN AUTOMATIC PROSODY LABELING SYSTEM USING ANN-BASED SYNTACTIC-PROSODIC MODEL AND GMM-BASED ACOUSTIC-PROSODIC MODELChen, K. / Hasegawa-Johnson, M. / Cohen, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.3: VARIATIONAL BAYESIAN FEATURE SELECTION FOR GAUSSIAN MIXTURE MODELSValente, F. / Wellekens, C. J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.3: CLUSTERING AND SEGMENTING SPEAKERS AND THEIR LOCATIONS IN MEETINGSAjmera, J. / McCowan, I. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.6: OPTIMIZING ACOUSTIC MODELS FOR COMMERCIAL SPEECH RECOGNITION USING FOREGROUND SCORES AND DATA WEIGHTINGBoies, D. / Strope, B. / Weintraub, M. / Wu, S.-L. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.5: AUTOMATIC DETERMINATION OF ACOUSTIC MODEL TOPOLOGY USING VARIATIONAL BAYESIAN ESTIMATION AND CLUSTERINGWatanabe, S. / Sako, A. / Nakamura, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.1: CODEBOOK DESIGN FOR ASR SYSTEMS USING CUSTOM ARITHMETIC UNITSLi, X. / Malkin, J. / Bilmes, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.10: PROSODY-BASED RECOGNITION OF SPOKEN GERMAN VARIETIESDizdarevic, V. / Hagmuller, M. / Kubin, G. / Pernkopf, F. / Baum, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.4: NOISE ROBUST SPEECH RECOGNITION WITH A SWITCHING LINEAR DYNAMIC MODELDroppo, J. / Acero, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.10: MINIMUM MEAN SQUARE ERROR FILTERING OF NOISY CEPSTRAL COEFFICIENTS WITH APPLICATIONS TO ASRMyrvoll, T. A. / Nakamura, S. / IEEE Signal Processing Society et al. | 2004
- I
-
Robust speech feature extraction by growth transformation in reproducing kernel Hilbert spaceChakrabartty, S. / Yunbin Deng, / Cauwenberghs, G. et al. | 2004
- I
-
Phase autocorrelation (PAC) features in entropy based multi-stream for robust speech recognitionIkbal, S. / Misra, H. / Bourlard, H. / Hermansky, H. et al. | 2004
- I
-
Multiple-microphone time-varying filters for robust speech recognitionCalvin Yiu-Kit Lai, / Aarabi, P. et al. | 2004
- I
-
A scalable speech and audio coding scheme with continuous bitrate flexibilityKovesi, B. / Massaloux, D. / Sollaud, A. et al. | 2004
- I
-
A study of various composite kernels for kernel eigenvoice speaker adaptationMak, B. / Kwok, J.T. / Ho, S. et al. | 2004
- I
-
Eigen-MLLRs applied to unsupervised speaker enrollment for large vocabulary continuous speech recognitionAubert, X.L. et al. | 2004
- I
-
Unsupervised and active learning in automatic speech recognition for call classificationHakkani-Tur, D. / Tur, G. / Rahim, M. / Riccardi, G. et al. | 2004
- I
-
A model-based tone labeling method for Min-Nan/Taiwanese speechWei-Chih Kuo, / Yih-Ru Wang, / Sin-Horng Chen, et al. | 2004
- I
-
Feature generation based on maximum normalized acoustic likelihood for improved speech recognitionXiang Li, / Stern, R.M. et al. | 2004
- I
-
Acoustic analysis of friendly speechFangxin Chen, / Aijun Li, / Haibo Wang, / Tianqing Wang, / Qiang Fang, et al. | 2004
- I
-
Yet another acoustic representation of speech soundsMinematsu, N. et al. | 2004
- I
-
Estimating vocal-tract area functions from vowel sound signals over closed glottal phasesHuiqun Deng, / Ward, R.K. / Beddoes, M.P. / Hodgson, M. et al. | 2004
- I
-
A voice activity detector using the chi-square testAhmed, B. / Holmes, P.H. et al. | 2004
- I
-
Perceptual Kalman filtering for speech enhancement in colored noiseNing Ma, / Bouchard, M. / Goubran, R.A. et al. | 2004
- I
-
New speech harmonic structure measure and it application to post speech enhancementAn-Tze Yu, / Hsiao-chuan Wang, et al. | 2004
- I
-
Model complexity control and compression using discriminative growth functionsLiu, X. / Gales, M.J.F. et al. | 2004
- I
-
Robust speech recognition in additive and channel noise environments using GMM and EM algorithmFujimoto, M. / Riki, Y.A. et al. | 2004
- I
-
Combining feature compensation and weighted Viterbi decoding for noise robust speech recognition with limited adaptation dataXiaodong Cui, / Alwan, A. et al. | 2004
- I
-
SP-L1.1: NON-PARALLEL TRAINING FOR VOICE CONVERSION BY MAXIMUM LIKELIHOOD CONSTRAINED ADAPTATIONMouchtaris, A. / Van der Spiegel, J. / Mueller, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.2: PARAMETER SHARING AND MINIMUM CLASSIFICATION ERROR TRAINING OF MIXTURES OF FACTOR ANALYZERS FOR SPEAKER IDENTIFICATIONYamamoto, H. / Nankaku, Y. / Miyajima, C. / Tokuda, K. / Kitamura, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.3: DISCOVERING RELATIONS AMONG DISCRIMINATIVE TRAINING OBJECTIVESLi, Q. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.5: DIMENSIONALITY REDUCTION USING MCE-OPTIMIZED LDA TRANSFORMATIONLi, X.-B. / Li, J.-Y. / Wang, R.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.6: THE USE OF A LINGUISTICALLY MOTIVATED LANGUAGE MODEL IN CONVERSATIONAL SPEECH RECOGNITIONWang, W. / Stolcke, A. / Harper, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.4: A MULTIPLE DESCRIPTION SPEECH CODER BASED ON AMR-WB FOR MOBILE AD HOC NETWORKSDong, H. / Gersho, A. / Gibson, J. / Cuperman, V. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.8: CONFIDENCE MEASURES IN MULTIPLE PRONUNCIATIONS MODELING FOR SPEAKER VERIFICATIONBenZeghiba, M. F. / Bourlard, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.10: BENEFITS OF PRIOR ACOUSTIC SEGMENTATION FOR AUTOMATIC SPEAKER SEGMENTATIONMeignier, S. / Moraru, D. / Fredouille, C. / Besacier, L. / Bonastre, J.-F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.11: NOISE REDUCTION ON SPEECH CODEC PARAMETERSTaddei, H. / Beaugeant, C. / de Meuleneire, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.10: PREDICTING FOREGROUND SH, SL AND BNH DAM SCORES FOR MULTIDIMENSIONAL OBJECTIVE MEASURE OF SPEECH QUALITYSen, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.4: APPLICATION OF THE MODIFIED GROUP DELAY FUNCTION TO SPEAKER IDENTIFICATION AND DISCRIMINATIONHegde, R. / Murthy, H. / Gadde, V. R. R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.5: ACOUSTIC ANALYSIS OF FRIENDLY SPEECHChen, F. / Li, A. / Wang, H. / Wang, T. / Fang, Q. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.12: MODELING PRONUNCIATION VARIATION FOR SPONTANEOUS SPEECH SYNTHESISWerner, S. / Wolff, M. / Eichner, M. / Hoffmann, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.15: REAL-TIME WORD CONFIDENCE SCORING USING LOCAL POSTERIOR PROBABILITIES ON TREE TRELLIS SEARCHLee, A. / Shikano, K. / Kawahara, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.8: EXPERIMENTS IN KEYPAD-AIDED SPELLING RECOGNITIONParthasarathy, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.5: A MODIFIED EPHRAIM-MALAH NOISE SUPPRESSION RULE FOR AUTOMATIC SPEECH RECOGNITIONGemello, R. / Mana, F. / De Mori, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.13: PCMM-BASED FEATURE COMPENSATION SCHEMES USING MODEL INTERPOLATION AND MIXTURE SHARINGKim, W. / Kwon, O. / Ko, H. / IEEE Signal Processing Society et al. | 2004
- I
-
Speaker identification using supra-segmental pitch pattern dynamicsFarahani, F. / Georgiou, P.G. / Narayanan, S.S. et al. | 2004
- I
-
Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approachSelouani, S.A. / O'Shaughnessy, D. et al. | 2004
- I
-
Cepstral gain normalization for noise robust speech recognitionYoshizawa, S. / Hayasaka, N. / Wada, N. / Miyanaga, Y. et al. | 2004
- I
-
Development of the 2003 CU-HTK conversational telephone speech transcription systemEvermann, G. / Chan, H.Y. / Gales, M.J.F. / Hain, T. / Liu, X. / Mrva, D. / Wang, L. / Woodland, P.C. et al. | 2004
- I
-
On the architecture of the cdma2000/spl reg/ variable-rate multimode wideband (VMR-WB) speech coding standardJelinek, M. / Salami, R. / Ahmadi, S. / Bessetle, B. / Gournay, P. / Laflamme, C. et al. | 2004
- I
-
An investigation into front-end signal processing for speaker normalizationUmesh, S. / Sinha, R. / Kumar, S.V.B. et al. | 2004
- I
-
Eigenspace-based MLLR with speaker adaptive training in large vocabulary conversational speech recognitionDounipiotis, V. / Yonggang Deng, et al. | 2004
- I
-
Bootstrap estimates for confidence intervals in ASR performance evaluationBisani, M. / Ney, H. et al. | 2004
- I
-
Noise-dependent postfilteringGrancharov, V. / Samuelsson, J. / Kleijn, W.B. et al. | 2004
- I
-
Combined estimation/coding of highband spectral envelopes for speech spectrum expansionAgiomyrgiannakis, Y. / Stylianou, Y. et al. | 2004
- I
-
Multisensor MELPe using parameter substitutionBrady, K. / Quatieri, T.F. / Campbell, J.P. / Campbell, W.M. / Brandstein, M. / Weinstein, C.J. et al. | 2004
- I
-
Noise reduction on speech codec parametersTaddei, H. / Beaugeant, C. / de Meuleneire, M. et al. | 2004
- I
-
Sound feature detection using leaky integrate-and-fire neuronsSmith, L.S. / Fraser, D.S. et al. | 2004
- I
-
Minimum segmentation error based discriminative training for speech synthesis applicationYi-Jian Wu, / Hisashi Kawai, / Jinfu Ni, / Ren-Hua Wang, et al. | 2004
- I
-
Probability based prosody model for unit selectionXijun Ma, / Wei Zhang, / Weibin Zhu, / Qin Shi, / Ling Jin, et al. | 2004
- I
-
A strategy to solve data scarcity problems in corpus based intonation modellingCardenoso, V. / Escudero, D. et al. | 2004
- I
-
Speech synthesis from real time ultrasound images of the tongueDenby, B. / Stone, M. et al. | 2004
- I
-
Speech enhancement by perceptual filter with sequential noise parameter estimationTe-Won Lee, / Kaisheng Yao, et al. | 2004
- I
-
Speech enhancement with missing data techniques using recurrent neural networksParveen, S. / Green, P. et al. | 2004
- I
-
A generalized construction of integrated speech recognition transducersAllauzen, C. / Mohri, M. / Riley, M. / Roark, B. et al. | 2004
- I
-
A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMsTamura, S. / Iwano, K. / Furui, S. et al. | 2004
- I
-
Speech enhancement based on multiple directivity patterns using a microphone arraySekiya, T. / Kobayashi, T. et al. | 2004
- I
-
Noise robust speech recognition with a switching linear dynamic modelDroppo, J. / Acero, A. et al. | 2004
- I
-
Minimum mean square error filtering of noisy cepstral coefficients with applications to ASRMyrvoll, T.A. / Nakamura, S. et al. | 2004
- I
-
Lightly supervised acoustic model training using consensus networksLangzhou Chen, / Lamel, L. / Gauvain, J.L. et al. | 2004
- I
-
Performance comparisons of all-pass transform adaptation with maximum likelihood linear regressionMcDonough, J. / Waibel, A. et al. | 2004
- I
-
Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognitionXiaodong He, / Yunxin Zhao, et al. | 2004
- I
-
Identifying in-set and out-of-set speakers using neighborhood informationAngkititrakul, P. / Hansen, J.H.L. et al. | 2004
- I
-
A data mining approach to objective speech quality measurementWei Zha, / Wai-Yip Chan, et al. | 2004
- I
-
Efficient spectrum coding for super-wideband speech and its application to 7/10/15 kHz bandwidth scalable codersOshikiri, M. / Ehara, H. / Yoshida, K. et al. | 2004
- I
-
Towards multilingual speech recognition using data driven source/target acoustical units associationBayeh, R. / Lin, S. / Chollet, G. / Mokbel, C. et al. | 2004
- I
-
Formant frequency estimation in noiseBin Chen, / Loizou, P.C. et al. | 2004
- I
-
Watermarking of speech signals using the sinusoidal model and frequency modulation of the partialsGirin, L. / Marchand, S. et al. | 2004
- I
-
Automated lip-reading for improved speech intelligibilityMcClain, M. / Brady, K. / Brandstein, M. / Quatieri, T. et al. | 2004
- I
-
Out-of-domain detection based on confidence measures from multiple topic classificationLane, L.R. / Kawahara, T. / Matsui, T. / Nakamura, S. et al. | 2004
- I
-
Cross-dialectal acoustic data sharing for Arabic speech recognitionKirchhoff, K. / Vergyri, D. et al. | 2004
- I
-
Filler model based confidence measures for spoken dialogue systems: a case study for TurkishAkyol, A. / Erdogan, H. et al. | 2004
- I
-
Rao-Blackwellised Gibbs sampling for switching linear dynamical systemsRosti, A.V.I. / Gales, M.J.F. et al. | 2004
- I
-
Training for polynomial segment model using the expectation maximization algorithmChak-Fai Li, / Man-Hung Siu, et al. | 2004
- I
-
Acoustic model adaptation using first order prediction for reverberant speechTakiguchi, T. / Nishimura, M. et al. | 2004
- I
-
On tracking noise with linear dynamical system modelsRaj, B. / Singh, R. / Stern, R. et al. | 2004
- I
-
Nonlinear noise compensation in feature domain for speech recognition with numerical methodsHui Jiang, / Qi Wang, et al. | 2004
- I
-
Tone articulation modeling for Mandarin spontaneous speech recognitionJian-lai Zhou, / Ye Tian, / Yu Shi, / Chao Huang, / Chang, E. et al. | 2004
- I
-
SP-L1.4: ALGORITHM AMALGAM: MORPHING WAVEFORM BASED METHODS, SINUISOIDAL MODELS AND STRAIGHTKawahara, H. / Banno, H. / Irino, T. / Zolfaghari, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.5: GENERALIZED LOCALLY RECURRENT PROBABILISTIC NEURAL NETWORKS FOR TEXT-INDEPENDENT SPEAKER VERIFICATIONGanchev, T. / Fakotakis, N. / Tasoulis, D. / Vrahatis, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.6: DISCRIMINATION POWER WEIGHTED SUBWORD-BASED SPEAKER VERIFICATIONChan, S.-M. / Si, M.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.1: SOFT DECODING STRATEGIES FOR DISTRIBUTED SPEECH RECOGNITION OVER IP NETWORKSCardenal-Lopez, A. / Docio-Fernandez, L. / Garcia-Mateo, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.6: A NOVEL METHOD FOR COMPUTATION OF PERIODICITY, APERIODICITY AND PITCH OF SPEECH SIGNALSDeshmukh, O. / Singh, J. / Espy-Wilson, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.1: LOW-COMPLEXITY PREDICTIVE TRELLIS CODED QUANTIZATION OF WIDEBAND SPEECH LSF PARAMETERSShin, Y. / Kang, S. / Fischer, T. R. / Son, C. / Lee, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.4: SPEECH ENHANCEMENT BASED ON A COMBINED MULTI-CHANNEL ARRAY WITH CONSTRAINED INTERATIVE AND AUDITORY MASKED PROCESSINGZhang, X. / Hansen, J. H. L. / Arehart, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.5: ENHANCEMENT OF MISMATCHED CONDITIONS IN SPEAKER RECOGNITION FOR MULTIMEDIA APPLICATIONSFakhr, W. / Abdelsalam, A. / Hamdy, N. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.12: A PITCH SYNCHRONOUS FEATURE EXTRACTION METHOD FOR SPEAKER RECOGNITIONKim, S. / Eriksson, T. / Kang, H.-G. / Youn, D. H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.3: ROBUST MULTIMODAL UNDERSTANDINGBangalore, S. / Johnston, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.9: WIDEBAND AUDIO OVER NARROWBAND LOW-RESOLUTION MEDIADing, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.12: ENTROPY-BASED VARIABLE FRAME RATE ANALYSIS OF SPEECH SIGNALS AND ITS APPLICATION TO ASRYou, H. / Zhu, Q. / Alwan, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.6: IMPORTANCE OFWINDOWSHAPE FOR PHASE-ONLY RECONSTRUCTION OF SPEECHAlsteris, L. / Paliwal, K. K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.7: CLOSED-FORM ESTIMATION OF THE AMPLITUDE COMMANDS IN THE AUTOMATIC EXTRACTION OF FUJISAKI'S MODELSilva, S. / Netto, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.5: ESTIMATION OF SHORT-TERM PREDICTOR PARAMETERS FOR CODING AND ENHANCEMENT OF NOISY SPEECHSrinivasan, S. / Samuelsson, J. / Kleijn, W. B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.12: SPEECH ENHANCEMENT WITH MISSING DATA TECHNIQUES USING RECURRENT NEURAL NETWORKSParveen, S. / Green, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.5: GENERATING AND EVALUATING SEGMENTATIONS FOR AUTOMATIC SPEECH RECOGNITION OF CONVERSATIONAL TELEPHONE SPEECHTranter, S. / Yu, K. / Evermann, G. / Woodland, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.12: TRAINING FOR POLYNOMIAL SEGMENT MODEL USING THE EXPECTATION MAXIMIZATION ALGORITHMLi, C.-F. / Siu, M.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.6: UNIVERSAL COMPENSATION - AN APPROACH TO NOISY SPEECH RECOGNITION ASSUMING NO KNOWLEDGE OF NOISEMing, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.9: SNR-DEPENDENT NON-UNIFORM SPECTRAL COMPRESSION FOR NOISY SPEECH RECOGNITIONChu, K.-k. / Leung, S. H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.2: TONE ARTICULATION MODELING FOR MANDARIN SPONTANEOUS SPEECH RECOGNITIONZhou, J.-L. / Tian, Y. / Shi, Y. / Huang, C. / Chang, E. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.3: SPATIO-TEMPORAL PROCESSING FOR DISTANT SPEECH RECOGNITIONLow, S. Y. / Togneri, R. / Nordholm, S. / IEEE Signal Processing Society et al. | 2004
- I
-
Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMMNakagawa, S. / Zhang, W. / Takahashi, M. et al. | 2004
- I
-
Tone recognition with fractionized models and outlined featuresYe Tian, / Jian-Lai Zhou, / Min Chu, / Chang, E. et al. | 2004
- I
-
The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluationSorin, A. / Ramabadran, T. / Chazan, D. / Hoory, R. / Mclaughlin, M. / Pearce, D. / Wang, F.C. / Yaxin Zhang, et al. | 2004
- I
-
Improved quantization structures using generalized HMM modelling with application to wideband speech codingDuni, E.R. / Subramaniam, A.D. / Rao, B.D. et al. | 2004
- I
-
Effects on transcription errors on supervised learning in speech recognitionSundaram, R. / Picone, J. et al. | 2004
- I
-
Optimal blind separation of convolutive audio mixtures without temporal constraintsKokkinakis, K. / Nandi, A.K. et al. | 2004
- I
-
The use of a linguistically motivated language model in conversational speech recognitionWen Wang, / Stolcke, A. / Harper, M.P. et al. | 2004
- I
-
Low distortion speech denoising using an adaptive parametric Wiener filterNingping Fan, et al. | 2004
- I
-
MPE-based discriminative linear transform for speaker adaptationWang, L. / Woodland, P. et al. | 2004
- I
-
Adaptive training using structured transformsYu, K. / Gales, M.J.F. et al. | 2004
- I
-
Benefits of prior acoustic segmentation for automatic speaker segmentationMeignier, S. / Moraru, D. / Fredouille, C. / Besacier, L. / Bonastre, J.F. et al. | 2004
- I
-
Extending boosting for call classification using word confusion networksTur, G. / Hakkani-Tur, D. / Riccardi, G. et al. | 2004
- I
-
Dialog trajectory analysisAbella, A. / Wright, J. / Gorin, A.L. et al. | 2004
- I
-
Fractional Fourier transform features for speech recognitionSarikaya, R. / Gao, Y. / Saon, G. et al. | 2004
- I
-
Bayesian modelling of the speech spectrum using mixture of GaussiansZolfaghari, P. / Watanabe, S. / Nakamura, A. / Katagiri, S. et al. | 2004
- I
-
An estimate of physical scale from speechSmith, L.H. / Nelson, D.J. et al. | 2004
- I
-
Formant tracking by mixture state particle filterYanli Zheng, / Hasegawa-Johnson, M. et al. | 2004
- I
-
Speech modeling and voiced/unvoiced/mixed/silence speech segmentation with fractionally Gaussian noise based modelsOveisgharan, S. / Shamsollahi, M.B. et al. | 2004
- I
-
Spherical harmonic analysis of equalization in a reverberant roomBetlehem, T. / Abhayapala, T.D. et al. | 2004
- I
-
Automatic generation of non-uniform HMM structures based on variational Bayesian approachJitsuhiro, T. / Nakamura, S. et al. | 2004
- I
-
A Viterbi algorithm for a trajectory model derived from HMM with explicit relationship between static and dynamic featuresZen, H. / Tokuda, K. / Kitamura, T. et al. | 2004
- I
-
Parsing speech into articulatory eventsHacioglu, K. / Pellom, B. / Ward, W. et al. | 2004
- I
-
Bayesian duration modeling and learning for speech recognitionJen-Tzung Chien, / Chih-Hsien Huang, et al. | 2004
- I
-
SP-L4.6: IMPROVEMENT OF SPEAKER RECOGNITION BY COMBINING RESIDUAL AND PROSODIC FEATURES WITH ACOUSTIC FEATURESChen, S.-H. / Wang, H.-C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.3: TONE RECOGNITION WITH FRACTIONIZED MODELS AND OUTLINED FEATURESTian, Y. / Zhou, J.-L. / Chu, M. / Chang, E. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.6: NOISE SUPPRESSION FOR AUTOMOTIVE APPLICATIONS BASED ON DIRECTIONAL INFORMATIONFuchs, M. / Haulick, T. / Schmidt, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.8: ENROLLMENT IN LOW-RESOURCE SPEECH RECOGNITION SYSTEMSDeligne, S. / Dharanipragada, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.2: DESPERATELY SEEKING IMPOSTORS: DATA-MINING FOR COMPETITIVE IMPOSTOR TESTING IN A TEXT-DEPENDENT SPEAKER VERIFICATION SYSTEMHebert, M. / Mirghafori, N. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.3: A MULTIMEDIA APPROACH FOR AUDIO SEGMENTATION IN TV BROADCAST NEWSPerez-Freire, L. / Garcia-Mateo, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.7: PUBLIC SPEECH-ORIENTED GUIDANCE SYSTEM WITH ADULT AND CHILD DISCRIMINATION CAPABILITYNisimura, R. / Lee, A. / Saruwatari, H. / Shikano, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.9: DIALOG TRAJECTORY ANALYSISAbella, A. / Wright, J. / Gorin, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.11: AUTOMATIC INDEXING OF KEY SENTENCES FOR LECTURE ARCHIVES USING STATISTICS OF PRESUMED DISCOURSE MARKERSNanjo, H. / Kitade, T. / Kawahara, T. / IEEE Signal Processing Society et al. | 2004