SP-L2.2: PARAMETER SHARING AND MINIMUM CLASSIFICATION ERROR TRAINING OF MIXTURES OF FACTOR ANALYZERS FOR SPEAKER IDENTIFICATION (Englisch)
- Neue Suche nach: Yamamoto, H.
- Neue Suche nach: Nankaku, Y.
- Neue Suche nach: Miyajima, C.
- Neue Suche nach: Tokuda, K.
- Neue Suche nach: Kitamura, T.
- Neue Suche nach: IEEE Signal Processing Society
- Neue Suche nach: Yamamoto, H.
- Neue Suche nach: Nankaku, Y.
- Neue Suche nach: Miyajima, C.
- Neue Suche nach: Tokuda, K.
- Neue Suche nach: Kitamura, T.
- Neue Suche nach: IEEE Signal Processing Society
In:
ICASSP; 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing
;
I - 29-32
;
2004
-
ISBN:
-
ISSN:
- Aufsatz (Konferenz) / Print
-
Titel:SP-L2.2: PARAMETER SHARING AND MINIMUM CLASSIFICATION ERROR TRAINING OF MIXTURES OF FACTOR ANALYZERS FOR SPEAKER IDENTIFICATION
-
Beteiligte:Yamamoto, H. ( Autor:in ) / Nankaku, Y. ( Autor:in ) / Miyajima, C. ( Autor:in ) / Tokuda, K. ( Autor:in ) / Kitamura, T. ( Autor:in ) / IEEE Signal Processing Society
-
Kongress:29th, ICASSP; 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing ; 2004 ; Montreal, Quebec
-
Erschienen in:
-
Verlag:
- Neue Suche nach: IEEE
-
Erscheinungsort:Piscataway, N.J.
-
Erscheinungsdatum:01.01.2004
-
Format / Umfang:I - 29-32
-
Anmerkungen:Conference number extrapolated. "IEEE Catalog Number: 04CH37568. Conference held in 5 separate issues. Subject of Volume 1 is Speech processing
-
ISBN:
-
ISSN:
-
Medientyp:Aufsatz (Konferenz)
-
Format:Print
-
Sprache:Englisch
-
Schlagwörter:
-
Datenquelle:
© Metadata Copyright the British Library Board and other contributors. All rights reserved.
Inhaltsverzeichnis Konferenzband
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 0_1
-
2004 IEEE International Conference on Acoustics, Speech and Signal Processing| 2004
- 0_1
-
2004 IEEE International Conferrence on Acoustics, Speech and Signal Processing| 2004
- I
-
Speaking style adaptation using context clustering decision tree for HMM-based speech synthesisYamagishi, J. / Tachibana, M. / Masuko, T. / Kobayashi, T. et al. | 2004
- I
-
Soft decoding strategies for distributed speech recognition over IP networksCardenal-Lopez, A. / Docio-Fernandez, L. / Garcia-Mateo, C. et al. | 2004
- 57
-
A subvector-based error concealment algorithm for speech recognition over mobile networksTan, Zheng-Hua / Daisgaard, P. / Lindberg, B. et al. | 2004
- I
-
A complexity reduction of ETSI advanced front-end for DSRJin-Yu Li, / Bo Liu, / Ren-Hua Wang, / Li-Rong Dai, et al. | 2004
- I
-
Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleavingWei-hao Hsu, / Lin-shan Lee, et al. | 2004
- I
-
A novel method for computation of periodicity, aperiodicity and pitch of speech signalsDeshmukh, O. / Singh, J. / Espy-Wilson, C. et al. | 2004
- I
-
Non-uniform speaker normalization using affine-transformationKumar, S.V.B. / Umesh, S. / Sinha, R. et al. | 2004
- I
-
Speech feature extraction method representing periodicity and aperiodicity in sub bands for robust speech recognitionIshizuka, K. / Miyazaki, N. et al. | 2004
- 169
-
Performance analysis for a class of robust adaptive beamformersBesson, O. / Vincent, F. et al. | 2004
- 189
-
Spatial filtering of RF interference in radio astronomy using a reference antennaVeen, A.J. van der / Boonstra, A.J. et al. | 2004
- I
-
Higher order cepstral moment normalization (HOCMN) for robust speech recognitionChang-wen Hsu, / Lin-shan Lee, et al. | 2004
- 229
-
Speech enhancement based on a combined multi-channel array with constrained iterative and auditory masked processingZhang, Xianxian / Hansen, J.H.L. / Rehar, K.A. et al. | 2004
- 237
-
An improved array interpolation approach to DOA estimation in correlated signal environmentsLau, B.K. / Cook, G.J. / Leung, Y.H. et al. | 2004
- I
-
Meta-data conditional language modelingBacchiani, M. / Roark, B. et al. | 2004
- 249
-
Direct position determination of narrowband radio transmittersWeiss, A.J. et al. | 2004
- I
-
Cross-lingual latent semantic analysis for language modelingWoosung Kim, / Khudanpur, S. et al. | 2004
- 293
-
A Kalman filter based registration approach for asynchronous sensors in multiple sensor fusion applicationsZhou, Yifeng et al. | 2004
- 329
-
A single-carrier/OFDM comparison for broadband wireless communicationVan der Perre, L. / Tubbax, J. / Horlin, F. / De Man, H. et al. | 2004
- I
-
Speaker indexing and adaptation using speaker clustering based on statistical model selectionNishida, M. / Kawahara, T. et al. | 2004
- 361
-
Geolocation by time difference of arrival using hyperbolic asymptotesDrake, S.R. / Dogancay, K. et al. | 2004
- 393
-
Design of complex allpass filtersFernandez-Vazquez, A. / Jovanovic-Dolecek, G. et al. | 2004
- 397
-
Multiplier-free band-selectable digital filtersSantraine, A. / Leprince, S. / Taylor, F. et al. | 2004
- I
-
Public speech-oriented guidance system with adult and child discrimination capabilityNisimura, R. / Lee, A. / Saruwatari, H. / Shikano, K. et al. | 2004
- I
-
Improving phoneme recognition of telephone quality speechQiang Huang, / Cox, S. et al. | 2004
- 449
-
A stochastic model for the affine projection algorithm operating in a nonstationary environmentAlmeida, S.J.M. de / Bermudez, J.C.M. / Bershad, N.J. et al. | 2004
- 457
-
A statistical analysis of the multi-split LMS algorithmResende, L.S. / Rocha, C.A.F. / Bermudez, J.C.M. / Bellanger, M.G. et al. | 2004
- 461
-
Sufficient condition for tap-length gradient adaption of LMS algorithmGu, Yuantao / Tang, Kun / Cui, Huijuan et al. | 2004
- 469
-
A modified constant-Q transform for audio signalsSantos, C.N. dos / Netto, S.L. / Biscainho, L.W.R. / Graziosi, D.B. et al. | 2004
- 501
-
Weighted low rank approximation and reduced rank linear regressionWerner, K. / Jansson, M. et al. | 2004
- 505
-
Wavelet packets-based direction-of-arrival estimationXue, Yanbo / Wang, Jinkuan / Liu, Zhigang et al. | 2004
- I
-
An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic modelKen Chen, / Hasegawa-Johnson, M. / Cohen, A. et al. | 2004
- 521
-
A new signal model and identification algorithm for hidden semi-Markov signalsAzimi, M. / Nasiopoulos, P. / Ward, R.K. et al. | 2004
- 561
-
Polyphase analysis of aliasing effects in enlargementsSeidner, D. et al. | 2004
- 569
-
Can timing jitter improve random process reconstruction in presence of aliasing?Lacaze, B. / Mailhes, C. et al. | 2004
- I
-
Importance of window shape for phase-only reconstruction of speechAlsteris, L.D. / Paliwal, K.K. et al. | 2004
- 581
-
Frequency analysis using non-uniform sampling with application to active queue managementGunnarsson, F. / Gustafsson, F. et al. | 2004
- I
-
Automatic emotional speech classificationVerveridis, D. / Kotropoulos, C. / Pitas, I. et al. | 2004
- 597
-
Parametric smoothing of spline interpolationIbanez, J. / Santamaria, I. / Pantaleon, C. / Vielva, L. et al. | 2004
- 629
-
Diffusion equations for adaptive affine distributionsGosme, J. / Richard, C. / Goncalves, P. et al. | 2004
- 633
-
Comparative study of three time-frequency representations with applications to a novel correlation methodSejdic, E. / Jiang, J. et al. | 2004
- 637
-
A bootstrap scheme for time-frequency auto-term selection in antenna arraysCirillo, L.A. / Zoubir, A.M. et al. | 2004
- 649
-
Pole optimisation in adaptive Laguerre filteringden Brinker, A.C. / Sarroukh, B.E. et al. | 2004
- 665
-
On adaptive interpolated FIR filtersBilcu, R.C. / Kuosmanen, P. / Egiazarian, K. et al. | 2004
- 673
-
A recursive least squares algorithm robust to low-power excitationLudovico, C.S. / Bermudez, J.C.M. et al. | 2004
- 709
-
Kalman filtering in stochastic gradient algorithms: construction of a stopping ruleBittner, B. / Pronzato, L. et al. | 2004
- 713
-
Combining equalization and estimation for bandwidth extension of narrowband speechQian, Yasheng / Kabal, P. et al. | 2004
- 753
-
Novel approach to AM-FM decomposition with applications to speech and music analysisSekhar, S.C. / Sreenivas, T.V. et al. | 2004
- 757
-
Time-frequency-moving-average processes: principles and cepstral methods for parameter estimationJachan, M. / Matz, G. / Hlawatsch, F. et al. | 2004
- 825
-
Studies in massively speaker-specific speech recognitionShi, Yu / Chang, Eric et al. | 2004
- I
-
Codebook design for ASR systems using custom arithmetic unitsXiao Li, / Malkin, J. / Bilmes, J. et al. | 2004
- I
-
Parameter sharing in subband likelihood-maximizing beamforming for speech recognition using microphone arraysSeltzer, M.L. / Stern, R.M. et al. | 2004
- I
-
Extended cluster information vector quantization (ECI-VQ) for robust classificationArrowood, J.A. / Clements, M.A. et al. | 2004
- 933
-
Notions of strong ergodicity for stochastic analysis of multirate systemsMarelli, D. / Fu, Minyue et al. | 2004
- 945
-
An extended sure approach for multicomponent image denoisingBenazza-Benyahia, A. / Pesquet, J.C. et al. | 2004
- 1009
-
Blind deconvolution using Bayesian methods with application to the dereverberation of speechDaly, M.J. / Reilly, J.R. et al. | 2004
- 1033
-
Automatic recognition of Bluetooth speech in 802.11 interference and the effectiveness of insertion-based compensation techniquesNour-Eldin, A.H. / Tolba, H. / O'Shaughnessy, D. et al. | 2004
- 1037
-
SP-P16.12: SENSITIVITY ANALYSIS OF NOISE ROBUSTNESS METHODSBrayda, L. / Rigazio, L. / Boman, R. / Junqua, J.-C. / IEEE Signal Processing Society et al. | 2004
- 1041
-
Fast MCMC computations for the estimation of sparse processes from noisy observationsDavy, M. / Idier, J. et al. | 2004
- 1041
-
Author index| 2004
- 1053
-
An approach based on influence function to evaluate robustness and detection performance of CFAR detectorsMeng, Huadong / Wang, Xiqin / Zhang, Hao / Peng, Yingning et al. | 2004
- 1069
-
Detection performance for discrete test statistics. Application to low-flux imageryFerrari, A. / Tourneret, J.Y. et al. | 2004
- 1097
-
Signal detection and estimation using atomic decomposition and information-theoretic criteriaLopez-Risueno, G. / Grajal, J. / Yeste-Ojeda, O.A. et al. | 2004
- I
-
SP-L5.1: PITCH PREDICTION FROM MFCC VECTORS FOR SPEECH RECONSTRUCTIONShao, X. / Milner, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.4: EXTRACTION OF PITCH IN ADVERSE CONDITIONSMahadeva Prasanna, S. R. / Yegnanarayana, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.2: HIGHER ORDER CEPSTRAL MOMENT NORMALIZATION (HOCMN) FOR ROBUST SPEECH RECOGNITIONHsu, C.-w. / Lee, L.-s. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.9: EMPLOYING LAPLACIAN-GAUSSIAN DENSITIES FOR SPEECH ENHANCEMENTGazor, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.10: ROBUST ADAPTIVE KALMAN FILTERING-BASED SPEECH ENHANCEMENT ALGORITHMGabrea, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.10: EIGEN-MLLRS APPLIED TO UNSUPERVISED SPEAKER ENROLLMENT FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITIONAubert, X. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.7: EFFICIENT SPECTRUM CODING FOR SUPER-WIDEBAND SPEECH AND ITS APPLICATION TO 7/10/15 KHZ BANDWIDTH SCALABLE CODERSOshikiri, M. / Ehara, H. / Yoshida, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.7: FRACTIONAL FOURIER TRANSFORM FEATURES FOR SPEECH RECOGNITIONSarikaya, R. / Gao, Y. / Saon, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.1: MINIMUM SEGMENTATION ERROR BASED DISCRIMINATIVE TRAINING FOR SPEECH SYNTHESIS APPLICATIONWu, Y.-J. / Kawai, H. / Ni, J. / Wang, R.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.9: EVALUATION OF THE EFFECT OF STRESS ON FORMANTS IN FARSI VOWELSGharavian, D. / Ahadi, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.14: SCALING OF WAVEFORM SEGMENTS ALONG THE TIME AXIS FOR CONCATENATIVE SPEECH SYNTHESISNishizawa, N. / Kawai, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.8: CROSS-DIALECTAL ACOUSTIC DATA SHARING FOR ARABIC SPEECH RECOGNITIONKirchhoff, K. / Vergyri, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.3: SEGMENTAL TONAL MODELING FOR PHONE SET DESIGN IN MANDARIN LVCSRHuang, C. / Shi, Y. / Zhou, J.-L. / Chu, M. / Wang, T. / Chang, E. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.4: DECISION TREE BASED TONE MODELING FOR CHINESE SPEECH RECOGNITIONWong, P.-F. / Siu, M.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
Non-parallel training for voice conversion by maximum likelihood constrained adaptationMouchtaris, A. / Van der Spiegel, J. / Mueller, P. et al. | 2004
- I
-
Parameter sharing and minimum classification error training of mixtures of factor analyzers for speaker identificationYamamoto, H. / Nankaku, Y. / Miyajima, C. / Tokuda, K. / Kitamura, T. et al. | 2004
- I
-
Generalized locally recurrent probabilistic neural networks for text-independent speaker verificationGanchev, T. / Fakotakis, N. / Tasoulis, D.K. / Vrahatis, M.N. et al. | 2004
- I
-
Robust speech recognition techniques evaluation for telephony server based in-car applicationsDelphin-Poulat, L. et al. | 2004
- I
-
High-level speaker verification with support vector machinesCampbell, W.M. / Campbell, J.R. / Reynolds, D.A. / Jones, D.A. / Leek, T.R. et al. | 2004
- I
-
Weighted autocorrelation-based F0 estimation for distant-talking interaction with a distributed microphone networkArmani, L. / Omologo, M. et al. | 2004
- I
-
Product of power spectrum and group delay function for speech recognitionDonglai Zhu, / Paliwal, K.K. et al. | 2004
- I
-
Low-complexity predictive trellis coded quantization of wideband speech LSF parametersYongwon Shin, / Sangwon Kang, / Fischer, T.R. / Changyong Son, / Yongbeom Lee, et al. | 2004
- I
-
Variable-dimension quantization of sinusoidal amplitudes using Gaussian mixture modelsLindblom, J. / Hedelin, P. et al. | 2004
- I
-
Robust speech recognition using cepstral domain missing data techniques and noisy masksVan Hamme, H. et al. | 2004
- I
-
Noise suppression for automotive applications based on directional informationFuchs, M. / Haulick, T. / Schmidt, G. et al. | 2004
- I
-
Vocabulary-independent search in spontaneous speechSeide, F. / Peng Yu, / Chengyuan Ma, / Chang, E. et al. | 2004
- I
-
Robust adaptive Kalman filtering-based speech enhancement algorithmGabrea, M. et al. | 2004
- I
-
Language boundary detection and identification of mixed-language speech based on MAP estimationChi-Jiun Shia, / Yu-Hsien Chiu, / Jia-Hsin Hsieh, / Chung-Hsien Wu, et al. | 2004
- I
-
Language identification using parallel syllable-like unit recognitionNagarajan, T. / Murthy, H.A. et al. | 2004
- I
-
A multi-pass linear fold algorithm for sentence boundary detection using prosodic cuesDagen Wang, / Narayanan, S.S. et al. | 2004
- I
-
An evaluation of automatic phone segmentation for concatenative speech synthesisKawai, H. / Toda, T. et al. | 2004
- I
-
Estimation of short-term predictor parameters for coding and enhancement of noisy speechSrinivasan, S. / Samuelsson, J. / Kleijn, W.B. et al. | 2004
- I
-
HMM-based frequency bandwidth extension for speech enhancement using line spectral frequenciesChen, G. / Parsa, V. et al. | 2004
- I
-
Optimizing acoustic models for commercial speech recognition using foreground scores and data weightingBoies, D. / Strope, B. / Weintraub, M. / Su-Lin Wu, et al. | 2004
- I
-
Hidden spectral peak trajectory model for phone classificationYiu-Pong LAI, / Man-Hung SIU, et al. | 2004
- I
-
SP-L1.5: VOICE CHARACTERISTICS CONVERSION FOR TTS USING REVERSE VTLNEichner, M. / Wolff, M. / Hoffmann, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.4: A COMPLEXITY REDUCTION OF ETSI ADVANCED FRONT-END FOR DSRLi, J.-Y. / Liu, B. / Wang, R.-H. / Dai, L.-R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L4.5: SPEAKER IDENTIFICATION USING SUPRA-SEGMENTAL PITCH PATTERN DYNAMICSFarahani, F. / Georgiou, P. / Narayanan, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.2: PRODUCT OF POWER SPECTRUM AND GROUP DELAY FUNCTION FOR SPEECH RECOGNITIONZhu, D. / Paliwal, K. K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.4: A LOCALLY, WEIGHTED DISTANCE MEASURE FOR EXAMPLE BASED SPEECH RECOGNITIONDe Wachter, M. / Demuynck, K. / Wambacq, P. / Van Compernolle, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.6: ROBUST SPEECH RECOGNITION USING CEPSTRAL DOMAIN MISSING DATA TECHNIQUES AND NOISY MASKSVan hamme, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.5: CEPSTRAL GAIN NORMALIZATION FOR NOISE ROBUST SPEECH RECOGNITIONYoshizawa, S. / Hayasaka, N. / Wada, N. / Miyanaga, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.5: AUTOMATIC LEARNING OF INTERPRETATION STRATEGIES FOR SPOKEN DIALOGUE SYSTEMSRaymond, C. / Bechet, F. / De Mori, R. / Damnati, G. / Esteve, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.6: UNSUPERVISED AND ACTIVE LEARNING IN AUTOMATIC SPEECH RECOGNITION FOR CALL CLASSIFICATIONHakkani-Tur, D. / Tur, G. / Rahim, M. / Riccardi, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.12: SPEECH-ACTIVATED TEXT RETRIEVAL SYSTEM FOR MULTIMODAL CELLULAR PHONESIshikawa, S.-y. / Ikeda, T. / Miki, K. / Adachi, F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.8: ENHANCED STANDARD COMPLIANT DISTRIBUTED SPEECH RECOGNITION (AURORA ENCODER) USING RATE ALLOCATIONSrinivasamurthy, N. / Ortega, A. / Narayanan, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.9: TRAPPING CONVERSATIONAL SPEECH: EXTENDING TRAP/TANDEM APPROACHES TO CONVERSATIONAL TELEPHONE SPEECH RECOGNITIONMorgan, N. / Chen, B. / Zhu, Q. / Stolcke, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.3: AN ESTIMATE OF PHYSICAL SCALE FROM SPEECHSmith, L. / Nelson, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.2: WATERMARKING OF SPEECH SIGNALS USING THE SINUSOIDAL MODEL AND FREQUENCY MODULATION OF THE PARTIALSGirin, L. / Marchand, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.5: A LOW-BAND SPECTRUM ENVELOPE MODELING FOR HIGH QUALITY PITCH MODIFICATIONMochizuki, R. / Kobayashi, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.8: OPTIMIZING SUB-COST FUNCTIONS FOR SEGMENT SELECTION BASED ON PERCEPTUAL EVALUATIONS IN CONCATENATIVE SPEECH SYNTHESISToda, T. / Kawai, H. / Tsuzaki, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.9: SPEECH ENHANCEMENT USING ROBUST WEIGHTING FACTORS FOR CRITICAL-BAND-WAVELET-PACKET TRANSFORMLu, C.-T. / Wang, H.-C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.4: CORRECTIVE LANGUAGE MODELING FOR LARGE VOCABULARY ASR WITH THE PERCEPTRON ALGORITHMRoark, B. / Saraclar, M. / Collins, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.7: A GENERALIZED CONSTRUCTION OF INTEGRATED SPEECH RECOGNITION TRANSDUCERSAllauzen, C. / Mohri, M. / Riley, M. / Roark, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.3: AUTOMATIC GENERATION OF NON-UNIFORM HMM STRUCTURES BASED ON VARIATIONAL BAYESIAN APPROACHJitsuhiro, T. / Nakamura, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.6: INVESTIGATIONS INTO THE RELATIONSHIP BETWEEN MEASURABLE SPEECH QUALITY AND SPEECH RECOGNITION RATE FOR TELEPHONY SPEECHSun, H. / Shue, L. / Chen, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.11: FUSION BASED SPEECH SEGMENTATION IN DARPA SPINE2 TASKZheng, C. / Yan, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.6: A STUDY ON ROBUST SEGMENTATION AND LOCATION OF TONE NUCLEI IN CHINESE CONTINUOUS SPEECHZhang, J. / Hirose, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.1: ROBUST SPEECH RECOGNITION IN ADDITIVE AND CHANNEL NOISE ENVIRONMENTS USING GMM AND EM ALGORITHMFujimoto, M. / Ariki, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.8: COMBINING FEATURE COMPENSATION AND WEIGHTED VITERBI DECODING FOR NOISE ROBUST SPEECH RECOGNITION WITH LIMITED ADAPTATION DATACui, X. / Alwan, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.11: A TREE-STRUCTURED CLUSTERING METHOD INTEGRATING NOISE AND SNR FOR PIECEWISE LINEAR-TRANSFORMATION-BASED NOISE ADAPTATIONZhang, Z. / Sugimura, T. / Furui, S. / IEEE Signal Processing Society et al. | 2004
- I
-
Discrimination power weighted subword-based speaker verificationSiu-Man Chan, / Man-Hung Siu, et al. | 2004
- I
-
Applying articulatory features to telephone-based speaker verificationKa-Yee Leung, / Man-Wai Mak, / Sun-Yuan Kung, et al. | 2004
- I
-
Pitch prediction from MFCC vectors for speech reconstructionXu Shao, / Milner, B. et al. | 2004
- I
-
Joint decoding for phoneme-grapheme continuous speech recognitionMagimai-Doss, M. / Bengio, S. / Bourlard, H. et al. | 2004
- I
-
Spectral entropy based feature for robust ASRMisra, H. / Ikbal, S. / Bourlard, H. / Hermansky, H. et al. | 2004
- I
-
A multiple description speech coder based on AMR-WB for mobile ad hoc networksDong, H. / Gersho, A. / Gibson, J.D. / Cuperman, V. et al. | 2004
- I
-
A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standardSung-Kyo Jung, / Kyung-Tae Kini, / Hong-Goo Kang, et al. | 2004
- I
-
Employing Laplacian-Gaussian densities for speech enhancementGazor, S. et al. | 2004
- I
-
Online speaker clusteringLilt, D. / Kubala, F. et al. | 2004
- I
-
Robust multimodal understandingBangalore, S. / Johnston, M. et al. | 2004
- I
-
A distributed framework for enterprise level speech recognition servicesArizmendi, I. / Rose, R.C. et al. | 2004
- I
-
Speech-activated text retrieval system for multimodal cellular phonesIshikawa, S.Y. / Ikeda, T. / Miki, K. / Adachi, F. / Isotani, R. / Iso, K.I. / Okumura, A. et al. | 2004
- I
-
Enhanced standard compliant distributed speech recognition (Aurora encoder) using rate allocationSrinivasamurthy, N. / Ortega, A. / Narayanan, S. et al. | 2004
- I
-
Variational Bayesian feature selection for Gaussian mixture modelsValente, F. / Wellekens, C. et al. | 2004
- I
-
Joint frequency domain and reconstructed phase space features for speech recognitionLindgren, A.C. / T Johnson, M. / Povinelli, R.J. et al. | 2004
- I
-
Refining segmental boundaries for TTS database using fine contextual-dependent boundary modelsLuuan Wang, / Yong Zhao, / Min Chu, / Jianlai Zhou, / Zhigang Cao, et al. | 2004
- I
-
Evaluation of the effect of stress on formants in Farsi vowelsGharavian, D. / Ahadi, S.M. et al. | 2004
- I
-
Improving broadcast news transcription by lightly supervised discriminative trainingChan, H.Y. / Woodland, P. et al. | 2004
- I
-
The 2003 ISL rich transcription system for conversational telephony speechSoltau, H. / Hua Yu, / Metze, F. / Fugen, C. / Qin Jin, / Szu-Chen Jou, et al. | 2004
- I
-
A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation-based noise adaptationZhang, Z. / Sugimura, T. / Furui, S. et al. | 2004
- I
-
Spatio-temporal processing for distant speech recognitionSiow Yong Low, / Togneri, R. / Nordholm, S. et al. | 2004
- I
-
Sensitivity analysis of noise robustness methodsBrayda, L. / Rigazio, L. / Boman, R. / Junqua, J.C. et al. | 2004
- I
-
Can back-ends be more robust than front-ends? Investigation over the Aurora-2 databaseBernard, A. / Yifan Gong, / Xiaodong Cui, et al. | 2004
- I
-
SP-L2.4: DISENTANGLING SPEAKER AND CHANNEL EFFECTS IN SPEAKER VERIFICATIONKenny, P. / Dumouchel, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.2: THE ETSI EXTENDED DISTRIBUTED SPEECH RECOGNITION (DSR) STANDARDS: SERVER-SIDE SPEECH RECONSTRUCTIONRamabadran, T. / Sorin, A. / McLaughlin, M. / Chazan, D. / Pearce, D. / Hoory, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.2: ALGORITHM FOR AUTOMATIC GLOTTAL WAVEFORM ESTIMATION WITHOUT THE RELIANCE ON PRECISE GLOTTAL CLOSURE INFORMATIONMoore, E. / Clements, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.3: VARIABLE-DIMENSION QUANTIZATION OF SINUSOIDAL AMPLITUDES USING GAUSSIAN MIXTURE MODELSLindblom, J. / Hedelin, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.6: WAVEFORM QUANTIZATION OF SPEECH USING GAUSSIAN MIXTURE MODELSSamuelsson, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L9.1: SPECTRAL ENTROPY BASED FEATURE FOR ROBUST ASRMisra, H. / Ikbal, S. / Bourlard, H. / Hermansky, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.3: A SCALABLE SPEECH AND AUDIO CODING SCHEME WITH CONTINUOUS BITRATE FLEXIBILITYKovesi, B. / Massaloux, D. / Sollaud, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.6: A BIT-RATE/BANDWIDTH SCALABLE SPEECH CODER BASED ON ITU-T G.723.1 STANDARDJung, S.-K. / Kim, K.-T. / Kang, H.-G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.1: PARAMETERIZATION OF THE SCORE THRESHOLD FOR A TEXT-DEPENDENT ADAPTIVE SPEAKER VERIFICATION SYSTEMMirghafori, N. / Hebert, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.4: THE ELISA CONSORTIUM APPROACHES IN BROADCAST NEWS SPEAKER SEGMENTATION DURING THE NIST 2003 RICH TRANSCRIPTION EVALUATIONMoraru, D. / Meignier, S. / Fredouille, C. / Besacier, L. / Bonastre, J.-F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.11: LANGUAGE IDENTIFICATION USING PARALLEL SYLLABLE-LIKE UNIT RECOGNITIONThangavelu, N. / Murthy, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.4: A DISTRIBUTED FRAMEWORK FOR ENTERPRISE LEVEL SPEECH RECOGNITION SERVICESArizmendi, I. / Rose, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.6: A MULTI-PASS LINEAR FOLD ALGORITHM FOR SENTENCE BOUNDARY DETECTION USING PROSODIC CUESWang, D. / Narayanan, S. S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.9: YET ANOTHER ACOUSTIC REPRESENTATION OF SPEECH SOUNDSMinematsu, N. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.10: ESTIMATING VOCAL-TRACT AREA FUNCTIONS FROM VOWEL SOUND SIGNALS OVER CLOSED GLOTTAL PHASESDeng, H. / Ward, R. K. / Beddoes, M. / Hodgson, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.8: A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TESTAhmed, B. / Holmes, W. H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.9: ADVANCES IN THE AUTOMATIC TRANSCRIPTION OF LECTURESCettolo, M. / Brugnara, F. / Federico, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.9: PHONE DURATION MODELING FOR LVCSRPovey, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.1: INTEGRATING THUMBNAIL FEATURES FOR SPEECH RECOGNITION USING CONDITIONAL EXPONENTIAL MODELSYu, H. / Waibel, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.2: DISCRIMINATIVE FEATURE TRANSFORMATION BY GUIDED DISCRIMINATIVE TRAININGHsiao, R. / Mak, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.5: HIDDEN SPECTRAL PEAK TRAJECTORY MODEL FOR PHONE CLASSIFICATIONLai, Y.-P. / Siu, M.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.12: MINIMUM CLASSIFICATION ERROR TRAINING OF LANDMARK MODELS FOR REAL-TIME CONTINUOUS SPEECH RECOGNITIONMcDermott, E. / Hazen, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.6: MULTI-ENVIRONMENT MODELS BASED LINEAR NORMALIZATION FOR SPEECH RECOGNITION IN CAR CONDITIONSBuera, L. / Lleida, E. / Miguel, A. / Ortega, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.9: CAN BACK-ENDS BE MORE ROBUST THAN FRONT-ENDS? INVESTIGATION OVER THE AURORA-2 DATABASEBernard, A. / Gong, Y. / Cui, X. / IEEE Signal Processing Society et al. | 2004
- I
-
Algorithm amalgam: morphing waveform based methods, sinusoidal models and STRAIGHTKawahara, H. / Banno, H. / Irino, T. / Zolfaghari, P. et al. | 2004
- I
-
Algorithm for automatic glottal waveform estimation without the reliance on precise glottal closure informationMoore, E. / Clements, M. et al. | 2004
- I
-
Waveform quantization of speech using Gaussian mixture modelsSamuelsson, J. et al. | 2004
- I
-
Confidence measures in multiple pronunciations modeling for speaker verificationBenZeghiba, M.F. / Bourlard, H. et al. | 2004
- I
-
Entropy-based variable frame rate analysis of speech signals and its application to ASRYou, H. / Zhu, Q. / Alwan, A. et al. | 2004
- I
-
A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonancesLi Deng, / Lee, L.J. / Attias, H. / Acero, A. et al. | 2004
- I
-
An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimationIseli, M. / Alwan, A. et al. | 2004
- I
-
Advances in unsupervised audio segmentation for the broadcast news and NGSW corporaHuang, R. / Hansen, J.H.L. et al. | 2004
- I
-
Lightly supervised and data-driven approaches to Mandarin broadcast news transcriptionBerlin Chen, / Jen-Wei Kuo, / Wen-Hung Tsai, et al. | 2004
- I
-
Investigations into the relationship between measurable speech quality and speech recognition rate for telephony speechHanwu Sun, / Shue, L. / Jianfeng Chen, et al. | 2004
- I
-
A study on robust segmentation and location of tone nuclei in Chinese continuous speechJin-Song Zhang, / Keikichi Hirose, et al. | 2004
- I
-
Tone variation modeling for fluent Mandarin tone recognition based on clusteringWan-Yi Lin, et al. | 2004
- I
-
A modified Ephraim-Malah noise suppression rule for automatic speech recognitionGemello, R. / Mana, F. / De Mori, R. et al. | 2004
- I
-
SNR-dependent non-uniform spectral compression for noisy speech recognitionChu, K.K. / Leung, S.H. et al. | 2004
- I
-
Multi-environment models based linear normalization for speech recognition in car conditionsBuera, L. / Lleida, E. / Miguel, A. / Ortega, A. et al. | 2004
- I
-
Minimum Kullback-Leibler distance based multivariate Gaussian feature adaptation for distant-talking speech recognitionYue Pan, / Waibel, A. et al. | 2004
- I
-
SP-L4.4: APPLYING ARTICULATORY FEATURES TO TELEPHONE-BASED SPEAKER VERIFICATIONLeung, K.-Y. / Mak, M.-W. / Kung, S.-Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.4: ON SPLIT QUANTIZATION OF LSF PARAMETERSNorden, F. / Eriksson, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.2: COMBINATION OF HIDDEN MARKOV MODELS WITH DYNAMIC TIME WARPING FOR SPEECH RECOGNITIONAxelrod, S. / Maison, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.3: JOINT DECODING FOR PHONEME-GRAPHEME CONTINUOUS SPEECH RECOGNITIONMagimai-Doss, M. / Bengio, S. / Bourlard, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.5: LIGHT SUPERVISION IN ACOUSTIC MODEL TRAININGNguyen, L. / Xiang, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.2: EXACT TRAINING OF A NEURAL SYNTACTIC LANGUAGE MODELEmami, A. / Jelinek, F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.8: ON THE DECISION-DIRECTED ESTIMATION APPROACH OF EPHRAIM AND MALAHCohen, I. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.5: FEATURE SPACE GAUSSIANIZATIONSaon, G. / Dharanipragada, S. / Povey, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.12: EIGENSPACE-BASED MLLR WITH SPEAKER ADAPTIVE TRAINING IN LARGE VOCABULARY CONVERSATIONAL SPEECH RECOGNITIONDoumpiotis, V. / Deng, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.4: COMBINED ESTIMATION/CODING OF HIGHBAND SPECTRAL ENVELOPES FOR SPEECH SPECTRUM EXPANSIONAgiomyrgiannakis, Y. / Stylianou, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.5: AUTOMATICALLY DERIVED UNITS FOR SEGMENT VOCODERSRamasubramanian, V. / Sreenivas, T. V. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.1: BAYESIAN MODELLING OF THE SPEECH SPECTRUM USING MIXTURE OF GAUSSIANSZolfaghari, P. / Watanabe, S. / Nakamura, A. / Katagiri, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.4: FORMANT TRACKING BY MIXTURE STATE PARTICLE FILTERZheng, Y. / Hasegawa-Johnson, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.4: REFINING SEGMENTAL BOUNDARIES FOR TTS DATABASE USING FINE CONTEXTUAL-DEPENDENT BOUNDARY MODELSWang, L. / Zhao, Y. / Chu, M. / Zhou, J.-L. / Cao, Z. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.13: AN EVALUATION OF AUTOMATIC PHONE SEGMENTATION FOR CONCATENATIVE SPEECH SYNTHESISKawai, H. / Toda, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.1: SPHERICAL HARMONIC ANALYSIS OF EQUALIZATION IN A REVERBERANT ROOMBetlehem, T. / Abhayapala, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.4: AUTOMATED LIP-READING FOR IMPROVED SPEECH INTELLIGIBILITYMcClain, M. / Brady, K. / Brandstein, M. / Quatieri, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.14: IMPROVED NAME RECOGNITION WITH META-DATA DEPENDENT NAME NETWORKSMaskey, S. / Bacchiani, M. / Roark, B. / Sproat, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.11: LIGHTLY SUPERVISED AND DATA-DRIVEN APPROACHES TO MANDARIN BROADCAST NEWS TRANSCRIPTIONChen, B. / Kuo, J.-W. / Tsai, W.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.8: VOICING FEATURE INTEGRATION IN SRI'S DECIPHER LVCSR SYSTEMGraciarena, M. / Franco, H. / Zheng, J. / Vergyri, D. / Stolcke, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.11: TONE VARIATION MODELING FOR FLUENT MANDARIN TONE RECOGNITION BASED ON CLUSTERINGLin, W.-Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.9: PARSING SPEECH INTO ARTICULATORY EVENTSHacioglu, K. / Pellom, B. / Ward, W. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.2: ASSESSMENT OF SIGNAL SUBSPACE BASED SPEECH ENHANCEMENT FOR NOISE ROBUST SPEECH RECOGNITIONHermus, K. / Wambacq, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.7: MODELING SUB-BAND CORRELATION FOR NOISE-ROBUST SPEECH RECOGNITIONMcAuley, J. / Ming, J. / Hanna, P. / Stewart, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.4: BAYESIAN DURATION MODELING AND LEARNING FOR SPEECH RECOGNITIONChien, J.-T. / Huang, C.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
High quality voice morphingHui Ye, / Young, S. et al. | 2004
- I
-
Voice characteristics conversion for TTS using reverse VTLNEichner, M. / Wolff, M. / Hoffmann, R. et al. | 2004
- I
-
Discovering relations among discriminative training objectives [speak recognition applications]Qi Li, et al. | 2004
- I
-
Disentangling speaker and channel effects in speaker verificationKenny, P. / Dumouchel, P. et al. | 2004
- I
-
The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstructionRamabadran, T. / Sorin, A. / McLaughlin, M. / Chazan, D. / Pearce, D. / Hoory, R. et al. | 2004
- I
-
Using Haar transformed vocal source information for automatic speaker recognitionNengheng Zheng, / Ching, P.C. et al. | 2004
- I
-
Multiple frame block quantisation of line spectral frequencies using Gaussian mixture modelsPaliwal, K.K. / So, S. et al. | 2004
- I
-
On split quantization of LSF parametersNordin, F. / Eriksson, T. et al. | 2004
- I
-
On the decision-directed estimation approach of Ephraim and MalahCohen, I. et al. | 2004
- I
-
Adaptive time-segmentation for speech coding with limited delayRodbro, C.A. / Jensen, J. / Heusdens, R. et al. | 2004
- I
-
Closed-form estimation of the amplitude commands in the automatic extraction of the Fujisaki's modelSilva, S.D.S. / Netto, S.L. et al. | 2004
- I
-
A real-time Cantonese text-to-audiovisual speech synthesizerJian-Qing Wang, / Ka-Ho Wong, / Pheng-Ann Pheng, / Meng, H.M. / Tien-Tsin Wong, et al. | 2004
- I
-
Modeling pronunciation variation for spontaneous speech synthesisWerner, S. / Wolff, M. / Eichner, M. / Hoffinann, R. et al. | 2004
- I
-
Basis superposition precision matrix modelling for large vocabulary continuous speech recognitionSim, K.C. / Gales, M.J.F. et al. | 2004
- I
-
Voicing feature integration in SRI's decipher LVCSR systemGraciarena, M. / Franco, H. / Jing Zheng, / Vergyri, D. / Stolcke, A. et al. | 2004
- I
-
Chinese-English bilingual phone modeling for cross-language speech recognitionShengmin Yu, / Shitwu Zhang, / Bo Xu, et al. | 2004
- I
-
Prosody-based recognition of spoken German varietiesDizdarevic, V. / Hagmuller, M. / Kubin, G. / Pernkopf, E. / Baum, M. et al. | 2004
- I
-
Assessment of signal subspace based speech enhancement for noise robust speech recognitionHermus, K. / Wambacq, P. et al. | 2004
- I
-
DBN based multi-stream models for audio-visual speech recognitionGowdy, J.N. / Subramanya, A. / Bartels, C. / Bilmes, J. et al. | 2004
- I
-
Modeling sub-band correlation for noise-robust speech recognitionMcauley, J. / Ji Ming, / Hanna, P. / Stewart, D. et al. | 2004
- I
-
SP-L9.3: ROBUSTNESS OF SPEECH RECOGNITION USING GENETIC ALGORITHMS AND A MEL-CEPSTRAL SUBSPACE APPROACHSelouani, S.-A. / O Shaughnessy, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.1: OPTIMAL BLIND SEPARATION OF CONVOLUTIVE AUDIO MIXTURES WITHOUT TEMPORAL CONSTRAINTSKokkinakis, K. / Nandi, A. K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.5: ON THE ARCHITECTURE OF THE CDMA2000® VARIABLE-RATE MULTIMODE WIDEBAND (VMR-WB) SPEECH CODING STANDARDJelinek, M. / Salami, R. / Ahmadi, S. / Bessette, B. / Gournay, P. / Laflamme, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.7: A TWO-STEP NOISE REDUCTION TECHNIQUEPlapous, C. / Marro, C. / Mauuary, L. / Scalart, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.11: A NOISE ESTIMATION ALGORITHM WITH RAPID ADAPTATION FOR HIGHLY NON-STATIONARY ENVIRONMENTSRangachari, S. / Loizou, P. / Hu, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.1: PERFORMANCE COMPARISONS OF ALL-PASS TRANSFORM ADAPTATION WITH MAXIMUM LIKELIHOOD LINEAR REGRESSIONMcDonough, J. / Waibel, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.9: IDENTIFYING IN-SET AND OUT-OF-SET SPEAKERS USING NEIGHBORHOOD INFORMATIONAngkititrakul, P. / Hansen, J. H. L. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.8: EXTENDING BOOSTING FOR CALL CLASSIFICATION USING WORD CONFUSION NETWORKSTur, G. / Hakkani-Tur, D. / Riccardi, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.2: A DATA MINING APPROACH TO OBJECTIVE SPEECH QUALITY MEASUREMENTZha, W. / Chan, W.-Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.1: A MODEL-BASED TONE LABELING METHOD FOR MIN-NAN/TAIWANESE SPEECHKuo, W.-C. / Wang, Y.-R. / Chen, S.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.2: AN AUTOMATIC PROSODY LABELING SYSTEM USING ANN-BASED SYNTACTIC-PROSODIC MODEL AND GMM-BASED ACOUSTIC-PROSODIC MODELChen, K. / Hasegawa-Johnson, M. / Cohen, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.3: VARIATIONAL BAYESIAN FEATURE SELECTION FOR GAUSSIAN MIXTURE MODELSValente, F. / Wellekens, C. J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.3: CLUSTERING AND SEGMENTING SPEAKERS AND THEIR LOCATIONS IN MEETINGSAjmera, J. / McCowan, I. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.6: OPTIMIZING ACOUSTIC MODELS FOR COMMERCIAL SPEECH RECOGNITION USING FOREGROUND SCORES AND DATA WEIGHTINGBoies, D. / Strope, B. / Weintraub, M. / Wu, S.-L. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.5: AUTOMATIC DETERMINATION OF ACOUSTIC MODEL TOPOLOGY USING VARIATIONAL BAYESIAN ESTIMATION AND CLUSTERINGWatanabe, S. / Sako, A. / Nakamura, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.1: CODEBOOK DESIGN FOR ASR SYSTEMS USING CUSTOM ARITHMETIC UNITSLi, X. / Malkin, J. / Bilmes, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.10: PROSODY-BASED RECOGNITION OF SPOKEN GERMAN VARIETIESDizdarevic, V. / Hagmuller, M. / Kubin, G. / Pernkopf, F. / Baum, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.4: NOISE ROBUST SPEECH RECOGNITION WITH A SWITCHING LINEAR DYNAMIC MODELDroppo, J. / Acero, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.10: MINIMUM MEAN SQUARE ERROR FILTERING OF NOISY CEPSTRAL COEFFICIENTS WITH APPLICATIONS TO ASRMyrvoll, T. A. / Nakamura, S. / IEEE Signal Processing Society et al. | 2004
- I
-
Robust speech feature extraction by growth transformation in reproducing kernel Hilbert spaceChakrabartty, S. / Yunbin Deng, / Cauwenberghs, G. et al. | 2004
- I
-
Phase autocorrelation (PAC) features in entropy based multi-stream for robust speech recognitionIkbal, S. / Misra, H. / Bourlard, H. / Hermansky, H. et al. | 2004
- I
-
Multiple-microphone time-varying filters for robust speech recognitionCalvin Yiu-Kit Lai, / Aarabi, P. et al. | 2004
- I
-
A scalable speech and audio coding scheme with continuous bitrate flexibilityKovesi, B. / Massaloux, D. / Sollaud, A. et al. | 2004
- I
-
A study of various composite kernels for kernel eigenvoice speaker adaptationMak, B. / Kwok, J.T. / Ho, S. et al. | 2004
- I
-
Eigen-MLLRs applied to unsupervised speaker enrollment for large vocabulary continuous speech recognitionAubert, X.L. et al. | 2004
- I
-
Unsupervised and active learning in automatic speech recognition for call classificationHakkani-Tur, D. / Tur, G. / Rahim, M. / Riccardi, G. et al. | 2004
- I
-
A model-based tone labeling method for Min-Nan/Taiwanese speechWei-Chih Kuo, / Yih-Ru Wang, / Sin-Horng Chen, et al. | 2004
- I
-
Feature generation based on maximum normalized acoustic likelihood for improved speech recognitionXiang Li, / Stern, R.M. et al. | 2004
- I
-
Acoustic analysis of friendly speechFangxin Chen, / Aijun Li, / Haibo Wang, / Tianqing Wang, / Qiang Fang, et al. | 2004
- I
-
Yet another acoustic representation of speech soundsMinematsu, N. et al. | 2004
- I
-
Estimating vocal-tract area functions from vowel sound signals over closed glottal phasesHuiqun Deng, / Ward, R.K. / Beddoes, M.P. / Hodgson, M. et al. | 2004
- I
-
A voice activity detector using the chi-square testAhmed, B. / Holmes, P.H. et al. | 2004
- I
-
Perceptual Kalman filtering for speech enhancement in colored noiseNing Ma, / Bouchard, M. / Goubran, R.A. et al. | 2004
- I
-
New speech harmonic structure measure and it application to post speech enhancementAn-Tze Yu, / Hsiao-chuan Wang, et al. | 2004
- I
-
Model complexity control and compression using discriminative growth functionsLiu, X. / Gales, M.J.F. et al. | 2004
- I
-
Robust speech recognition in additive and channel noise environments using GMM and EM algorithmFujimoto, M. / Riki, Y.A. et al. | 2004
- I
-
Combining feature compensation and weighted Viterbi decoding for noise robust speech recognition with limited adaptation dataXiaodong Cui, / Alwan, A. et al. | 2004
- I
-
SP-L1.1: NON-PARALLEL TRAINING FOR VOICE CONVERSION BY MAXIMUM LIKELIHOOD CONSTRAINED ADAPTATIONMouchtaris, A. / Van der Spiegel, J. / Mueller, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.2: PARAMETER SHARING AND MINIMUM CLASSIFICATION ERROR TRAINING OF MIXTURES OF FACTOR ANALYZERS FOR SPEAKER IDENTIFICATIONYamamoto, H. / Nankaku, Y. / Miyajima, C. / Tokuda, K. / Kitamura, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.3: DISCOVERING RELATIONS AMONG DISCRIMINATIVE TRAINING OBJECTIVESLi, Q. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.5: DIMENSIONALITY REDUCTION USING MCE-OPTIMIZED LDA TRANSFORMATIONLi, X.-B. / Li, J.-Y. / Wang, R.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.6: THE USE OF A LINGUISTICALLY MOTIVATED LANGUAGE MODEL IN CONVERSATIONAL SPEECH RECOGNITIONWang, W. / Stolcke, A. / Harper, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.4: A MULTIPLE DESCRIPTION SPEECH CODER BASED ON AMR-WB FOR MOBILE AD HOC NETWORKSDong, H. / Gersho, A. / Gibson, J. / Cuperman, V. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.8: CONFIDENCE MEASURES IN MULTIPLE PRONUNCIATIONS MODELING FOR SPEAKER VERIFICATIONBenZeghiba, M. F. / Bourlard, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.10: BENEFITS OF PRIOR ACOUSTIC SEGMENTATION FOR AUTOMATIC SPEAKER SEGMENTATIONMeignier, S. / Moraru, D. / Fredouille, C. / Besacier, L. / Bonastre, J.-F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.11: NOISE REDUCTION ON SPEECH CODEC PARAMETERSTaddei, H. / Beaugeant, C. / de Meuleneire, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.10: PREDICTING FOREGROUND SH, SL AND BNH DAM SCORES FOR MULTIDIMENSIONAL OBJECTIVE MEASURE OF SPEECH QUALITYSen, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.4: APPLICATION OF THE MODIFIED GROUP DELAY FUNCTION TO SPEAKER IDENTIFICATION AND DISCRIMINATIONHegde, R. / Murthy, H. / Gadde, V. R. R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.5: ACOUSTIC ANALYSIS OF FRIENDLY SPEECHChen, F. / Li, A. / Wang, H. / Wang, T. / Fang, Q. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.12: MODELING PRONUNCIATION VARIATION FOR SPONTANEOUS SPEECH SYNTHESISWerner, S. / Wolff, M. / Eichner, M. / Hoffmann, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.15: REAL-TIME WORD CONFIDENCE SCORING USING LOCAL POSTERIOR PROBABILITIES ON TREE TRELLIS SEARCHLee, A. / Shikano, K. / Kawahara, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.8: EXPERIMENTS IN KEYPAD-AIDED SPELLING RECOGNITIONParthasarathy, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.5: A MODIFIED EPHRAIM-MALAH NOISE SUPPRESSION RULE FOR AUTOMATIC SPEECH RECOGNITIONGemello, R. / Mana, F. / De Mori, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.13: PCMM-BASED FEATURE COMPENSATION SCHEMES USING MODEL INTERPOLATION AND MIXTURE SHARINGKim, W. / Kwon, O. / Ko, H. / IEEE Signal Processing Society et al. | 2004
- I
-
Speaker identification using supra-segmental pitch pattern dynamicsFarahani, F. / Georgiou, P.G. / Narayanan, S.S. et al. | 2004
- I
-
Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approachSelouani, S.A. / O'Shaughnessy, D. et al. | 2004
- I
-
Cepstral gain normalization for noise robust speech recognitionYoshizawa, S. / Hayasaka, N. / Wada, N. / Miyanaga, Y. et al. | 2004
- I
-
Development of the 2003 CU-HTK conversational telephone speech transcription systemEvermann, G. / Chan, H.Y. / Gales, M.J.F. / Hain, T. / Liu, X. / Mrva, D. / Wang, L. / Woodland, P.C. et al. | 2004
- I
-
On the architecture of the cdma2000/spl reg/ variable-rate multimode wideband (VMR-WB) speech coding standardJelinek, M. / Salami, R. / Ahmadi, S. / Bessetle, B. / Gournay, P. / Laflamme, C. et al. | 2004
- I
-
An investigation into front-end signal processing for speaker normalizationUmesh, S. / Sinha, R. / Kumar, S.V.B. et al. | 2004
- I
-
Eigenspace-based MLLR with speaker adaptive training in large vocabulary conversational speech recognitionDounipiotis, V. / Yonggang Deng, et al. | 2004
- I
-
Bootstrap estimates for confidence intervals in ASR performance evaluationBisani, M. / Ney, H. et al. | 2004
- I
-
Noise-dependent postfilteringGrancharov, V. / Samuelsson, J. / Kleijn, W.B. et al. | 2004
- I
-
Combined estimation/coding of highband spectral envelopes for speech spectrum expansionAgiomyrgiannakis, Y. / Stylianou, Y. et al. | 2004
- I
-
Multisensor MELPe using parameter substitutionBrady, K. / Quatieri, T.F. / Campbell, J.P. / Campbell, W.M. / Brandstein, M. / Weinstein, C.J. et al. | 2004
- I
-
Noise reduction on speech codec parametersTaddei, H. / Beaugeant, C. / de Meuleneire, M. et al. | 2004
- I
-
Sound feature detection using leaky integrate-and-fire neuronsSmith, L.S. / Fraser, D.S. et al. | 2004
- I
-
Minimum segmentation error based discriminative training for speech synthesis applicationYi-Jian Wu, / Hisashi Kawai, / Jinfu Ni, / Ren-Hua Wang, et al. | 2004
- I
-
Probability based prosody model for unit selectionXijun Ma, / Wei Zhang, / Weibin Zhu, / Qin Shi, / Ling Jin, et al. | 2004
- I
-
A strategy to solve data scarcity problems in corpus based intonation modellingCardenoso, V. / Escudero, D. et al. | 2004
- I
-
Speech synthesis from real time ultrasound images of the tongueDenby, B. / Stone, M. et al. | 2004
- I
-
Speech enhancement by perceptual filter with sequential noise parameter estimationTe-Won Lee, / Kaisheng Yao, et al. | 2004
- I
-
Speech enhancement with missing data techniques using recurrent neural networksParveen, S. / Green, P. et al. | 2004
- I
-
A generalized construction of integrated speech recognition transducersAllauzen, C. / Mohri, M. / Riley, M. / Roark, B. et al. | 2004
- I
-
A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMsTamura, S. / Iwano, K. / Furui, S. et al. | 2004
- I
-
Speech enhancement based on multiple directivity patterns using a microphone arraySekiya, T. / Kobayashi, T. et al. | 2004
- I
-
Noise robust speech recognition with a switching linear dynamic modelDroppo, J. / Acero, A. et al. | 2004
- I
-
Minimum mean square error filtering of noisy cepstral coefficients with applications to ASRMyrvoll, T.A. / Nakamura, S. et al. | 2004
- I
-
Combination of hidden Markov models with dynamic time warping for speech recognitionAxelrod, S. / Maison, B. et al. | 2004
- I
-
Exact training of a neural syntactic language modelEmami, A. / Jelinek, F. et al. | 2004
- I
-
A two-step noise reduction techniquePlapous, C. / Marro, C. / Mauuary, L. / Scalart, P. et al. | 2004
- I
-
Enrollment in low-resource speech recognition systemsDeligne, S. / Dharanipragada, S. et al. | 2004
- I
-
A multimedia approach for audio segmentation in TV broadcast newsPerez-Freire, L. / Garcia-Mateo, C. et al. | 2004
- I
-
The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluationMoraru, D. / Meignier, S. / Fredouille, C. / Besacier, L. / Bonastre, J.F. et al. | 2004
- I
-
Fusing language identification systems using performance confidence indexesGutierrez, J. / Rouas, J.L. / Andre-Obrecht, R. et al. | 2004
- I
-
Enhancement of mismatched conditions in speaker recognition for multimedia applicationsFakhr, W. / Abdelsalam, A. / Hamdy, N. et al. | 2004
- I
-
A detection based approach to robust speech understandingKuansan Wang, et al. | 2004
- I
-
Automatic learning of interpretation strategies for spoken dialogue systemsRaymond, C. / Bechet, F. / De Mori, R. / Damnati, G. / Esteve, Y. et al. | 2004
- I
-
Automatically derived units for segment vocodersRamasubramanian, V. / Sreenivas, T.V. et al. | 2004
- I
-
Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architectureSchuller, B. / Rigoll, G. / Lang, M. et al. | 2004
- I
-
Clustering and segmenting speakers and their locations in meetingsAjmera, J. / Lathoud, G. / McCowan, L. et al. | 2004
- I
-
Analysis by synthesis of acoustic correlates of British, Australian and American accentsQin Yan, / Vaseghi, S. / Rentzos, D. / Ching-Hsiang Ho, et al. | 2004
- I
-
A low-band spectrum envelope modeling for high quality pitch modificationMochizuki, R. / Kobayashi, A. et al. | 2004
- I
-
Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesisToda, T. / Kawai, H. / Tsuzaki, M. et al. | 2004
- I
-
Corrective language modeling for large vocabulary ASR with the perceptron algorithmRoark, B. / Saraclar, M. / Collins, M. et al. | 2004
- I
-
Improved name recognition with meta-data dependent name networksMaskey, S.R. / Bacchiani, M. / Roark, B. / Sproat, R. et al. | 2004
- I
-
A new voice activity detector using subband order-statistics filters for robust speech recognitionRamirez, J. / Segura, J.C. / Benirez, C. / de la Torre, A. / Rubio, A. et al. | 2004
- I
-
Fusion based speech segmentation in DARPA SPINE2 taskChengyi Zheng, / Yonghong Yan, et al. | 2004
- I
-
Discriminative feature transformation by guided discriminative trainingHsiao, R. / Mak, B. et al. | 2004
- I
-
Decision tree based tone modeling for Chinese speech recognitionPui-Fung WONG, / Man-Hung SIU, et al. | 2004
- I
-
Joint removal of additive and convolutional noise with model-based feature enhancementStouten, V. / Van Hamme, H. / Wambacq, P. et al. | 2004
- I
-
Minimum classification error training of landmark models for real-time continuous speech recognitionMcDermott, E. / Hazen, T.J. et al. | 2004
- I
-
Universal compensation -- an approach to noisy speech recognition assuming no knowledge of noiseJi Ming, et al. | 2004
- I
-
SP-L2.1: DISCRIMINATIVE TRAINING FOR SPEAKER IDENTIFICATION BASED ON MAXIMUM MODEL DISTANCE ALGORITHMHong, Q. Y. / Kwong, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L1.6: VOICE CONVERSION THROUGH TRANSFORMATION OF SPECTRAL AND INTONATION FEATURESRentzos, D. / Vaseghi, S. / Yan, Q. / Ho, C.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.3: A SUBVECTOR-BASED ERROR CONCEALMENT ALGORITHM FOR SPEECH RECOGNITION OVER MOBILE NETWORKSTan, Z.-H. / Dalsgaard, P. / Lindberg, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.6: EFFICIENT AND ROBUST DISTRIBUTED SPEECH RECOGNITION (DSR) OVER WIRELESS FADING CHANNELS: 2D-DCT COMPRESSION, ITERATIVE BIT ALLOCATION, SHORT BCH CODE AND INTERLEAVINGHsu, W.-h. / Lee, L.-s. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L4.3: TEXT-INDEPENDENT SPEAKER RECOGNITION BY COMBINING SPEAKER-SPECIFIC GMM WITH SPEAKER ADAPTED SYLLABLE-BASED HMMNakagawa, S. / Zhang, W. / Takahashi, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L4.1: HIGH-LEVEL SPEAKER VERIFICATION USING SUPPORT VECTOR MACHINESCampbell, W. / Campbell, J. / Reynolds, D. / Jones, D. / Leek, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.5: WEIGHTED AUTOCORRELATION-BASED F0 ESTIMATION FOR DISTANT-TALKING INTERACTION WITH A DISTRIBUTED MICROPHONE NETWORKArmani, L. / Omologo, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.6: SPEECH FEATURE EXTRACTION METHOD REPRESENTING PERIODICITY AND APERIODICITY IN SUB BANDS FOR ROBUST SPEECH RECOGNITIONIshizuka, K. / Miyazaki, N. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.5: IMPROVED QUANTIZATION STRUCTURES USING GENERALIZED HMM MODELLING WITH APPLICATION TO WIDEBAND SPEECH CODINGDuni, E. / Subramaniam, A. / Rao, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.3: OVERDETERMINED BLIND SEPARATION FOR CONVOLUTIVE MIXTURES OF SPEECH BASED ON MULTISTAGE ICA USING SUBARRAY PROCESSINGNishikawa, T. / Abe, H. / Saruwatari, H. / Shikano, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.5: MULTIPLE-MICROPHONE TIME-VARYING FILTERS FOR ROBUST SPEECH RECOGNITIONLai, C. / Aarabi, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.5: CROSS-LINGUAL LATENT SEMANTIC ANALYSIS FOR LANGUAGE MODELINGKim, W. / Khudanpur, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.11: SPEAKER INDEXING AND ADAPTATION USING SPEAKER CLUSTERING BASED ON STATISTICAL MODEL SELECTIONNishida, M. / Kawahara, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.6: LANGUAGE BOUNDARY DETECTION AND INDENTIFICATION OF MIXED-LANGUAGE SPEECH BASED ON MAP ESTIMATIONShia, C.-J. / Chiu, Y.-H. / Hsieh, J.-H. / Wu, C.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.6: MULTISENSOR MELPE USING PARAMETER SUBSTITUTIONBrady, K. / Quatieri, T. / Campbell, J. / Campbell, W. / Brandstein, M. / Weinstein, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.11: AN IMPROVED CORRECTION FORMULA FOR THE ESTIMATION OF HARMONIC MAGNITUDES AND ITS APPLICATION TO OPEN QUOTIENT ESTIMATIONIseli, M. / Alwan, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.6: HMM-BASED FREQUENCY BANDWIDTH EXTENSION FOR SPEECH ENHANCEMENT USING LINE SPECTRAL FREQUENCIESChen, G. / Parsa, V. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.8: PERCEPTUAL KALMAN FILTERING FOR SPEECH ENHANCEMENT IN COLORED NOISEMa, N. / Bouchard, M. / Goubran, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.10: AN MMSE SPEECH ENHANCEMENT APPROACH INCORPORATING MASKING PROPERTIESYou, C. h. / Koh, S. n. / Rahardja, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.1: IMPROVING BROADCAST NEWS TRANSCRIPTION BY LIGHTLY SUPERVISED DISCRIMINATIVE TRAININGChan, H. Y. / Woodland, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.10: SEQUENTIAL CLUSTERING ALGORITHM FOR GAUSSIAN MIXTURE INITIALIZATIONMessina, R. / Jouvet, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.2: A NEW VOICE ACTIVITY DETECTOR USING SUBBAND ORDER-STATISTICS FILTERS FOR ROBUST SPEECH RECOGNITIONRamirez, J. / Segura, J. C. / Benitez, C. / de la Torre, A. / Rubio, A. J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.4: A STREAM-WEIGHT OPTIMIZATION METHOD FOR AUDIO-VISUAL SPEECH RECOGNITION USING MULTI-STREAM HMMSTamura, S. / Iwano, K. / Furui, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.5: A FACTORIAL HMM APPROACH TO SIMULTANEOUS RECOGNITION OF ISOLATED DIGITS SPOKEN BY MULTIPLE TALKERS ON ONE AUDIO CHANNELDeoras, A. / Hasegawa-Johnson, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.10: PARAMETER SHARING IN SUBBAND LIKELIHOOD-MAXIMIZING BEAMFORMING FOR SPEECH RECOGNITION USING MICROPHONE ARRAYSSeltzer, M. / Stern, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P14.7: CHINESE-ENGLISH BILINGUAL PHONE MODELING FOR CROSS-LANGUAGE SPEECH RECOGNITIONYu, S. / Zhang, S. / Xu, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.10: MINIMUM KULLBACK-LEIBLER DISTANCE BASED MULTIVARIATE GAUSSIAN FEATURE ADAPTATION FOR DISTANT-TALKING SPEECH RECOGNITIONPan, Y. / Waibel, A. / IEEE Signal Processing Society et al. | 2004
- I
-
Lightly supervised acoustic model training using consensus networksLangzhou Chen, / Lamel, L. / Gauvain, J.L. et al. | 2004
- I
-
Performance comparisons of all-pass transform adaptation with maximum likelihood linear regressionMcDonough, J. / Waibel, A. et al. | 2004
- I
-
Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognitionXiaodong He, / Yunxin Zhao, et al. | 2004
- I
-
Identifying in-set and out-of-set speakers using neighborhood informationAngkititrakul, P. / Hansen, J.H.L. et al. | 2004
- I
-
A data mining approach to objective speech quality measurementWei Zha, / Wai-Yip Chan, et al. | 2004
- I
-
Efficient spectrum coding for super-wideband speech and its application to 7/10/15 kHz bandwidth scalable codersOshikiri, M. / Ehara, H. / Yoshida, K. et al. | 2004
- I
-
Towards multilingual speech recognition using data driven source/target acoustical units associationBayeh, R. / Lin, S. / Chollet, G. / Mokbel, C. et al. | 2004
- I
-
Formant frequency estimation in noiseBin Chen, / Loizou, P.C. et al. | 2004
- I
-
Watermarking of speech signals using the sinusoidal model and frequency modulation of the partialsGirin, L. / Marchand, S. et al. | 2004
- I
-
Automated lip-reading for improved speech intelligibilityMcClain, M. / Brady, K. / Brandstein, M. / Quatieri, T. et al. | 2004
- I
-
Out-of-domain detection based on confidence measures from multiple topic classificationLane, L.R. / Kawahara, T. / Matsui, T. / Nakamura, S. et al. | 2004
- I
-
Cross-dialectal acoustic data sharing for Arabic speech recognitionKirchhoff, K. / Vergyri, D. et al. | 2004
- I
-
Filler model based confidence measures for spoken dialogue systems: a case study for TurkishAkyol, A. / Erdogan, H. et al. | 2004
- I
-
Rao-Blackwellised Gibbs sampling for switching linear dynamical systemsRosti, A.V.I. / Gales, M.J.F. et al. | 2004
- I
-
Training for polynomial segment model using the expectation maximization algorithmChak-Fai Li, / Man-Hung Siu, et al. | 2004
- I
-
Acoustic model adaptation using first order prediction for reverberant speechTakiguchi, T. / Nishimura, M. et al. | 2004
- I
-
On tracking noise with linear dynamical system modelsRaj, B. / Singh, R. / Stern, R. et al. | 2004
- I
-
Nonlinear noise compensation in feature domain for speech recognition with numerical methodsHui Jiang, / Qi Wang, et al. | 2004
- I
-
Tone articulation modeling for Mandarin spontaneous speech recognitionJian-lai Zhou, / Ye Tian, / Yu Shi, / Chao Huang, / Chang, E. et al. | 2004
- I
-
SP-L1.4: ALGORITHM AMALGAM: MORPHING WAVEFORM BASED METHODS, SINUISOIDAL MODELS AND STRAIGHTKawahara, H. / Banno, H. / Irino, T. / Zolfaghari, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.5: GENERALIZED LOCALLY RECURRENT PROBABILISTIC NEURAL NETWORKS FOR TEXT-INDEPENDENT SPEAKER VERIFICATIONGanchev, T. / Fakotakis, N. / Tasoulis, D. / Vrahatis, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L2.6: DISCRIMINATION POWER WEIGHTED SUBWORD-BASED SPEAKER VERIFICATIONChan, S.-M. / Si, M.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L3.1: SOFT DECODING STRATEGIES FOR DISTRIBUTED SPEECH RECOGNITION OVER IP NETWORKSCardenal-Lopez, A. / Docio-Fernandez, L. / Garcia-Mateo, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L5.6: A NOVEL METHOD FOR COMPUTATION OF PERIODICITY, APERIODICITY AND PITCH OF SPEECH SIGNALSDeshmukh, O. / Singh, J. / Espy-Wilson, C. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.1: LOW-COMPLEXITY PREDICTIVE TRELLIS CODED QUANTIZATION OF WIDEBAND SPEECH LSF PARAMETERSShin, Y. / Kang, S. / Fischer, T. R. / Son, C. / Lee, Y. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.4: SPEECH ENHANCEMENT BASED ON A COMBINED MULTI-CHANNEL ARRAY WITH CONSTRAINED INTERATIVE AND AUDITORY MASKED PROCESSINGZhang, X. / Hansen, J. H. L. / Arehart, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.5: ENHANCEMENT OF MISMATCHED CONDITIONS IN SPEAKER RECOGNITION FOR MULTIMEDIA APPLICATIONSFakhr, W. / Abdelsalam, A. / Hamdy, N. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.12: A PITCH SYNCHRONOUS FEATURE EXTRACTION METHOD FOR SPEAKER RECOGNITIONKim, S. / Eriksson, T. / Kang, H.-G. / Youn, D. H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.3: ROBUST MULTIMODAL UNDERSTANDINGBangalore, S. / Johnston, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P5.9: WIDEBAND AUDIO OVER NARROWBAND LOW-RESOLUTION MEDIADing, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.12: ENTROPY-BASED VARIABLE FRAME RATE ANALYSIS OF SPEECH SIGNALS AND ITS APPLICATION TO ASRYou, H. / Zhu, Q. / Alwan, A. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.6: IMPORTANCE OFWINDOWSHAPE FOR PHASE-ONLY RECONSTRUCTION OF SPEECHAlsteris, L. / Paliwal, K. K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.7: CLOSED-FORM ESTIMATION OF THE AMPLITUDE COMMANDS IN THE AUTOMATIC EXTRACTION OF FUJISAKI'S MODELSilva, S. / Netto, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.5: ESTIMATION OF SHORT-TERM PREDICTOR PARAMETERS FOR CODING AND ENHANCEMENT OF NOISY SPEECHSrinivasan, S. / Samuelsson, J. / Kleijn, W. B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P10.12: SPEECH ENHANCEMENT WITH MISSING DATA TECHNIQUES USING RECURRENT NEURAL NETWORKSParveen, S. / Green, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.5: GENERATING AND EVALUATING SEGMENTATIONS FOR AUTOMATIC SPEECH RECOGNITION OF CONVERSATIONAL TELEPHONE SPEECHTranter, S. / Yu, K. / Evermann, G. / Woodland, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.12: TRAINING FOR POLYNOMIAL SEGMENT MODEL USING THE EXPECTATION MAXIMIZATION ALGORITHMLi, C.-F. / Siu, M.-H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.6: UNIVERSAL COMPENSATION - AN APPROACH TO NOISY SPEECH RECOGNITION ASSUMING NO KNOWLEDGE OF NOISEMing, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.9: SNR-DEPENDENT NON-UNIFORM SPECTRAL COMPRESSION FOR NOISY SPEECH RECOGNITIONChu, K.-k. / Leung, S. H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.2: TONE ARTICULATION MODELING FOR MANDARIN SPONTANEOUS SPEECH RECOGNITIONZhou, J.-L. / Tian, Y. / Shi, Y. / Huang, C. / Chang, E. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.3: SPATIO-TEMPORAL PROCESSING FOR DISTANT SPEECH RECOGNITIONLow, S. Y. / Togneri, R. / Nordholm, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L7.2: MULTIPLE FRAME BLOCK QUANTISATION OF LINE SPECTRAL FREQUENCIES USING GAUSSIAN MIXTURE MODELSPaliwal, K. K. / So, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.1: EFFECTS OF TRANSCRIPTION ERRORS ON SUPERVISED LEARNING IN SPEECH RECOGNITIONSundaram, R. / Picone, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.3: MPE-BASED DISCRIMINATIVE LINEAR TRANSFORM FOR SPEAKER ADAPTATIONWang, L. / Woodland, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.4: A STUDY OF VARIOUS COMPOSITE KERNELS FOR KERNEL EIGENVOICE SPEAKER ADAPTATIONMak, B. / Kwok, J. / Ho, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.2: A DETECTION BASED APPROACH TO ROBUST SPEECH UNDERSTANDINGWang, K. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.8: JOINT FREQUENCY DOMAIN AND RECONSTRUCTED PHASE SPACE FEATURES FOR SPEECH RECOGNITIONLindgren, A. / Johnson, M. / Povinelli, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P7.8: FORMANT FREQUENCY ESTIMATION IN NOISEChen, B. / Loizou, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.2: SPEECH DISCRIMINATION BASED ON MULTISCALE SPECTRO-TEMPORAL MODULATIONSMesgarani, N. / Shamma, S. / Slaney, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.4: VOICE ACTIVITY DETECTION USING VISUAL INFORMATIONLiu, P. / Wang, Z. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.5: SPEECH MODELING AND VOICED/UNVOICED/MIXED/SILENCE SPEECH SEGMENTATION WITH FRACTIONALLY GAUSSIAN NOISE BASED MODELSOveisgharan, S. / Shamsollahi, M. B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P8.6: SOUND FEATURE DETECTION USING LEAKY INTEGRATE-AND-FIRE NEURONSSmith, L. / Fraser, D. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.7: A REAL-TIME CANTONESE TEXT-TO-AUDIOVISUAL SPEECH SYNTHESIZERWang, J.-Q. / Wong, K.-H. / Heng, P.-A. / Meng, H. M.-L. / Wong, T.-T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.15: SPEECH SYNTHESIS FROM REAL TIME ULTRASOUND IMAGES OF THE TONGUEDenby, B. / Stone, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.12: FILLER MODEL BASED CONFIDENCE MEASURES FOR SPOKEN DIALOGUE SYSTEMS: A CASE STUDY FOR TURKISHAkyol, A. / Erdogan, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.2: BASIS SUPERPOSITION PRECISION MATRIX MODELLING FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITIONSim, K. C. / Gales, M. J. F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.3: JOINT REMOVAL OF ADDITIVE AND CONVOLUTIONAL NOISE WITH MODEL-BASED FEATURE ENHANCEMENTStouten, V. / Van hamme, H. / Wambacq, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P15.7: ON TRACKING NOISE WITH LINEAR DYNAMICAL SYSTEM MODELSRaj, B. / Singh, R. / Stern, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P16.8: MITIGATION OF CHANNEL ERRORS IN EFR-BASED SPEECH RECOGNITIONGomez, A. M. / Peinado, A. M. / Sanchez, V. E. / Perez-Cordoba, J. L. / Rubio, A. J. / IEEE Signal Processing Society et al. | 2004
- I
-
Improvement of speaker recognition by combining residual and prosodic features with acoustic featuresShi-Han Chen, / Hsiao-Chuan Wang, et al. | 2004
- I
-
Dimensionality reduction using MCE-optimized LDA transformationXiao-Bing Li, / Jin-Yu Li, / Ren-Hua Wang, et al. | 2004
- I
-
Light supervision in acoustic model trainingLong Nguyen, / Bing Xiang, et al. | 2004
- I
-
Overdetermined blind separation for convolutive mixtures of speech based on multistage ICA using subarray processingNishikawa, T. / Abe, H. / Saruwatari, H. / Shikano, K. et al. | 2004
- I
-
A study of design compromises for speech coders in packet networksLefebvre, R. / Philippe, G.T. / Salami, R. et al. | 2004
- I
-
Improvement issues on transcoding algorithms: for the flexible usage to the various pairs of speech codecJin-Kyu Choi, / Chang-Heon Lee, / Hong-Goo, K. / Young-Cheol Park, / Dae Hee Youn, et al. | 2004
- I
-
Parameterization of the score threshold for a text-dependent adaptive speaker verification systemMirghafori, N. / Hebert, M. et al. | 2004
- I
-
Wideband audio over narrowband low-resolution mediaHeping Ding, et al. | 2004
- I
-
A differential spectral voice activity detectorGarner, P.N. / Fukada, T. / Komori, Y. et al. | 2004
- I
-
Scaling of waveform segments along the time axis for concatenative speech synthesisNishizawa, N. / Kawai, H. et al. | 2004
- I
-
Sequential clustering algorithm for Gaussian mixture initializationMessina, R. / Jouvet, D. et al. | 2004
- I
-
An analysis of interleavers for robust speech recognition in burst-like packet lossJames, A.B. / Milner, B.P. et al. | 2004
- I
-
A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channelDeoras, A.N. / Hasegawa-Johnson, A. et al. | 2004
- I
-
PCMM-based feature compensation schemes using model interpolation and mixture sharingWooil Kim, / Ohil Kwon, / Hanseok Ko, et al. | 2004
- I
-
Asynchronous HMM with applications to speech recognitionGarg, A. / Balakrishnan, S. / Vaithyanathan, S. et al. | 2004
- I
-
SP-L6.1: NON-UNIFORM SPEAKER NORMALIZATION USING AFFINE-TRANSFORMATIONBharath Kumar, S. V. / Umesh, S. / Sinha, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L6.4: ROBUST SPEECH FEATURE EXTRACTION BY GROWTH TRANSFORMATION IN REPRODUCING KERNEL HILBERT SPACEChakrabartty, S. / Deng, Y. / Cauwenberghs, G. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L8.6: LIGHTLY SUPERVISED ACOUSTIC MODEL TRAINING USING CONSENSUS NETWORKSChen, L. / Lamel, L. / Gauvain, J.-L. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L10.2: MICROPHONE ARRAY POST-FILTER FOR SEPARATION OF SIMULTANEOUS NON-STATIONARY SOURCESValin, J.-M. / Rouat, J. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.1: META-DATA CONDITIONAL LANGUAGE MODELINGBacchiani, M. / Roark, B. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-L11.3: DEVELOPMENT OF THE 2003 CU-HTK CONVERSATIONAL TELEPHONE SPEECH TRANSCRIPTION SYSTEMEvermann, G. / Chan, H. Y. / Gales, M. J. F. / Hain, T. / Liu, X. / Mrva, D. / Wang, L. / Woodland, P. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.1: A STUDY OF DESIGN COMPROMISES FOR SPEECH CODERS IN PACKET NETWORKSLefebvre, R. / Gournay, P. / Salami, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P1.2: IMPROVEMENT ISSUES ON TRANSCODING ALGORITHMS: FOR THE FLEXIBLE USAGE TO THE VARIOUS PAIRS OF SPEECH CODECChoi, J.-K. / Lee, C.-H. / Kang, H.-G. / Park, Y.-C. / Youn, D. H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P2.6: ONLINE SPEAKER CLUSTERINGLiu, D. / Kubala, F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P3.7: FUSING LANGUAGE IDENTIFICATION SYSTEMS USING PERFORMANCE CONFIDENCE INDEXESGutierrez, J. / Rouas, J.-L. / Andre-Obrecht, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P4.1: BOOTSTRAP ESTIMATES FOR CONFIDENCE INTERVALS IN ASR PERFORMANCE EVALUATIONBisani, M. / Ney, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.10: ON USE OF TASK INDEPENDENT TRAINING DATA IN TANDEM FEATURE EXTRACTIONSivadas, S. / Hermansky, H. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P6.11: FEATURE GENERATION BASED ON MAXIMUM NORMALIZED ACOUSTIC LIKELIHOOD FOR IMPROVED SPEECH RECOGNITIONLi, X. / Stern, R. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P9.6: PROBABILITY BASED PROSODY MODEL FOR UNIT SELECTIONMa, X. J. / Zhang, W. / Zhu, W. B. / Shi, Q. / Jin, L. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.3: HYBRID LANGUAGE MODELS FOR OUT OF VOCABULARY WORD DETECTION IN LARGE VOCABULARY CONVERSATIONAL SPEECH RECOGNITIONYazgan, A. / Saraclar, M. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P11.6: OUT-OF-DOMAIN DETECTION BASED ON CONFIDENCE MEASURES FROM MULTIPLE TOPIC CLASSIFICATIONLane, I. / Kawahara, T. / Matsui, T. / Nakamura, S. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.1: MODEL COMPLEXITY CONTROL AND COMPRESSION USING DISCRIMINATIVE GROWTH FUNCTIONSLiu, X. / Gales, M. J. F. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P12.11: A VITERBI ALGORITHM FOR A TRAJECTORY MODEL DERIVED FROM HMM WITH EXPLICIT RELATIONSHIP BETWEEN STATIC AND DYNAMIC FEATURESZen, H. / Tokuda, K. / Kitamura, T. / IEEE Signal Processing Society et al. | 2004
- I
-
SP-P13.3: AN ANALYSIS OF INTERLEAVERS FOR ROBUST SPEECH RECOGNITION IN BURST-LIKE PACKET LOSSJames, A. / Milner, B. / IEEE Signal Processing Society et al. | 2004
- I
-
Voice conversion through transformation of spectral and intonation featuresRentzos, D. / Vaseghi, S. / Qin Yan, / Ching-Hsiang Ho, et al. | 2004
- I
-
Discriminative training for speaker identification based on maximum model distance algorithmHong, Q.Y. / Kwong, S. et al. | 2004
- I
-
Extraction of pitch in adverse conditionsPrasanna, S.R.M. / Yegnanarayana, B. et al. | 2004
- I
-
A locally weighted distance measure for example based speech recognitionDe Wachter, M. / Demuynck, K. / Wambacq, P. / Van Compernolle, D. et al. | 2004
- I
-
A noise estimation algorithm with rapid adaptation for highly nonstationary environmentsRangachari, S. / Loizou, P.C. / Yi Hu, et al. | 2004
- I
-
Feature space GaussianizationSaon, G. / Dharanipragada, S. / Povey, D. et al. | 2004
- I
-
Desperately seeking impostors: data-mining for competitive impostor testing in a text-dependent speaker verification systemHebert, M. / Mirghafori, N. et al. | 2004
- I
-
Low-complexity multi-rate lattice vector quantization with application to wideband TCX speech coding at 32 kbit/sRagot, S. / Bessette, B. / Lefebvre, R. et al. | 2004
- I
-
Speech discrimination based on multiscale spectro-temporal modulationsMesgarani, N. / Shamma, S. / Slaney, M. et al. | 2004
- I
-
Voice activity detection using visual informationPeng Liu, / Zuoying Wang, et al. | 2004
- I
-
Feature selection for improved bandwidth extension of speech signalsJax, P. / Vary, P. et al. | 2004
- I
-
Speech enhancement using robust weighting factors for critical-band-wavelet-packet transformChing-Ta Lu, / Hsiao-Chuan Wang, et al. | 2004
- I
-
An MMSE speech enhancement approach incorporating masking propertiesChang Huai You, / Soo Ngee Koh, / Rahardja, S. et al. | 2004
- I
-
Real-time word confidence scoring using local posterior probabilities on tree trellis searchLee, A. / Shikano, K. / Kawahara, T. et al. | 2004