Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs) (Englisch)

Gupta, Priyanka / Chodingala, Piyushkumar K. / Patil, Hemant A.

In: 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ; 125-130 ; 2023

ISBN:

979-8-3503-0067-3

ISSN:

2640-0103

Aufsatz (Konferenz) / Elektronische Ressource

Wie erhalte ich diesen Titel?

Zugriff prüfen

Download

Kommerziell Vergütung an den Verlag: 30,47 € Grundgebühr: 4,00 € Gesamtpreis: 34,47 €

Akademisch Vergütung an den Verlag: 30,47 € Grundgebühr: 2,00 € Gesamtpreis: 32,47 €

Exportieren, teilen und zitieren

There have been various studies involving Instantaneous Frequency (IF) estimation for Spoofed Speech Detection (SSD) task, such as the derivative of the phase obtained by Hilbert Transform (HT) approach and Energy Separation-based method. However, IF estimation by HT leads to lack of good temporal resolution. On the other hand, ESA-based method leads to excellent time resolution, however, lacks the relative phase information. Therefore, in this paper, we have proposed Cochlear Filter Cepstral Coefficients-based Instantaneous Frequency using Quadrature Energy Separation Algorithm (CFCCIF-QESA) feature set, which merits of having an excellent time resolution as well as inclusion of the relative phase information. Hence, we illustrate the significance of incorporating the quadrature-phase component along with the in-phase component for SSD of replay detection in VAs. To that effect, we perform experiments on the Realistic Replay AttackF Microphone-Array Speech Corpus (ReMASC) dataset. Furthermore, the proposed CFCCIF-QESA feature set gives 28.71 and 29.89 %EER using GMM and CNN respectively, on Eval set. The proposed feature set is evaluated using various performance metrics, including EER and other confusion matrix-based metrics. Finally, the latency for CFCCIF-QESA and CFCCIF-ESA is presented, showing the better suitability of the proposed CFCCIF-QESA feature set w.r.t. practical deployment.

Titel:

Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs)
Beteiligte:

Gupta, Priyanka ( Autor:in ) / Chodingala, Piyushkumar K. ( Autor:in ) / Patil, Hemant A. ( Autor:in )
Erschienen in:

2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ; 125-130
Verlag:

IEEE

Erscheinungsdatum:

31.10.2023
Format / Umfang:

1381644 byte
ISBN:

979-8-3503-0067-3
ISSN:

2640-0103
DOI:

https://doi.org/10.1109/APSIPAASC58517.2023.10317171
Medientyp:

Aufsatz (Konferenz)
Format:

Elektronische Ressource
Sprache:

Englisch
Datenquelle:

IEEE

Inhaltsverzeichnis Konferenzband

Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.

1: Mixed Emotion Recognition Based on EEG Signals
Pei, Guanxiong / Li, Bingjie / Li, Taihao / Fan, Cunhang / Zhang, Chao / Lv, Zhao et al. | 2023
Elektronische Ausgabe
8: Attention-based CNN and Relative Phase Feature Modeling for Improved Imagined Speech Recognition
Niimura, Yoshiki / Takemoto, Jun / Kai, Atsuhiko / Nakagawa, Seiichi et al. | 2023
Elektronische Ausgabe
15: Manipulation of Neuronal Network Firing Patterns using Temporal Deep Unfolding-based MPC
Aizawa, Jumpei / Ogura, Masaki / Shimono, Masanori / Wakamiya, Naoki et al. | 2023
Elektronische Ausgabe
22: Goodness of Fit to the Convolution Model of fMRI Data and Determination of the Regularization Parameter
Nakamura, Wakako et al. | 2023
Elektronische Ausgabe
27: Detection model of sister chromatid cohesion defects based on Vision Transformer
Matsumoto, Shinya / Okubo, Kan / Abe, Takuya / Nishikawa, Kiyoshi et al. | 2023
Elektronische Ausgabe
32: GRALA: modeling social information for microblog sentiment analysis from the view of balancing sparsity and smoothness of social contexts
Zou, Xiaomei / Hu, Shiyong / Li, Taihao et al. | 2023
Elektronische Ausgabe
38: Adopting Neural Translation Model in Data Generation for Inverse Text Normalization
Jiang, Yufei / Ho, Thi-Nga / Chng, Eng-Siong et al. | 2023
Elektronische Ausgabe
46: Mismatched Semi-supervised Learning with Feature Similarity Consistency
Liang, Zechen / Fan, Qiaosong / Wang, Yuan-Gen et al. | 2023
Elektronische Ausgabe
51: Collaborative Pseudo Labeling for Prompt-Based Learning
Chien, Jen-Tzung / Chen, Chien-Ching et al. | 2023
Elektronische Ausgabe
57: Learning Meta Soft Prompt for Few-Shot Language Models
Chien, Jen-Tzung / Chen, Ming-Yen / Xue, Jing-Hao et al. | 2023
Elektronische Ausgabe
63: MSDF-Net: A Multi-Scale Deep Fusion Network with Dilated Convolutions for Cloud Removal from Sentinel-2 Imagery
Jayakrishnan, A / Venkatesan, M / Prabhavathy, P / Alkha, Mohan et al. | 2023
Elektronische Ausgabe
71: Instance Implant-Aided Non-uniformly Cropping for Person Detection in Aerial Images
Zhang, Xiangqing / Feng, Yan / Zhang, Shun / Wang, Yuning et al. | 2023
Elektronische Ausgabe
84: Unbiased Decision-Making Framework in Long-Video Macro & Micro-Expression Spotting
Tan, Pei-Sze / Rajanala, Sailaja / Pal, Arghya / Phan, Raphael C.-W. / Ong, Huey-Fang et al. | 2023
Elektronische Ausgabe
90: Adaptive Beamforming Based on Interference-Plus-Noise Covariance Matrix Reconstruction for Speech Separation
Xiao, Yongxiong / Zhu, Shiqiang / Li, Te / Wan, Minhong / Song, Wei / Gu, Jason / Fu, Qiang et al. | 2023
Elektronische Ausgabe
96: Correlated Multi-Level Speech Enhancement for Robust Real-World ASR Applications Using Mask-Waveform-Feature Optimization
Chen, Hang / Du, Jun / Wang, Zhe / Wang, Chenxi / Ren, Yuling / Li, Qinglong / Liu, Ruibo / Lee, Chin-Hui et al. | 2023
Elektronische Ausgabe
102: CASA-Net: Cross-attention and Self-attention for End-to-End Audio-visual Speaker Diarization
Zhou, Haodong / Li, Tao / Wang, Jie / Li, Lin / Hong, Qingyang et al. | 2023
Elektronische Ausgabe
107: Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction
Guo, Aoqi / Wu, Junnan / Gao, Peng / Zhu, Wenbo / Guo, Qinwen / Gao, Dazhi / Wang, Yujun et al. | 2023
Elektronische Ausgabe
114: Low-complexity Multi-Channel Speaker Extraction with Pure Speech Cues
Zeng, Bang / Suo, Hongbin / Wan, Yulong / Li, Ming et al. | 2023
Elektronische Ausgabe
119: Modeling Suprasegmental Information Using Finite Difference Network for End-to-End Speaker Verification
Li, Jin / Mak, Man-Wai / Yan, Nan / Wang, Lan et al. | 2023
Elektronische Ausgabe
125: Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs)
Gupta, Priyanka / Chodingala, Piyushkumar K. / Patil, Hemant A. et al. | 2023
Elektronische Ausgabe
131: Exploring Residual Cepstral Features for Spoken Language Identification
Hora, Baveet Singh / Parmar, Krishna / Machhar, Shrey / Patil, Hemant A. / Praveen, Kiran / Radhakrishnan, Balaji et al. | 2023
Elektronische Ausgabe
139: Consideration of Varying Training Lengths for Short-Duration Speaker Verification
Ko, WooSeok / Um, Seyun / Piao, Zhenyu / Kang, Hong-Goo et al. | 2023
Elektronische Ausgabe
145: Adversarial Robustness of Mel Based Speaker Recognition Systems
Srivastava, Ritu / Kosgi, Saiteja / Sivaprasad, Sarath / Sahipjohn, Neha / Gandhi, Vineet et al. | 2023
Elektronische Ausgabe
151: Joint Drum Transcription and Metrical Analysis Based on Periodicity-Aware Multi-Task Learning
Kamakura, Daichi / Nanamura, Eita / Oyama, Takehisa / Yoshii, Kazuyoshi et al. | 2023
Elektronische Ausgabe
158: CTC2: End-to-End Drum Transcription Based on Connectionist Temporal Classification With Constant Tempo Constraint
Kamakura, Daichi / Nakamura, Eita / Yoshii, Kazuyoshi et al. | 2023
Elektronische Ausgabe
165: Learning Multifaceted Self-Similarity for Musical Structure Analysis
Chen, Tsung-Ping / Su, Li / Yoshii, Kazuyoshi et al. | 2023
Elektronische Ausgabe
173: Simultaneous Measurement of Multiple Acoustic Attributes Using Structured Periodic Test Signals Including Music and Other Sound Materials
Kawahara, Hideki / Yatabe, Kohei / Sakakibara, Ken-Ichi / Mizumachi, Mitsunori / Kitamura, Tatsuya et al. | 2023
Elektronische Ausgabe
187: Gait Analysis in Powered Exoskeleton-Assisted Walking in Patients with Stroke: A Case Series Cohort
Huang, Jian-Jia / Chang, Shih-Chieh / Cheng, Cheng-Hsu / Wan, Timothy / Pei, Yu-Cheng et al. | 2023
Elektronische Ausgabe
195: Prediction Model of Postoperative Pain Exacerbation Using a Wearable Electrocardiogram Sensor
Nakanishi, Toshiyuki / Fujiwara, Koichi / Sobue, Kazuya et al. | 2023
Elektronische Ausgabe
199: Directional Neural Connectivity during Robot Mirror Therapy in Patients with Stroke
Kanaizuka, Yuma / Manabe, Takahiro / Huang, Jian-Jia / Hung, Jen-Wen / Ono, Yumie et al. | 2023
Elektronische Ausgabe
206: Evaluation of neural response recorded using scalp EEG in virtual reality environment
Kanayama, Noriaki / Miyakoshi, Makoto / Machizawa, Maro et al. | 2023
Elektronische Ausgabe
211: Machine Learning Based Action Recognition with Modular CNN
Huang, Shi-Zong / Chiu, Ching-Te / Chang, Yu-Jen et al. | 2023
Elektronische Ausgabe
217: Real-Time Processing for Weighted Pulse Decomposition of Photoplethysmography Signals Based on Interior Point Method in Wearable Devices for Hemodynamic State
Wong, Ting-Jui / Tsai, Pei-Yun et al. | 2023
Elektronische Ausgabe
222: QoS-Aware Downlink Beamforming for Joint Transmission in Multi-Cell Networks
Lin, Chen-Yen / Liu, Kuang-Hao Stanley et al. | 2023
Elektronische Ausgabe
230: Deep-Learning-Based Lattice Reduction Preprocessing for Time-Correlated MIMO Systems
Li, Yi-Mei / Chi, Jung-Chun / Huang, Yuan-Hao et al. | 2023
Elektronische Ausgabe
238: Utilizing Unlabeled Data and Synthetic Data for Bird Sound Detection: Consistency Training, Mean Teacher, and Domain Adaptation Techniques
Chen, Fang-Ching / Liu, Yi-Wen et al. | 2023
Elektronische Ausgabe
243: A Comparative Evaluation of Video Codecs for rPPG-based Heart Rate Estimation
Hyanda, Muhammad H. / Ahmadi, Nur / Charlton, Peter H. / Constandinou, Timothy G. / Purwarianti, Ayu / Adiono, Trio et al. | 2023
Elektronische Ausgabe
248: Human Activity Recognition Based on FMCW Radar Using CNN and Transfer Learning
Triani, Listi Restu / Ahmadi, Nur / Adiono, Trio et al. | 2023
Elektronische Ausgabe
254: DQN Algorithm Design for Fast Efficient Shortest Path System
Sumarudin, A / Sutisna, Nana / Syafalni, Infall / Trilaksono, Bambang Riyanto / Adiono, Trio et al. | 2023
Elektronische Ausgabe
261: Comparison of MPPT based on Deep Reinforcement Learning by DQN, DDPG and TD3
Panggabean, Jayandi / Sutisna, Nana / Syafalni, Infall / Adiono, Trio et al. | 2023
Elektronische Ausgabe
267: Signal Quality Assessment for Wearable Multichannel Photoplethysmography Signals
Prihatmoko, Muhammad Dzaky / Ahmadi, Nur / Charlton, Peter H. / Adiono, Trio et al. | 2023
Elektronische Ausgabe
272: After-Fatigue Condition: A Novel Analysis Based on Surface EMG Signals
Nguyen, Van-Hieu / Luu, Gia Thien / Van Luong, Thien / Trang, Mai Xuan / Ravier, Philippe / Buttelli, Olivier et al. | 2023
Elektronische Ausgabe
278: On the Semi-Blind Mutually Referenced Equalizers for MIMO Systems
Son, Do Hai / Abed-Meraim, Karim / Duy, Tran Trong / Trung, Nguyen Linh / Quynh, Tran Thi Thuy et al. | 2023
Elektronische Ausgabe
284: Accurate continuous action and gesture recognition method based on skeleton and sliding windows techniques
Le, Viet-Duc / Nghiem, Thi-Lich / Le, Thi-Lan et al. | 2023
Elektronische Ausgabe
291: Transformer-Based Deep Learning Detector for Dual-Mode Index Modulation 3D-OFDM
Gian, Toan / Nguyen, Tien-Hoa / Nguyen, Trung Tan / Pham, Van-Cuong / Van Luong, Thien et al. | 2023
Elektronische Ausgabe
297: GAFormer: Wearable IMU-Based Human Activity Recognition with Gramian Angular Field and Transformer
Le, Trung-Hieu / Nguyen, Thai-Khanh / Tran, Trung-Kien / Tran, Thanh-Hai / Pham, Cuong et al. | 2023
Elektronische Ausgabe
304: Fatigue Classification and Onset estimation using Surface EMG Signals during Strength Training
Adapa, Eswar / Turlapaty, Anish C / Naidu, Surya et al. | 2023
Elektronische Ausgabe
311: P300 Event-Related Potential in Perception of Multiple Traffic Objects During Vehicle Driving
Yamamoto, Yuki / Nobukawa, Sou / Wagatsuma, Nobuhiko / Inagaki, Keiichiro et al. | 2023
Elektronische Ausgabe
317: Kernel Random Projection Depth for Outlier Detection
Tamamori, Akira et al. | 2023
Elektronische Ausgabe
325: Soft-Sensor Construction Method Based on Adaptive Modeling and Transfer Learning for Manufacturing Process Including Maintenance Periods
Katayama, Kaito / Fujiwara, Koichi / Yamamoto, Kazuki et al. | 2023
Elektronische Ausgabe
329: Detecting Wire Bonding Defects in Point Clouds on Self-Generated Dataset
Yuen, Shang Li / Lau, Phooi Yee / Wong, Chin Wee / Samsuri, Muhammad Hafiz / Hussin, Zarina / Kamarudin, Nur Afiqah / Talib, Muhammad Syukri Mohd / Hon, Hock Woon et al. | 2023
Elektronische Ausgabe
336: Predicting Outcomes of Cognitive Behavioral Therapy for Depression Using Data Driven Approaches
Tyszczuk, Lily / Levita, Liat / Delgadillo, Jaime / Haihong, Zhang / Arvaneh, Mahnaz et al. | 2023
Elektronische Ausgabe
344: Learning Adapters for Code-Switching Speech Recognition
He, Chun-Yi / Chien, Jen-Tzung et al. | 2023
Elektronische Ausgabe
350: FID-RPRGAN-VC: Fréchet Inception Distance Loss based Region-wise Position Normalized Relativistic GAN for Non-Parallel Voice Conversion
Dhar, Sandipan / Akhter, MD. Tousin / Banerjee, Padmanabha / Jana, Nanda Dulal / Das, Swagatam et al. | 2023
Elektronische Ausgabe
357: Deformable Aligned Fusion for Video Super Resolution
Lee, Sin-Hong / Kuo, Chih-Hung / Yu, Tsai-Chun et al. | 2023
Elektronische Ausgabe
365: Learning Single Image Rain Streak Removal Based on Deep Attention Mechanism
Huang, Kuan-Hua / Kang, Li-Wei et al. | 2023
Elektronische Ausgabe
373: A Transformer-Based Framework for Tiny Object Detection
Liao, Yi-Kai / Lin, Gong-Si / Yeh, Mei-Chen et al. | 2023
Elektronische Ausgabe
378: Lightweight Models Distillation with Learnable Teaching Material: An Application for Smart Table Tennis System
Chen, Duan-Yu / Chen, Yu-Hsuan et al. | 2023
Elektronische Ausgabe
384: Selecting Suitable Data Input for Deep-Learning Sign-Language Recognition with a Small Dataset
Chen, Yu-Jen / Su, Po-Chyi et al. | 2023
Elektronische Ausgabe
392: Analysis of the Interaction Effect on Pruning and Transfer Learning in Model Training
Wei, Yu-Jen / Chen, Jia-Hong / Kuo, Tien-Ying et al. | 2023
Elektronische Ausgabe
396: Old Damaged Photo Recovery with Style Transfer-Based Data Augmentation
Wang, Chih-Hao / Wei, Yu-Jen / Chang, Ching Hsiang / Kuo, Tien-Ying et al. | 2023
Elektronische Ausgabe
401: A Deep Learning based Sustainable Energy Scheduling System
Tsai, Kun-Lin / Chen, Yan-Hao / Huang, Choa-Ting / Huang, Guo-Wei / Tseng, Shih-Ting et al. | 2023
Elektronische Ausgabe
408: A Computational Efficient Direct Position Determination Approach of Narrow-band Emitter
Zhao, Yuan / Sheng, Hanmin / Shao, Jinliang et al. | 2023
Elektronische Ausgabe
414: Modeling and Analysis of the Epidemic-Behavior Co-evolution Dynamics with User Irrationality
Dong, Wenxiang / Zhao, H. Vicky et al. | 2023
Elektronische Ausgabe
422: Noise-robust Pitch Detection Based on Super-Resolution Harmonics
Zhu, Dongjie / Zhu, Weibin / Wang, Tianrui / Gao, Yingying / Feng, Junlan / Zhang, Shilei et al. | 2023
Elektronische Ausgabe
427: A Subband Approach to Personal Sound Zone with Joint Optimization of Sound Pressure and Particle Velocity
Zhao, Yingke / Zhang, Wen / Chen, Jingdong et al. | 2023
Elektronische Ausgabe
432: An Multi-evidence Fusion Based on C-Distance with Uncertain Reasoning for Classification
Cheng, Cuiping / Yue, Pengcheng / Li, Taihao et al. | 2023
Elektronische Ausgabe
438: On Uncertainty Principles for Lowband Graph Signals
Li, Na / Shang, Linbo / Zhang, Zhichao et al. | 2023
Elektronische Ausgabe
443: CoA-DLinkNet: Connectivity-Enhanced Dual-Branch Road Extraction Network Based on D-LinkNet
Li, Linghan / Chen, Heliu / He, Renjie / Dai, Yuchao / He, Mingyi et al. | 2023
Elektronische Ausgabe
450: Black-box Lossless Fragile Watermarking Based on Hidden Space Search for DNN Integrity Authentication
Zhao, Gejian / Qin, Chuan et al. | 2023
Elektronische Ausgabe
456: Hiding patient information in medical images:A high-capacity and reversible hiding algorithm for E-healthcare
Zhou, Xiaoyi / Lee, Shuai et al. | 2023
Elektronische Ausgabe
462: A Visually Meaningful Image Encryption Algorithm with Attention Mechanism and Artificial Bee Colony Optimization
Mao, Jiarong / An, Yuting / Zhou, Xiaoyi et al. | 2023
Elektronische Ausgabe
468: High-Quality Triggers Based Fragile Watermarking for Optical Character Recognition Model
Yin, Yujie / Yin, Heng / Yin, Zhaoxia / Lyu, Wanli / Wei, Sha et al. | 2023
Elektronische Ausgabe
476: Coupled Transformed Induced Tensor Nuclear Norm for Robust Tensor Completion
Qin, Mengjie / Lin, Zheyuan / Wan, Minhong / Zhang, Chunlong / Gu, Jason / Li, Te et al. | 2023
Elektronische Ausgabe
484: Multi-Frequency Feature Enhancement for Multi-Granularity Visual Classification
Fu, Meijiang / Zheng, Yixiao / Chang, Dongliang / Li, Wenpan / Ma, Zhanyu et al. | 2023
Elektronische Ausgabe
490: Improving Aspect Sentiment Classification via Retrieving from Training Data
Ling, Tongtao / Chen, Lei / Liao, Chen / Huang, Shilei / Yu, Zhipeng / Liu, Yi et al. | 2023
Elektronische Ausgabe
498: CH-MEAD: A Chinese Multimodal Conversational Emotion Analysis Dataset with Fine-Grained Emotion Taxonomy
Ruan, Yu-Ping / Zheng, Shu-Kai / Huang, Jiantao / Zhang, Xiaoning / Liu, Yulong / Li, Taihao et al. | 2023
Elektronische Ausgabe
506: Evolutionary Analysis and Cultural Transmission Models of Color Style Distributions in Painting Arts
Nakamura, Eita / Saito, Yasuyuki et al. | 2023
Elektronische Ausgabe
514: Ultimatelink Between Characters Having a Certain Meaning in Physical Space to URL in Cyberspace with Robust Print and Scan
Yamadera, Keiji / Niimi, Michiharu et al. | 2023
Elektronische Ausgabe
519: Human Flow Measurement System Using Floor Estimation of Depth Images for Low-End IoT Devices
Nagatoshi, Takuya / Niimi, Michiharu et al. | 2023
Elektronische Ausgabe
523: Holo-QoI: A Human Factor-Based Dataset and Prediction Framework for Assessing Quality of Interaction in Augmented Reality
Kim, Seongjean / Choi, Seonghwa / Lee, Sanghoon et al. | 2023
Elektronische Ausgabe
529: Supervised Single-channel EEG Decomposition using Detector-kernel Networks for Noise Reduction
Higashi, Hiroshi et al. | 2023
Elektronische Ausgabe
535: Cross-Subject Classification of Spoken Mandarin Vowels and Tones with EEG Signals: A Study of End-to-End CNN with Fine-Tuning
Wang, Xinyu / Li, Mingtao / Li, Hao / Pun, Sio Hang / Chen, Fei et al. | 2023
Elektronische Ausgabe
540: Decoding time-course of saliency network of fMRI signals by EEG signals using optimized forward variable selection: a concurrent EEG-fMRI study
Dang, Tung / Ono, Kentaro / Sasaoka, Takafumi / Yamawaki, Shigeto / Machizawa, Maro G et al. | 2023
Elektronische Ausgabe
546: Multimodal recognition of speech and electrocorticogram
Ahuja, Mitali / Komeiji, Shuji / Mitsuhashi, Takumi / Iimura, Yasushi / Suzuki, Hiroharu / Sugano, Hidenori / Shinoda, Koichi / Tanaka, Toshihisa et al. | 2023
Elektronische Ausgabe
551: Enhancing Real-Time Semantic Segmentation with Textual Knowledge of Pre-Trained Vision-Language Model: A Lightweight Approach
Lin, Chia-Yi / Chen, Jun-Cheng / Wu, Ja-Ling et al. | 2023
Elektronische Ausgabe
559: EEG study on anticipation of difficulty for upcoming auditory task
Song, Zichen / Higashi, Hiroshi / Ishii, Shin et al. | 2023
Elektronische Ausgabe
567: Event-Related Potential in Rapid Serial Visual Presentation-based Partial Face Cognition Depends on Visible Face Components
Chanpornpakdi, Ingon / Tanaka, Toshihisa et al. | 2023
Elektronische Ausgabe
575: Residual, Mixer, and Attention: The Three-way Combination for Streaming Wake Word Detection Framework
Singkul, Sattaya / Sakdejayont, Theerat / Chalothorn, Tawunrat et al. | 2023
Elektronische Ausgabe
583: Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC Loss
Deng, Tengyu / Nakamura, Eita / Yoshii, Kazuyoshi et al. | 2023
Elektronische Ausgabe
591: Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow
Chang, Li-Jen / Liao, Yu-Cheng / Lin, Chia-Hui / Yang-Mao, Shih-Fang / Chen, Hwann-Tzong et al. | 2023
Elektronische Ausgabe
599: A Reversible Image Processing Method for Color Tone Control Using Data Hiding
Nakaya, Daichi / Imaizumi, Shoko et al. | 2023
Elektronische Ausgabe
605: Image-Text Out-Of-Context Detection Using Synthetic Multimodal Misinformation
Shalabi, Fatma / Nguyen, Huy H. / Felouat, Hichem / Chang, Ching-Chun / Echizen, Isao et al. | 2023
Elektronische Ausgabe
613: Gait Recognition Scheme Focusing on Operating Characteristics at Feature Points Detected by OpenPose
Tanaka, Chinatsu / Kuribayashi, Minoru / Funabiki, Nobuo et al. | 2023
Elektronische Ausgabe
620: A Study on Eliminating Biased Node in Federated Learning
Akai, Reon / Kuribayashi, Minoru / Funabiki, Nobuo et al. | 2023
Elektronische Ausgabe
628: Can StArtGAN withstand Image Processing Attacks?
Ng, Koi Yee / Ong, Simying / Loh, Yuen Peng et al. | 2023
Elektronische Ausgabe
635: Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition
Wang, Chang / Du, Jun / Chen, Hang / Wang, Ruoyu / Yang, Chao-Han Huck / Zhao, Jiangjiang / Ren, Yuling / Li, Qinglong / Lee, Chin-Hui et al. | 2023
Elektronische Ausgabe
643: Interpretable Image Recognition in Hyperbolic Space
Lebedeva, Irina / Bah, Mohamed Jaward / Li, Taihao et al. | 2023
Elektronische Ausgabe
651: Low-light is More Than Darkness: An Empirical Study on Illumination Types and Enhancement Methods
Liew, Hui Sze / Loh, Yuen Peng / Ong, Simying et al. | 2023
Elektronische Ausgabe
659: MoMo Strategy: Learn More from More Mistakes
Chulif, Sophia / Lee, Sue Han / Loong Chang, Yang / Kit Tsun, Mark Tee / Chai, Kok Chin / Then, Yi Lung et al. | 2023
Elektronische Ausgabe
666: Unveiling Robust Feature Spaces: Image vs. Embedding-Oriented Approaches for Plant Disease Identification
Ishrat, Hamza Ahmed / Yu Hao Chai, Abel / Lee, Sue Han / Hui Then, Patrick Hang et al. | 2023
Elektronische Ausgabe
674: Facial Expression Recognition as markers of Depression
Gue, Jia Xuan / Chong, Chun Yong / Lim, Mei Kuan et al. | 2023
Elektronische Ausgabe
681: How Transferable are Herbarium-Field Features in Few-Shot Plant Identification with Triplet Loss?
Chulif, Sophia / Lee, Sue Han / Loong Chang, Yang / Kit Tsun, Mark Tee / Chin Chai, Kok / Then, Yi Lung et al. | 2023
Elektronische Ausgabe
688: Resolution-Adaptive Lossless Image Compression Using Frequency Decomposition Network
Rhee, Hochang / Cho, Nam Ik et al. | 2023
Elektronische Ausgabe
696: Implementation and Analysis on Backpropagating Refinement Scheme for Interactive Image Segmentation
Lee, Chaewon / Jang, Won-Dong / Kim, Chang-Su et al. | 2023
Elektronische Ausgabe
703: Implicit Neural Representation for Video Coding Through Progressive Feature Extraction
Lee, Jihoo / Kang, Je-Won et al. | 2023
Elektronische Ausgabe
709: Deep Unfolded Underwater Image Enhancement Based on Extreme Channels Prior
Pham, Thuy Thi / Mai, Truong Thanh Nhat / Lee, Chul et al. | 2023
Elektronische Ausgabe
714: Low-Light Image Enhancement via Distillation of NIR-to-RGB Conversion Knowledge
Jeong, Young-Min / Park, Tae-Sung / Park, Jeong-Hyeok / Kim, Jong-Ok et al. | 2023
Elektronische Ausgabe
719: 3D Human Skeleton Estimation from Single RGB Image Based on Fusion of Predicted Depths from Multiple Virtual-Viewpoints
Lie, Wen-Nung / Vann, Veasna et al. | 2023
Elektronische Ausgabe
726: GNN-Based Small-Data Learning with Area-Control Mechanism for Hyperspectral Satellite Change Detection
Lin, Tzu-Hsuan / Lin, Chia-Hsiang / Young, Si-Sheng et al. | 2023
Elektronische Ausgabe
733: Efficient Constraint-Aware Neural Architecture Search for Object Detection
Poliakov, Egor / Hung, Wei-Jie / Huang, Ching-Chun et al. | 2023
Elektronische Ausgabe
741: A Reliable Feature-Based Framework for Vehicle Tracking in Advanced Driver Assistance Systems
Ha-Phan, Ngoc -Quan / Truong, Thanh-Nguyen / Tran, Vu -Hoang / Huang, Ching-Chun et al. | 2023
Elektronische Ausgabe
748: Light-weight Zero-Reference-based Image Enhancement for Low-Light Images
Chang, Jie-Fan / Lai, Kuan-Ting / Zhuang, Cheng-Xuan / Lin, Guo-Shiang / Chang, Ku-Yaw et al. | 2023
Elektronische Ausgabe
753: Classwise Self-Paced Self-Training for Semi-Supervised Image Classification
Lu, Cheng-Yu / Hsu, Heng-Cheng / Chiang, Chen-Kuo et al. | 2023
Elektronische Ausgabe
759: CapFormer: A Space-Time Video Description Model using Joint-Attention Transformer
Moussa, Mahamat / Lim, Chern Hong / Wong, KokSheik et al. | 2023
Elektronische Ausgabe
765: Local Contrast Enhancement with Multiscale Filtering
Hayashi, Kohei / Maeda, Yoshihiro / Fukushima, Norishige et al. | 2023
Elektronische Ausgabe
771: Marine Snow Removal Benchmarking Dataset
Kaneko, Reina / Sato, Yuya / Ueda, Takumi / Higashi, Hiroshi / Tanaka, Yuichi et al. | 2023
Elektronische Ausgabe
779: Cross-Frame Foreground Structural Similarity Modeling by Convolutional Sparse Representation
Naganuma, Kazuki / Ono, Shunsuke et al. | 2023
Elektronische Ausgabe
784: JPEG Artifact Removal for Hyperspectral Images Based on Spatial-Spectral Regularization
Eguchi, Ryunosuke / Kobayashi, Iori / Ono, Shunsuke / Matsuoka, Ryo et al. | 2023
Elektronische Ausgabe
788: Data Driven Multiband Image Fusion That Preserves Wavelength-Specific Image Features
Lin, Hsuan / Hirakawa, Keigo et al. | 2023
Elektronische Ausgabe
795: Shot-Noise-Aware Image Signal Restoration for Photoelectronic Charge-Based Sensors
Takamura, Seishi et al. | 2023
Elektronische Ausgabe
800: Generative Adversarial Network-Based Frame Interpolation with Multi-Perspective Discrimination
Tran, Quang Nhat / Yang, Shih-Hsuan et al. | 2023
Elektronische Ausgabe
806: ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation
Barua, Hrishav Bakul / Krishnasamy, Ganesh / Wong, KokSheik / Stefanov, Kalin / Dhall, Abhinav et al. | 2023
Elektronische Ausgabe
813: LSR++: An Efficient and Tiny Model for Image Super-Resolution
Wang, Wei / Lei, Xuejing / Chen, Yueru / Lee, Ming-Siu / Kuo, C.-C. Jay et al. | 2023
Elektronische Ausgabe
820: High-Quality Font Generation Based on StyleGAN2 and FSFont Font Generation Model
Shimamura, Yuki / Niimi, Michiharu et al. | 2023
Elektronische Ausgabe
826: Enhanced Residual Fourier Transformation Network for Lightweight Image Super-resolution
Yang, Yunming / Ikehara, Masaaki et al. | 2023
Elektronische Ausgabe
833: ELEGANT: End-to-end Language Grounded Speech Denoiser for Efficient Generation of Talking Face
Chai, Ai-Fang / Rajanala, Sailaja / Pal, Arghya / Phan, Raphael C.-W. / Ting, Chee-Ming et al. | 2023
Elektronische Ausgabe
839: Segmentation Enhancement for Iris Recognition Using Unit Gradient Vectors
Meam, Limhourlaurent / Duangpummet, Suradej / Kongprawechnon, Waree et al. | 2023
Elektronische Ausgabe
846: FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking
Cheung, Tsun-Hin / Lam, Kin-Man et al. | 2023
Elektronische Ausgabe
854: Auditory Representation Effective for Estimating Vocal Tract Information
Irino, Toshio / Doan, Shintaro et al. | 2023
Elektronische Ausgabe
862: Accurate and Practical Query-by-Example Using Multiple Deep Learning Models and Frame Compression Methods
Yamaga, Hikaru / Hatakeyama, Kazuki / Kojima, Kazunori / Lee, Shi-Wook / Itoh, Yoshiaki et al. | 2023
Elektronische Ausgabe
868: Fundamental Frequency Estimation Based on Finite-Order Harmonic Constraint Differential Equation
Yamada, Kenta / Masuyama, Yoshiki / Yamaoka, Kouei / Ono, Nobutaka et al. | 2023
Elektronische Ausgabe
873: Tone Labeling by Deep Learning-based Tone Recognizer for Mandarin Speech
Li, Wu-Hao / Chiang, Chen-Yu / Liu, Te-Hsin et al. | 2023
Elektronische Ausgabe
881: Learning to Enhance the Position Embedding and Coherence
Shu, Ting-Jia / Chien, Jen-Tzung et al. | 2023
Elektronische Ausgabe
887: VLSI Design of Near-Lossless Image Compression using Improved LZW
Zhang, Yao-Zhong / Chen, Chiung-An / Zhang, Jia-Sheng / Wang, Jia-Wen et al. | 2023
Elektronische Ausgabe
892: The color demosaicing and image scaling based on improve Hamilton-Adams
Peng, Yu-Wen / Hu, Chia-Yu / Chin, Yen-Ju / Chou, He-Sheng / Lin, Yuan-Jin / Liu, Yu-Lin / Chen, Shih-Lun / Chen, Tsung-Yi / Li, Kuo-Chen / Chen, Chiung-An et al. | 2023
Elektronische Ausgabe
898: Improving Regularization of Deep Learning Models in Fundus Analysis
Hsu, Wei-Wen / Chang, Yao-Chung / Lee, Wei-Min / Huang, Yu-Chuan / Lu, Da-Wen et al. | 2023
Elektronische Ausgabe
902: Design of Interactive System for Acupoint Analysis Based on Augmented Reality
Wei, Chung-Yen / Xu, Bo-Yuan / Zhao, Yu-Xiang et al. | 2023
Elektronische Ausgabe
910: Dental Positioning Medical Assistance System for BW Radiograph Based on YOLOV4
Lin, Mu-Feng / Li, Yi-Qian / Chen, Tsung-Yi / Liu, Yu-Lin / Lin, Yuan-Jin / Chan, Mei-Ling / Chen, Chiung-An / Li, Kuo-Chen / Chen, Shih-Lun et al. | 2023
Elektronische Ausgabe
918: The Development of an AI-assisted Diagnosis System for Adult Glioma Subtyping Prediction
Hsu, Wei-Wen / Lin, Jia-Yi / Lai, Hsin-Hung / Hsu, Wan-Lin / Jiang, Jeng-Ting / Chang, Yao-Chung / Li, Yao-Feng et al. | 2023
Elektronische Ausgabe
922: Poisoning Attacks against Gait-based Identity Recognition
Dong, Jianmin / Peng, Da-Tian / Pei, Guanxiong / Li, Taihao et al. | 2023
Elektronische Ausgabe
927: STrack: Velocity Estimation Using Single Antenna WiFi Devices
Xu, Jian / Zhang, Dongheng / Li, Jiamu / Sun, Qibin / Chen, Yan et al. | 2023
Elektronische Ausgabe
934: SEformer: Dual-Path Conformer Neural Network is a Good Speech Denoiser
Wang, Kai / Hatzinakos, Dimitrios et al. | 2023
Elektronische Ausgabe
941: Complex Feature Information Enhanced Speech Emotion Recognition
Yue, Pengcheng / Zheng, Shukai / Li, Taihao et al. | 2023
Elektronische Ausgabe
947: Incorporating Pinyin into Pipeline Named Entity Recognition from Chinese Speech
Zhang, Min / Qiao, Xiaosong / Zhao, Yanqing / Su, Chang / Li, Yinglu / Zhu, Ming / Zhu, Junhao / Li, Yuang / Zhao, Xiaofeng / Liu, Yilun et al. | 2023
Elektronische Ausgabe
954: Learning Semantic Information from Machine Translation to Improve Speech-to-Text Translation
Deng, Pan / Zhang, Jie / Zhou, Xinyuan / Ye, Zhongyi / Zhang, Weitai / Cui, Jianwei / Dai, Lirong et al. | 2023
Elektronische Ausgabe
960: Effective Fine-tuning Method for Tibetan Low-resource Dialect Speech Recognition
Yang, Jiahao / Wei, Jianguo / Khysru, Kuntharrgyal / Xu, Junhai / Lu, Wenhuan / Ke, Wenjun / Yang, Xiaokang et al. | 2023
Elektronische Ausgabe
966: Multi-task Piano Transcription with Local Relative Time Attention
Wang, Qi / Liu, Mingkuan / Chen, Xianhong / Xiong, Mengwen et al. | 2023
Elektronische Ausgabe
972: Real and imaginary part interaction network for monaural speech enhancement and de-reverberation
Zhang, Zehua / He, Changjun / Xu, Shiyun / Wang, Mingjiang et al. | 2023
Elektronische Ausgabe
978: Progressive Multi-scale Self-supervised Learning for Speech Recognition
Wan, Genshun / Chen, Hang / Liu, Tan / Wang, Chenxi / Pan, Jia / Ye, Zhongfu et al. | 2023
Elektronische Ausgabe
983: Improved Data2vec with Soft Supervised Hidden Unit for Mandarin Speech Recognition
Wan, Genshun / Chen, Hang / Li, Pengcheng / Pan, Jia / Ye, Zhongfu et al. | 2023
Elektronische Ausgabe
988: Investigation of Ensemble of Self-Supervised Models for Speech Emotion Recognition
Wu, Yanfeng / Yue, Pengcheng / Cheng, Cuiping / Li, Taihao et al. | 2023
Elektronische Ausgabe
996: Single Source Zone Detection in the Spherical Harmonic Domain for Multisource Localization
Tao, Liang / Jia, Maoshen / Bu, Bing / Yao, Dingding et al. | 2023
Elektronische Ausgabe
1002: Robust Representation Learning for Speech Emotion Recognition with Moment Exchange
Cai, Yunrui / Song, Changhe / Tang, Boshi / Dai, Dongyang / Wu, Zhiyong / Meng, Helen et al. | 2023
Elektronische Ausgabe
1008: Few Shot Learning Guided by Emotion Distance for Cross-corpus Speech Emotion Recognition
Yue, Pengcheng / Wu, Yanfeng / Qu, Leyuan / Zheng, Shukai / Zhao, Shuyuan / Li, Taihao et al. | 2023
Elektronische Ausgabe
1013: Speech Emotion Recognition by Late Fusion of Linguistic and Acoustic Features using Deep Learning Models
Sato, Kiyohide / Kishi, Keita / Kosaka, Tetsuo et al. | 2023
Elektronische Ausgabe
1019: Multilingual, Cross-lingual, and Monolingual Speech Emotion Recognition on EmoFilm Dataset
Atmaja, Bagus Tris / Sasou, Akira et al. | 2023
Elektronische Ausgabe
1026: Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from Speech
Atmaja, Bagus Tris / Sasou, Akira et al. | 2023
Elektronische Ausgabe
1030: An Automatic Pipeline For Building Emotional Speech Dataset
Thi, Ngoc-Anh Nguyen / Thang Ta, Bao / Le, Nhat Minh / Hai Do, Van et al. | 2023
Elektronische Ausgabe
1036: Analysis of Emotions in Speech using AESDD
Uthiraa, S. / Patil, Hemant et al. | 2023
Elektronische Ausgabe
1042: Modified Parametric Multichannel Wiener Filter for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers
Guo, Ning / Nakatani, Tomohiro / Araki, Shoko / Moriya, Takehiro et al. | 2023
Elektronische Ausgabe
1050: Blind Source Separation Using Independent Low-Rank Matrix Analysis with Spectrogram-Consistency Regularization
Misawa, Sota / Takamune, Norihiro / Yatabe, Kohei / Kitamura, Daichi / Saruwatari, Hiroshi et al. | 2023
Elektronische Ausgabe
1058: Moving Interference Speaker removal using Geometrically Constrained Independent Vector Analysis
Furunaga, Shinya / Ueda, Tetsuya / Makino, Shoji et al. | 2023
Elektronische Ausgabe
1064: A Dual-Channel Three-Stage Model for DoA and Speech Enhancement
Wu, Meng-Hsuan / Shen, Yih-Liang / Chou, Hsuan-Cheng / Shih, Bo-Wun / Chi, Tai-Shih et al. | 2023
Elektronische Ausgabe
1069: A Weighted Binary Cross-Entropy for Sound Event Representation Learning and Few-Shot Classification
Bai, Zhongxin / Pan, Chao / Chen, Gong / Chen, Jingdong / Benesty, Jacob et al. | 2023
Elektronische Ausgabe
1075: A Reconfigurable Hardware Architecture for Graph Convolution Network in Action Recognition
Tsai, Tsung-Han / Chen, Tzu-Chieh et al. | 2023
Elektronische Ausgabe
1079: Automated Carina Detection in Chest X-ray Images Using Non-Overlapping and Cross-Squeeze Convolutional Neural Networks
Hsu, Chung-Chian / Chen, Chi-Yuan / Salahuddin Morsalin, S. M. / Chang, Arthur / Fan, Wen-Lin et al. | 2023
Elektronische Ausgabe
1085: Identifying the Style of Chatting
Zhang, Manman / Ma, Yuchen / Luo, Ge / Li, Sheng / Qian, Zhenxing / Zhang, Xinpeng et al. | 2023
Elektronische Ausgabe
1093: Pose-Based Visual Servoing with Lightweight Deep-Learning Binarization for Autonomous Mobile Robot Application
Ho, Chian C. / Lin, Cian-Duo et al. | 2023
Elektronische Ausgabe
1100: Real-Time Noise Suppression Using Harmonic/Percussive Separation with Morphological Operations for Hammering Test
Uchiyama, Ryugo / Tanabe, Nari et al. | 2023
Elektronische Ausgabe
1107: ΔΣ Modulators for Discrete-time Closed Loop Control Systems with Quantization and Saturation
Ohno, Shuichi / Wang, Shenjian / Takaba, Kiyotsugu et al. | 2023
Elektronische Ausgabe
1112: Asymptotic Estimation Performance of Linear Regression Model with Sparse Bayesian Learning as Both Samples and Signals Approach Infinity
Murayama, Kazuaki et al. | 2023
Elektronische Ausgabe
1119: Convolutional Multidimensional Amplitude Spectrum Nuclear Norm for Frequency-domain Robust Principal Component Analysis
Harashima, Ryoya / Eguchi, Ryunosuke / Kyochi, Seisuke et al. | 2023
Elektronische Ausgabe
1126: Moreau Envelope ADMM for Decentralized Weakly Convex Optimization
Mirzaeifard, Reza / Venkategowda, Naveen K. D. / Jung, Alexander / Werner, Stefan et al. | 2023
Elektronische Ausgabe
1131: An Audio-Visual Speech Enhancement System Based on 3D Image Features: An Application in Hearing Aids
Chung, Yu-Ching / Han, Ji-Yan / Wang, Bo-Sin / Zheng, Wei-Zhong / Shen, Kung-Yao / Lai, Ying-Hui et al. | 2023
Elektronische Ausgabe
1138: On Joint Dereverberation and Source Separation with Geometrical Constraints and Iterative Source Steering
Mo, Kaien / Wang, Xianrui / Yang, Yichen / Ueda, Tetsuya / Makino, Shoji / Chen, Jingdong et al. | 2023
Elektronische Ausgabe
1143: Study of Generative Adversarial Networks for Noisy Speech Simulation from Clean Speech
Maben, Leander Melroy / Guo, Zixun / Chen, Chen / Chudiwal, Utkarsh / Siong, Chng Eng et al. | 2023
Elektronische Ausgabe
1150: Step Size Control of Shared-error Normalized Least Mean Square Algorithm for Acoustic Echo and Noise Canceller
Iwai, Kenta / Nishiura, Takanobu et al. | 2023
Elektronische Ausgabe
1155: Enhancing Spectrogram for Audio Classification Using Time-Frequency Enhancer
Xing, Haoran / Zhang, Shiqi / Takeuchi, Daiki / Niizumi, Daisuke / Harada, Noboru / Makino, Shoji et al. | 2023
Elektronische Ausgabe
1161: Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion
Huang, Wen-Chin / Toda, Tomoki et al. | 2023
Elektronische Ausgabe
1167: DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion
Watanabe, Chihiro / Kameoka, Hirokazu et al. | 2023
Elektronische Ausgabe
1172: Speech Synthesis Using Ambiguous Inputs From Wearable Keyboards
Iwasaki, Matsuri / Hara, Sunao / Abe, Masanobu et al. | 2023
Elektronische Ausgabe
1179: Accent-Preserving Voice Conversion between Native-Nonnative Speakers for Second Language Learning
Correa, Iago Lourenco / Ueno, Sei / Lee, Akinobu et al. | 2023
Elektronische Ausgabe
1187: Increasing Speech Intelligibility by Mimicking Professional Announcers’ Voices and Its Physical Correlates
Tran, Dung Kim / Akagi, Masato / Unoki, Masashi et al. | 2023
Elektronische Ausgabe
1193: Robust Networked Federated Learning for Localization
Mirzaeifard, Reza / Venkategowda, Naveen K. D. / Werner, Stefan et al. | 2023
Elektronische Ausgabe
1199: Continual Local Updates for Federated Learning with Enhanced Robustness to Link Noise
Lari, Ehsan / Gogineni, Vinay Chakravarthi / Arablouei, Reza / Werner, Stefan et al. | 2023
Elektronische Ausgabe
1204: Gaussian Process Learning for Location-Based Service Data
Ugurel, Ekin / Huang, Shuai / Chen, Cynthia et al. | 2023
Elektronische Ausgabe
1208: Distributed on-line anomaly detection using kernel methods
Kuh, Anthony / Baguio, Tyler et al. | 2023
Elektronische Ausgabe
1214: Communication-Efficient Design of Learning System for Energy Demand Forecasting of Electrical Vehicles
Xu, Jiacong / Kilfoyle, Riley / Xiong, Zixiang / Lu, Ligang et al. | 2023
Elektronische Ausgabe
1221: Radiated Sound Field Reproduction for Surrounding Loudspeaker Array Based on Higher-order Ambisonics
Naiki, Shota / Miura, Shumpei / Iwai, Kenta / Nishiura, Takanobu / Soeta, Yoshiharu et al. | 2023
Elektronische Ausgabe
1226: Multichannel learning-based spatially extended active noise control via model matching and sensor transfer function interpolation
Zhong, Pei-Lin / Chen, You-Siang / Bai, Mingsian R. et al. | 2023
Elektronische Ausgabe
1234: A Study of the Microphone Protection of Active Noise Control for Axial Fan
Shen, Yi-Tsung / Chang, Cheng-Yuan et al. | 2023
Elektronische Ausgabe
1240: SFANC with Compensation Filter Based on MEFxDCTLMS Algorithm
Doi, Kenya / Kajikawa, Yoshinobu et al. | 2023
Elektronische Ausgabe
1245: Practical Active Noise Control: Restriction of Maximum Output Power
Gan, Woon-Seng / Shi, Dongyuan / Shen, Xiaoyi et al. | 2023
Elektronische Ausgabe
1250: A QoS Throughput Performance Measurement Comparison between UGS and BE Services of a Real-time FPGA Based OFDM Multi-user System Design Implementation
Adiono, Trio / Jonathan, Michael / Setiawan, Erwin / Sutisna, Nana / Mulyawan, Rahmat / Syafalni, Infall et al. | 2023
Elektronische Ausgabe
1257: Algorithm Development for Stepwise Valve Deflation Method in Blood Pressure Measurement
Adiono, Trio / Ramadhani, Reina Puteri / Amadeus, Clarance / Cicilya Sinaga, Sindy Novaria et al. | 2023
Elektronische Ausgabe
1263: SUMO Based Hardware/Software Co-simulation for Two-Intersection Adaptive and Collaborative Traffic Signal Controller
Ginting, Kendrik Emkel / Sutisna, Nana / Syafalni, Infall / Adiono, Trio et al. | 2023
Elektronische Ausgabe
1271: Sparsity Exploration for Structured and Unstructured Weight Formations in CNN Architecture
Endrawati, Devi Noor / Syafalni, Infall / Sutisna, Nana / Adiono, Trio et al. | 2023
Elektronische Ausgabe
1279: 1M parameters are enough? A lightweight CNN-based model for medical image segmentation
Dinh, Binh-Duong / Nguyen, Thanh-Thu / Tran, Thi-Thao / Pham, Van-Truong et al. | 2023
Elektronische Ausgabe
1285: Imaging Ultrasound Scattering Targets using Density-Enhanced Chaotic Compressive Sampling
Theu, Luong Thi / Huy, Tran Quang / Quynh, Tran Thi Thuy / Tran, Duc-Tan et al. | 2023
Elektronische Ausgabe
1291: Segmentation and observation of hand rehabilitation exercises by supporting of acceleration signals
Nguyen, Sinh-Huy / Le, Thi-Thu-Hong / Nguyen, Hoang-Bach / Duong, Ngoc-Bach / Nguyen, Hung-Cuong / Nguyen, Chi-Thanh / Nguyen, Van-Loi / Vu, Hai et al. | 2023
Elektronische Ausgabe
1296: Investigating the Role of Human Action Detector in Visual-guide Audio Source Separation System
Duong, Thanh Thi-Hien / Nguyen, Trung-Hieu / Le, The Thanh-Dat / Nghiem, Thi-Lich / Pham, Duc-Huy / Le, Thi-Lan et al. | 2023
Elektronische Ausgabe
1304: A combination of time and frequency synchronization with Doppler compensation for coded OFDM-based UWA systems
Nguyen Thi, Hoai Linh / Khuong Nguyen, Quoc / Nguyen, Van Duc et al. | 2023
Elektronische Ausgabe
1310: Classification of Normal vs. Pathological Infant Cries Using Morse Wavelets
Gupta, Priyanka / Kachhi, Aastha / Patil, Hemant A. et al. | 2023
Elektronische Ausgabe
1317: Compressive Sensing Based Algorithms for Limited-View PAT Image Reconstruction
John, Mary Josy / Barhumi, Imad et al. | 2023
Elektronische Ausgabe
1323: Towards AST-LLDs for the Analysis of Depression in Speech Signals
Nagappan, Sidharrth / Lim, Chern Hong / Thimali Dharmaratne, Anuja et al. | 2023
Elektronische Ausgabe
1329: ecVoice: Audio Text Extraction Optimization of Video Based on Idioms Similarity Replacement
Lin, Jinwei et al. | 2023
Elektronische Ausgabe
1337: Heart Rate Acquisition and Processing Techniques for a Miniature Wearable Microphone Sensor
Ang, Yi Yang / Boodhoo, Kirish / Ser, Wee / Tan, Rex Xiao et al. | 2023
Elektronische Ausgabe
1343: Detection and Correction of Defective Relative Humidity Data Collected from the Greenhouse Environment Using Nested Kalman Filters with Standard Deviation Analysis
Sirisanwannakul, Kraithep / Siripool, Nutchanon / Suzuki, Kenji / Kongprawechnon, Waree / Karnjana, Jessada et al. | 2023
Elektronische Ausgabe
1349: Pedestrian Crossing Intention Prediction with Multi-Modal Transformer-Based Model
Wang, Ting Wei / Lai, Shang-Hong et al. | 2023
Elektronische Ausgabe
1357: Revolutionizing Formative Assessment in STEM Fields: Leveraging AI and NLP Techniques
Tan, Chi Wee / Lim, Khai Yin et al. | 2023
Elektronische Ausgabe
1365: A Biased Mixed-Precision Convolution Engine for Hardware-Efficient Computational Imaging CNN
Tu, Hao-Jiun / Ou, Yu-Feng / Chen, Yong-Tai / Huang, Chao-Tsung et al. | 2023
Elektronische Ausgabe
1372: A Lightweight Speaker Verification Model For Edge Device
Chen, Ting-Wei / Chen, Chia-Ping / Lu, Chung-Li / Chan, Bo-Cheng / Cheng, Yu-Han / Chuang, Hsiang-Feng / Chen, Wei-Yu et al. | 2023
Elektronische Ausgabe
1378: Efficient Dictionary and Grid-Based Framework for Answering Durable k-Nearest Neighbor Queries on Time Series Data
Santoso, Bagus Jati / Armunanta, Dwi Prasetya / Pratomo, Baskoro Adi / Studiawan, Hudan et al. | 2023
Elektronische Ausgabe
1386: Dual-Path Residual Attention Convolution Networks for Color-Embedded-Grayscale Image
Prasetyo, Heri / Mahdy, Abid Ammar / Nadhif, Abrar Dwi Fairuz / Hidayat, Taufiqurrakhman Nur / Hartono, Rudi et al. | 2023
Elektronische Ausgabe
1392: DOC: A Novel DOuble-Contour-Based Macro Placement Framework for Mixed-Size Designs
Zhuo, Yin-Rong / Chen, Hui-Lin / Chen, Yu-Guang et al. | 2023
Elektronische Ausgabe
1398: Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings
MaungMaung, AprilPyone / Echizen, Isao / Kiya, Hitoshi et al. | 2023
Elektronische Ausgabe
1405: Implementation of PLIM on 429MHz LoRa/FSK with improved conversion table
Takeda, Keita / Miyamoto, Ryuji / Takyu, Osamu et al. | 2023
Elektronische Ausgabe
1410: Numerical Performance Evaluation of ℓ1 - ℓ2 Sparse Reconstruction Using Optical Analog Circuit
Furusawa, Soma / Hayashi, Kazunori / Kameda, Kaito / Hayakawa, Ryo et al. | 2023
Elektronische Ausgabe
1417: Assessing the Effects of Filtering Processing on Pulse Wave Transit Time Measured by Photoplethysmography from Earlobe
Liao, Shangdi / Liu, Haipeng / Zheng, Dingchang / Chen, Fei et al. | 2023
Elektronische Ausgabe
1422: Efficient Incremental Text-to-Speech on GPUs
Du, Muyang / Liu, Chuan / Qi, Jiaxing / Lai, Junjie et al. | 2023
Elektronische Ausgabe
1429: Retinex-based Low-Light Image Enhancement
Luo, Rui / Feng, Yan / He, Mingxin / Zhang, Yuliang et al. | 2023
Elektronische Ausgabe
1435: Fine-grained Face Anti-Spoofing based on Recursive Self-Attention and Multi-Scale Fusion
Xie, Shichuang / Wu, Jiasheng / Chen, Yanli / Han, Meng / Wu, Ting / Qiao, Tong et al. | 2023
Elektronische Ausgabe
1443: StyleStegan: Leak-free Style Transfer Based on Feature Steganography
Liang, Xiujian / Liu, Bingshan / Ying, Qichao / Qian, Zhenxing / Cho, Hsunfang / Zhang, Xinpeng et al. | 2023
Elektronische Ausgabe
1451: Robust Watermark Imaging via Graph-signal Optimization
Yang, Ruiguo / Han, Xinhui / Qi, Wenfa / Hu, Wei et al. | 2023
Elektronische Ausgabe
1458: A print-scan-resilient watermarking scheme for trademark images
Qi, Wenfa / Wang, Jiameng / Yuan, Zichen / Li, Xiaolong et al. | 2023
Elektronische Ausgabe
1463: AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network
Xi, Ziyi / Huang, Wenmin / Wei, Kangkang / Luo, Weiqi / Zheng, Peijia et al. | 2023
Elektronische Ausgabe
1471: ResNet-Based Camera Model Identification with Adaptive Preprocessing Module and Weight Fusion of Global Information
Chen, Boru / Abdulla, Waleed et al. | 2023
Elektronische Ausgabe
1479: Structural Quality Assured Global Optimization for CTU-Level Rate Control of Screen Content Coding
Tang, Tong / Tan, Yuan / Ding, Shihang / Li, Zhidu et al. | 2023
Elektronische Ausgabe
1484: Multimodal Emotion Recognition based on 2D Kernel Density Estimation for Multiple Labels Fusion
Luo, Zhaojie / Komatani, Kazunori et al. | 2023
Elektronische Ausgabe
1492: RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations
Sahipjohn, Neha / Shah, Neil / Tambrahalli, Vishal / Gandhi, Vineet et al. | 2023
Elektronische Ausgabe
1500: Realizing Nipple in Profile Recognition and Nipple Detection Using a Single Classification
Zeng, Yi-Chong et al. | 2023
Elektronische Ausgabe
1506: Exploring a CLIP-Enhanced Automated Approach for Video Description Generation
Zhang, Siang-Ling / Cheng, Huai-Hsun / Chen, Yen-Hsin / Yeh, Mei-Chen et al. | 2023
Elektronische Ausgabe
1512: 3D Point Cloud Denoising Based on Color Attribute
Lin, Wei-Chi / Lee, Ming-Zhan / Chou, He-Sheng / Lin, Yuan-Jin / Li, Kuo-Chen / Lin, Ting-Lan / Chen, Shin-Lun et al. | 2023
Elektronische Ausgabe
1517: The DSP and DDR4 VLSI Design for Multi-Sensor in Biomedical System
Zhang, Jia-Sheng / Chen, Chiung-An / Chen, Shih-Lun / Zhang, Yao-Zhong et al. | 2023
Elektronische Ausgabe
1521: Identification of Victims Wearing Vibrant Clothing using MATLAB
Hao-Cheng, Lu / Chiung-An, Chen / Jia-Sheng, Zhang / Yao-Zhong, Zhang et al. | 2023
Elektronische Ausgabe
1525: Point Cloud Inpainting Based on Delaunay Triangulation
Liu, Yu-Lin / Chou, He-Sheng / Lee, Ming-Zhan / Chan, Mei-Ling / Lin, Ting-Lan / Chen, Chiung-An / Chen, Shin-Lun et al. | 2023
Elektronische Ausgabe
1530: Dense Three-Dimensional Color Reconstruction for Large-Scale Outdoor Scenes
Liu, Zixiao / Guo, Sheng / Pun, Man-On et al. | 2023
Elektronische Ausgabe
1536: Safety Enhancement for Mobility Scooter with Rule-Based Danger Prevention
Chen, Yan-Ru / Tseng, Shih-Wei-Chen / Chen, Yu-Chi / Chang, Yeong-Hwa et al. | 2023
Elektronische Ausgabe
1542: Dictionary-driven Chinese ASR Entity Correction with Controllable Decoding
Li, Rongjun / Peng, Wei et al. | 2023
Elektronische Ausgabe
1549: A Method of Efficient Synthesizing Post-disaster Remote Sensing Image with Diffusion Model and LLM
Ou, Ruizhe / Yan, Haotian / Wu, Ming / Zhang, Chuang et al. | 2023
Elektronische Ausgabe
1556: Privacy-oriented Coded Caching in Mobile Information-centric Networking
Yang, Binchen / Guo, Yu / Chen, Xingyan et al. | 2023
Elektronische Ausgabe
1564: MKTformer: Fine-grained Meter Classification Based on Multi-modal Knowledge Transfer
Zheng, Zhaoye / Zhang, Ke / Shi, Chaojun / Zheng, Fei et al. | 2023
Elektronische Ausgabe
1571: Feature Augmentation Reconstruction Network for Few-Shot Image Classification
Li, Zhen / Wang, Lang / An, Wenjuan / Qi, Song / Li, Xiaoxu / Fei, Xuezhi et al. | 2023
Elektronische Ausgabe
1579: Dual Feature Reconstruction Network For Few-shot Image Classification
Guo, Xiaowei / Wu, Jijie / Ren, Kai / Song, Qi / Li, Xiaoxu et al. | 2023
Elektronische Ausgabe
1585: A Cloud-based Data Platform for Efficient EEG Data Management, Collaboration, and Analysis
Tian, Qi / Wu, Wen / Zhu, Qin / Cai, Tao / Jiang, Siyi / Li, Yaqing / Zhou, Jinrun / Zhu, Nan / Wei, Yina / Tang, Tao et al. | 2023
Elektronische Ausgabe
1593: Incorporating the Digit Triplet Test in A Lightweight Speech Intelligibility Prediction for Hearing Aids
Zhou, Xiajie / Mawalim, Candy Olivia / Angela Titalim, Benita / Unoki, Masashi et al. | 2023
Elektronische Ausgabe
1601: Deep Learning-based MRI Super-Resolution Using Non-uniform Segmented Phase-Scrambling Fourier Transform Signals
Yamato, Kazuki / Fujisawa, Shuntaro / Ito, Satoshi et al. | 2023
Elektronische Ausgabe
1607: An Extreme Gradient Boosting-based Prediction for Depression
Ibrahum, Ahmed / Park, Kwang Ho / Hong, Jang-Eui / Pham, Van-Huy / Ryu, Keun Ho et al. | 2023
Elektronische Ausgabe
1614: An Improved Check Digit-based Participant Identification System for Human Biorepositories
Chu, Minseok / Kang, Gilwon / Ryu, Keun Ho et al. | 2023
Elektronische Ausgabe
1622: Enhancing Snoring Detection with Statistical Analysis of Audio Features
Buaruk, Suphachok / Deepaisarn, Somrudee et al. | 2023
Elektronische Ausgabe
1628: Un-Rectifying in ReLU Networks and Applications
Tung, Shih-Shuo / Chung, Ming-Yu / Ho, Jinn / Hwang, Wen-Liang et al. | 2023
Elektronische Ausgabe
1636: OpenPose Based Yoga Poses Difficulty Estimation for Dynamic and Static Yoga Exercises
Huang, Wan-Chia / Shih, Cheng-Liang / Anggraini, Irin Tri / Xiao, Yanqi / Funabiki, Nobuo / Fan, Chih-Peng et al. | 2023
Elektronische Ausgabe
1641: Multimodal Multifaceted Music Emotion Recognition Based on Self-Attentive Fusion of Psychology-Inspired Symbolic and Acoustic Features
Zhao, Jiahao / Yoshii, Kazuyoshi et al. | 2023
Elektronische Ausgabe
1646: Learned String Quartet Music with Variational Auto Encoder
Chen, Young-Long / Huang, Hsin -I / Yen, Tzu-Te et al. | 2023
Elektronische Ausgabe
1652: SOAda-YOLOR: Small Object Adaptive YOLOR Algorithm for Road Object Detection
Huang, Yu-Fang / Liu, Tsung-Jung / Lin, Chun-An / Liu, Kuan-Hsien et al. | 2023
Elektronische Ausgabe
1659: Badminton Self-Training System Based on Virtual Reality
Tai, Wei-Shen / Liu, Kuan-Hsien et al. | 2023
Elektronische Ausgabe
1664: Rotation Angle Detection Using a Pilot Signal from Rotated Stego-Image
Kawano, Rinka / Kawamura, Masaki et al. | 2023
Elektronische Ausgabe
1670: Application for generating re-accessible screenshots of web pages using histogram shrinkage
Sakamoto, Ayaka / Kawano, Rinka / Kawamura, Masaki et al. | 2023
Elektronische Ausgabe
1677: Domain Adaptation for Efficiently Fine-tuning Vision Transformer with Encrypted Images
Nagamori, Teru / Shiota, Sayaka / Kiya, Hitoshi et al. | 2023
Elektronische Ausgabe
1684: Study on Face Landmark-based Analysis for Synthetic Media Identification Generated by Adversarial Generative Networks
Ura, Akinobu / Kuribayashi, Minoru / Funabiki, Nobuo et al. | 2023
Elektronische Ausgabe
1691: HDR Image Watermarking based on Saliency Detection and Quantization Index Modulation
Khan, Ahmed / Kuribayashi, Minoru / Wong, KokSheik / Baskaran, Vishnu Monn et al. | 2023
Elektronische Ausgabe
1697: Quick Response (QR) codes embedding in VVC using Quantisation Parameter Manipulation
Joan, Hau / Tan, Li Peng / Tew, Yiqi et al. | 2023
Elektronische Ausgabe
1705: CPIPS: Learning to Preserve Perceptual Distances in End-to-End Image Compression
Huang, Chen-Hsiu / Wu, Ja-Ling et al. | 2023
Elektronische Ausgabe
1712: Task-Specific Pruning: Efficient Parameter Reduction in Multi-task Object Detection Models
Ke, Wei-Hsun / Tseng, Yu-Wen / Cheng, Wen-Huang et al. | 2023
Elektronische Ausgabe
1718: Transformer-based Image Compression with Variable Image Quality Objectives
Kao, Chia-Hao / Chen, Yi-Hsin / Chien, Cheng / Chiu, Wei-Chen / Peng, Wen-Hsiao et al. | 2023
Elektronische Ausgabe
1726: From Synthetic To Real: Enhancing Deep Learning Models With Generative Adversarial Networks For Efficient Data Utilization In Automatic Retail Stores
Dang, Cong-Ty / Tran, Vu-Hoang / Le, Ngoc-Hoang-Lam / Huang, Ching-Chun et al. | 2023
Elektronische Ausgabe
1732: Virtual Garment Fitting Through Parsing and Context-Aware Generative Adversarial Networks with Discriminator Group
Su, Wei-Hong / Chen, Sze-Ann / Chin, Chen-I / Hsiao, Hsu-Feng et al. | 2023
Elektronische Ausgabe
1739: Sparse Tensor-based point cloud attribute compression using Augmented Normalizing Flows
Lin, Tzu-Po / Yim, Monyneath / Chiang, Jui-Chiu / Peng, Wen-Hsiao / Lie, Wen-Nung et al. | 2023
Elektronische Ausgabe
1745: Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic Singing Voice Understanding Tasks: Three Case Studies
Yamamoto, Yuya et al. | 2023
Elektronische Ausgabe
1753: Out-of-Vocabulary Word Detection in Spoken Dialogues Based on Joint Decoding with User Response Patterns
Oshio, Miki / Munakata, Hokuto / Takeda, Ryu / Komatani, Kazunori et al. | 2023
Elektronische Ausgabe
1760: Synthetic Data Augmentation for ASR with Domain Filtering
Vu Ho, Tuan / Horiguchi, Shota / Watanabe, Shinji / Garcia, Paola / Sumiyoshi, Takashi et al. | 2023
Elektronische Ausgabe
1766: Multi-Self-Supervised Learning Model-Based Throat Microphone Speech Recognition
Masuda, Kohta / Ogata, Jun / Nishida, Masafumi / Nishimura, Masafumi et al. | 2023
Elektronische Ausgabe
1771: ASR Model Adaptation for Rare Words Using Synthetic Data Generated by Multiple Text-To-Speech Systems
Yuen, Kwok Chin / Haoyang, Li / Siong, Chng Eng et al. | 2023
Elektronische Ausgabe
1779: Streaming End-to-End ASR Using CTC Decoder and DRA for Linguistic Information Substitution
Takagi, Tatsunari / Ogawa, Atsunori / Kitaoka, Norihide / Wakabayashi, Yukoh et al. | 2023
Elektronische Ausgabe
1784: A Biometric Signature Scheme with Template Protection and Authenticated Sample Recoverability
Nakamura, Wataru / Takahashi, Kenta et al. | 2023
Elektronische Ausgabe
1792: IPFed: Identity protected federated learning for user authentication
Kaga, Yosuke / Suzuki, Yusei / Takahashi, Kenta et al. | 2023
Elektronische Ausgabe
1798: Privacy-Preserving Image Transformation Method for Person Detection and Re-ID
Ouchi, Yumo / Uchida, Hidetsugu / Abe, Narishige et al. | 2023
Elektronische Ausgabe
1804: Eye Biometrics Combined with Periocular and Iris Recognition Using CNN
Tonosaki, Taito / Kawakami, Shokei / Ito, Koichi / Aoki, Takafumi / Yasumura, Yoshiko / Fujio, Masakazu / Kaga, Yosuke / Takahashi, Kenta et al. | 2023
Elektronische Ausgabe
1811: Development of a Robust Ear Recognition Algorithm using Planar Approximation
Arakawa, Takahiko / Sato, Yuya / Sakano, Hitoshi / Ohki, Tetsushi et al. | 2023
Elektronische Ausgabe
1816: Word encoding for word-looking DGA-based Botnet classification
Liew, Sea Ran Cleon / Law, Ngai Fong et al. | 2023
Elektronische Ausgabe
1822: Analysis of Spectro-Temporal Modulation Representation for Deep-Fake Speech Detection
Cheng, Haowei / Mawalim, Candy Olivia / Li, Kai / Wang, Lijun / Unoki, Masashi et al. | 2023
Elektronische Ausgabe
1830: Flexible Evidence Model to Reduce Uncertainty Mismatch Between Speech Enhancement and ASR Based on Encoder-Decoder Architecture
Takeda, Ryu / Sudo, Yui / Komatani, Kazunori et al. | 2023
Elektronische Ausgabe
1838: Investigating the Effectiveness of Speaker Embeddings for Shout Intensity Prediction
Fukumori, Takahiro / Ishida, Taito / Yamashita, Yoichi et al. | 2023
Elektronische Ausgabe
1843: Is the Ideal Ratio Mask Really the Best? — Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers
Hiroe, Atsuo / Itoyama, Katsutoshi / Nakadai, Kazuhiro et al. | 2023
Elektronische Ausgabe
1851: Language modeling for spontaneous speech recognition based on disfluency labeling and generation of disfluent text
Horii, Koharu / Ohta, Kengo / Nishimura, Ryota / Ogawa, Atsunori / Kitaoka, Norihide et al. | 2023
Elektronische Ausgabe
1857: Transformer-based Automatic Speech Recognition of Simultaneous Interpretation with Auxiliary Input of Source Language Text
Taniguchi, Shuta / Kato, Tsuneo / Tamura, Akihiro / Yasuda, Keiji et al. | 2023
Elektronische Ausgabe
1862: An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing
Violeta, Lester Phillip / Toda, Tomoki et al. | 2023
Elektronische Ausgabe
1868: Classification of Vocal Cord Disorders: Comparison Across Voice Datasets, Speech Tasks, and Machine Learning Methods
Chen, Ching-Chieh / Hsu, Wei-Cheng / Lin, Tzu-Han / Chen, Kuan-Dar / Tsou, Yung-An / Liu, Yi-Wen et al. | 2023
Elektronische Ausgabe
1878: Application of Deep Learning Techniques for Thermal Imagery Analysis in Abnormal Identification of Floor Tiles in Heritage Environments
Yu, Chen-Xin / Chen, Wu-Pei / Ju, Chin-Yen / Chen, Tsung-Yi / Li, Kuo-Chen / Chen, Chiung-An / Chan, Mei-Ling / Chen, Shih-Lun et al. | 2023
Elektronische Ausgabe
1885: Wavelet and Cutout in YOLO Architecture for Road Pothole Detection
Lu, Shao-Hua / Lu, Jia-Teng / Lin, Szu-Yin / Hsia, Chih-Hsien et al. | 2023
Elektronische Ausgabe
1892: Robust Finger Vein Recognition Based on Lightweight Attention Convolutional Neural Networks
Wei, Ming-Yi / Wang, Yu-Chi / Ke, Liang-Ying / Hsia, Chih-Hsien et al. | 2023
Elektronische Ausgabe
1896: Lightweight CNN and Image Enhancement Using in Palm Vein Recognition
Chen, Ping-Han / Hung, Yung-Sheng / Hsia, Chih-Hsien et al. | 2023
Elektronische Ausgabe
1903: Breast Cancer Detection Auxiliary System Leveraging Deep Learning and Mixed Reality
Lin, Szu-Yin / Chien, Ming-Chun / Kwong Meng, Edwin Tiong / Wang, Yu-Chien / Kuo, Yu-Yi / Lin, Che-Hsuan et al. | 2023
Elektronische Ausgabe
1907: Efficient Reversible Data Hiding for 3D Mesh Models Based on Multi-LSB Substitution and Ring-prediction
Lyu, Wanli / Cheng, Lulu / Yin, Zhaoxia / Luo, Bin et al. | 2023
Elektronische Ausgabe
1915: MAEDefense: An Effective Masked AutoEncoder Defense against Adversarial Attacks
Lyu, Wanli / Wu, Mengjiang / Yin, Zhaoxia / Luo, Bin et al. | 2023
Elektronische Ausgabe
1923: Preemptive Image Protection against Steganography
Guo, Yusheng / Zhong, Nan / Qian, Zhenxing / Zhang, Xinpeng / Cho, Hsunfang et al. | 2023
Elektronische Ausgabe
1931: Zero-shot multi-speaker accent TTS with limited accent data
Zhang, Mingyang / Zhou, Yi / Wu, Zhizheng / Li, Haizhou et al. | 2023
Elektronische Ausgabe
1937: Speech Enhancement with Multi-granularity Vector Quantization
Zhao, Xiaoying / Zhu, Qiushi / Zhang, Jie / Zhou, Yeping / Liu, Peiqi et al. | 2023
Elektronische Ausgabe
1943: A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings
Shi, Mohan / Zhang, Jie / Du, Zhihao / Yu, Fan / Chen, Qian / Zhang, Shiliang / Dai, Li-Rong et al. | 2023
Elektronische Ausgabe
1949: Hybrid Syllable and Character Representations for Mandarin ASR
Zhang, Fengrun / Li, Chengfei / Deng, Shuhao / Wang, Yaoping / Bai, Jinfeng et al. | 2023
Elektronische Ausgabe
1955: Enhancing Whisper Model for Pronunciation Assessment with Multi-Adapters
Li, Jing / Li, Rui / Guo, Shen / Wumaier, Aishan et al. | 2023
Elektronische Ausgabe
1960: DoA Estimation of Room Reflections Using NN-Based MUSIC Algorithm
Li, Haowen / Zhang, Wen / Zhang, Lijun et al. | 2023
Elektronische Ausgabe
1966: Hybrid Multi-Task Learning for End-To-End Multimodal Emotion Recognition
Chen, Junjie / Li, Yongwei / Zhao, Ziping / Liu, Xuefei / Wen, Zhengqi / Tao, Jianhua et al. | 2023
Elektronische Ausgabe
1972: It’s What You Say and How You Say It: Exploring Audio and Textual Features for Podcast Data
Shah, Neil / Srivastava, Vivek / Bhardwaj, Mohit / Kadlay, Satej / Agrawal, Dharmeshkumar / Bhat, Savita / Pedanekar, Niranjan et al. | 2023
Elektronische Ausgabe
1978: Improved One-class Learning for Voice Spoofing Detection
Li, Lixiang / Xue, Xiaopeng / Peng, Haipeng / Ren, Yeqing / Zhao, Mengmeng et al. | 2023
Elektronische Ausgabe
1984: Sound Field Estimation around a Rigid Sphere with Physics-informed Neural Network
Chen, Xingyu / Ma, Fei / Bastine, Amy / Samarasinghe, Prasanga / Sun, Huiyuan et al. | 2023
Elektronische Ausgabe
1990: A Controlled Noise Reduction Wiener Filter Based on the Quadratic Eigenvalue Problem
Pan, Ningning / Benesty, Jacob / Chen, Jingdong et al. | 2023
Elektronische Ausgabe
1995: Target Speaker Extraction with Attention Enhancement and Gated Fusion Mechanism
Sijie, Wang / Hamdulla, Askar / Ablimit, Mijit et al. | 2023
Elektronische Ausgabe
2002: Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
Yip, Jia Qi / Ng, Dianwen / Ma, Bin / Siong, Chng Eng et al. | 2023
Elektronische Ausgabe
2008: Geometrically Constrained Blind Moving Source Extraction based on Constant Separation Vector and Auxiliary Function Technique
Zhang, Ruifeng / Ueda, Tetsuya / Makino, Shoji et al. | 2023
Elektronische Ausgabe
2013: Universal Sound Separation Using Replay-based Data Sampling in Incremental Learning
Shimonishi, Kanta / Fukumori, Takahiro / Yamashita, Yoichi et al. | 2023
Elektronische Ausgabe
2019: Multiple Sound Source Tracking Based on Generative Modeling and Recursive Bayesian Filtering of Spatial Gradient Spectra
Takazawa, Keisuke / Kameoka, Hirokazu / Yukawa, Masahiro et al. | 2023
Elektronische Ausgabe
2024: Spatially-Regularized Switching Independent Vector Analysis
Ueda, Tetsuya / Nakatani, Tomohiro / Ikeshita, Rintaro / Araki, Shoko / Makino, Shoji et al. | 2023
Elektronische Ausgabe
2031: ASF-LLRDA: Locality-regularized Linear Regression Discriminant Analysis with Approximately Symmetrical Face Preprocessing for Face Recognition
Widyadhana, Arya / Hidayati, Shintami Chusnul / Navastara, Dini Adni / Anistyasari, Yeni et al. | 2023
Elektronische Ausgabe
2037: Joint Optimization Algorithm for Adaptive Bit Allocation Based on Temporal-Spatial Information
Wang, Shaokang / Sun, Songlin et al. | 2023
Elektronische Ausgabe
2043: Maximization of 2D Cross-Correlation Based on Auxiliary Function Method for Image Alignment
Kinoshita, Yuma / Yamaoka, Kouei / Kiya, Hitoshi et al. | 2023
Elektronische Ausgabe
2048: Multitask Record for Badminton Match
Guo, Jing-Ming / Huang, Yu-Shun / Chang, Ting-Yu / Ciou, Tai-Cyuan / Yeh, Yun-Ching / Chen, Jeffrey et al. | 2023
Elektronische Ausgabe
2053: Deep Residual and Classified Neural Networks for Inverse Halftoning
Guo, Jing-Ming / Sankarasrinivasan, S. / Hung, Let Viet / Liu, Wei et al. | 2023
Elektronische Ausgabe
2061: DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection
Fujita, Yoto / Bando, Yoshiaki / Imoto, Keisuke / Onishi, Masaki / Yoshii, Kazuyoshi et al. | 2023
Elektronische Ausgabe
2068: Improving Sound Event Localization and Detection with Class-Dependent Sound Separation for Real-World Scenarios
Cheng, Shi / Du, Jun / Wang, Qing / Jiang, Ya / Nian, Zhaoxu / Niu, Shutong / Lee, Chin-Hui / Gao, Yu / Zhang, Wenbin et al. | 2023
Elektronische Ausgabe
2074: Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach
Igarashi, Ami / Tsubaki, Shunsuke / Niizumi, Daisuke / Takeuchi, Daiki / Ohishi, Yasunori / Harada, Noboru / Imoto, Keisuke et al. | 2023
Elektronische Ausgabe
2081: Cross-domain Sound Recognition for Efficient Underwater Data Analysis
Park, Jeongsoo / Han, Dong-Gyun / La, Hyoung Sul / Lee, Sangmin / Han, Yoonchang / Yang, Eun-Jin et al. | 2023
Elektronische Ausgabe
2087: Augmentation of Various Speed Data by Controlling Frame Overlap for Acoustic Traffic Monitoring
Takahashi, Tomohiro / Kinoshita, Yuma / Ueno, Natsuki / Wakabayashi, Yukoh / Ono, Nobutaka / Honda, Jun / Fukuma, Seishi / Kitamori, Aoi / Nakagawa, Hiroshi et al. | 2023
Elektronische Ausgabe
2092: Distributed Computation of Heat Kernel Smoothing Using Series Expansion Method
Tseng, Chien-Cheng / Lee, Su-Ling et al. | 2023
Elektronische Ausgabe
2099: In-Air Handwriting for Chinese Character Recognition from Monocular Camera: A Deep Learning based Approach with Fingertip Detection and Virtual Strokes Elimination
Yu, Chih-Chang / Huang, Zi-Hang / Cheng, Hsu-Yung et al. | 2023
Elektronische Ausgabe
2104: EffSegmentNet: Efficient Design for Real-time Semantic Segmentation
Wang, Cyun-Bo / Ding, Jian-Jiun et al. | 2023
Elektronische Ausgabe
2112: Universal Optimal Parameters of the Closed-Form Linear Canonical Wigner Distribution
Zhang, Zhichao et al. | 2023
Elektronische Ausgabe
2118: Autoencoder-Enhanced Federated Learning with Reduced Overhead and Lower Latency
Hsieh, Chi-Kai / Chien, Feng-Tsun / Chang, Min-Kuan et al. | 2023
Elektronische Ausgabe
2124: Deep Unfolding-based Distributed MIMO Detection
Kumagai, Masaya / Nakai-Kasai, Ayano / Wadayama, Tadashi et al. | 2023
Elektronische Ausgabe
2131: A Comparative Analysis of the Yolo Models for Intelligent Lobster Surveillance Camera
Akhyar, Fityanul / Novamizanti, Ledya / Usman, Koredianto / Aditya, Ghanes Mahesa / Nur Hakim, Farhan / Ilman, Mukhamad Zidni / Ramdhon, Ferdi / Lin, Chih-Yang et al. | 2023
Elektronische Ausgabe
2137: A UAV Indoor Obstacle Avoidance System Based on Deep Reinforcement Learning
Lo, Chun-Huang / Lee, Chung-Nan et al. | 2023
Elektronische Ausgabe
2144: Approximate modeling of malware diffusion on wireless mobile devices
Miura, Hideyoshi / Abukawa, Shoya / Kimura, Tomotaka / Hirata, Kouji et al. | 2023
Elektronische Ausgabe
2149: Impacts of 5G-TDD Time Slot Configurations on the Downlink and Uplink Data Rates
Lai, Wen-Ping / Chen, Wen-Ru / Lai, Hong-Lun / Li, Hong-Yi et al. | 2023
Elektronische Ausgabe
2155: Bearing Fault Diagnosis and Interpretation Based on 2D Images and Convolutional Neural Network
Tian, Zhenzhen / Zhang, Xinyu / Yan, Wei / Wang, Jihua et al. | 2023
Elektronische Ausgabe
2163: Study on Reduction of Background Fringes for Defect Detection of Specular Surface
Wei, An-Chi / Chang, Yi-Cheng / Sze, Jyh-Rou et al. | 2023
Elektronische Ausgabe
2168: On the Optimal Self-Supervised Multi-Fault Detector for Temperature Sensor Data
Harfiya, Latifa Nabila / Hsu, Yan-Cheng / Li, Yung-Hui / Wang, Jia-Ching et al. | 2023
Elektronische Ausgabe
2173: Application of Wafer Defect Pattern Classification Model in the Semiconductor Industry
Lee, Chin-Wei / Hladek, Daniel / Pleva, Matus / Liao, Yuan-Fu / Su, Ming-Hsiang et al. | 2023
Elektronische Ausgabe
2178: Question Answering System Based on Pre-Training Model and Retrieval Reranking for Industry 4.0
Chen, Ta-Fu / Lin, Yi-Xing / Su, Ming-Hsiang / Chen, Po-Kai / Tai, Tzu-Chiang / Wang, Jia-Ching et al. | 2023
Elektronische Ausgabe
2182: Deepfake-speech Detection with Pathological Features and Multilayer Perceptron Neural Network
Chaiwongyen, Anuwat / Duangpummet, Suradej / Karnjana, Jessada / Kongprawechnon, Waree / Unoki, Masashi et al. | 2023
Elektronische Ausgabe
2189: Temporal and Type Correlation in Digital Phenotyping for Bipolar Disorder State Prediction Using Multitask Self-Supervised Learning
Hsu, Jia-Hao / Tseng, Hua-Wei / Wu, Chung-Hsien / Lin, Esther Ching-Lan / See Chen, Po et al. | 2023
Elektronische Ausgabe
2196: Data Selection Based on Phoneme Affinity Matrix for Electrolarynx Speech Recognition
Hsieh, I-Ting / Wu, Chung-Hsien / Tsa, Shu-Wei et al. | 2023
Elektronische Ausgabe
2203: Reduction of Annotation Effort in Medical Image Analysis Based on Self-supervised Learning
Chan, Kai-Hsuan / Zeng, Yi-Chong et al. | 2023
Elektronische Ausgabe
2209: STUA-Net: A Fingerprint Reconstruction with Swin Transformer and Soft Collective Attention
Hakim, Farchan Raswa / Yoga Wicaksana, Prabowo / Putri, Wenny Ramadha / Harjoko, Agus / Wang, Jia-Ching et al. | 2023
Elektronische Ausgabe
2213: Coarse-Age Loss: A New Training Method Using Coarse-Age Labeled Data for Speaker Age Estimation
Kitagishi, Yuki / Kamiyama, Hosana / Tawara, Naohiro / Ogawa, Atsunori / Miyazaki, Noboru / Asami, Taichi et al. | 2023
Elektronische Ausgabe
2221: Contribution of modulation spectral features for cross-lingual speech emotion recognition under noisy reverberant conditions
Guo, Taiyang / Li, Sixia / Kidani, Shunsuke / Okada, Shogo / Unoki, Masashi et al. | 2023
Elektronische Ausgabe
2228: Vocal Tract Length Perturbation-based Pseudo-Speaker Augmentation for Speaker Embedding Learning
Wakamatsu, Tomoka / Shiota, Sayaka / Kiya, Hitoshi et al. | 2023
Elektronische Ausgabe
2233: Automatic Call Classification of Autism Model Marmosets by Deep Learning and Analysis of Their Vocal Development
Uesaka, Minato / Kawauchi, Hideto / Yamaoka, Kouei / Wakabayashi, Yukoh / Kinoshita, Yuma / Ono, Nobutaka / Noguchi, Jun / Watanabe, Satoshi / Ichinohe, Noritaka / Benner, Seico et al. | 2023
Elektronische Ausgabe
2238: Cross-Domain adaptation in Distance Space for Speaker Verification
Yi, Lu / Mak, Man Wai et al. | 2023
Elektronische Ausgabe
2244: Urban Noise Monitoring using Edge Computing with CNN-LSTM on Jetson Nano
Peng, Bo / Abdulla, Waleed H. / Wang, Kevin I-Kai et al. | 2023
Elektronische Ausgabe
2251: Random forest of Classification and Regression Tree (CART) in the estimation of SWC based on meteorological inputs and hydrodynamics behind
Wu, Tsung-Hsi / Chen, Pei-Yuan / Chen, Chien-Chih / Chung, Meng-Ju / Ye, Zheng-Kai / Li, Ming-Hsu et al. | 2023
Elektronische Ausgabe
2256: A Framework for Reusing Earth Science Data on Data and Model Marketplaces
Huang, Chung-I / Chang, Jih-Sheng / Sun, Chen-Kai / Wang, Taichi / Chen, Wei-Yu / Yu, Hui Hung / Chang, Wen-Yi / Lin, Fang-Pang et al. | 2023
Elektronische Ausgabe
2261: Impact of the weighted loss function on the innovative CMAQ-CNN PM2.5 forecasting model
Lee, Yi-Ju / Cheng, Fang-Yi / Feng, Chih-Yung / Yang, Zhih-Min et al. | 2023
Elektronische Ausgabe
2267: Jointly Modelling Transcriptions and Phonemes with Optimal Features to Detect Dementia from Spontaneous Cantonese
Ke, Xiaoquan / Mak, Man-Wai / Meng, Helen M. et al. | 2023
Elektronische Ausgabe
2274: Combining multiple end-to-end speech recognition models based on density ratio approach
Hojo, Keigo / Mori, Daiki / Wakabayashi, Yukoh / Ohta, Kengo / Ogawa, Atsunori / Kitaoka, Norihide et al. | 2023
Elektronische Ausgabe
2280: Speech-Emotion Control for Text-to-Speech in Spoken Dialogue Systems Using Voice Conversion and x-vector Embedding
Kohara, Shunichi / Abe, Masanobu / Hara, Sunao et al. | 2023
Elektronische Ausgabe
2287: Narrow-edged Acoustical Beamforming Utilizing Phase Inversion for Frequency Modulation-based Parametric Array Loudspeaker
Geng, Yuting / Nakayama, Masato / Nishiura, Takanobu et al. | 2023
Elektronische Ausgabe
2294: Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition
Kobayashi, Akio / Yasu, Keiichi et al. | 2023
Elektronische Ausgabe
2299: Ensemble of Transformer and Convolutional Recurrent Neural Network for Improving Discrimination Accuracy in Automatic Chord Recognition
Yamaga, Hikaru / Momma, Toshifumi / Kojima, Kazunori / Itoh, Yoshiaki et al. | 2023
Elektronische Ausgabe
2306: Construction of Automatic Speech Recognition Model that Recognizes Linguistic Information and Verbal/Non-verbal Phenomena
Shione, Nagito / Wakabayashi, Yukoh / Kitaoka, Norihide et al. | 2023
Elektronische Ausgabe
2312: Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Zhong, Lifan / Cooper, Erica / Yamagishi, Junichi / Minematsu, Nobuaki et al. | 2023
Elektronische Ausgabe
2320: Speech Quality Improvement Utilizing Out-of-Focus Areas in Rolling-Shutter Video on Speech Extraction
Nakano, Hayata / Yoshizawa, Tsubasa / Geng, Yuting / Iwai, Kenta / Nishiura, Takanobu et al. | 2023
Elektronische Ausgabe
2326: Personalized Audio Quality Preference Prediction
Wang, Chung-Che / Lin, Yu-Chun / Hsu, Yu-Teng / Jang, Jyh-Shing Roger et al. | 2023
Elektronische Ausgabe
2331: AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Wang, Yi-Cheng / Yang, Tzu-Ting / Wang, Hsin-Wei / Yan, Bi-Cheng / Chen, Berlin et al. | 2023
Elektronische Ausgabe
2336: Regression-based Sound Event Detection with Semi-supervised Learning
Liu, Chia-Chuan / Chen, Chia-Ping / Lu, Chung-Li / Chan, Bo-Cheng / Cheng, Yu-Han / Chuang, Hsiang-Feng / Chen, Wei-Yu et al. | 2023
Elektronische Ausgabe
2343: Proportionate NLMS with Variable Step-Size for Adaptive Feedback Cancellation in Hearing Aids
Thuc Tran, Linh Thi / Albu, Felix / Nguyen, Hieu Trung / Nordholm, Sven et al. | 2023
Elektronische Ausgabe
2349: Residual Echo Suppression using Spatial Feature for Stereo Acoustic Echo Cancellation
Chou, Hsuan-Cheng / Shen, Yih-Liang / Wu, Meng-Hsuan / Shih, Bo-Wun / Chi, Tai-Shih et al. | 2023
Elektronische Ausgabe
2354: Multitaper Adaptive Time-Frequency Windowed Fourier Transform Based on the Reliable Region of Window Widths
Cheng, Jen-Chieh / Ding, Jian-Jiun et al. | 2023
Elektronische Ausgabe
2362: Enhancing Retinal Disease Classification with Dual Scale Twin Vision Transformers using OCT Imaging
Karn, Prakash Kumar / Abdulla, Waleed H et al. | 2023
Elektronische Ausgabe
2370: Classification of Infant Sleep/Wake States: Cross-Attention among Large Scale Pretrained Transformer Networks using Audio, ECG, and IMU Data
Chang, Kai Chieh / Hasegawa-Johnson, Mark / McElwain, Nancy L. / Islam, Bashima et al. | 2023
Elektronische Ausgabe
2378: Dynamic Characteristics of Electroencephalogram Reflecting Driving-Experience-Dependent Performance Using Microstates
Iinuma, Yuta / Ozawa, Takuto / Nobukawa, Sou / Wagatsuma, Nobuhiko / Inagaki, Keiichiro et al. | 2023
Elektronische Ausgabe
2384: Quefrency Domain Features with Residual Networks for Spoof Speech Detection
Kamble, Madhu R. et al. | 2023
Elektronische Ausgabe
2390: PDF-NET: Pitch-adaptive Dynamic Filter Network for Intra-gender Speaker Verification
Piao, Zhenyu / Lim, Hyungseob / Kim, Miseul / Kang, Hong-Goo et al. | 2023
Elektronische Ausgabe
2395: Subjective Evaluation of a Focused Sound Source Reproducing at the positions of a Listener’s Moving Hand
Hirohashi, Miho / Haneda, Yoichi et al. | 2023
Elektronische Ausgabe
2402: Time Sensitive Hash and Adaptive Image Recovery based Self-embedding Fragile Watermarking Scheme in Encrypted Images
Wang, Xin / He, Hongjie / Chen, Fan et al. | 2023
Elektronische Ausgabe
2409: Multi-granularity Semantic and Acoustic Stress Prediction for Expressive TTS
Chi, Wenjiang / Feng, Xiaoqin / Xue, Liumeng / Chen, Yunlin / Xie, Lei / Li, Zhifei et al. | 2023
Elektronische Ausgabe
2416: NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Wang, Wen / Yang, Dongchao / Ye, Qichen / Cao, Bowen / Zou, Yuexian et al. | 2023
Elektronische Ausgabe
2424: Multi-accent pronunciation assessment based on domain adversarial training
Lin, Binghuai / Wang, Liyuan et al. | 2023
Elektronische Ausgabe
2429: GAN-Based Time-Domain Packet Loss Concealment Method with Consistent Mapping Approach
Zhao, Yunhao / Bao, Changchun / Yang, Xue / Zhou, Jing et al. | 2023
Elektronische Ausgabe
2441: Feature Selection Based on Clonal Selection Algorithm for Image Steganalysis
Liu, Yu / Wang, Hongxia et al. | 2023
Elektronische Ausgabe
2448: ScaleFormer: Transformer-based speech enhancement in the multi-scale time domain
Wu, Tianci / He, Shulin / Zhang, Hui / Zhang, XueLiang et al. | 2023
Elektronische Ausgabe
2454: UniVR: A Unified Framework for Pitch-Shifted Voice Restoration in Speaker Identification
Li, Yangfu / Lin, Xiaodan et al. | 2023
Elektronische Ausgabe
i: Table of Contents
| 2023
Elektronische Ausgabe
i: Technical Program Committee
| 2023
Elektronische Ausgabe
i: Authors Index
| 2023
Elektronische Ausgabe
i: Cognitive Assessment of Autism Spectrum Disorder Using an EEG-based Social Interaction Platform
Tseng, Yi-Li / Chien, Yi-Ling / Chuang, Tse-Min / Chiu, Yen-Nan / Tsai, Wen-Che et al. | 2023
Elektronische Ausgabe
i: Copyright Page
| 2023
Elektronische Ausgabe

Wie erhalte ich diesen Titel?

Zugriff prüfen

Download

Kommerziell Vergütung an den Verlag: 30,47 € Grundgebühr: 4,00 € Gesamtpreis: 34,47 €

Akademisch Vergütung an den Verlag: 30,47 € Grundgebühr: 2,00 € Gesamtpreis: 32,47 €

Schnellzugriff

Ausleihen & Bestellen

Schnellzugriff

Recherchieren & Entdecken

Schnellzugriff

Lernen & Arbeiten

Schnellzugriff

Publizieren & Archivieren

Schnellzugriff

Über die TIB

Schnellzugriff

Forschung & Entwicklung

Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs) (Englisch)

Wie erhalte ich diesen Titel?

Exportieren, teilen und zitieren

Mehr Angaben zu diesem Treffer

Inhaltsverzeichnis

Inhaltsverzeichnis Konferenzband

Ähnliche Titel

Wie erhalte ich diesen Titel?

Exportieren, teilen und zitieren