Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs) (Englisch)
- Neue Suche nach: Gupta, Priyanka
- Neue Suche nach: Chodingala, Piyushkumar K.
- Neue Suche nach: Patil, Hemant A.
- Neue Suche nach: Gupta, Priyanka
- Neue Suche nach: Chodingala, Piyushkumar K.
- Neue Suche nach: Patil, Hemant A.
In:
2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
;
125-130
;
2023
-
ISBN:
-
ISSN:
- Aufsatz (Konferenz) / Elektronische Ressource
-
Titel:Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs)
-
Beteiligte:Gupta, Priyanka ( Autor:in ) / Chodingala, Piyushkumar K. ( Autor:in ) / Patil, Hemant A. ( Autor:in )
-
Erschienen in:
-
Verlag:
- Neue Suche nach: IEEE
-
Erscheinungsdatum:31.10.2023
-
Format / Umfang:1381644 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Medientyp:Aufsatz (Konferenz)
-
Format:Elektronische Ressource
-
Sprache:Englisch
-
Datenquelle:
Inhaltsverzeichnis Konferenzband
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 1
-
Mixed Emotion Recognition Based on EEG SignalsPei, Guanxiong / Li, Bingjie / Li, Taihao / Fan, Cunhang / Zhang, Chao / Lv, Zhao et al. | 2023
- 8
-
Attention-based CNN and Relative Phase Feature Modeling for Improved Imagined Speech RecognitionNiimura, Yoshiki / Takemoto, Jun / Kai, Atsuhiko / Nakagawa, Seiichi et al. | 2023
- 15
-
Manipulation of Neuronal Network Firing Patterns using Temporal Deep Unfolding-based MPCAizawa, Jumpei / Ogura, Masaki / Shimono, Masanori / Wakamiya, Naoki et al. | 2023
- 22
-
Goodness of Fit to the Convolution Model of fMRI Data and Determination of the Regularization ParameterNakamura, Wakako et al. | 2023
- 27
-
Detection model of sister chromatid cohesion defects based on Vision TransformerMatsumoto, Shinya / Okubo, Kan / Abe, Takuya / Nishikawa, Kiyoshi et al. | 2023
- 32
-
GRALA: modeling social information for microblog sentiment analysis from the view of balancing sparsity and smoothness of social contextsZou, Xiaomei / Hu, Shiyong / Li, Taihao et al. | 2023
- 38
-
Adopting Neural Translation Model in Data Generation for Inverse Text NormalizationJiang, Yufei / Ho, Thi-Nga / Chng, Eng-Siong et al. | 2023
- 46
-
Mismatched Semi-supervised Learning with Feature Similarity ConsistencyLiang, Zechen / Fan, Qiaosong / Wang, Yuan-Gen et al. | 2023
- 51
-
Collaborative Pseudo Labeling for Prompt-Based LearningChien, Jen-Tzung / Chen, Chien-Ching et al. | 2023
- 57
-
Learning Meta Soft Prompt for Few-Shot Language ModelsChien, Jen-Tzung / Chen, Ming-Yen / Xue, Jing-Hao et al. | 2023
- 63
-
MSDF-Net: A Multi-Scale Deep Fusion Network with Dilated Convolutions for Cloud Removal from Sentinel-2 ImageryJayakrishnan, A / Venkatesan, M / Prabhavathy, P / Alkha, Mohan et al. | 2023
- 71
-
Instance Implant-Aided Non-uniformly Cropping for Person Detection in Aerial ImagesZhang, Xiangqing / Feng, Yan / Zhang, Shun / Wang, Yuning et al. | 2023
- 84
-
Unbiased Decision-Making Framework in Long-Video Macro & Micro-Expression SpottingTan, Pei-Sze / Rajanala, Sailaja / Pal, Arghya / Phan, Raphael C.-W. / Ong, Huey-Fang et al. | 2023
- 90
-
Adaptive Beamforming Based on Interference-Plus-Noise Covariance Matrix Reconstruction for Speech SeparationXiao, Yongxiong / Zhu, Shiqiang / Li, Te / Wan, Minhong / Song, Wei / Gu, Jason / Fu, Qiang et al. | 2023
- 96
-
Correlated Multi-Level Speech Enhancement for Robust Real-World ASR Applications Using Mask-Waveform-Feature OptimizationChen, Hang / Du, Jun / Wang, Zhe / Wang, Chenxi / Ren, Yuling / Li, Qinglong / Liu, Ruibo / Lee, Chin-Hui et al. | 2023
- 102
-
CASA-Net: Cross-attention and Self-attention for End-to-End Audio-visual Speaker DiarizationZhou, Haodong / Li, Tao / Wang, Jie / Li, Lin / Hong, Qingyang et al. | 2023
- 107
-
Enhanced Neural Beamformer with Spatial Information for Target Speech ExtractionGuo, Aoqi / Wu, Junnan / Gao, Peng / Zhu, Wenbo / Guo, Qinwen / Gao, Dazhi / Wang, Yujun et al. | 2023
- 114
-
Low-complexity Multi-Channel Speaker Extraction with Pure Speech CuesZeng, Bang / Suo, Hongbin / Wan, Yulong / Li, Ming et al. | 2023
- 119
-
Modeling Suprasegmental Information Using Finite Difference Network for End-to-End Speaker VerificationLi, Jin / Mak, Man-Wai / Yan, Nan / Wang, Lan et al. | 2023
- 125
-
Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs)Gupta, Priyanka / Chodingala, Piyushkumar K. / Patil, Hemant A. et al. | 2023
- 131
-
Exploring Residual Cepstral Features for Spoken Language IdentificationHora, Baveet Singh / Parmar, Krishna / Machhar, Shrey / Patil, Hemant A. / Praveen, Kiran / Radhakrishnan, Balaji et al. | 2023
- 139
-
Consideration of Varying Training Lengths for Short-Duration Speaker VerificationKo, WooSeok / Um, Seyun / Piao, Zhenyu / Kang, Hong-Goo et al. | 2023
- 145
-
Adversarial Robustness of Mel Based Speaker Recognition SystemsSrivastava, Ritu / Kosgi, Saiteja / Sivaprasad, Sarath / Sahipjohn, Neha / Gandhi, Vineet et al. | 2023
- 151
-
Joint Drum Transcription and Metrical Analysis Based on Periodicity-Aware Multi-Task LearningKamakura, Daichi / Nanamura, Eita / Oyama, Takehisa / Yoshii, Kazuyoshi et al. | 2023
- 158
-
CTC2: End-to-End Drum Transcription Based on Connectionist Temporal Classification With Constant Tempo ConstraintKamakura, Daichi / Nakamura, Eita / Yoshii, Kazuyoshi et al. | 2023
- 165
-
Learning Multifaceted Self-Similarity for Musical Structure AnalysisChen, Tsung-Ping / Su, Li / Yoshii, Kazuyoshi et al. | 2023
- 173
-
Simultaneous Measurement of Multiple Acoustic Attributes Using Structured Periodic Test Signals Including Music and Other Sound MaterialsKawahara, Hideki / Yatabe, Kohei / Sakakibara, Ken-Ichi / Mizumachi, Mitsunori / Kitamura, Tatsuya et al. | 2023
- 187
-
Gait Analysis in Powered Exoskeleton-Assisted Walking in Patients with Stroke: A Case Series CohortHuang, Jian-Jia / Chang, Shih-Chieh / Cheng, Cheng-Hsu / Wan, Timothy / Pei, Yu-Cheng et al. | 2023
- 195
-
Prediction Model of Postoperative Pain Exacerbation Using a Wearable Electrocardiogram SensorNakanishi, Toshiyuki / Fujiwara, Koichi / Sobue, Kazuya et al. | 2023
- 199
-
Directional Neural Connectivity during Robot Mirror Therapy in Patients with StrokeKanaizuka, Yuma / Manabe, Takahiro / Huang, Jian-Jia / Hung, Jen-Wen / Ono, Yumie et al. | 2023
- 206
-
Evaluation of neural response recorded using scalp EEG in virtual reality environmentKanayama, Noriaki / Miyakoshi, Makoto / Machizawa, Maro et al. | 2023
- 211
-
Machine Learning Based Action Recognition with Modular CNNHuang, Shi-Zong / Chiu, Ching-Te / Chang, Yu-Jen et al. | 2023
- 217
-
Real-Time Processing for Weighted Pulse Decomposition of Photoplethysmography Signals Based on Interior Point Method in Wearable Devices for Hemodynamic StateWong, Ting-Jui / Tsai, Pei-Yun et al. | 2023
- 222
-
QoS-Aware Downlink Beamforming for Joint Transmission in Multi-Cell NetworksLin, Chen-Yen / Liu, Kuang-Hao Stanley et al. | 2023
- 230
-
Deep-Learning-Based Lattice Reduction Preprocessing for Time-Correlated MIMO SystemsLi, Yi-Mei / Chi, Jung-Chun / Huang, Yuan-Hao et al. | 2023
- 238
-
Utilizing Unlabeled Data and Synthetic Data for Bird Sound Detection: Consistency Training, Mean Teacher, and Domain Adaptation TechniquesChen, Fang-Ching / Liu, Yi-Wen et al. | 2023
- 243
-
A Comparative Evaluation of Video Codecs for rPPG-based Heart Rate EstimationHyanda, Muhammad H. / Ahmadi, Nur / Charlton, Peter H. / Constandinou, Timothy G. / Purwarianti, Ayu / Adiono, Trio et al. | 2023
- 248
-
Human Activity Recognition Based on FMCW Radar Using CNN and Transfer LearningTriani, Listi Restu / Ahmadi, Nur / Adiono, Trio et al. | 2023
- 254
-
DQN Algorithm Design for Fast Efficient Shortest Path SystemSumarudin, A / Sutisna, Nana / Syafalni, Infall / Trilaksono, Bambang Riyanto / Adiono, Trio et al. | 2023
- 261
-
Comparison of MPPT based on Deep Reinforcement Learning by DQN, DDPG and TD3Panggabean, Jayandi / Sutisna, Nana / Syafalni, Infall / Adiono, Trio et al. | 2023
- 267
-
Signal Quality Assessment for Wearable Multichannel Photoplethysmography SignalsPrihatmoko, Muhammad Dzaky / Ahmadi, Nur / Charlton, Peter H. / Adiono, Trio et al. | 2023
- 272
-
After-Fatigue Condition: A Novel Analysis Based on Surface EMG SignalsNguyen, Van-Hieu / Luu, Gia Thien / Van Luong, Thien / Trang, Mai Xuan / Ravier, Philippe / Buttelli, Olivier et al. | 2023
- 278
-
On the Semi-Blind Mutually Referenced Equalizers for MIMO SystemsSon, Do Hai / Abed-Meraim, Karim / Duy, Tran Trong / Trung, Nguyen Linh / Quynh, Tran Thi Thuy et al. | 2023
- 284
-
Accurate continuous action and gesture recognition method based on skeleton and sliding windows techniquesLe, Viet-Duc / Nghiem, Thi-Lich / Le, Thi-Lan et al. | 2023
- 291
-
Transformer-Based Deep Learning Detector for Dual-Mode Index Modulation 3D-OFDMGian, Toan / Nguyen, Tien-Hoa / Nguyen, Trung Tan / Pham, Van-Cuong / Van Luong, Thien et al. | 2023
- 297
-
GAFormer: Wearable IMU-Based Human Activity Recognition with Gramian Angular Field and TransformerLe, Trung-Hieu / Nguyen, Thai-Khanh / Tran, Trung-Kien / Tran, Thanh-Hai / Pham, Cuong et al. | 2023
- 304
-
Fatigue Classification and Onset estimation using Surface EMG Signals during Strength TrainingAdapa, Eswar / Turlapaty, Anish C / Naidu, Surya et al. | 2023
- 311
-
P300 Event-Related Potential in Perception of Multiple Traffic Objects During Vehicle DrivingYamamoto, Yuki / Nobukawa, Sou / Wagatsuma, Nobuhiko / Inagaki, Keiichiro et al. | 2023
- 317
-
Kernel Random Projection Depth for Outlier DetectionTamamori, Akira et al. | 2023
- 325
-
Soft-Sensor Construction Method Based on Adaptive Modeling and Transfer Learning for Manufacturing Process Including Maintenance PeriodsKatayama, Kaito / Fujiwara, Koichi / Yamamoto, Kazuki et al. | 2023
- 329
-
Detecting Wire Bonding Defects in Point Clouds on Self-Generated DatasetYuen, Shang Li / Lau, Phooi Yee / Wong, Chin Wee / Samsuri, Muhammad Hafiz / Hussin, Zarina / Kamarudin, Nur Afiqah / Talib, Muhammad Syukri Mohd / Hon, Hock Woon et al. | 2023
- 336
-
Predicting Outcomes of Cognitive Behavioral Therapy for Depression Using Data Driven ApproachesTyszczuk, Lily / Levita, Liat / Delgadillo, Jaime / Haihong, Zhang / Arvaneh, Mahnaz et al. | 2023
- 344
-
Learning Adapters for Code-Switching Speech RecognitionHe, Chun-Yi / Chien, Jen-Tzung et al. | 2023
- 350
-
FID-RPRGAN-VC: Fréchet Inception Distance Loss based Region-wise Position Normalized Relativistic GAN for Non-Parallel Voice ConversionDhar, Sandipan / Akhter, MD. Tousin / Banerjee, Padmanabha / Jana, Nanda Dulal / Das, Swagatam et al. | 2023
- 357
-
Deformable Aligned Fusion for Video Super ResolutionLee, Sin-Hong / Kuo, Chih-Hung / Yu, Tsai-Chun et al. | 2023
- 365
-
Learning Single Image Rain Streak Removal Based on Deep Attention MechanismHuang, Kuan-Hua / Kang, Li-Wei et al. | 2023
- 373
-
A Transformer-Based Framework for Tiny Object DetectionLiao, Yi-Kai / Lin, Gong-Si / Yeh, Mei-Chen et al. | 2023
- 378
-
Lightweight Models Distillation with Learnable Teaching Material: An Application for Smart Table Tennis SystemChen, Duan-Yu / Chen, Yu-Hsuan et al. | 2023
- 384
-
Selecting Suitable Data Input for Deep-Learning Sign-Language Recognition with a Small DatasetChen, Yu-Jen / Su, Po-Chyi et al. | 2023
- 392
-
Analysis of the Interaction Effect on Pruning and Transfer Learning in Model TrainingWei, Yu-Jen / Chen, Jia-Hong / Kuo, Tien-Ying et al. | 2023
- 396
-
Old Damaged Photo Recovery with Style Transfer-Based Data AugmentationWang, Chih-Hao / Wei, Yu-Jen / Chang, Ching Hsiang / Kuo, Tien-Ying et al. | 2023
- 401
-
A Deep Learning based Sustainable Energy Scheduling SystemTsai, Kun-Lin / Chen, Yan-Hao / Huang, Choa-Ting / Huang, Guo-Wei / Tseng, Shih-Ting et al. | 2023
- 408
-
A Computational Efficient Direct Position Determination Approach of Narrow-band EmitterZhao, Yuan / Sheng, Hanmin / Shao, Jinliang et al. | 2023
- 414
-
Modeling and Analysis of the Epidemic-Behavior Co-evolution Dynamics with User IrrationalityDong, Wenxiang / Zhao, H. Vicky et al. | 2023
- 422
-
Noise-robust Pitch Detection Based on Super-Resolution HarmonicsZhu, Dongjie / Zhu, Weibin / Wang, Tianrui / Gao, Yingying / Feng, Junlan / Zhang, Shilei et al. | 2023
- 427
-
A Subband Approach to Personal Sound Zone with Joint Optimization of Sound Pressure and Particle VelocityZhao, Yingke / Zhang, Wen / Chen, Jingdong et al. | 2023
- 432
-
An Multi-evidence Fusion Based on C-Distance with Uncertain Reasoning for ClassificationCheng, Cuiping / Yue, Pengcheng / Li, Taihao et al. | 2023
- 438
-
On Uncertainty Principles for Lowband Graph SignalsLi, Na / Shang, Linbo / Zhang, Zhichao et al. | 2023
- 443
-
CoA-DLinkNet: Connectivity-Enhanced Dual-Branch Road Extraction Network Based on D-LinkNetLi, Linghan / Chen, Heliu / He, Renjie / Dai, Yuchao / He, Mingyi et al. | 2023
- 450
-
Black-box Lossless Fragile Watermarking Based on Hidden Space Search for DNN Integrity AuthenticationZhao, Gejian / Qin, Chuan et al. | 2023
- 456
-
Hiding patient information in medical images:A high-capacity and reversible hiding algorithm for E-healthcareZhou, Xiaoyi / Lee, Shuai et al. | 2023
- 462
-
A Visually Meaningful Image Encryption Algorithm with Attention Mechanism and Artificial Bee Colony OptimizationMao, Jiarong / An, Yuting / Zhou, Xiaoyi et al. | 2023
- 468
-
High-Quality Triggers Based Fragile Watermarking for Optical Character Recognition ModelYin, Yujie / Yin, Heng / Yin, Zhaoxia / Lyu, Wanli / Wei, Sha et al. | 2023
- 476
-
Coupled Transformed Induced Tensor Nuclear Norm for Robust Tensor CompletionQin, Mengjie / Lin, Zheyuan / Wan, Minhong / Zhang, Chunlong / Gu, Jason / Li, Te et al. | 2023
- 484
-
Multi-Frequency Feature Enhancement for Multi-Granularity Visual ClassificationFu, Meijiang / Zheng, Yixiao / Chang, Dongliang / Li, Wenpan / Ma, Zhanyu et al. | 2023
- 490
-
Improving Aspect Sentiment Classification via Retrieving from Training DataLing, Tongtao / Chen, Lei / Liao, Chen / Huang, Shilei / Yu, Zhipeng / Liu, Yi et al. | 2023
- 498
-
CH-MEAD: A Chinese Multimodal Conversational Emotion Analysis Dataset with Fine-Grained Emotion TaxonomyRuan, Yu-Ping / Zheng, Shu-Kai / Huang, Jiantao / Zhang, Xiaoning / Liu, Yulong / Li, Taihao et al. | 2023
- 506
-
Evolutionary Analysis and Cultural Transmission Models of Color Style Distributions in Painting ArtsNakamura, Eita / Saito, Yasuyuki et al. | 2023
- 514
-
Ultimatelink Between Characters Having a Certain Meaning in Physical Space to URL in Cyberspace with Robust Print and ScanYamadera, Keiji / Niimi, Michiharu et al. | 2023
- 519
-
Human Flow Measurement System Using Floor Estimation of Depth Images for Low-End IoT DevicesNagatoshi, Takuya / Niimi, Michiharu et al. | 2023
- 523
-
Holo-QoI: A Human Factor-Based Dataset and Prediction Framework for Assessing Quality of Interaction in Augmented RealityKim, Seongjean / Choi, Seonghwa / Lee, Sanghoon et al. | 2023
- 529
-
Supervised Single-channel EEG Decomposition using Detector-kernel Networks for Noise ReductionHigashi, Hiroshi et al. | 2023
- 535
-
Cross-Subject Classification of Spoken Mandarin Vowels and Tones with EEG Signals: A Study of End-to-End CNN with Fine-TuningWang, Xinyu / Li, Mingtao / Li, Hao / Pun, Sio Hang / Chen, Fei et al. | 2023
- 540
-
Decoding time-course of saliency network of fMRI signals by EEG signals using optimized forward variable selection: a concurrent EEG-fMRI studyDang, Tung / Ono, Kentaro / Sasaoka, Takafumi / Yamawaki, Shigeto / Machizawa, Maro G et al. | 2023
- 546
-
Multimodal recognition of speech and electrocorticogramAhuja, Mitali / Komeiji, Shuji / Mitsuhashi, Takumi / Iimura, Yasushi / Suzuki, Hiroharu / Sugano, Hidenori / Shinoda, Koichi / Tanaka, Toshihisa et al. | 2023
- 551
-
Enhancing Real-Time Semantic Segmentation with Textual Knowledge of Pre-Trained Vision-Language Model: A Lightweight ApproachLin, Chia-Yi / Chen, Jun-Cheng / Wu, Ja-Ling et al. | 2023
- 559
-
EEG study on anticipation of difficulty for upcoming auditory taskSong, Zichen / Higashi, Hiroshi / Ishii, Shin et al. | 2023
- 567
-
Event-Related Potential in Rapid Serial Visual Presentation-based Partial Face Cognition Depends on Visible Face ComponentsChanpornpakdi, Ingon / Tanaka, Toshihisa et al. | 2023
- 575
-
Residual, Mixer, and Attention: The Three-way Combination for Streaming Wake Word Detection FrameworkSingkul, Sattaya / Sakdejayont, Theerat / Chalothorn, Tawunrat et al. | 2023
- 583
-
Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC LossDeng, Tengyu / Nakamura, Eita / Yoshii, Kazuyoshi et al. | 2023
- 591
-
Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from ShadowChang, Li-Jen / Liao, Yu-Cheng / Lin, Chia-Hui / Yang-Mao, Shih-Fang / Chen, Hwann-Tzong et al. | 2023
- 599
-
A Reversible Image Processing Method for Color Tone Control Using Data HidingNakaya, Daichi / Imaizumi, Shoko et al. | 2023
- 605
-
Image-Text Out-Of-Context Detection Using Synthetic Multimodal MisinformationShalabi, Fatma / Nguyen, Huy H. / Felouat, Hichem / Chang, Ching-Chun / Echizen, Isao et al. | 2023
- 613
-
Gait Recognition Scheme Focusing on Operating Characteristics at Feature Points Detected by OpenPoseTanaka, Chinatsu / Kuribayashi, Minoru / Funabiki, Nobuo et al. | 2023
- 620
-
A Study on Eliminating Biased Node in Federated LearningAkai, Reon / Kuribayashi, Minoru / Funabiki, Nobuo et al. | 2023
- 628
-
Can StArtGAN withstand Image Processing Attacks?Ng, Koi Yee / Ong, Simying / Loh, Yuen Peng et al. | 2023
- 635
-
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech RecognitionWang, Chang / Du, Jun / Chen, Hang / Wang, Ruoyu / Yang, Chao-Han Huck / Zhao, Jiangjiang / Ren, Yuling / Li, Qinglong / Lee, Chin-Hui et al. | 2023
- 643
-
Interpretable Image Recognition in Hyperbolic SpaceLebedeva, Irina / Bah, Mohamed Jaward / Li, Taihao et al. | 2023
- 651
-
Low-light is More Than Darkness: An Empirical Study on Illumination Types and Enhancement MethodsLiew, Hui Sze / Loh, Yuen Peng / Ong, Simying et al. | 2023
- 659
-
MoMo Strategy: Learn More from More MistakesChulif, Sophia / Lee, Sue Han / Loong Chang, Yang / Kit Tsun, Mark Tee / Chai, Kok Chin / Then, Yi Lung et al. | 2023
- 666
-
Unveiling Robust Feature Spaces: Image vs. Embedding-Oriented Approaches for Plant Disease IdentificationIshrat, Hamza Ahmed / Yu Hao Chai, Abel / Lee, Sue Han / Hui Then, Patrick Hang et al. | 2023
- 674
-
Facial Expression Recognition as markers of DepressionGue, Jia Xuan / Chong, Chun Yong / Lim, Mei Kuan et al. | 2023
- 681
-
How Transferable are Herbarium-Field Features in Few-Shot Plant Identification with Triplet Loss?Chulif, Sophia / Lee, Sue Han / Loong Chang, Yang / Kit Tsun, Mark Tee / Chin Chai, Kok / Then, Yi Lung et al. | 2023
- 688
-
Resolution-Adaptive Lossless Image Compression Using Frequency Decomposition NetworkRhee, Hochang / Cho, Nam Ik et al. | 2023
- 696
-
Implementation and Analysis on Backpropagating Refinement Scheme for Interactive Image SegmentationLee, Chaewon / Jang, Won-Dong / Kim, Chang-Su et al. | 2023
- 703
-
Implicit Neural Representation for Video Coding Through Progressive Feature ExtractionLee, Jihoo / Kang, Je-Won et al. | 2023
- 709
-
Deep Unfolded Underwater Image Enhancement Based on Extreme Channels PriorPham, Thuy Thi / Mai, Truong Thanh Nhat / Lee, Chul et al. | 2023
- 714
-
Low-Light Image Enhancement via Distillation of NIR-to-RGB Conversion KnowledgeJeong, Young-Min / Park, Tae-Sung / Park, Jeong-Hyeok / Kim, Jong-Ok et al. | 2023
- 719
-
3D Human Skeleton Estimation from Single RGB Image Based on Fusion of Predicted Depths from Multiple Virtual-ViewpointsLie, Wen-Nung / Vann, Veasna et al. | 2023
- 726
-
GNN-Based Small-Data Learning with Area-Control Mechanism for Hyperspectral Satellite Change DetectionLin, Tzu-Hsuan / Lin, Chia-Hsiang / Young, Si-Sheng et al. | 2023
- 733
-
Efficient Constraint-Aware Neural Architecture Search for Object DetectionPoliakov, Egor / Hung, Wei-Jie / Huang, Ching-Chun et al. | 2023
- 741
-
A Reliable Feature-Based Framework for Vehicle Tracking in Advanced Driver Assistance SystemsHa-Phan, Ngoc -Quan / Truong, Thanh-Nguyen / Tran, Vu -Hoang / Huang, Ching-Chun et al. | 2023
- 748
-
Light-weight Zero-Reference-based Image Enhancement for Low-Light ImagesChang, Jie-Fan / Lai, Kuan-Ting / Zhuang, Cheng-Xuan / Lin, Guo-Shiang / Chang, Ku-Yaw et al. | 2023
- 753
-
Classwise Self-Paced Self-Training for Semi-Supervised Image ClassificationLu, Cheng-Yu / Hsu, Heng-Cheng / Chiang, Chen-Kuo et al. | 2023
- 759
-
CapFormer: A Space-Time Video Description Model using Joint-Attention TransformerMoussa, Mahamat / Lim, Chern Hong / Wong, KokSheik et al. | 2023
- 765
-
Local Contrast Enhancement with Multiscale FilteringHayashi, Kohei / Maeda, Yoshihiro / Fukushima, Norishige et al. | 2023
- 771
-
Marine Snow Removal Benchmarking DatasetKaneko, Reina / Sato, Yuya / Ueda, Takumi / Higashi, Hiroshi / Tanaka, Yuichi et al. | 2023
- 779
-
Cross-Frame Foreground Structural Similarity Modeling by Convolutional Sparse RepresentationNaganuma, Kazuki / Ono, Shunsuke et al. | 2023
- 784
-
JPEG Artifact Removal for Hyperspectral Images Based on Spatial-Spectral RegularizationEguchi, Ryunosuke / Kobayashi, Iori / Ono, Shunsuke / Matsuoka, Ryo et al. | 2023
- 788
-
Data Driven Multiband Image Fusion That Preserves Wavelength-Specific Image FeaturesLin, Hsuan / Hirakawa, Keigo et al. | 2023
- 795
-
Shot-Noise-Aware Image Signal Restoration for Photoelectronic Charge-Based SensorsTakamura, Seishi et al. | 2023
- 800
-
Generative Adversarial Network-Based Frame Interpolation with Multi-Perspective DiscriminationTran, Quang Nhat / Yang, Shih-Hsuan et al. | 2023
- 806
-
ArtHDR-Net: Perceptually Realistic and Accurate HDR Content CreationBarua, Hrishav Bakul / Krishnasamy, Ganesh / Wong, KokSheik / Stefanov, Kalin / Dhall, Abhinav et al. | 2023
- 813
-
LSR++: An Efficient and Tiny Model for Image Super-ResolutionWang, Wei / Lei, Xuejing / Chen, Yueru / Lee, Ming-Siu / Kuo, C.-C. Jay et al. | 2023
- 820
-
High-Quality Font Generation Based on StyleGAN2 and FSFont Font Generation ModelShimamura, Yuki / Niimi, Michiharu et al. | 2023
- 826
-
Enhanced Residual Fourier Transformation Network for Lightweight Image Super-resolutionYang, Yunming / Ikehara, Masaaki et al. | 2023
- 833
-
ELEGANT: End-to-end Language Grounded Speech Denoiser for Efficient Generation of Talking FaceChai, Ai-Fang / Rajanala, Sailaja / Pal, Arghya / Phan, Raphael C.-W. / Ting, Chee-Ming et al. | 2023
- 839
-
Segmentation Enhancement for Iris Recognition Using Unit Gradient VectorsMeam, Limhourlaurent / Duangpummet, Suradej / Kongprawechnon, Waree et al. | 2023
- 846
-
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-CheckingCheung, Tsun-Hin / Lam, Kin-Man et al. | 2023
- 854
-
Auditory Representation Effective for Estimating Vocal Tract InformationIrino, Toshio / Doan, Shintaro et al. | 2023
- 862
-
Accurate and Practical Query-by-Example Using Multiple Deep Learning Models and Frame Compression MethodsYamaga, Hikaru / Hatakeyama, Kazuki / Kojima, Kazunori / Lee, Shi-Wook / Itoh, Yoshiaki et al. | 2023
- 868
-
Fundamental Frequency Estimation Based on Finite-Order Harmonic Constraint Differential EquationYamada, Kenta / Masuyama, Yoshiki / Yamaoka, Kouei / Ono, Nobutaka et al. | 2023
- 873
-
Tone Labeling by Deep Learning-based Tone Recognizer for Mandarin SpeechLi, Wu-Hao / Chiang, Chen-Yu / Liu, Te-Hsin et al. | 2023
- 881
-
Learning to Enhance the Position Embedding and CoherenceShu, Ting-Jia / Chien, Jen-Tzung et al. | 2023
- 887
-
VLSI Design of Near-Lossless Image Compression using Improved LZWZhang, Yao-Zhong / Chen, Chiung-An / Zhang, Jia-Sheng / Wang, Jia-Wen et al. | 2023
- 892
-
The color demosaicing and image scaling based on improve Hamilton-AdamsPeng, Yu-Wen / Hu, Chia-Yu / Chin, Yen-Ju / Chou, He-Sheng / Lin, Yuan-Jin / Liu, Yu-Lin / Chen, Shih-Lun / Chen, Tsung-Yi / Li, Kuo-Chen / Chen, Chiung-An et al. | 2023
- 898
-
Improving Regularization of Deep Learning Models in Fundus AnalysisHsu, Wei-Wen / Chang, Yao-Chung / Lee, Wei-Min / Huang, Yu-Chuan / Lu, Da-Wen et al. | 2023
- 902
-
Design of Interactive System for Acupoint Analysis Based on Augmented RealityWei, Chung-Yen / Xu, Bo-Yuan / Zhao, Yu-Xiang et al. | 2023
- 910
-
Dental Positioning Medical Assistance System for BW Radiograph Based on YOLOV4Lin, Mu-Feng / Li, Yi-Qian / Chen, Tsung-Yi / Liu, Yu-Lin / Lin, Yuan-Jin / Chan, Mei-Ling / Chen, Chiung-An / Li, Kuo-Chen / Chen, Shih-Lun et al. | 2023
- 918
-
The Development of an AI-assisted Diagnosis System for Adult Glioma Subtyping PredictionHsu, Wei-Wen / Lin, Jia-Yi / Lai, Hsin-Hung / Hsu, Wan-Lin / Jiang, Jeng-Ting / Chang, Yao-Chung / Li, Yao-Feng et al. | 2023
- 922
-
Poisoning Attacks against Gait-based Identity RecognitionDong, Jianmin / Peng, Da-Tian / Pei, Guanxiong / Li, Taihao et al. | 2023
- 927
-
STrack: Velocity Estimation Using Single Antenna WiFi DevicesXu, Jian / Zhang, Dongheng / Li, Jiamu / Sun, Qibin / Chen, Yan et al. | 2023
- 934
-
SEformer: Dual-Path Conformer Neural Network is a Good Speech DenoiserWang, Kai / Hatzinakos, Dimitrios et al. | 2023
- 941
-
Complex Feature Information Enhanced Speech Emotion RecognitionYue, Pengcheng / Zheng, Shukai / Li, Taihao et al. | 2023
- 947
-
Incorporating Pinyin into Pipeline Named Entity Recognition from Chinese SpeechZhang, Min / Qiao, Xiaosong / Zhao, Yanqing / Su, Chang / Li, Yinglu / Zhu, Ming / Zhu, Junhao / Li, Yuang / Zhao, Xiaofeng / Liu, Yilun et al. | 2023
- 954
-
Learning Semantic Information from Machine Translation to Improve Speech-to-Text TranslationDeng, Pan / Zhang, Jie / Zhou, Xinyuan / Ye, Zhongyi / Zhang, Weitai / Cui, Jianwei / Dai, Lirong et al. | 2023
- 960
-
Effective Fine-tuning Method for Tibetan Low-resource Dialect Speech RecognitionYang, Jiahao / Wei, Jianguo / Khysru, Kuntharrgyal / Xu, Junhai / Lu, Wenhuan / Ke, Wenjun / Yang, Xiaokang et al. | 2023
- 966
-
Multi-task Piano Transcription with Local Relative Time AttentionWang, Qi / Liu, Mingkuan / Chen, Xianhong / Xiong, Mengwen et al. | 2023
- 972
-
Real and imaginary part interaction network for monaural speech enhancement and de-reverberationZhang, Zehua / He, Changjun / Xu, Shiyun / Wang, Mingjiang et al. | 2023
- 978
-
Progressive Multi-scale Self-supervised Learning for Speech RecognitionWan, Genshun / Chen, Hang / Liu, Tan / Wang, Chenxi / Pan, Jia / Ye, Zhongfu et al. | 2023
- 983
-
Improved Data2vec with Soft Supervised Hidden Unit for Mandarin Speech RecognitionWan, Genshun / Chen, Hang / Li, Pengcheng / Pan, Jia / Ye, Zhongfu et al. | 2023
- 988
-
Investigation of Ensemble of Self-Supervised Models for Speech Emotion RecognitionWu, Yanfeng / Yue, Pengcheng / Cheng, Cuiping / Li, Taihao et al. | 2023
- 996
-
Single Source Zone Detection in the Spherical Harmonic Domain for Multisource LocalizationTao, Liang / Jia, Maoshen / Bu, Bing / Yao, Dingding et al. | 2023
- 1002
-
Robust Representation Learning for Speech Emotion Recognition with Moment ExchangeCai, Yunrui / Song, Changhe / Tang, Boshi / Dai, Dongyang / Wu, Zhiyong / Meng, Helen et al. | 2023
- 1008
-
Few Shot Learning Guided by Emotion Distance for Cross-corpus Speech Emotion RecognitionYue, Pengcheng / Wu, Yanfeng / Qu, Leyuan / Zheng, Shukai / Zhao, Shuyuan / Li, Taihao et al. | 2023
- 1013
-
Speech Emotion Recognition by Late Fusion of Linguistic and Acoustic Features using Deep Learning ModelsSato, Kiyohide / Kishi, Keita / Kosaka, Tetsuo et al. | 2023
- 1019
-
Multilingual, Cross-lingual, and Monolingual Speech Emotion Recognition on EmoFilm DatasetAtmaja, Bagus Tris / Sasou, Akira et al. | 2023
- 1026
-
Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from SpeechAtmaja, Bagus Tris / Sasou, Akira et al. | 2023
- 1030
-
An Automatic Pipeline For Building Emotional Speech DatasetThi, Ngoc-Anh Nguyen / Thang Ta, Bao / Le, Nhat Minh / Hai Do, Van et al. | 2023
- 1036
-
Analysis of Emotions in Speech using AESDDUthiraa, S. / Patil, Hemant et al. | 2023
- 1042
-
Modified Parametric Multichannel Wiener Filter for Low-latency Enhancement of Speech Mixtures with Unknown Number of SpeakersGuo, Ning / Nakatani, Tomohiro / Araki, Shoko / Moriya, Takehiro et al. | 2023
- 1050
-
Blind Source Separation Using Independent Low-Rank Matrix Analysis with Spectrogram-Consistency RegularizationMisawa, Sota / Takamune, Norihiro / Yatabe, Kohei / Kitamura, Daichi / Saruwatari, Hiroshi et al. | 2023
- 1058
-
Moving Interference Speaker removal using Geometrically Constrained Independent Vector AnalysisFurunaga, Shinya / Ueda, Tetsuya / Makino, Shoji et al. | 2023
- 1064
-
A Dual-Channel Three-Stage Model for DoA and Speech EnhancementWu, Meng-Hsuan / Shen, Yih-Liang / Chou, Hsuan-Cheng / Shih, Bo-Wun / Chi, Tai-Shih et al. | 2023
- 1069
-
A Weighted Binary Cross-Entropy for Sound Event Representation Learning and Few-Shot ClassificationBai, Zhongxin / Pan, Chao / Chen, Gong / Chen, Jingdong / Benesty, Jacob et al. | 2023
- 1075
-
A Reconfigurable Hardware Architecture for Graph Convolution Network in Action RecognitionTsai, Tsung-Han / Chen, Tzu-Chieh et al. | 2023
- 1079
-
Automated Carina Detection in Chest X-ray Images Using Non-Overlapping and Cross-Squeeze Convolutional Neural NetworksHsu, Chung-Chian / Chen, Chi-Yuan / Salahuddin Morsalin, S. M. / Chang, Arthur / Fan, Wen-Lin et al. | 2023
- 1085
-
Identifying the Style of ChattingZhang, Manman / Ma, Yuchen / Luo, Ge / Li, Sheng / Qian, Zhenxing / Zhang, Xinpeng et al. | 2023
- 1093
-
Pose-Based Visual Servoing with Lightweight Deep-Learning Binarization for Autonomous Mobile Robot ApplicationHo, Chian C. / Lin, Cian-Duo et al. | 2023
- 1100
-
Real-Time Noise Suppression Using Harmonic/Percussive Separation with Morphological Operations for Hammering TestUchiyama, Ryugo / Tanabe, Nari et al. | 2023
- 1107
-
ΔΣ Modulators for Discrete-time Closed Loop Control Systems with Quantization and SaturationOhno, Shuichi / Wang, Shenjian / Takaba, Kiyotsugu et al. | 2023
- 1112
-
Asymptotic Estimation Performance of Linear Regression Model with Sparse Bayesian Learning as Both Samples and Signals Approach InfinityMurayama, Kazuaki et al. | 2023
- 1119
-
Convolutional Multidimensional Amplitude Spectrum Nuclear Norm for Frequency-domain Robust Principal Component AnalysisHarashima, Ryoya / Eguchi, Ryunosuke / Kyochi, Seisuke et al. | 2023
- 1126
-
Moreau Envelope ADMM for Decentralized Weakly Convex OptimizationMirzaeifard, Reza / Venkategowda, Naveen K. D. / Jung, Alexander / Werner, Stefan et al. | 2023
- 1131
-
An Audio-Visual Speech Enhancement System Based on 3D Image Features: An Application in Hearing AidsChung, Yu-Ching / Han, Ji-Yan / Wang, Bo-Sin / Zheng, Wei-Zhong / Shen, Kung-Yao / Lai, Ying-Hui et al. | 2023
- 1138
-
On Joint Dereverberation and Source Separation with Geometrical Constraints and Iterative Source SteeringMo, Kaien / Wang, Xianrui / Yang, Yichen / Ueda, Tetsuya / Makino, Shoji / Chen, Jingdong et al. | 2023
- 1143
-
Study of Generative Adversarial Networks for Noisy Speech Simulation from Clean SpeechMaben, Leander Melroy / Guo, Zixun / Chen, Chen / Chudiwal, Utkarsh / Siong, Chng Eng et al. | 2023
- 1150
-
Step Size Control of Shared-error Normalized Least Mean Square Algorithm for Acoustic Echo and Noise CancellerIwai, Kenta / Nishiura, Takanobu et al. | 2023
- 1155
-
Enhancing Spectrogram for Audio Classification Using Time-Frequency EnhancerXing, Haoran / Zhang, Shiqi / Takeuchi, Daiki / Niizumi, Daisuke / Harada, Noboru / Makino, Shoji et al. | 2023
- 1161
-
Evaluating Methods for Ground-Truth-Free Foreign Accent ConversionHuang, Wen-Chin / Toda, Tomoki et al. | 2023
- 1167
-
DisC-VC: Disentangled and F0-Controllable Neural Voice ConversionWatanabe, Chihiro / Kameoka, Hirokazu et al. | 2023
- 1172
-
Speech Synthesis Using Ambiguous Inputs From Wearable KeyboardsIwasaki, Matsuri / Hara, Sunao / Abe, Masanobu et al. | 2023
- 1179
-
Accent-Preserving Voice Conversion between Native-Nonnative Speakers for Second Language LearningCorrea, Iago Lourenco / Ueno, Sei / Lee, Akinobu et al. | 2023
- 1187
-
Increasing Speech Intelligibility by Mimicking Professional Announcers’ Voices and Its Physical CorrelatesTran, Dung Kim / Akagi, Masato / Unoki, Masashi et al. | 2023
- 1193
-
Robust Networked Federated Learning for LocalizationMirzaeifard, Reza / Venkategowda, Naveen K. D. / Werner, Stefan et al. | 2023
- 1199
-
Continual Local Updates for Federated Learning with Enhanced Robustness to Link NoiseLari, Ehsan / Gogineni, Vinay Chakravarthi / Arablouei, Reza / Werner, Stefan et al. | 2023
- 1204
-
Gaussian Process Learning for Location-Based Service DataUgurel, Ekin / Huang, Shuai / Chen, Cynthia et al. | 2023
- 1208
-
Distributed on-line anomaly detection using kernel methodsKuh, Anthony / Baguio, Tyler et al. | 2023
- 1214
-
Communication-Efficient Design of Learning System for Energy Demand Forecasting of Electrical VehiclesXu, Jiacong / Kilfoyle, Riley / Xiong, Zixiang / Lu, Ligang et al. | 2023
- 1221
-
Radiated Sound Field Reproduction for Surrounding Loudspeaker Array Based on Higher-order AmbisonicsNaiki, Shota / Miura, Shumpei / Iwai, Kenta / Nishiura, Takanobu / Soeta, Yoshiharu et al. | 2023
- 1226
-
Multichannel learning-based spatially extended active noise control via model matching and sensor transfer function interpolationZhong, Pei-Lin / Chen, You-Siang / Bai, Mingsian R. et al. | 2023
- 1234
-
A Study of the Microphone Protection of Active Noise Control for Axial FanShen, Yi-Tsung / Chang, Cheng-Yuan et al. | 2023
- 1240
-
SFANC with Compensation Filter Based on MEFxDCTLMS AlgorithmDoi, Kenya / Kajikawa, Yoshinobu et al. | 2023
- 1245
-
Practical Active Noise Control: Restriction of Maximum Output PowerGan, Woon-Seng / Shi, Dongyuan / Shen, Xiaoyi et al. | 2023
- 1250
-
A QoS Throughput Performance Measurement Comparison between UGS and BE Services of a Real-time FPGA Based OFDM Multi-user System Design ImplementationAdiono, Trio / Jonathan, Michael / Setiawan, Erwin / Sutisna, Nana / Mulyawan, Rahmat / Syafalni, Infall et al. | 2023
- 1257
-
Algorithm Development for Stepwise Valve Deflation Method in Blood Pressure MeasurementAdiono, Trio / Ramadhani, Reina Puteri / Amadeus, Clarance / Cicilya Sinaga, Sindy Novaria et al. | 2023
- 1263
-
SUMO Based Hardware/Software Co-simulation for Two-Intersection Adaptive and Collaborative Traffic Signal ControllerGinting, Kendrik Emkel / Sutisna, Nana / Syafalni, Infall / Adiono, Trio et al. | 2023
- 1271
-
Sparsity Exploration for Structured and Unstructured Weight Formations in CNN ArchitectureEndrawati, Devi Noor / Syafalni, Infall / Sutisna, Nana / Adiono, Trio et al. | 2023
- 1279
-
1M parameters are enough? A lightweight CNN-based model for medical image segmentationDinh, Binh-Duong / Nguyen, Thanh-Thu / Tran, Thi-Thao / Pham, Van-Truong et al. | 2023
- 1285
-
Imaging Ultrasound Scattering Targets using Density-Enhanced Chaotic Compressive SamplingTheu, Luong Thi / Huy, Tran Quang / Quynh, Tran Thi Thuy / Tran, Duc-Tan et al. | 2023
- 1291
-
Segmentation and observation of hand rehabilitation exercises by supporting of acceleration signalsNguyen, Sinh-Huy / Le, Thi-Thu-Hong / Nguyen, Hoang-Bach / Duong, Ngoc-Bach / Nguyen, Hung-Cuong / Nguyen, Chi-Thanh / Nguyen, Van-Loi / Vu, Hai et al. | 2023
- 1296
-
Investigating the Role of Human Action Detector in Visual-guide Audio Source Separation SystemDuong, Thanh Thi-Hien / Nguyen, Trung-Hieu / Le, The Thanh-Dat / Nghiem, Thi-Lich / Pham, Duc-Huy / Le, Thi-Lan et al. | 2023
- 1304
-
A combination of time and frequency synchronization with Doppler compensation for coded OFDM-based UWA systemsNguyen Thi, Hoai Linh / Khuong Nguyen, Quoc / Nguyen, Van Duc et al. | 2023
- 1310
-
Classification of Normal vs. Pathological Infant Cries Using Morse WaveletsGupta, Priyanka / Kachhi, Aastha / Patil, Hemant A. et al. | 2023
- 1317
-
Compressive Sensing Based Algorithms for Limited-View PAT Image ReconstructionJohn, Mary Josy / Barhumi, Imad et al. | 2023
- 1323
-
Towards AST-LLDs for the Analysis of Depression in Speech SignalsNagappan, Sidharrth / Lim, Chern Hong / Thimali Dharmaratne, Anuja et al. | 2023
- 1329
-
ecVoice: Audio Text Extraction Optimization of Video Based on Idioms Similarity ReplacementLin, Jinwei et al. | 2023
- 1337
-
Heart Rate Acquisition and Processing Techniques for a Miniature Wearable Microphone SensorAng, Yi Yang / Boodhoo, Kirish / Ser, Wee / Tan, Rex Xiao et al. | 2023
- 1343
-
Detection and Correction of Defective Relative Humidity Data Collected from the Greenhouse Environment Using Nested Kalman Filters with Standard Deviation AnalysisSirisanwannakul, Kraithep / Siripool, Nutchanon / Suzuki, Kenji / Kongprawechnon, Waree / Karnjana, Jessada et al. | 2023
- 1349
-
Pedestrian Crossing Intention Prediction with Multi-Modal Transformer-Based ModelWang, Ting Wei / Lai, Shang-Hong et al. | 2023
- 1357
-
Revolutionizing Formative Assessment in STEM Fields: Leveraging AI and NLP TechniquesTan, Chi Wee / Lim, Khai Yin et al. | 2023
- 1365
-
A Biased Mixed-Precision Convolution Engine for Hardware-Efficient Computational Imaging CNNTu, Hao-Jiun / Ou, Yu-Feng / Chen, Yong-Tai / Huang, Chao-Tsung et al. | 2023
- 1372
-
A Lightweight Speaker Verification Model For Edge DeviceChen, Ting-Wei / Chen, Chia-Ping / Lu, Chung-Li / Chan, Bo-Cheng / Cheng, Yu-Han / Chuang, Hsiang-Feng / Chen, Wei-Yu et al. | 2023
- 1378
-
Efficient Dictionary and Grid-Based Framework for Answering Durable k-Nearest Neighbor Queries on Time Series DataSantoso, Bagus Jati / Armunanta, Dwi Prasetya / Pratomo, Baskoro Adi / Studiawan, Hudan et al. | 2023
- 1386
-
Dual-Path Residual Attention Convolution Networks for Color-Embedded-Grayscale ImagePrasetyo, Heri / Mahdy, Abid Ammar / Nadhif, Abrar Dwi Fairuz / Hidayat, Taufiqurrakhman Nur / Hartono, Rudi et al. | 2023
- 1392
-
DOC: A Novel DOuble-Contour-Based Macro Placement Framework for Mixed-Size DesignsZhuo, Yin-Rong / Chen, Hui-Lin / Chen, Yu-Guang et al. | 2023
- 1398
-
Hindering Adversarial Attacks with Multiple Encrypted Patch EmbeddingsMaungMaung, AprilPyone / Echizen, Isao / Kiya, Hitoshi et al. | 2023
- 1405
-
Implementation of PLIM on 429MHz LoRa/FSK with improved conversion tableTakeda, Keita / Miyamoto, Ryuji / Takyu, Osamu et al. | 2023
- 1410
-
Numerical Performance Evaluation of ℓ1 - ℓ2 Sparse Reconstruction Using Optical Analog CircuitFurusawa, Soma / Hayashi, Kazunori / Kameda, Kaito / Hayakawa, Ryo et al. | 2023
- 1417
-
Assessing the Effects of Filtering Processing on Pulse Wave Transit Time Measured by Photoplethysmography from EarlobeLiao, Shangdi / Liu, Haipeng / Zheng, Dingchang / Chen, Fei et al. | 2023
- 1422
-
Efficient Incremental Text-to-Speech on GPUsDu, Muyang / Liu, Chuan / Qi, Jiaxing / Lai, Junjie et al. | 2023
- 1429
-
Retinex-based Low-Light Image EnhancementLuo, Rui / Feng, Yan / He, Mingxin / Zhang, Yuliang et al. | 2023
- 1435
-
Fine-grained Face Anti-Spoofing based on Recursive Self-Attention and Multi-Scale FusionXie, Shichuang / Wu, Jiasheng / Chen, Yanli / Han, Meng / Wu, Ting / Qiao, Tong et al. | 2023
- 1443
-
StyleStegan: Leak-free Style Transfer Based on Feature SteganographyLiang, Xiujian / Liu, Bingshan / Ying, Qichao / Qian, Zhenxing / Cho, Hsunfang / Zhang, Xinpeng et al. | 2023
- 1451
-
Robust Watermark Imaging via Graph-signal OptimizationYang, Ruiguo / Han, Xinhui / Qi, Wenfa / Hu, Wei et al. | 2023
- 1458
-
A print-scan-resilient watermarking scheme for trademark imagesQi, Wenfa / Wang, Jiameng / Yuan, Zichen / Li, Xiaolong et al. | 2023
- 1463
-
AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream NetworkXi, Ziyi / Huang, Wenmin / Wei, Kangkang / Luo, Weiqi / Zheng, Peijia et al. | 2023
- 1471
-
ResNet-Based Camera Model Identification with Adaptive Preprocessing Module and Weight Fusion of Global InformationChen, Boru / Abdulla, Waleed et al. | 2023
- 1479
-
Structural Quality Assured Global Optimization for CTU-Level Rate Control of Screen Content CodingTang, Tong / Tan, Yuan / Ding, Shihang / Li, Zhidu et al. | 2023
- 1484
-
Multimodal Emotion Recognition based on 2D Kernel Density Estimation for Multiple Labels FusionLuo, Zhaojie / Komatani, Kazunori et al. | 2023
- 1492
-
RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised RepresentationsSahipjohn, Neha / Shah, Neil / Tambrahalli, Vishal / Gandhi, Vineet et al. | 2023
- 1500
-
Realizing Nipple in Profile Recognition and Nipple Detection Using a Single ClassificationZeng, Yi-Chong et al. | 2023
- 1506
-
Exploring a CLIP-Enhanced Automated Approach for Video Description GenerationZhang, Siang-Ling / Cheng, Huai-Hsun / Chen, Yen-Hsin / Yeh, Mei-Chen et al. | 2023
- 1512
-
3D Point Cloud Denoising Based on Color AttributeLin, Wei-Chi / Lee, Ming-Zhan / Chou, He-Sheng / Lin, Yuan-Jin / Li, Kuo-Chen / Lin, Ting-Lan / Chen, Shin-Lun et al. | 2023
- 1517
-
The DSP and DDR4 VLSI Design for Multi-Sensor in Biomedical SystemZhang, Jia-Sheng / Chen, Chiung-An / Chen, Shih-Lun / Zhang, Yao-Zhong et al. | 2023
- 1521
-
Identification of Victims Wearing Vibrant Clothing using MATLABHao-Cheng, Lu / Chiung-An, Chen / Jia-Sheng, Zhang / Yao-Zhong, Zhang et al. | 2023
- 1525
-
Point Cloud Inpainting Based on Delaunay TriangulationLiu, Yu-Lin / Chou, He-Sheng / Lee, Ming-Zhan / Chan, Mei-Ling / Lin, Ting-Lan / Chen, Chiung-An / Chen, Shin-Lun et al. | 2023
- 1530
-
Dense Three-Dimensional Color Reconstruction for Large-Scale Outdoor ScenesLiu, Zixiao / Guo, Sheng / Pun, Man-On et al. | 2023
- 1536
-
Safety Enhancement for Mobility Scooter with Rule-Based Danger PreventionChen, Yan-Ru / Tseng, Shih-Wei-Chen / Chen, Yu-Chi / Chang, Yeong-Hwa et al. | 2023
- 1542
-
Dictionary-driven Chinese ASR Entity Correction with Controllable DecodingLi, Rongjun / Peng, Wei et al. | 2023
- 1549
-
A Method of Efficient Synthesizing Post-disaster Remote Sensing Image with Diffusion Model and LLMOu, Ruizhe / Yan, Haotian / Wu, Ming / Zhang, Chuang et al. | 2023
- 1556
-
Privacy-oriented Coded Caching in Mobile Information-centric NetworkingYang, Binchen / Guo, Yu / Chen, Xingyan et al. | 2023
- 1564
-
MKTformer: Fine-grained Meter Classification Based on Multi-modal Knowledge TransferZheng, Zhaoye / Zhang, Ke / Shi, Chaojun / Zheng, Fei et al. | 2023
- 1571
-
Feature Augmentation Reconstruction Network for Few-Shot Image ClassificationLi, Zhen / Wang, Lang / An, Wenjuan / Qi, Song / Li, Xiaoxu / Fei, Xuezhi et al. | 2023
- 1579
-
Dual Feature Reconstruction Network For Few-shot Image ClassificationGuo, Xiaowei / Wu, Jijie / Ren, Kai / Song, Qi / Li, Xiaoxu et al. | 2023
- 1585
-
A Cloud-based Data Platform for Efficient EEG Data Management, Collaboration, and AnalysisTian, Qi / Wu, Wen / Zhu, Qin / Cai, Tao / Jiang, Siyi / Li, Yaqing / Zhou, Jinrun / Zhu, Nan / Wei, Yina / Tang, Tao et al. | 2023
- 1593
-
Incorporating the Digit Triplet Test in A Lightweight Speech Intelligibility Prediction for Hearing AidsZhou, Xiajie / Mawalim, Candy Olivia / Angela Titalim, Benita / Unoki, Masashi et al. | 2023
- 1601
-
Deep Learning-based MRI Super-Resolution Using Non-uniform Segmented Phase-Scrambling Fourier Transform SignalsYamato, Kazuki / Fujisawa, Shuntaro / Ito, Satoshi et al. | 2023
- 1607
-
An Extreme Gradient Boosting-based Prediction for DepressionIbrahum, Ahmed / Park, Kwang Ho / Hong, Jang-Eui / Pham, Van-Huy / Ryu, Keun Ho et al. | 2023
- 1614
-
An Improved Check Digit-based Participant Identification System for Human BiorepositoriesChu, Minseok / Kang, Gilwon / Ryu, Keun Ho et al. | 2023
- 1622
-
Enhancing Snoring Detection with Statistical Analysis of Audio FeaturesBuaruk, Suphachok / Deepaisarn, Somrudee et al. | 2023
- 1628
-
Un-Rectifying in ReLU Networks and ApplicationsTung, Shih-Shuo / Chung, Ming-Yu / Ho, Jinn / Hwang, Wen-Liang et al. | 2023
- 1636
-
OpenPose Based Yoga Poses Difficulty Estimation for Dynamic and Static Yoga ExercisesHuang, Wan-Chia / Shih, Cheng-Liang / Anggraini, Irin Tri / Xiao, Yanqi / Funabiki, Nobuo / Fan, Chih-Peng et al. | 2023
- 1641
-
Multimodal Multifaceted Music Emotion Recognition Based on Self-Attentive Fusion of Psychology-Inspired Symbolic and Acoustic FeaturesZhao, Jiahao / Yoshii, Kazuyoshi et al. | 2023
- 1646
-
Learned String Quartet Music with Variational Auto EncoderChen, Young-Long / Huang, Hsin -I / Yen, Tzu-Te et al. | 2023
- 1652
-
SOAda-YOLOR: Small Object Adaptive YOLOR Algorithm for Road Object DetectionHuang, Yu-Fang / Liu, Tsung-Jung / Lin, Chun-An / Liu, Kuan-Hsien et al. | 2023
- 1659
-
Badminton Self-Training System Based on Virtual RealityTai, Wei-Shen / Liu, Kuan-Hsien et al. | 2023
- 1664
-
Rotation Angle Detection Using a Pilot Signal from Rotated Stego-ImageKawano, Rinka / Kawamura, Masaki et al. | 2023
- 1670
-
Application for generating re-accessible screenshots of web pages using histogram shrinkageSakamoto, Ayaka / Kawano, Rinka / Kawamura, Masaki et al. | 2023
- 1677
-
Domain Adaptation for Efficiently Fine-tuning Vision Transformer with Encrypted ImagesNagamori, Teru / Shiota, Sayaka / Kiya, Hitoshi et al. | 2023
- 1684
-
Study on Face Landmark-based Analysis for Synthetic Media Identification Generated by Adversarial Generative NetworksUra, Akinobu / Kuribayashi, Minoru / Funabiki, Nobuo et al. | 2023
- 1691
-
HDR Image Watermarking based on Saliency Detection and Quantization Index ModulationKhan, Ahmed / Kuribayashi, Minoru / Wong, KokSheik / Baskaran, Vishnu Monn et al. | 2023
- 1697
-
Quick Response (QR) codes embedding in VVC using Quantisation Parameter ManipulationJoan, Hau / Tan, Li Peng / Tew, Yiqi et al. | 2023
- 1705
-
CPIPS: Learning to Preserve Perceptual Distances in End-to-End Image CompressionHuang, Chen-Hsiu / Wu, Ja-Ling et al. | 2023
- 1712
-
Task-Specific Pruning: Efficient Parameter Reduction in Multi-task Object Detection ModelsKe, Wei-Hsun / Tseng, Yu-Wen / Cheng, Wen-Huang et al. | 2023
- 1718
-
Transformer-based Image Compression with Variable Image Quality ObjectivesKao, Chia-Hao / Chen, Yi-Hsin / Chien, Cheng / Chiu, Wei-Chen / Peng, Wen-Hsiao et al. | 2023
- 1726
-
From Synthetic To Real: Enhancing Deep Learning Models With Generative Adversarial Networks For Efficient Data Utilization In Automatic Retail StoresDang, Cong-Ty / Tran, Vu-Hoang / Le, Ngoc-Hoang-Lam / Huang, Ching-Chun et al. | 2023
- 1732
-
Virtual Garment Fitting Through Parsing and Context-Aware Generative Adversarial Networks with Discriminator GroupSu, Wei-Hong / Chen, Sze-Ann / Chin, Chen-I / Hsiao, Hsu-Feng et al. | 2023
- 1739
-
Sparse Tensor-based point cloud attribute compression using Augmented Normalizing FlowsLin, Tzu-Po / Yim, Monyneath / Chiang, Jui-Chiu / Peng, Wen-Hsiao / Lie, Wen-Nung et al. | 2023
- 1745
-
Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic Singing Voice Understanding Tasks: Three Case StudiesYamamoto, Yuya et al. | 2023
- 1753
-
Out-of-Vocabulary Word Detection in Spoken Dialogues Based on Joint Decoding with User Response PatternsOshio, Miki / Munakata, Hokuto / Takeda, Ryu / Komatani, Kazunori et al. | 2023
- 1760
-
Synthetic Data Augmentation for ASR with Domain FilteringVu Ho, Tuan / Horiguchi, Shota / Watanabe, Shinji / Garcia, Paola / Sumiyoshi, Takashi et al. | 2023
- 1766
-
Multi-Self-Supervised Learning Model-Based Throat Microphone Speech RecognitionMasuda, Kohta / Ogata, Jun / Nishida, Masafumi / Nishimura, Masafumi et al. | 2023
- 1771
-
ASR Model Adaptation for Rare Words Using Synthetic Data Generated by Multiple Text-To-Speech SystemsYuen, Kwok Chin / Haoyang, Li / Siong, Chng Eng et al. | 2023
- 1779
-
Streaming End-to-End ASR Using CTC Decoder and DRA for Linguistic Information SubstitutionTakagi, Tatsunari / Ogawa, Atsunori / Kitaoka, Norihide / Wakabayashi, Yukoh et al. | 2023
- 1784
-
A Biometric Signature Scheme with Template Protection and Authenticated Sample RecoverabilityNakamura, Wataru / Takahashi, Kenta et al. | 2023
- 1792
-
IPFed: Identity protected federated learning for user authenticationKaga, Yosuke / Suzuki, Yusei / Takahashi, Kenta et al. | 2023
- 1798
-
Privacy-Preserving Image Transformation Method for Person Detection and Re-IDOuchi, Yumo / Uchida, Hidetsugu / Abe, Narishige et al. | 2023
- 1804
-
Eye Biometrics Combined with Periocular and Iris Recognition Using CNNTonosaki, Taito / Kawakami, Shokei / Ito, Koichi / Aoki, Takafumi / Yasumura, Yoshiko / Fujio, Masakazu / Kaga, Yosuke / Takahashi, Kenta et al. | 2023
- 1811
-
Development of a Robust Ear Recognition Algorithm using Planar ApproximationArakawa, Takahiko / Sato, Yuya / Sakano, Hitoshi / Ohki, Tetsushi et al. | 2023
- 1816
-
Word encoding for word-looking DGA-based Botnet classificationLiew, Sea Ran Cleon / Law, Ngai Fong et al. | 2023
- 1822
-
Analysis of Spectro-Temporal Modulation Representation for Deep-Fake Speech DetectionCheng, Haowei / Mawalim, Candy Olivia / Li, Kai / Wang, Lijun / Unoki, Masashi et al. | 2023
- 1830
-
Flexible Evidence Model to Reduce Uncertainty Mismatch Between Speech Enhancement and ASR Based on Encoder-Decoder ArchitectureTakeda, Ryu / Sudo, Yui / Komatani, Kazunori et al. | 2023
- 1838
-
Investigating the Effectiveness of Speaker Embeddings for Shout Intensity PredictionFukumori, Takahiro / Ishida, Taito / Yamashita, Yoichi et al. | 2023
- 1843
-
Is the Ideal Ratio Mask Really the Best? — Exploring the Best Extraction Performance and Optimal Mask of Mask-based BeamformersHiroe, Atsuo / Itoyama, Katsutoshi / Nakadai, Kazuhiro et al. | 2023
- 1851
-
Language modeling for spontaneous speech recognition based on disfluency labeling and generation of disfluent textHorii, Koharu / Ohta, Kengo / Nishimura, Ryota / Ogawa, Atsunori / Kitaoka, Norihide et al. | 2023
- 1857
-
Transformer-based Automatic Speech Recognition of Simultaneous Interpretation with Auxiliary Input of Source Language TextTaniguchi, Shuta / Kato, Tsuneo / Tamura, Akihiro / Yasuda, Keiji et al. | 2023
- 1862
-
An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-HearingVioleta, Lester Phillip / Toda, Tomoki et al. | 2023
- 1868
-
Classification of Vocal Cord Disorders: Comparison Across Voice Datasets, Speech Tasks, and Machine Learning MethodsChen, Ching-Chieh / Hsu, Wei-Cheng / Lin, Tzu-Han / Chen, Kuan-Dar / Tsou, Yung-An / Liu, Yi-Wen et al. | 2023
- 1878
-
Application of Deep Learning Techniques for Thermal Imagery Analysis in Abnormal Identification of Floor Tiles in Heritage EnvironmentsYu, Chen-Xin / Chen, Wu-Pei / Ju, Chin-Yen / Chen, Tsung-Yi / Li, Kuo-Chen / Chen, Chiung-An / Chan, Mei-Ling / Chen, Shih-Lun et al. | 2023
- 1885
-
Wavelet and Cutout in YOLO Architecture for Road Pothole DetectionLu, Shao-Hua / Lu, Jia-Teng / Lin, Szu-Yin / Hsia, Chih-Hsien et al. | 2023
- 1892
-
Robust Finger Vein Recognition Based on Lightweight Attention Convolutional Neural NetworksWei, Ming-Yi / Wang, Yu-Chi / Ke, Liang-Ying / Hsia, Chih-Hsien et al. | 2023
- 1896
-
Lightweight CNN and Image Enhancement Using in Palm Vein RecognitionChen, Ping-Han / Hung, Yung-Sheng / Hsia, Chih-Hsien et al. | 2023
- 1903
-
Breast Cancer Detection Auxiliary System Leveraging Deep Learning and Mixed RealityLin, Szu-Yin / Chien, Ming-Chun / Kwong Meng, Edwin Tiong / Wang, Yu-Chien / Kuo, Yu-Yi / Lin, Che-Hsuan et al. | 2023
- 1907
-
Efficient Reversible Data Hiding for 3D Mesh Models Based on Multi-LSB Substitution and Ring-predictionLyu, Wanli / Cheng, Lulu / Yin, Zhaoxia / Luo, Bin et al. | 2023
- 1915
-
MAEDefense: An Effective Masked AutoEncoder Defense against Adversarial AttacksLyu, Wanli / Wu, Mengjiang / Yin, Zhaoxia / Luo, Bin et al. | 2023
- 1923
-
Preemptive Image Protection against SteganographyGuo, Yusheng / Zhong, Nan / Qian, Zhenxing / Zhang, Xinpeng / Cho, Hsunfang et al. | 2023
- 1931
-
Zero-shot multi-speaker accent TTS with limited accent dataZhang, Mingyang / Zhou, Yi / Wu, Zhizheng / Li, Haizhou et al. | 2023
- 1937
-
Speech Enhancement with Multi-granularity Vector QuantizationZhao, Xiaoying / Zhu, Qiushi / Zhang, Jie / Zhou, Yeping / Liu, Peiqi et al. | 2023
- 1943
-
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party MeetingsShi, Mohan / Zhang, Jie / Du, Zhihao / Yu, Fan / Chen, Qian / Zhang, Shiliang / Dai, Li-Rong et al. | 2023
- 1949
-
Hybrid Syllable and Character Representations for Mandarin ASRZhang, Fengrun / Li, Chengfei / Deng, Shuhao / Wang, Yaoping / Bai, Jinfeng et al. | 2023
- 1955
-
Enhancing Whisper Model for Pronunciation Assessment with Multi-AdaptersLi, Jing / Li, Rui / Guo, Shen / Wumaier, Aishan et al. | 2023
- 1960
-
DoA Estimation of Room Reflections Using NN-Based MUSIC AlgorithmLi, Haowen / Zhang, Wen / Zhang, Lijun et al. | 2023
- 1966
-
Hybrid Multi-Task Learning for End-To-End Multimodal Emotion RecognitionChen, Junjie / Li, Yongwei / Zhao, Ziping / Liu, Xuefei / Wen, Zhengqi / Tao, Jianhua et al. | 2023
- 1972
-
It’s What You Say and How You Say It: Exploring Audio and Textual Features for Podcast DataShah, Neil / Srivastava, Vivek / Bhardwaj, Mohit / Kadlay, Satej / Agrawal, Dharmeshkumar / Bhat, Savita / Pedanekar, Niranjan et al. | 2023
- 1978
-
Improved One-class Learning for Voice Spoofing DetectionLi, Lixiang / Xue, Xiaopeng / Peng, Haipeng / Ren, Yeqing / Zhao, Mengmeng et al. | 2023
- 1984
-
Sound Field Estimation around a Rigid Sphere with Physics-informed Neural NetworkChen, Xingyu / Ma, Fei / Bastine, Amy / Samarasinghe, Prasanga / Sun, Huiyuan et al. | 2023
- 1990
-
A Controlled Noise Reduction Wiener Filter Based on the Quadratic Eigenvalue ProblemPan, Ningning / Benesty, Jacob / Chen, Jingdong et al. | 2023
- 1995
-
Target Speaker Extraction with Attention Enhancement and Gated Fusion MechanismSijie, Wang / Hamdulla, Askar / Ablimit, Mijit et al. | 2023
- 2002
-
Analysis of Speech Separation Performance Degradation on Emotional Speech MixturesYip, Jia Qi / Ng, Dianwen / Ma, Bin / Siong, Chng Eng et al. | 2023
- 2008
-
Geometrically Constrained Blind Moving Source Extraction based on Constant Separation Vector and Auxiliary Function TechniqueZhang, Ruifeng / Ueda, Tetsuya / Makino, Shoji et al. | 2023
- 2013
-
Universal Sound Separation Using Replay-based Data Sampling in Incremental LearningShimonishi, Kanta / Fukumori, Takahiro / Yamashita, Yoichi et al. | 2023
- 2019
-
Multiple Sound Source Tracking Based on Generative Modeling and Recursive Bayesian Filtering of Spatial Gradient SpectraTakazawa, Keisuke / Kameoka, Hirokazu / Yukawa, Masahiro et al. | 2023
- 2024
-
Spatially-Regularized Switching Independent Vector AnalysisUeda, Tetsuya / Nakatani, Tomohiro / Ikeshita, Rintaro / Araki, Shoko / Makino, Shoji et al. | 2023
- 2031
-
ASF-LLRDA: Locality-regularized Linear Regression Discriminant Analysis with Approximately Symmetrical Face Preprocessing for Face RecognitionWidyadhana, Arya / Hidayati, Shintami Chusnul / Navastara, Dini Adni / Anistyasari, Yeni et al. | 2023
- 2037
-
Joint Optimization Algorithm for Adaptive Bit Allocation Based on Temporal-Spatial InformationWang, Shaokang / Sun, Songlin et al. | 2023
- 2043
-
Maximization of 2D Cross-Correlation Based on Auxiliary Function Method for Image AlignmentKinoshita, Yuma / Yamaoka, Kouei / Kiya, Hitoshi et al. | 2023
- 2048
-
Multitask Record for Badminton MatchGuo, Jing-Ming / Huang, Yu-Shun / Chang, Ting-Yu / Ciou, Tai-Cyuan / Yeh, Yun-Ching / Chen, Jeffrey et al. | 2023
- 2053
-
Deep Residual and Classified Neural Networks for Inverse HalftoningGuo, Jing-Ming / Sankarasrinivasan, S. / Hung, Let Viet / Liu, Wei et al. | 2023
- 2061
-
DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and DetectionFujita, Yoto / Bando, Yoshiaki / Imoto, Keisuke / Onishi, Masaki / Yoshii, Kazuyoshi et al. | 2023
- 2068
-
Improving Sound Event Localization and Detection with Class-Dependent Sound Separation for Real-World ScenariosCheng, Shi / Du, Jun / Wang, Qing / Jiang, Ya / Nian, Zhaoxu / Niu, Shutong / Lee, Chin-Hui / Gao, Yu / Zhang, Wenbin et al. | 2023
- 2074
-
Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised ApproachIgarashi, Ami / Tsubaki, Shunsuke / Niizumi, Daisuke / Takeuchi, Daiki / Ohishi, Yasunori / Harada, Noboru / Imoto, Keisuke et al. | 2023
- 2081
-
Cross-domain Sound Recognition for Efficient Underwater Data AnalysisPark, Jeongsoo / Han, Dong-Gyun / La, Hyoung Sul / Lee, Sangmin / Han, Yoonchang / Yang, Eun-Jin et al. | 2023
- 2087
-
Augmentation of Various Speed Data by Controlling Frame Overlap for Acoustic Traffic MonitoringTakahashi, Tomohiro / Kinoshita, Yuma / Ueno, Natsuki / Wakabayashi, Yukoh / Ono, Nobutaka / Honda, Jun / Fukuma, Seishi / Kitamori, Aoi / Nakagawa, Hiroshi et al. | 2023
- 2092
-
Distributed Computation of Heat Kernel Smoothing Using Series Expansion MethodTseng, Chien-Cheng / Lee, Su-Ling et al. | 2023
- 2099
-
In-Air Handwriting for Chinese Character Recognition from Monocular Camera: A Deep Learning based Approach with Fingertip Detection and Virtual Strokes EliminationYu, Chih-Chang / Huang, Zi-Hang / Cheng, Hsu-Yung et al. | 2023
- 2104
-
EffSegmentNet: Efficient Design for Real-time Semantic SegmentationWang, Cyun-Bo / Ding, Jian-Jiun et al. | 2023
- 2112
-
Universal Optimal Parameters of the Closed-Form Linear Canonical Wigner DistributionZhang, Zhichao et al. | 2023
- 2118
-
Autoencoder-Enhanced Federated Learning with Reduced Overhead and Lower LatencyHsieh, Chi-Kai / Chien, Feng-Tsun / Chang, Min-Kuan et al. | 2023
- 2124
-
Deep Unfolding-based Distributed MIMO DetectionKumagai, Masaya / Nakai-Kasai, Ayano / Wadayama, Tadashi et al. | 2023
- 2131
-
A Comparative Analysis of the Yolo Models for Intelligent Lobster Surveillance CameraAkhyar, Fityanul / Novamizanti, Ledya / Usman, Koredianto / Aditya, Ghanes Mahesa / Nur Hakim, Farhan / Ilman, Mukhamad Zidni / Ramdhon, Ferdi / Lin, Chih-Yang et al. | 2023
- 2137
-
A UAV Indoor Obstacle Avoidance System Based on Deep Reinforcement LearningLo, Chun-Huang / Lee, Chung-Nan et al. | 2023
- 2144
-
Approximate modeling of malware diffusion on wireless mobile devicesMiura, Hideyoshi / Abukawa, Shoya / Kimura, Tomotaka / Hirata, Kouji et al. | 2023
- 2149
-
Impacts of 5G-TDD Time Slot Configurations on the Downlink and Uplink Data RatesLai, Wen-Ping / Chen, Wen-Ru / Lai, Hong-Lun / Li, Hong-Yi et al. | 2023
- 2155
-
Bearing Fault Diagnosis and Interpretation Based on 2D Images and Convolutional Neural NetworkTian, Zhenzhen / Zhang, Xinyu / Yan, Wei / Wang, Jihua et al. | 2023
- 2163
-
Study on Reduction of Background Fringes for Defect Detection of Specular SurfaceWei, An-Chi / Chang, Yi-Cheng / Sze, Jyh-Rou et al. | 2023
- 2168
-
On the Optimal Self-Supervised Multi-Fault Detector for Temperature Sensor DataHarfiya, Latifa Nabila / Hsu, Yan-Cheng / Li, Yung-Hui / Wang, Jia-Ching et al. | 2023
- 2173
-
Application of Wafer Defect Pattern Classification Model in the Semiconductor IndustryLee, Chin-Wei / Hladek, Daniel / Pleva, Matus / Liao, Yuan-Fu / Su, Ming-Hsiang et al. | 2023
- 2178
-
Question Answering System Based on Pre-Training Model and Retrieval Reranking for Industry 4.0Chen, Ta-Fu / Lin, Yi-Xing / Su, Ming-Hsiang / Chen, Po-Kai / Tai, Tzu-Chiang / Wang, Jia-Ching et al. | 2023
- 2182
-
Deepfake-speech Detection with Pathological Features and Multilayer Perceptron Neural NetworkChaiwongyen, Anuwat / Duangpummet, Suradej / Karnjana, Jessada / Kongprawechnon, Waree / Unoki, Masashi et al. | 2023
- 2189
-
Temporal and Type Correlation in Digital Phenotyping for Bipolar Disorder State Prediction Using Multitask Self-Supervised LearningHsu, Jia-Hao / Tseng, Hua-Wei / Wu, Chung-Hsien / Lin, Esther Ching-Lan / See Chen, Po et al. | 2023
- 2196
-
Data Selection Based on Phoneme Affinity Matrix for Electrolarynx Speech RecognitionHsieh, I-Ting / Wu, Chung-Hsien / Tsa, Shu-Wei et al. | 2023
- 2203
-
Reduction of Annotation Effort in Medical Image Analysis Based on Self-supervised LearningChan, Kai-Hsuan / Zeng, Yi-Chong et al. | 2023
- 2209
-
STUA-Net: A Fingerprint Reconstruction with Swin Transformer and Soft Collective AttentionHakim, Farchan Raswa / Yoga Wicaksana, Prabowo / Putri, Wenny Ramadha / Harjoko, Agus / Wang, Jia-Ching et al. | 2023
- 2213
-
Coarse-Age Loss: A New Training Method Using Coarse-Age Labeled Data for Speaker Age EstimationKitagishi, Yuki / Kamiyama, Hosana / Tawara, Naohiro / Ogawa, Atsunori / Miyazaki, Noboru / Asami, Taichi et al. | 2023
- 2221
-
Contribution of modulation spectral features for cross-lingual speech emotion recognition under noisy reverberant conditionsGuo, Taiyang / Li, Sixia / Kidani, Shunsuke / Okada, Shogo / Unoki, Masashi et al. | 2023
- 2228
-
Vocal Tract Length Perturbation-based Pseudo-Speaker Augmentation for Speaker Embedding LearningWakamatsu, Tomoka / Shiota, Sayaka / Kiya, Hitoshi et al. | 2023
- 2233
-
Automatic Call Classification of Autism Model Marmosets by Deep Learning and Analysis of Their Vocal DevelopmentUesaka, Minato / Kawauchi, Hideto / Yamaoka, Kouei / Wakabayashi, Yukoh / Kinoshita, Yuma / Ono, Nobutaka / Noguchi, Jun / Watanabe, Satoshi / Ichinohe, Noritaka / Benner, Seico et al. | 2023
- 2238
-
Cross-Domain adaptation in Distance Space for Speaker VerificationYi, Lu / Mak, Man Wai et al. | 2023
- 2244
-
Urban Noise Monitoring using Edge Computing with CNN-LSTM on Jetson NanoPeng, Bo / Abdulla, Waleed H. / Wang, Kevin I-Kai et al. | 2023
- 2251
-
Random forest of Classification and Regression Tree (CART) in the estimation of SWC based on meteorological inputs and hydrodynamics behindWu, Tsung-Hsi / Chen, Pei-Yuan / Chen, Chien-Chih / Chung, Meng-Ju / Ye, Zheng-Kai / Li, Ming-Hsu et al. | 2023
- 2256
-
A Framework for Reusing Earth Science Data on Data and Model MarketplacesHuang, Chung-I / Chang, Jih-Sheng / Sun, Chen-Kai / Wang, Taichi / Chen, Wei-Yu / Yu, Hui Hung / Chang, Wen-Yi / Lin, Fang-Pang et al. | 2023
- 2261
-
Impact of the weighted loss function on the innovative CMAQ-CNN PM2.5 forecasting modelLee, Yi-Ju / Cheng, Fang-Yi / Feng, Chih-Yung / Yang, Zhih-Min et al. | 2023
- 2267
-
Jointly Modelling Transcriptions and Phonemes with Optimal Features to Detect Dementia from Spontaneous CantoneseKe, Xiaoquan / Mak, Man-Wai / Meng, Helen M. et al. | 2023
- 2274
-
Combining multiple end-to-end speech recognition models based on density ratio approachHojo, Keigo / Mori, Daiki / Wakabayashi, Yukoh / Ohta, Kengo / Ogawa, Atsunori / Kitaoka, Norihide et al. | 2023
- 2280
-
Speech-Emotion Control for Text-to-Speech in Spoken Dialogue Systems Using Voice Conversion and x-vector EmbeddingKohara, Shunichi / Abe, Masanobu / Hara, Sunao et al. | 2023
- 2287
-
Narrow-edged Acoustical Beamforming Utilizing Phase Inversion for Frequency Modulation-based Parametric Array LoudspeakerGeng, Yuting / Nakayama, Masato / Nishiura, Takanobu et al. | 2023
- 2294
-
Corpus Construction for Deaf Speakers and Analysis by Automatic Speech RecognitionKobayashi, Akio / Yasu, Keiichi et al. | 2023
- 2299
-
Ensemble of Transformer and Convolutional Recurrent Neural Network for Improving Discrimination Accuracy in Automatic Chord RecognitionYamaga, Hikaru / Momma, Toshifumi / Kojima, Kazunori / Itoh, Yoshiaki et al. | 2023
- 2306
-
Construction of Automatic Speech Recognition Model that Recognizes Linguistic Information and Verbal/Non-verbal PhenomenaShione, Nagito / Wakabayashi, Yukoh / Kitaoka, Norihide et al. | 2023
- 2312
-
Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic MusicZhong, Lifan / Cooper, Erica / Yamagishi, Junichi / Minematsu, Nobuaki et al. | 2023
- 2320
-
Speech Quality Improvement Utilizing Out-of-Focus Areas in Rolling-Shutter Video on Speech ExtractionNakano, Hayata / Yoshizawa, Tsubasa / Geng, Yuting / Iwai, Kenta / Nishiura, Takanobu et al. | 2023
- 2326
-
Personalized Audio Quality Preference PredictionWang, Chung-Che / Lin, Yu-Chun / Hsu, Yu-Teng / Jang, Jyh-Shing Roger et al. | 2023
- 2331
-
AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive LearningWang, Yi-Cheng / Yang, Tzu-Ting / Wang, Hsin-Wei / Yan, Bi-Cheng / Chen, Berlin et al. | 2023
- 2336
-
Regression-based Sound Event Detection with Semi-supervised LearningLiu, Chia-Chuan / Chen, Chia-Ping / Lu, Chung-Li / Chan, Bo-Cheng / Cheng, Yu-Han / Chuang, Hsiang-Feng / Chen, Wei-Yu et al. | 2023
- 2343
-
Proportionate NLMS with Variable Step-Size for Adaptive Feedback Cancellation in Hearing AidsThuc Tran, Linh Thi / Albu, Felix / Nguyen, Hieu Trung / Nordholm, Sven et al. | 2023
- 2349
-
Residual Echo Suppression using Spatial Feature for Stereo Acoustic Echo CancellationChou, Hsuan-Cheng / Shen, Yih-Liang / Wu, Meng-Hsuan / Shih, Bo-Wun / Chi, Tai-Shih et al. | 2023
- 2354
-
Multitaper Adaptive Time-Frequency Windowed Fourier Transform Based on the Reliable Region of Window WidthsCheng, Jen-Chieh / Ding, Jian-Jiun et al. | 2023
- 2362
-
Enhancing Retinal Disease Classification with Dual Scale Twin Vision Transformers using OCT ImagingKarn, Prakash Kumar / Abdulla, Waleed H et al. | 2023
- 2370
-
Classification of Infant Sleep/Wake States: Cross-Attention among Large Scale Pretrained Transformer Networks using Audio, ECG, and IMU DataChang, Kai Chieh / Hasegawa-Johnson, Mark / McElwain, Nancy L. / Islam, Bashima et al. | 2023
- 2378
-
Dynamic Characteristics of Electroencephalogram Reflecting Driving-Experience-Dependent Performance Using MicrostatesIinuma, Yuta / Ozawa, Takuto / Nobukawa, Sou / Wagatsuma, Nobuhiko / Inagaki, Keiichiro et al. | 2023
- 2384
-
Quefrency Domain Features with Residual Networks for Spoof Speech DetectionKamble, Madhu R. et al. | 2023
- 2390
-
PDF-NET: Pitch-adaptive Dynamic Filter Network for Intra-gender Speaker VerificationPiao, Zhenyu / Lim, Hyungseob / Kim, Miseul / Kang, Hong-Goo et al. | 2023
- 2395
-
Subjective Evaluation of a Focused Sound Source Reproducing at the positions of a Listener’s Moving HandHirohashi, Miho / Haneda, Yoichi et al. | 2023
- 2402
-
Time Sensitive Hash and Adaptive Image Recovery based Self-embedding Fragile Watermarking Scheme in Encrypted ImagesWang, Xin / He, Hongjie / Chen, Fan et al. | 2023
- 2409
-
Multi-granularity Semantic and Acoustic Stress Prediction for Expressive TTSChi, Wenjiang / Feng, Xiaoqin / Xue, Liumeng / Chen, Yunlin / Xie, Lei / Li, Zhifei et al. | 2023
- 2416
-
NADiffuSE: Noise-aware Diffusion-based Model for Speech EnhancementWang, Wen / Yang, Dongchao / Ye, Qichen / Cao, Bowen / Zou, Yuexian et al. | 2023
- 2424
-
Multi-accent pronunciation assessment based on domain adversarial trainingLin, Binghuai / Wang, Liyuan et al. | 2023
- 2429
-
GAN-Based Time-Domain Packet Loss Concealment Method with Consistent Mapping ApproachZhao, Yunhao / Bao, Changchun / Yang, Xue / Zhou, Jing et al. | 2023
- 2441
-
Feature Selection Based on Clonal Selection Algorithm for Image SteganalysisLiu, Yu / Wang, Hongxia et al. | 2023
- 2448
-
ScaleFormer: Transformer-based speech enhancement in the multi-scale time domainWu, Tianci / He, Shulin / Zhang, Hui / Zhang, XueLiang et al. | 2023
- 2454
-
UniVR: A Unified Framework for Pitch-Shifted Voice Restoration in Speaker IdentificationLi, Yangfu / Lin, Xiaodan et al. | 2023
- i
-
Table of Contents| 2023
- i
-
Technical Program Committee| 2023
- i
-
Authors Index| 2023
- i
-
Cognitive Assessment of Autism Spectrum Disorder Using an EEG-based Social Interaction PlatformTseng, Yi-Li / Chien, Yi-Ling / Chuang, Tse-Min / Chiu, Yen-Nan / Tsai, Wen-Che et al. | 2023
- i
-
Copyright Page| 2023