A Comparative Study of Pre-trained Audio and Speech Models for Heart Sound Detection (Englisch)
- Neue Suche nach: Duan, Yuxin
- Neue Suche nach: Yang, Chenyu
- Neue Suche nach: Zhao, Zihan
- Neue Suche nach: Jiang, Yiyang
- Neue Suche nach: Wang, Yanfeng
- Neue Suche nach: Wang, Yu
- Neue Suche nach: Jia, Jia
- Neue Suche nach: Ling, Zhenhua
- Neue Suche nach: Chen, Xie
- Neue Suche nach: Li, Ya
- Neue Suche nach: Zhang, Zixing
- Neue Suche nach: Duan, Yuxin
- Neue Suche nach: Yang, Chenyu
- Neue Suche nach: Zhao, Zihan
- Neue Suche nach: Jiang, Yiyang
- Neue Suche nach: Wang, Yanfeng
- Neue Suche nach: Wang, Yu
In:
Man-Machine Speech Communication
: 18th National Conference, NCMMSC 2023, Suzhou, China, December 8–10, 2023, Proceedings
;
Kapitel: 25
;
287-301
;
2024
- Aufsatz/Kapitel (Buch) / Elektronische Ressource
-
Titel:A Comparative Study of Pre-trained Audio and Speech Models for Heart Sound Detection
-
Weitere Titelangaben:Communic.Comp.Inf.Science
-
Beteiligte:Jia, Jia ( Herausgeber:in ) / Ling, Zhenhua ( Herausgeber:in ) / Chen, Xie ( Herausgeber:in ) / Li, Ya ( Herausgeber:in ) / Zhang, Zixing ( Herausgeber:in ) / Duan, Yuxin ( Autor:in ) / Yang, Chenyu ( Autor:in ) / Zhao, Zihan ( Autor:in ) / Jiang, Yiyang ( Autor:in ) / Wang, Yanfeng ( Autor:in )
-
Kongress:National Conference on Man-Machine Speech Communication ; 2023 ; Suzhou, China
-
Erschienen in:Man-Machine Speech Communication : 18th National Conference, NCMMSC 2023, Suzhou, China, December 8–10, 2023, Proceedings ; Kapitel: 25 ; 287-301Communications in Computer and Information Science ; 2006 ; 287-301
-
Verlag:
- Neue Suche nach: Springer Nature Singapore
-
Erscheinungsort:Singapore
-
Erscheinungsdatum:15.02.2024
-
Format / Umfang:15 pages
-
ISBN:
-
ISSN:
-
DOI:
-
Medientyp:Aufsatz/Kapitel (Buch)
-
Format:Elektronische Ressource
-
Sprache:Englisch
-
Schlagwörter:
-
Datenquelle:
Inhaltsverzeichnis E-Book
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 1
-
Ultra-Low Complexity Residue Echo and Noise Suppression Based on Recurrent Neural NetworkZhou, Jianquan / Gao, Yi / Zhang, Siyu et al. | 2024
- 2
-
Semi-End-to-End Nested Named Entity Recognition from SpeechZhang, Min / Qiao, XiaoSong / Zhao, Yanqing / Su, Chang / Li, Yuang / Li, Yinglu / Piao, Mengyao / Peng, Song / Tao, Shimin / Yang, Hao et al. | 2024
- 3
-
A Lightweight Music Source Separation Model with Graph Convolution NetworkZhu, Mengying / Wang, Liusong / Hu, Ying et al. | 2024
- 4
-
Joint Time-Domain and Frequency-Domain Progressive Learning for Single-Channel Speech Enhancement and RecognitionZou, Gongzhen / Du, Jun / Niu, Shutong / Chen, Hang / Ren, Yuling / Li, Qinglong / Liu, Ruibo / Lee, Chin-Hui et al. | 2024
- 5
-
A Study on Domain Adaptation for Audio-Visual Speech EnhancementWang, Chenxi / Chen, Hang / Du, Jun / Zhang, Chenyue / Ren, Yuling / Li, Qinglong / Liu, Ruibo / Lee, Chin-Hui et al. | 2024
- 6
-
APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase SpectraDu, Hui-Peng / Lu, Ye-Xin / Ai, Yang / Ling, Zhen-Hua et al. | 2024
- 7
-
Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker VerificationZhang, Jian-Tao / Song, Hao-Yu / Guo, Wu / Song, Yan / Dai, Li-Rong et al. | 2024
- 8
-
Joint Speech and Noise Estimation Using SNR-Adaptive Target Learning for Deep-Learning-Based Speech EnhancementLi, Xiaoran / Guo, Zilu / Du, Jun / Lee, Chin-Hui / Gao, Yu / Zhang, Wenbin et al. | 2024
- 9
-
Data Augmentation by Finite Element Analysis for Enhanced Machine Anomalous Sound DetectionZhang, Zhixian / Zhang, Yucong / Li, Ming et al. | 2024
- 10
-
A Fast Sampling Method in Diffusion-Based Dance Generation ModelsGuo, Puyuan / Han, Yichen / Gao, Yingming / Li, Ya et al. | 2024
- 11
-
End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural SearchYang, Baochen / Guo, Jiaqi / Li, Haoyu / Xi, Yu / Zhuo, Qing / Yu, Kai et al. | 2024
- 12
-
The Production of Successive Addition Boundary Tone in Mandarin PreschoolersLi, Aijun / Gao, Jun / Wang, Zhiwei et al. | 2024
- 13
-
Emotional Support Dialog System Through Recursive Interactions Among Large Language ModelsChen, Keqi / Lian, Huijun / Gao, Yingming / Li, Ya et al. | 2024
- 14
-
Task-Adaptive Generative Adversarial Network Based Speech Dereverberation for Robust Speech RecognitionLiu, Ji / Li, Nan / Ge, Meng / Fu, Yanjie / Wang, Longbiao / Dang, Jianwu et al. | 2024
- 15
-
Real-Time Automotive Engine Sound Simulation with Deep Neural NetworkLi, Hao / Wang, Weiqing / Li, Ming et al. | 2024
- 16
-
A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech EnhancementPan, Qiaoyi / Jiang, Wenbing / Zhuo, Qing / Yu, Kai et al. | 2024
- 17
-
Accent-VITS: Accent Transfer for End-to-End TTSMa, Linhan / Zhang, Yongmao / Zhu, Xinfa / Lei, Yi / Ning, Ziqian / Zhu, Pengcheng / Xie, Lei et al. | 2024
- 18
-
Multi-branch Network with Cross-Domain Feature Fusion for Anomalous Sound DetectionFang, Wenjie / Fan, Xin / Hu, Ying et al. | 2024
- 19
-
A Packet Loss Concealment Method Based on the Demucs Network StructureLi, Wenwen / Bao, Changchun et al. | 2024
- 20
-
Improving Speech Perceptual Quality and Intelligibility Through Sub-band Temporal Envelope CharacteristicsWu, Ruilin / Huang, Zhihua / Song, Jingyi / Liang, Xiaoming et al. | 2024
- 21
-
Adaptive Deep Graph Convolutional Network for Dialogical Speech Emotion RecognitionLiu, Jiaxing / Wu, Sheng / Wang, Longbiao / Dang, Jianwu et al. | 2024
- 22
-
Iterative Noisy-Target Approach: Speech Enhancement Without Clean SpeechZhang, Yifan / Jiang, Wenbin / Zhuo, Qing / Yu, Kai et al. | 2024
- 23
-
Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker DiarizationZhao, Huan / Zhang, Li / Li, Yue / Wang, Yannan / Wang, Hongji / Rao, Wei / Wang, Qing / Xie, Lei et al. | 2024
- 24
-
Zero-Shot Singing Voice Conversion Based on Timbre Space Modeling and Excitation Signal ControlJiang, Yuan / Chen, Yan-Nian / Liu, Li-Juan / Hu, Ya-Jun / Fang, Xin / Ling, Zhen-Hua et al. | 2024
- 25
-
A Comparative Study of Pre-trained Audio and Speech Models for Heart Sound DetectionDuan, Yuxin / Yang, Chenyu / Zhao, Zihan / Jiang, Yiyang / Wang, Yanfeng / Wang, Yu et al. | 2024
- 26
-
CAM-GUI: A Conversational Assistant on Mobile GUIZhu, Zichen / Sun, Liangtai / Yang, Jingkai / Peng, Yifan / Zou, Weilin / Li, Ziyuan / Li, Wutao / Chen, Lu / Ma, Yingzi / Zhang, Danyang et al. | 2024
- 27
-
A Pilot Study on the Prosodic Factors Influencing Voice Attractiveness of AI SpeechWang, Yihui / Lu, Haocheng / Wang, Gaowu et al. | 2024
- 28
-
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023Cheng, Ming / Wang, Weiqing / Qin, Xiaoyi / Lin, Yuke / Jiang, Ning / Zhao, Guoqing / Li, Ming et al. | 2024
- 29
-
Chinese EFL Learners’ Auditory and Visual Perception of English Statement and Question Intonations: The Effect of Lexical StressXu, Qiunan / Tang, Ping et al. | 2024
- 30
-
An Improved System for Partially Fake Audio Detection Using Pre-trained ModelZhang, Jianqian / Liu, Hanyue / Deng, Mengyuan / Wang, Jing / Sun, Yi / Xu, Liang / Li, Jiahao et al. | 2024
- 31
-
Leveraging Synthetic Speech for CIF-Based Customized Keyword SpottingLiu, Shuiyun / Zhang, Ao / Huang, Kaixun / Xie, Lei et al. | 2024