Trinet: Stabilizing Self-Supervised Learning From Complete or Slow Collapse (English)
- New search for: Cao, Lixin
- New search for: Wang, Jun
- New search for: Yang, Ben
- New search for: Su, Dan
- New search for: Yu, Dong
- New search for: Cao, Lixin
- New search for: Wang, Jun
- New search for: Yang, Ben
- New search for: Su, Dan
- New search for: Yu, Dong
In:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
;
1-5
;
2023
-
ISBN:
-
ISSN:
- Conference paper / Electronic Resource
-
Title:Trinet: Stabilizing Self-Supervised Learning From Complete or Slow Collapse
-
Contributors:Cao, Lixin ( author ) / Wang, Jun ( author ) / Yang, Ben ( author ) / Su, Dan ( author ) / Yu, Dong ( author )
-
Published in:
-
Publisher:
- New search for: IEEE
-
Publication date:2023-06-04
-
Size:1359906 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
-
Source:
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Learning ASR Pathways: A Sparse Multilingual ASR ModelYang, Mu / Tjandra, Andros / Liu, Chunxi / Zhang, David / Le, Duc / Kalinli, Ozlem et al. | 2023
- 1
-
Real-Time Target Sound ExtractionVeluri, Bandhav / Chan, Justin / Itani, Malek / Chen, Tuochao / Yoshioka, Takuya / Gollakota, Shyamnath et al. | 2023
- 1
-
Multi-Scale Receptive Field Graph Model for Emotion Recognition in ConversationsWei, Jie / Hu, Guanyu / Tuan, Luu Anh / Yang, Xinyu / Zhu, Wenjing et al. | 2023
- 1
-
Twitter Stance Detection via Neural Production SystemsZhang, Bowen / Ding, Daijun / Xu, Guangning / Guo, Jinjin / Huang, Zhichao / Huang, Xu et al. | 2023
- 1
-
Lost In Translation: Generating Adversarial Examples Robust to Round-Trip TranslationBhandari, Neel / Chen, Pin-Yu et al. | 2023
- 1
-
LDTSF: A Label-Decoupling Teacher-Student Framework for Semi-Supervised Echocardiography SegmentationZhang, Jiapeng / Wang, Yongxiong / Pan, Zhiqun / Tang, Zhenhui / Chen, Lijun / Liu, Jinlong et al. | 2023
- 1
-
SLBERT: A Novel Pre-Training Framework for Joint Speech and Language ModelingSusladkar, Onkar / Gatti, Prajwal / Kumar Yadav, Santosh et al. | 2023
- 1
-
Iterative Shallow Fusion of Backward Language Model for End-To-End Speech RecognitionOgawa, Atsunori / Moriya, Takafumi / Kamo, Naoyuki / Tawara, Naohiro / Delcroix, Marc et al. | 2023
- 1
-
Seri: Sketching-Reasoning-Integrating Progressive Workflow for Empathetic Response GenerationBi, Guanqun / Cao, Yanan / Li, Piji / Xie, Yuqiang / Fang, Fang / Lin, Zheng et al. | 2023
- 1
-
Vitasd: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial DiagnosisCao, Xu / Ye, Wenqian / Sizikova, Elena / Bai, Xue / Coffee, Megan / Zeng, Hongwu / Cao, Jianguo et al. | 2023
- 1
-
The Role of Initial Entanglement in Adaptive Gibbs State Preparation on Quantum ComputersEconomou, Sophia E. / Warren, Ada / Barnes, Edwin et al. | 2023
- 1
-
Multilevel FISTA for Image RestorationLauga, Guillaume / Riccietti, Elisa / Pustelnik, Nelly / Goncalves, Paulo et al. | 2023
- 1
-
JPEG Pleno Call for Proposals Responses Quality AssessmentPrazeres, Joao / Luo, Zhe / Pinheiro, Antonio M. G. / da Silva Cruz, Luis A. / Perry, Stuart et al. | 2023
- 1
-
Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention MechanismLi, Dichucheng / Che, Mingjin / Meng, Wenwu / Wu, Yulun / Yu, Yi / Xia, Fan / Li, Wei et al. | 2023
- 1
-
WITT: A Wireless Image Transmission Transformer for Semantic CommunicationsYang, Ke / Wang, Sixian / Dai, Jincheng / Tan, Kailin / Niu, Kai / Zhang, Ping et al. | 2023
- 1
-
Kernel Estimation and Deconvolution for Blind Image Super-ResolutionGong, Jiali / Gao, Hongfan / Chao, Jiahao / Zhou, Zhou / Yang, Zhengfeng / Zeng, Zhenbing et al. | 2023
- 1
-
Learned Video Coding with Motion Compensation Mixture ModelDinh, Khanh Quoc / Pyo Choi, Kwang et al. | 2023
- 1
-
Improving Few-Shot Learning for Talking Face System with TTS Data AugmentationChen, Qi / Ma, Ziyang / Liu, Tao / Tan, Xu / Lu, Qu / Yu, Kai / Chen, Xie et al. | 2023
- 1
-
A Synthetic Corpus Generation Method for Neural Vocoder TrainingWang, Zilin / Liu, Peng / Chen, Jun / Li, Sipan / Bai, Jinfeng / He, Gang / Wu, Zhiyong / Meng, Helen et al. | 2023
- 1
-
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource HeadphonesShashaank, N / Banar, Berker / Izadi, Mohammad Rasool / Kemmerer, Jeremy / Zhang, Shuo / Huang, Chuan-Che Jeff et al. | 2023
- 1
-
Robust Acoustic And Semantic Contextual Biasing In Neural Transducers For Speech RecognitionFu, Xuandi / Sathyendra, Kanthashree Mysore / Gandhe, Ankur / Liu, Jing / Strimel, Grant P. / McGowan, Ross / Mouchtaris, Athanasios et al. | 2023
- 1
-
Multi-Task Bias-Variance Trade-Off Through Functional ConstraintsCervino, Juan / Bazerque, Juan Andres / Calvo-Fullana, Miguel / Ribeiro, Alejandro et al. | 2023
- 1
-
Towards a More Stable and General Subgraph Information BottleneckLiu, Hongzhi / Zheng, Kaizhong / Yu, Shujian / Chen, Badong et al. | 2023
- 1
-
Unsupervised Domain Adaptation via Subspace Interpolating Deep Dictionary Learning: A Case Study in Machine InspectionKumar, Kriti / Majumdar, Angshul / Kumar, A Anil / Girish Chandra, M et al. | 2023
- 1
-
Adaptive Filtering Algorithms For Set-Valued Observations-Symmetric Measurement Approach To Unlabeled And Anonymized DataKrishnamurthy, Vikram et al. | 2023
- 1
-
Classification of Synthetic Facial Attributes by Means of Hybrid Classification/Localization Patch-Based AnalysisWang, Jun / Tondi, Benedetta / Barni, Mauro et al. | 2023
- 1
-
A Point is A Wave: Point-Wave Network for Place RecognitionLi, Ge / Zhang, Ruonan et al. | 2023
- 1
-
Robust and Globally Sparse Pca via Majorization-Minimization and Variable SplittingBrehier, Hugo / Breloy, Arnaud / El Korso, Mohammed Nabil / Kumar, Sandeep et al. | 2023
- 1
-
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed PrototypesXu, Xinzhou / Deng, Jun / Zhang, Zixing / Yang, Zhen / Schuller, Bjorn W. et al. | 2023
- 1
-
Multi-Task Transformer with Relation-Attention and Type-Attention for Named Entity RecognitionMo, Ying / Tang, Hongyin / Liu, Jiahao / Wang, Qifan / Xu, Zenglin / Wang, Jingang / Wu, Wei / Li, Zhoujun et al. | 2023
- 1
-
Self-Supervised Representations in Speech-Based Depression DetectionWu, Wen / Zhang, Chao / Woodland, Philip C. et al. | 2023
- 1
-
A Simple Yet Effective Approach to Structured Knowledge DistillationLin, Wenye / Li, Yangming / Liu, Lemao / Shi, Shuming / Zheng, Hai-Tao et al. | 2023
- 1
-
Leveraging Neural Koopman Operators to Learn Continuous Representations of Dynamical Systems from Scarce DataFrion, Anthony / Drumetz, Lucas / Mura, Mauro Dalla / Tochon, Guillaume / Aissa-El-Bey, Abdeldjalil et al. | 2023
- 1
-
WUDA: Unsupervised Domain Adaptation Based on Weak Source Domain LabelsLiu, Shengjie / Zhu, Chuang / Li, Yuan / Tang, Wenqi et al. | 2023
- 1
-
A Memory-Free Evolving Bipolar Neural Network for Efficient Multi-Label Stream LearningMishra, Sourav / Sundaram, Suresh et al. | 2023
- 1
-
Prototype Knowledge Distillation for Medical Segmentation with Missing ModalityWang, Shuai / Yan, Zipei / Zhang, Daoan / Wei, Haining / Li, Zhongsen / Li, Rui et al. | 2023
- 1
-
A Novel Efficient Multi-View Traffic-Related Object Detection FrameworkYang, Kun / Liu, Jing / Yang, Dingkang / Wang, Hanqi / Sun, Peng / Zhang, Yanni / Liu, Yan / Song, Liang et al. | 2023
- 1
-
Learning with Multigraph Convolutional FiltersButler, Landon / Parada-Mayorga, Alejandro / Ribeiro, Alejandro et al. | 2023
- 1
-
Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-DistillationZhang, Jing-Xuan / Wan, Genshun / Ling, Zhen-Hua / Pan, Jia / Gao, Jianqing / Liu, Cong et al. | 2023
- 1
-
Exploring Wav2vec 2.0 Fine Tuning for Improved Speech Emotion RecognitionChen, Li-Wei / Rudnicky, Alexander et al. | 2023
- 1
-
Reducing the GAP Between Streaming and Non-Streaming Transducer-Based ASR by Adaptive Two-Stage Knowledge DistillationTang, Haitao / Fu, Yu / Sun, Lei / Xue, Jiabin / Liu, Dan / Li, Yongchao / Ma, Zhiqiang / Wu, Minghui / Pan, Jia / Wan, Genshun et al. | 2023
- 1
-
Generalized Invariant Matching Property Via LassoDu, Kang / Xiang, Yu et al. | 2023
- 1
-
Efficient Feature Extraction for Non-Maximum Suppression in Visual Person DetectionSymeonidis, Charalampos / Mademlis, Ioannis / Pitas, Ioannis / Nikolaidis, Nikos et al. | 2023
- 1
-
Visual-Aware Text-to-Speech*Zhou, Mohan / Bai, Yalong / Zhang, Wei / Yao, Ting / Zhao, Tiejun / Mei, Tao et al. | 2023
- 1
-
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar SamplesRyu, Hyeonggon / Senocak, Arda / So Kweon, In / Son Chung, Joon et al. | 2023
- 1
-
Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech RecognitionChen, Xie / Ma, Ziyang / Tang, Changli / Wang, Yujin / Zheng, Zhisheng et al. | 2023
- 1
-
Do Prosody Transfer Models Transfer ProsodyƒSigurgeirsson, Atli Thor / King, Simon et al. | 2023
- 1
-
Rate Splitting and Precoding Strategies for Multi-User MIMO Broadcast Channels with Common and Private StreamsKhamidullina, Liana / de Almeida, Andre L. F. / Haardt, Martin et al. | 2023
- 1
-
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command RecognitionYang, Chao-Han Huck / Li, Bo / Zhang, Yu / Chen, Nanxin / Sainath, Tara N. / Marco Siniscalchi, Sabato / Lee, Chin-Hui et al. | 2023
- 1
-
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech RecognitionVander Eeckt, Steven / Van Hamme, Hugo et al. | 2023
- 1
-
VPPT: Visual Pre-Trained Prompt Tuning Framework for Few-Shot Image ClassificationSong, Zhao / Yang, Ke / Guan, Naiyang / Zhu, Junjie / Qiao, Peng / Hu, Qingyong et al. | 2023
- 1
-
Test Your Samples Jointly: Pseudo-Reference for Image Quality EvaluationTworski, Marcelin / Lathuiliere, Stephane et al. | 2023
- 1
-
Waveform Design to Improve the Estimation of Target Parameters Using the Fourier Transform Method in a MIMO OFDM DFRC SystemBhogavalli, Satwika / Grivel, Eric / Hari, K.V.S. / Corretja, Vincent et al. | 2023
- 1
-
Modify: Model-Driven Face Stylization Without Style ImagesDing, Yuhe / Liang, Jian / Cao, Jie / Zheng, Aihua / He, Ran et al. | 2023
- 1
-
TINYCOD: Tiny and Effective Model for Camouflaged Object DetectionXing, Haozhe / Gao, Shuyong / Tang, Hao / Mok, Tsui Qin / Kang, Yanlan / Zhang, Wenqiang et al. | 2023
- 1
-
Automatic Segmentation of Nasopharyngeal Carcinoma in CT Images Using Dual Attention and Edge DetectionWang, Qizhi / Huang, Wei / Zhang, Yuan / Li, Xuanya / Ye, Xiongjun / Hu, Kai et al. | 2023
- 1
-
Fast and Efficient Speech Enhancement with Variational AutoencodersSadeghi, Mostafa / Serizel, Romain et al. | 2023
- 1
-
Representation of Vocal Tract Length Transformation Based on Group TheoryMiyashita, Atsushi / Toda, Tomoki et al. | 2023
- 1
-
Sandformer: CNN and Transformer under Gated Fusion for Sand Dust Image RestorationShi, Jun / Wei, Bingcai / Zhou, Gang / Zhang, Liye et al. | 2023
- 1
-
Utility Polelocalization by Learning from Ambient Traces on Distributed Acoustic SensingJiang, Zhuocheng / Tian, Yue / Ding, Yangmin / Ozharar, Sarper / Wang, Ting et al. | 2023
- 1
-
Multi-User Methods for Vibrational Radar Backscatter CommunicationsCenters, Jessica / Krolik, Jeffrey et al. | 2023
- 1
-
Target Sound Extraction with Variable Cross-Modality CluesLi, Chenda / Qian, Yao / Chen, Zhuo / Wang, Dongmei / Yoshioka, Takuya / Liu, Shujie / Qian, Yanmin / Zeng, Michael et al. | 2023
- 1
-
Model-Free Learning of Optimal Beamformers for Passive IRS-Assisted Sumrate MaximizationHashmi, Hassaan / Pougkakiotis, Spyridon / Kalogerias, Dionysios S. et al. | 2023
- 1
-
Strategies for Enhanced Signal Modulation Classifications Under Unknown Symbol Rates and Noise ConditionsWang, Ruixuan / Qi, Yue / Vaezi, Mojtaba / Jiao, Xun / Amin, Moeness et al. | 2023
- 1
-
Target Velocity Estimation for Quantization-Based Cooperative MIMO Radar and Communications SystemWang, Zhen / Yan, Xuedan / He, Qian / Blum, Rick S. et al. | 2023
- 1
-
Margin-Mixup: A Method for Robust Speaker Verification In Multi-Speaker AudioThienpondt, Jenthe / Madhu, Nilesh / Demuynck, Kris et al. | 2023
- 1
-
Evopose: A Recursive Transformer for 3D Human Pose Estimation with Kinematic Structure PriorsZhang, Yaqi / Lu, Yan / Liu, Bin / Zhao, Zhiwei / Chu, Qi / Yu, Nenghai et al. | 2023
- 1
-
Subspace-Based Detector For Distributed Mmwave Mimo Radar SensorsAhmadi, Moein / Alaee-Kerahroodi, Mohammad / M. R., Bhavani Shankar / Ottersten, Bjorn et al. | 2023
- 1
-
A Unitary Transform Based Generalized Approximate Message PassingZhu, Jiang / Meng, Xiangming / Lei, Xupeng / Guo, Qinghua et al. | 2023
- 1
-
Adaptive Data Augmentation for Contrastive LearningZhang, Yuhan / Zhu, He / Yu, Shan et al. | 2023
- 1
-
E2E Segmentation in a Two-Pass Cascaded Encoder ASR ModelHuang, W. Ronny / Chang, Shuo-Yiin / Sainath, Tara N. / He, Yanzhang / Rybach, David / David, Robert / Prabhavalkar, Rohit / Allauzen, Cyril / Peyser, Cal / Strohman, Trevor D. et al. | 2023
- 1
-
Binary Sequence Set Optimization for CDMA Applications via Mixed-Integer Quadratic ProgrammingYang, Alan / Mina, Tara / Gao, Grace et al. | 2023
- 1
-
Blind Polynomial RegressionNatali, Alberto / Leus, Geert et al. | 2023
- 1
-
ERSAM: Neural Architecture Search for Energy-Efficient and Real-Time Social Ambiance MeasurementLi, Chaojian / Chen, Wenwan / Yuan, Jiayi / Lin, Yingyan Celine / Sabharwal, Ashutosh et al. | 2023
- 1
-
Statistical Analysis of Speech Disorder Specific Features to Characterise Dysarthria Severity LevelJoshy, Amlu Anna / Parameswaran, P. N. / Nair, Siddharth R. / Rajan, Rajeev et al. | 2023
- 1
-
Generalized Relative Harmonic CoefficientsHu, Yonggang / Gannot, Sharon / Abhayapala, Thushara D. et al. | 2023
- 1
-
Perceptual–Neural–Physical Sound MatchingHan, Han / Lostanlen, Vincent / Lagrange, Mathieu et al. | 2023
- 1
-
Improved Training Of Mixture-Of-Experts Language GANsChai, Yekun / Yin, Qiyue / Zhang, Junge et al. | 2023
- 1
-
Spatial-Domain Object Detection Under Mimo-Fmcw Automotive Radar InterferenceJin, Sian / Wang, Pu / Boufounos, Petros / Takahashi, Ryuhei / Roy, Sumit et al. | 2023
- 1
-
I See What You Hear: A Vision-Inspired Method to Localize WordsSamragh, Mohammad / Kundu, Arnav / Hu, Ting-Yao / Chadha, Aman / Srivastava, Ashish / Cho, Minsik / Tuzel, Oncel / Naik, Devang et al. | 2023
- 1
-
Lightweight Fisher Vector Transfer Learning for Video DeduplicationHenry, Chris / Liao, Rijun / Lin, Ruiyuan / Zhang, Zhebin / Sun, Hongyu / Li, Zhu et al. | 2023
- 1
-
Difference Coarrays of Rational ArraysKulkarni, Pranav / Vaidyanathan, P. P. et al. | 2023
- 1
-
SIGVIC: Spatial Importance Guided Variable-Rate Image CompressionLiang, Jiaming / Liu, Meiqin / Yao, Chao / Lin, Chunyu / Zhao, Yao et al. | 2023
- 1
-
UCONV-Conformer: High Reduction of Input Sequence Length for End-to-End Speech RecognitionAndrusenko, Andrei / Nasretdinov, Rauf / Romanenko, Aleksei et al. | 2023
- 1
-
Unsupervised Noise Adaptation Using Data SimulationChen, Chen / Hu, Yuchen / Zou, Heqing / Sun, Linhui / Chng, Eng Siong et al. | 2023
- 1
-
Logo-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression RecognitionMa, Fuyan / Sun, Bin / Li, Shutao et al. | 2023
- 1
-
Adaptive Time-Scale Modification for Improving Speech Intelligibility Based On Phoneme Clustering For Streaming ServicesJang, Sohee / Kim, Jiye / Kim, Yeon-Ju / Chang, Joon-Hyuk et al. | 2023
- 1
-
Learning to Reconnect Interrupted Trajectories for Weakly Supervised Multi-Object TrackingLi, Yu-Lei / Lu, Yang / Li, Jie / Wang, Hanzi et al. | 2023
- 1
-
Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASRBotros, Rami / Prabhavalkar, Rohit / Schalkwyk, Johan / Chelba, Ciprian / Sainath, Tara N. / Beaufays, Francoise et al. | 2023
- 1
-
Deepspace: Dynamic Spatial and Source CUE Based Source Separation for Dialog EnhancementMaster, Aaron / Lu, Lie / Samuelsson, Jonas / Lehtonen, Heidi-Maria / Norcross, Scott / Swedlow, Nathan / Howard, Audrey et al. | 2023
- 1
-
Batch-Ensemble Stochastic Neural Networks for Out-of-Distribution DetectionChen, Xiongjie / Li, Yunpeng / Yang, Yongxin et al. | 2023
- 1
-
Cross-Lingual Alzheimer’s Disease Detection Based on Paralinguistic and Pre-Trained FeaturesChen, Xuchu / Pu, Yu / Li, Jinpeng / Zhang, Wei-Qiang et al. | 2023
- 1
-
Multi-Carrier Wideband OCDM-Based THZ Automotive RadarBhattacharjee, Sangeeta / Mishra, Kumar Vijay / Annavajjala, Ramesh / Murthy, Chandra R. et al. | 2023
- 1
-
Low Precision Representations for High Dimensional ModelsSaha, Rajarshi / Pilanci, Mert / Goldsmith, Andrea J. et al. | 2023
- 1
-
Hypernetwork-Based Adaptive Image RestorationAharon, Shai / Ben-Artzi, Gil et al. | 2023
- 1
-
Your Camera Improves Your Point Cloud CompressionLin, Yuhuan / Xu, Tongda / Zhu, Ziyu / Li, Yanghao / Wang, Zhe / Wang, Yan et al. | 2023
- 1
-
Pseudo-Query Generation For Semi-Supervised Visual Grounding With Knowledge DistillationJin, Jianglin / Ye, Jiabo / Lin, Xin / He, Liang et al. | 2023
- 1
-
2DSBG: A 2d Semi Bi-Gaussian Filter Adapted for Adjacent and Multi-Scale Line Feature DetectionMagnier, Baptiste / Shokouh, Ghulam Sakhi / Berthier, Louis / Pie, Marcel / Ruggiero, Adrien et al. | 2023
- 1
-
Estimation of High-Dimensional Differential Graphs from Multi-Attribute DataTugnait, Jitendra K. et al. | 2023
- 1
-
Joint Unsupervised and Supervised Learning for Context-Aware Language IdentificationPark, Jinseok / Kim, Hyung Yong / Park, Jihwan / Kim, Byeong-Yeol / Choi, Shukjae / Lim, Yunkyu et al. | 2023
- 1
-
Improving Transformer-Based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention HeadsJeoung, Ye-Rin / Yang, Joon-Young / Choi, Jeong-Hwan / Chang, Joon-Hyuk et al. | 2023
- 1
-
On the Value of Stochastic Side Information in Online LearningJia, Junzhang / Wu, Xuetong / Evans, Jamie / Zhu, Jingge et al. | 2023
- 1
-
Learning Task-Aligned Mask Query for Instance SegmentationFu, Bin / He, Hongliang / Wei, Pengxu / Chen, Jie et al. | 2023
- 1
-
On The Primal and Dual Formulations Of The Discrete Mumford-Shah FunctionalPustelnik, Nelly et al. | 2023
- 1
-
Robust Angle Estimation for Hybrid mmWave SystemsLin, Yuan-Pei / Yang, Ting-Ming et al. | 2023
- 1
-
On The Fairness of Multitask Representation LearningLi, Yingcong / Oymak, Samet et al. | 2023
- 1
-
VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature DistillationLiu, Yuhao / Gong, Cheng / Wang, Longbiao / Wu, Xixin / Liu, Qiuyu / Dang, Jianwu et al. | 2023
- 1
-
Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-Based Speech Recognition of Low-Resource LanguageSoky, Kak / Li, Sheng / Chu, Chenhui / Kawahara, Tatsuya et al. | 2023
- 1
-
Pop2Piano : Pop Audio-Based Piano Cover GenerationChoi, Jongho / Lee, Kyogu et al. | 2023
- 1
-
Multi-Lingual Pronunciation Assessment with Unified Phoneme Set and Language-Specific EmbeddingsLin, Binghuai / Wang, Liyuan et al. | 2023
- 1
-
Interpolation Filter Model For Ramanujan Subspace SignalsKulkarni, Pranav / Vaidyanathan, P. P. et al. | 2023
- 1
-
Online Binaural Speech Separation Of Moving Speakers With A Wavesplit NetworkHan, Cong / Mesgarani, Nima et al. | 2023
- 1
-
A Hybrid Deep Neural Network for Nonlinear Causality Analysis in Complex Industrial Control SystemFeng, Tian / Chen, Qiming / Shi, Yao / Lang, Xun / Xie, Lei / Su, Hongye et al. | 2023
- 1
-
Autovocoder: Fast Waveform Generation from a Learned Speech Representation Using Differentiable Digital Signal ProcessingWebber, Jacob J / Valentini-Botinhao, Cassia / Williams, Evelyn / Henter, Gustav Eje / King, Simon et al. | 2023
- 1
-
Self-Sufficient Framework for Continuous Sign Language RecognitionJang, Youngjoon / Oh, Youngtaek / Cho, Jae Won / Kim, Myungchul / Kim, Dong-Jin / Kweon, In So / Son Chung, Joon et al. | 2023
- 1
-
Signal Processing On Product SpacesRoddenberry, T. Mitchell / Grande, Vincent P. / Frantzen, Florian / Schaub, Michael T. / Segarra, Santiago et al. | 2023
- 1
-
On the Effectiveness of Monoaural Target Source Extraction for Distant end-to-end Automatic Speech RecognitionZorila, Catalin / Doddipatla, Rama et al. | 2023
- 1
-
MAID: A Conditional Diffusion Model for Long Music Audio InpaintingLiu, Kaiyang / Gan, Wendong / Yuan, Chenchen et al. | 2023
- 1
-
Semi-Federated Learning for Edge Intelligence with Imperfect SICNi, Wanli / Zheng, Jingheng / Eldar, Yonina C. / You, Changsheng / Huang, Kaibin et al. | 2023
- 1
-
Dual Collaborative Visual-Semantic Mapping for Multi-Label Zero-Shot Image RecognitionHu, Yunqing / Jin, Xuan / Chen, Xi / Zhang, Yin et al. | 2023
- 1
-
Topological Slepians: Maximally Localized Representations of Signals Over Simplicial ComplexesBattiloro, Claudio / Di Lorenzo, Paolo / Barbarossa, Sergio et al. | 2023
- 1
-
Efficient Feature Fusion for Learning-Based Photometric StereoJu, Yakun / Lam, Kin-Man / Xiao, Jun / Zhang, Cong / Yang, Cuixin / Dong, Junyu et al. | 2023
- 1
-
Improving Scheduled Sampling for Neural Transducer-Based ASRMoriya, Takafumi / Ashihara, Takanori / Sato, Hiroshi / Matsuura, Kohei / Tanaka, Tomohiro / Masumura, Ryo et al. | 2023
- 1
-
Unobtrusive Respiratory Monitoring System for Intensive CareTan, Xudong / Hu, Menghan / Zhai, Guangtao / Zhu, Yan / Li, Wenfang / Zhang, XiaoPing et al. | 2023
- 1
-
Integrating the Sensing and Radio Communications Channel Modelling From Radar Mutual InterferenceCardona, Narcis / Romero, J. Samuel / Yang, Wenfei / Li, Jian et al. | 2023
- 1
-
TDMA-Based Multi-User Binary Computation Offloading in the Finite-Block-Length RegimeManouchehrpour, M. Amin / Lehal, Harvinder / Salmani, Mahsa / Davidson, Timothy N. et al. | 2023
- 1
-
Multispectral Image Fusion based on Super Pixel SegmentationOfir, Nati et al. | 2023
- 1
-
Optimal Transport with a Diversified Memory Bank for Cross-Domain Speaker VerificationZhang, Ruiteng / Wei, Jianguo / Lu, Xugang / Lu, Wenhuan / Jin, Di / Zhang, Lin / Xu, Junhai et al. | 2023
- 1
-
Fast Low-Latency Convolution by Low-Rank Tensor ApproximationJalmby, Martin / Elvander, Filip / van Waterschoot, Toon et al. | 2023
- 1
-
A Controllable Lifestyle Simulator for Use in Deep Reinforcement Learning AlgorithmsBraz, Libio Goncalves / Susaiyah, Allmin et al. | 2023
- 1
-
BTS-E: Audio Deepfake Detection Using Breathing-Talking-Silence EncoderDoan, Thien-Phuc / Nguyen-Vu, Long / Jung, Souhwan / Hong, Kihun et al. | 2023
- 1
-
Study of Manifold Geometry Using Multiscale Non-Negative Kernel GraphsHurtado, Carlos / Shekkizhar, Sarath / Ruiz-Hidalgo, Javier / Ortega, Antonio et al. | 2023
- 1
-
Learning Silhouettes with Group Sparse AutoencodersTheodosis, Emmanouil / Ba, Demba et al. | 2023
- 1
-
ScaleMix: Intra- And Inter-Layer Multiscale Feature Combination for Change DetectionHuang, Rui / Zhao, Qingyi / Wang, Ruofei / Liu, Caihua / Gao, Sihua / Zhang, Yuxiang / Fan, Wei et al. | 2023
- 1
-
Is Multi-Task Learning an Upper Bound for Continual Learning?Wu, Zihao / Tran, Huy / Pirsiavash, Hamed / Kolouri, Soheil et al. | 2023
- 1
-
Local Graph-Homomorphic Processing for Privatized Distributed SystemsRizk, Elsa / Vlaski, Stefan / Sayed, Ali H. et al. | 2023
- 1
-
MASKED-AP: Attention Pyramid Convolutional Neural Network with Mask for Cervical Cell ClassificationJin, Yu / Liu, Juan / Chen, Hua / Duan, Wensi / Cao, Dehua / Pang, Baochuan et al. | 2023
- 1
-
Pondering About Task Spatial Misalignment: Classification-Localization Equilibrated Object DetectionZhang, Yudong / Lu, Wei / Wang, Xu / Wang, Pengkun / Wang, Yang et al. | 2023
- 1
-
Multiple Access Computation Offloading for the K-User CaseLiu, Xiaomeng / Schaible, Christian / Davidson, Timothy N. et al. | 2023
- 1
-
Movienet-PS: A Large-Scale Person Search Dataset in the WildQin, Jie / Zheng, Peng / Yan, Yichao / Quan, Rong / Cheng, Xiaogang / Ni, Bingbing et al. | 2023
- 1
-
Spatial Similarity Guidance for Few-Shot SegmentationLuo, Xiaoliu / Duan, Zhao / Zhang, Taiping et al. | 2023
- 1
-
Efficient Monaural Speech Enhancement with Universal Sample Rate Band-Split RNNYu, Jianwei / Luo, Yi et al. | 2023
- 1
-
Code-Switching Speech Synthesis Based on Self-Supervised Learning and Domain Adaptive Speaker EncoderLin, Yi-Xing / Pai, Cheng-Hsun / Le, Phuong Thi / Prihasto, Bima / Huang, Chien-Ling / Wang, Jia Ching et al. | 2023
- 1
-
Mixed Sample Augmentation for Online DistillationShen, Yiqing / Xu, Liwu / Yang, Yuzhe / Li, Yaqian / Guo, Yandong et al. | 2023
- 1
-
Meeting Action Item Detection with Regularized Context ModelingLiu, Jiaqing / Deng, Chong / Zhang, Qinglin / Chen, Qian / Wang, Wen et al. | 2023
- 1
-
CLMAE: A Liter and Faster Masked AutoencodersSong, Yiran / Ma, Lizhuang et al. | 2023
- 1
-
Graph Signal Processing for Narrowband Direction of Arrival EstimationLi, Disheng / Liu, Wei / Zakharov, Yuriy / Mitchell, Paul D et al. | 2023
- 1
-
Privacy-Preserving Automatic Speaker DiarizationTeixeira, Francisco / Abad, Alberto / Raj, Bhiksha / Trancoso, Isabel et al. | 2023
- 1
-
An End-to-End Neural Network for Image-to-Audio TransformationChen, Liu / Deisher, Michael / Georges, Munir et al. | 2023
- 1
-
Joint Multi-Level Feature Network for Lightweight Person Re-IdentificationZhang, Yunzuo / Kang, Weili / Liu, Yameng / Zhu, Pengfei et al. | 2023
- 1
-
Learning Cross-Modal Audiovisual Representations with Ladder Networks for Emotion RecognitionGoncalves, Lucas / Busso, Carlos et al. | 2023
- 1
-
Quantized Precoding and RIS-Assisted Modulation for Integrated Sensing and Communications SystemsPrasobh Sankar, R. S. / Prabhakar Chepuri, Sundeep et al. | 2023
- 1
-
Towards Adversarially Robust Continual LearningBai, Tao / Chen, Chen / Lyu, Lingjuan / Zhao, Jun / Wen, Bihan et al. | 2023
- 1
-
Ultimate Negative Sampling for Contrastive LearningGuo, Huijie / Shi, Lei et al. | 2023
- 1
-
A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech TranslationHuang, Wen-Chin / Peloquin, Benjamin / Kao, Justine / Wang, Changhan / Gong, Hongyu / Salesky, Elizabeth / Adi, Yossi / Lee, Ann / Chen, Peng-Jen et al. | 2023
- 1
-
T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5Hsu, Chan-Jan / Chung, Ho-Lam / Lee, Hung-Yi / Tsao, Yu et al. | 2023
- 1
-
CD-FSOD: A Benchmark For Cross-Domain Few-Shot Object DetectionXiong, Wuti et al. | 2023
- 1
-
Elliptical Wishart Distribution: Maximum Likelihood Estimator from Information GeometryAyadi, Imen / Bouchard, Florent / Pascal, Frederic et al. | 2023
- 1
-
Distributed Bayesian Tracking on the Special Euclidean Group Using Lie Algebra Parametric ApproximationsBordin, Claudio J. / de Figueredo, Caio G. / Bruno, Marcelo G. S. et al. | 2023
- 1
-
Asynchronous Social LearningCemri, Mert / Bordignon, Virginia / Kayaalp, Mert / Shumovskaia, Valentina / Sayed, Ali H. et al. | 2023
- 1
-
Cramér-Rao Bound on Lie Groups with Observations on Lie Groups: Application to SE(2)Labsir, Samy / Renaux, Alexandre / Vila-Valls, Jordi / Chaumette, Eric et al. | 2023
- 1
-
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network Using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech EnhancementZhao, Shengkui / Ma, Bin et al. | 2023
- 1
-
Extended Kalman Filter for Graph Signals in Nonlinear Dynamic SystemsSagi, Guy / Shlezinger, Nir / Routtenberg, Tirza et al. | 2023
- 1
-
Perspective Projection-Based 3d CT Reconstruction from Biplanar X-RaysKyung, Daeun / Jo, Kyungmin / Choo, Jaegul / Lee, Joonseok / Choi, Edward et al. | 2023
- 1
-
Tg-Critic: A Timbre-Guided Model For Reference-Independent Singing EvaluationSun, Xiaoheng / Gao, Yuejie / Lin, Hanyao / Liu, Huaping et al. | 2023
- 1
-
Exploration of Language Dependency for Japanese Self-Supervised Speech Representation ModelsAshihara, Takanori / Moriya, Takafumi / Matsuura, Kohei / Tanaka, Tomohiro et al. | 2023
- 1
-
Frequency Bin-Wise Single Channel Speech Presence Probability Estimation Using Multiple DNNSTao, Shuai / Reddy, Himavanth / Jensen, Jesper Rindom / Christensen, Mads Grasboll et al. | 2023
- 1
-
Structural Optimization of Factor Graphs for Symbol Detection via Continuous Clustering and Machine LearningRapp, Lukas / Schmid, Luca / Rode, Andrej / Schmalen, Laurent et al. | 2023
- 1
-
Selective Film Conditioning with CTC-Based ASR Probability for Speech EnhancementYang, Da-Hee / Chang, Joon-Hyuk et al. | 2023
- 1
-
Egocentric Action Anticipation for Personal HealthRodin, Ivan / Furnari, Antonino / Mavroeidis, Dimitrios / Farinella, Giovanni Maria et al. | 2023
- 1
-
Enhanced Low-Resolution LiDAR-Camera Calibration via Depth Interpolation and Supervised Contrastive LearningZhang, Zhikang / Yu, Zifan / You, Suya / Rao, Raghuveer / Agarwal, Sanjeev / Ren, Fengbo et al. | 2023
- 1
-
SCSGNet: Spatial-Correlated and Shape-Guided Network for Breast Mass SegmentationLi, Qingqiu / Xu, Jilan / Yuan, Runtian / Zhang, Yuejie / Feng, Rui et al. | 2023
- 1
-
A Progressive Neural Network for Acoustic Echo CancellationChen, Zhuangqi / Xia, Xianjun / Sun, Siyu / Wang, Ziqian / Chen, Cheng / Xie, Guoliang / Zhang, Pingjian / Xiao, Yijian et al. | 2023
- 1
-
Ensemble Knowledge Distillation of Self-Supervised Speech ModelsHuang, Kuan -Po / Feng, Tzu-Hsun / Fu, Yu-Kuan / Hsu, Tsu-Yuan / Yen, Po-Chieh / Tseng, Wei-Cheng / Chang, Kai-Wei / Lee, Hung-Yi et al. | 2023
- 1
-
On Crowdsourcing-Design with Comparison Category Rating for Evaluating Speech Enhancement AlgorithmsSuarez, Angelica S. Z. / Laroche, Clement / Clemmensen, Line H. / Das, Sneha et al. | 2023
- 1
-
Rate-Distortion Optimization with Alternative References for UGC Video CompressionXiong, Xin / Pavez, Eduardo / Ortega, Antonio / Adsumilli, Balu et al. | 2023
- 1
-
Audiodec: An Open-Source Streaming High-Fidelity Neural Audio CodecWu, Yi-Chiao / Gebru, Israel D. / Markovic, Dejan / Richard, Alexander et al. | 2023
- 1
-
Image Reconstruction without Explicit PriorsGao, Angela F. / Leong, Oscar / Sun, He / Bouman, Katherine L. et al. | 2023
- 1
-
Classification via Subspace Learning Machine (SLM): Methodology and Performance EvaluationFu, Hongyu / Yang, Yijing / Mishra, Vinod K. / Jay Kuo, C.-C. et al. | 2023
- 1
-
A Multi-Scale Feature Aggregation Based Lightweight Network for Audio-Visual Speech EnhancementXu, Haitao / Wei, Liangfa / Zhang, Jie / Yang, Jianming / Wang, Yannan / Gao, Tian / Fang, Xin / Dai, Lirong et al. | 2023
- 1
-
Multi-Scale Compositional Constraints for Representation Learning on VideosParaskevopoulos, Georgios / Lavania, Chandrashekhar / Chum, Lovish / Sundaram, Shiva et al. | 2023
- 1
-
Enhanced GM-PHD Filter for Real Time Satellite Multi-Target TrackingAguilar, Camilo / Ortner, Mathias / Zerubia, Josiane et al. | 2023
- 1
-
De’hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech RecognitionNg, Dianwen / Zhang, Ruixi / Yip, Jia Qi / Yang, Zhao / Ni, Jinjie / Zhang, Chong / Ma, Yukun / Ni, Chongjia / Chng, Eng Siong / Ma, Bin et al. | 2023
- 1
-
Weakly- and Semi-Supervised Object LocalizationHuang, Zhen-Tang / Chen, Yan-He / Yeh, Mei-Chen et al. | 2023
- 1
-
Torchaudio-Squim: Reference-Less Speech Quality and Intelligibility Measures in TorchaudioKumar, Anurag / Tan, Ke / Ni, Zhaoheng / Manocha, Pranay / Zhang, Xiaohui / Henderson, Ethan / Xu, Buye et al. | 2023
- 1
-
Coarse-to-Fine Covid-19 Segmentation via Vision-Language AlignmentShan, Dandan / Li, Zihan / Chen, Wentao / Li, Qingde / Tian, Jie / Hong, Qingqi et al. | 2023
- 1
-
EMC2-Net: Joint Equalization and Modulation Classification Based on Constellation NetworkRyu, Hyun / Choi, Junil et al. | 2023
- 1
-
Ripple Sparse Self-Attention for Monaural Speech EnhancementZhang, Qiquan / Zhu, Hongxu / Song, Qi / Qian, Xinyuan / Ni, Zhaoheng / Li, Haizhou et al. | 2023
- 1
-
A Physically Explainable Framework for Human-Related Anomaly DetectionJiang, Yalong / Li, Huining / Li, Changkang et al. | 2023
- 1
-
Noncoherent Multiuser Grassmannian Constellations for the Mimo Multiple Access ChannelAlvarez-Vizoso, Javier / Cuevas, Diego / Beltran, Carlos / Santamaria, Ignacio / Tucek, Vit / Peters, Gunnar et al. | 2023
- 1
-
Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification SystemsCai, Danwei / Cai, Zexin / Li, Ming et al. | 2023
- 1
-
A Compensated Shrinkage Affine Projection Algorithm for Debiased Sparse Adaptive FilteringZhang, Yi / Yamada, Isao et al. | 2023
- 1
-
Cross-Domain Object Classification Via Successive Subspace AlignmentChen, Kecheng / Li, Haoliang / Yan, Hong et al. | 2023
- 1
-
Textless Direct Speech-to-Speech Translation with Discrete Speech RepresentationLi, Xinjian / Jia, Ye / Chiu, Chung-Cheng et al. | 2023
- 1
-
Speaker-Independent Acoustic-to-Articulatory Speech InversionWu, Peter / Chen, Li-Wei / Cho, Cheol Jun / Watanabe, Shinji / Goldstein, Louis / Black, Alan W / Anumanchipalli, Gopala K. et al. | 2023
- 1
-
Single-Photon Image Super-Resolution via Self-Supervised LearningChen, Yiwei / Jiang, Chen / Pan, Yu et al. | 2023
- 1
-
TSPTQ-ViT: Two-Scaled Post-Training Quantization for Vision TransformerTai, Yu-Shan / Lin, Ming-Guang / Wu, An-Yeu Andy et al. | 2023
- 1
-
Sparse Error Correction for Power Network ParametersSenaratne, Dilan / Kim, Jinsub et al. | 2023
- 1
-
An Evaluation Platform to Scope Performance of Synthetic Environments in Autonomous Ground Vehicles SimulationBai, Xiangyu / Jiang, Le / Luo, Yedi / Gupta, Aniket / Kaveti, Pushyami / Singh, Hanumant / Ostadabbas, Sarah et al. | 2023
- 1
-
Quaternion Orthogonal Transformer for Facial Expression Recognition in the WildZhou, Yu / Guo, Liyuan / Jin, Lianghai et al. | 2023
- 1
-
HQP-MVS:High-Quality Plane Priors Assisted Multi-View Stereo for Low-Textured AreasTian, Zefan / Wang, Rongjie / Wang, Zhenyu / Wang, Ronggang et al. | 2023
- 1
-
Daily Mental Health Monitoring from Speech: A Real-World Japanese Dataset and Multitask Learning AnalysisSong, Meishu / Triantafyllopoulos, Andreas / Yang, Zijiang / Takeuchi, Hiroki / Nakamura, Toru / Kishi, Akifumi / Ishizawa, Tetsuro / Yoshiuchi, Kazuhiro / Jing, Xin / Karas, Vincent et al. | 2023
- 1
-
ICCRN: Inplace Cepstral Convolutional Recurrent Neural Network for Monaural Speech EnhancementLiu, Jinjiang / Zhang, Xueliang et al. | 2023
- 1
-
CROSSSPEECH: Speaker-Independent Acoustic Representation for Cross-Lingual Speech SynthesisKim, Ji-Hoon / Yang, Hong-Sun / Ju, Yoon-Cheol / Kim, Il-Hwan / Kim, Byeong-Yeol et al. | 2023
- 1
-
Ensemble Prosody Prediction For Expressive Speech SynthesisTeh, Tian Huey / Hu, Vivian / Ram Mohan, Devang S / Hodari, Zack / Wallis, Christopher G. R. / Gomez Ibarrondo, Tomas / Torresquintero, Alexandra / Leoni, James / Gales, Mark / King, Simon et al. | 2023
- 1
-
Progressive Meta-Pooling Learning for Lightweight Image Classification ModelDong, Peijie / Niu, Xin / Tian, Zhiliang / Li, Lujun / Wang, Xiaodong / Wei, Zimian / Pan, Hengyue / Li, Dongsheng et al. | 2023
- 1
-
Euro: Espnet Unsupervised ASR Open-Source ToolkitGao, Dongji / Shi, Jiatong / Chuang, Shun-Po / Garcia, Leibny Paola / Lee, Hung-Yi / Watanabe, Shinji / Khudanpur, Sanjeev et al. | 2023
- 1
-
Learning Generalizable Light Field Networks from Few ImagesLi, Qian / Multon, Franck / Boukhayma, Adnane et al. | 2023
- 1
-
Cross-Domain Diffusion Based Speech Enhancement for Very Noisy SpeechWang, Heming / Wang, DeLiang et al. | 2023
- 1
-
A Few Shot Learning of Singing Technique Conversion Based on Cycle Consistency Generative Adversarial NetworksChen, Po-Wei / Soo, Von-Wun et al. | 2023
- 1
-
Compressed Distributed Regression over Adaptive NetworksCarpentiero, Marco / Matta, Vincenzo / Sayed, Ali H. et al. | 2023
- 1
-
An Approach to Ontological Learning from Weak LabelsShah, Ankit / Tang, Larry / Chou, Po Hao / Zheng, Yi Yu / Ge, Ziqian / Raj, Bhiksha et al. | 2023
- 1
-
Sequential Datum–Wise Joint Feature Selection and Classification in the Presence of External ClassifierEkanayake, Sachini Piyoni / Zois, DaphneynStavroula / Chelmis, Charalampos et al. | 2023
- 1
-
Learning From Label Proportion with Online Pseudo-Label Decision by Regret MinimizationMatsuo, Shinnosuke / Bise, Ryoma / Uchida, Seiichi / Suehiro, Daiki et al. | 2023
- 1
-
Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech SeparationLi, Chenda / Wu, Yifei / Qian, Yanmin et al. | 2023
- 1
-
Fine-Grained Emotional Control of Text-to-Speech: Learning to Rank Inter- and Intra-Class Emotion IntensitiesWang, Shijun / Guenason, Jon / Borth, Damian et al. | 2023
- 1
-
Role of Bias Terms in Dot-Product AttentionNamazifar, Mahdi / Hazarika, Devamanyu / Hakkani-Tur, Dilek et al. | 2023
- 1
-
Learning Interpretable Filters In Wav-UNet For Speech EnhancementMathieu, Felix / Courtat, Thomas / Richard, Gael / Peeters, Geoffroy et al. | 2023
- 1
-
Cochlear Decomposition: A Novel Bio-Inspired Multiscale Analysis FrameworkAlfalahi, Hessa / Khandoker, Ahsan / Alhussein, Ghada / Hadjileontiadis, Leontios et al. | 2023
- 1
-
Contrastive Learning of Sentence Embeddings in Product SearchZhang, Bo-Wen / Yan, Yan / Yu, Jiapei et al. | 2023
- 1
-
Leveraging Sparsity with Spiking Recurrent Neural Networks for Energy-Efficient Keyword SpottingDampfhoffer, Manon / Mesquida, Thomas / Hardy, Emmanuel / Valentian, Alexandre / Anghel, Lorena et al. | 2023
- 1
-
A Quantum Approach for Stochastic Constrained Binary OptimizationGupta, Sarthak / Kekatos, Vassilis et al. | 2023
- 1
-
Joint Antenna Selection and Beamforming in Integrated Automotive Radar Sensing-Communications with Quantized Double Phase ShiftersXu, Lifan / Sun, Shunqiao / Zhang, Yimin D. / Petropulu, Athina et al. | 2023
- 1
-
MODEFORMER: Modality-Preserving Embedding For Audio-Video Synchronization Using TransformersGupta, Akash / Tripathi, Rohun / Jang, Wondong et al. | 2023
- 1
-
Semi-Supervised Learning with Per-Class Adaptive Confidence Scores for Acoustic Environment Classification with Imbalanced DataFiorio, Luan Vinicius / Karanov, Boris / David, Johan / Houtum, Wim van / Widdershoven, Frans / Aarts, Ronald M. et al. | 2023
- 1
-
Database-Aware ASR Error Correction for Speech-to-SQL ParsingShao, Yutong / Kumar, Arun / Nakashole, Ndapa et al. | 2023
- 1
-
Convolutional Filtering on Sampled ManifoldsWang, Zhiyang / Ruiz, Luana / Ribeiro, Alejandro et al. | 2023
- 1
-
A Database for Multi-Modal Short Video Quality AssessmentZhang, Yukun / Wang, Chuan / Zhang, Sanyi / Cao, Xiaochun et al. | 2023
- 1
-
E-Prevention: The ICASSP-2023 Challenge on Person Identification and Relapse Detection from Continuous Recordings of BiosignalsZlatintsi, A. / Filntisis, P. P. / Efthymiou, N. / Garoufis, C. / Retsinas, G. / Sounapoglou, T. / Maglogiannis, I. / Tsanakas, P. / Smyrnis, N. / Maragos, P. et al. | 2023
- 1
-
Coarse-To-Fine Knowledge Selection for Document Grounded DialogsZhang, Yeqin / Fu, Haomin / Fu, Cheng / Yu, Haiyang / Li, Yongbin / Nguyen, Cam-Tu et al. | 2023
- 1
-
PRIME: 3D Human Pose and Body Shape Recovery with Perspective ProjectionXu, Baobei / Fang, Shukai / Li, Zhaoyang / Yang, Shicai / Xie, Di / Pu, Shiliang et al. | 2023
- 1
-
Parafac2-Based Coupled Matrix and Tensor FactorizationsSchenker, Carla / Wang, Xiulin / Acar, Evrim et al. | 2023
- 1
-
Single-Anchor UWB Localization Using Channel Impulse Response DistributionsLi, Sitian / Balatsoukas-Stimming, Alexios / Burg, Andreas et al. | 2023
- 1
-
Sparsity-Driven Joint Blind Deconvolution-Demodulation with Application to Motor Fault DetectionKelkar, Varun A. / Liu, Dehong / Inoue, Hiroshi / Kanemaru, Makoto et al. | 2023
- 1
-
Syngen: A Syntactic Plug-And-Play Module for Generative Aspect-Based Sentiment AnalysisYu, Chengze / Wu, Taiqiang / Li, Jiayi / Bai, Xingyu / Yang, Yujiu et al. | 2023
- 1
-
AdapITN: A Fast, Reliable, and Dynamic Adaptive Inverse Text NormalizationNguyen, Thai-Binh / Nhat, Le Duc Minh / Nguyen, Quang Minh / Do, Quoc Truong / Luong, Chi Mai / Waibel, Alexander et al. | 2023
- 1
-
Raising The Limit of Image Rescaling Using Auxiliary EncodingYin, Chenzhong / Pan, Zhihong / Zhou, Xin / Kang, Le / Bogdan, Paul et al. | 2023
- 1
-
Improved Appliance Transient Feature Extraction Via Template MatchingLiu, Bo / Chang, Fenglei / Luan, Wenpeng / Zhao, Bochao et al. | 2023
- 1
-
Deep Proximal Gradient Method for Learned Convex RegularizersBerk, Aaron / Ma, Yanting / Boufounos, Petros / Wang, Pu / Mansour, Hassan et al. | 2023
- 1
-
Joint Millimeter-Wave AoD and AoA Estimation Using one OFDM Symbol and Frequency-Dependent BeamsBoljanovic, Veljko / Cabric, Danijela et al. | 2023
- 1
-
MPS-AMS: Masked Patches Selection and Adaptive Masking Strategy Based Self-Supervised Medical Image SegmentationWang, Xiangtao / Wang, Ruizhi / Tian, Biao / Zhang, Jiaojiao / Zhang, Shuo / Chen, Junyang / Lukasiewicz, Thomas / Xu, Zhenghua et al. | 2023
- 1
-
An Efficient Beam-Sharing Algorithm for RIS-aided Simultaneous Wireless Information and Power Transfer ApplicationsTran, Nguyen Minh / Amri, Muhammad Miftahul / Park, Je Hyeon / Kim, Dong In / Choi, Kae Won et al. | 2023
- 1
-
Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label SmoothingKakouros, Sofoklis / Stafylakis, Themos / Mosner, Ladislav / Burget, Lukas et al. | 2023
- 1
-
Unsupervised Video Anomaly Detection For Stereotypical Behaviours in AutismGao, Jiaqi / Jiang, Xinyang / Yang, Yuqing / Li, Dongsheng / Qiu, Lili et al. | 2023
- 1
-
Self-Supervised Learning for Speech Enhancement Through SynthesisIrvin, Bryce / Stamenovic, Marko / Kegler, Mikolaj / Yang, Li-Chia et al. | 2023
- 1
-
Group Personalized Federated LearningLiu, Zhe / Hui, Yue / Peng, Fuchun et al. | 2023
- 1
-
RD-NAS: Enhancing One-Shot Supernet Ranking Ability Via Ranking Distillation From Zero-Cost ProxiesDong, Peijie / Niu, Xin / Li, Lujun / Tian, Zhiliang / Wang, Xiaodong / Wei, Zimian / Pan, Hengyue / Li, Dongsheng et al. | 2023
- 1
-
Detection of Real-Time Deepfakes in Video Conferencing with Active Probing and Corneal ReflectionGuo, Hui / Wang, Xin / Lyu, Siwei et al. | 2023
- 1
-
Speakeraugment: Data Augmentation for Generalizable Source Separation via Speaker Parameter ManipulationWang, Kai / Yang, Yuhang / Huang, Hao / Hu, Ying / Li, Sheng et al. | 2023
- 1
-
TT-Net: Dual-Path Transformer Based Sound Field Translation in the Spherical Harmonic DomainWang, Yiwen / Lan, Zijian / Wu, Xihong / Qu, Tianshu et al. | 2023
- 1
-
Low-Rank Tensor Decompositions for Quaternion Multiway ArraysImhogiemhe, Osimone / Flamant, Julien / Luciani, Xavier / Zniyed, Yassine / Miron, Sebastian et al. | 2023
- 1
-
A Novel State Connection Strategy for Quantum Computing to Represent and Compress Digital ImagesHaque, Md Ershadul / Paul, Manoranjan / Ulhaq, Anwar / Debnath, Tanmoy et al. | 2023
- 1
-
Line Segment Matching Based on Intersection-Enhanced Point CorrespondencesLiu, Zhiyu / Zhong, Baojiang et al. | 2023
- 1
-
Centroid Distance Distillation for Effective Rehearsal in Continual LearningLiu, Daofeng / Lyu, Fan / Li, Linyan / Xia, Zhenping / Hu, Fuyuan et al. | 2023
- 1
-
Deep Network Series for Large-Scale High-Dynamic Range ImagingAghabiglou, Amir / Terris, Matthieu / Jackson, Adrian / Wiaux, Yves et al. | 2023
- 1
-
Longshortnet: Exploring Temporal and Semantic Features Fusion In Streaming PerceptionLi, Chenyang / Cheng, Zhi-Qi / He, Jun-Yan / Li, Pengyu / Luo, Bin / Chen, Hanyuan / Geng, Yifeng / Lan, Jin-Peng / Xie, Xuansong et al. | 2023
- 1
-
Wav2vec-Based Detection and Severity Level Classification of Dysarthria From SpeechJavanmardi, Farhad / Tirronen, Saska / Kodali, Manila / Kadiri, Sudarsana Reddy / Alku, Paavo et al. | 2023
- 1
-
Efficient Protein Structural Class Prediction Via Chaos Game Representation and Recurrent Neural NetworksZervou, Michaela Areti / Doutsi, Effrosyni / Tsakalides, Panagiotis et al. | 2023
- 1
-
Multi-Layer Seasonal Perception Network for Time Series ForecastingWang, Ruoshu / Miao, Shengfa / Liu, Di / Jin, Xin / Zhang, Weisheng et al. | 2023
- 1
-
Wiener Filtering Without Covariance Matrix InversionDamale, Pranav U. / Chong, Edwin K. P. / Scharf, Louis L. et al. | 2023
- 1
-
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech RecognitionYang, Chao-Han Huck / Li, Bo / Zhang, Yu / Chen, Nanxin / Prabhavalkar, Rohit / Sainath, Tara N. / Strohman, Trevor et al. | 2023
- 1
-
Symbol-Level Precoding is Related to Parameter Estimation from Quantized DataShao, Mingjie / Ma, Wing-Kin / Liu, Yatao et al. | 2023
- 1
-
Direction Aware Positional and Structural Encoding for Directed Graph Neural NetworksSium, Yonas / Kollias, Georgios / Ide, Tsuyoshi / Das, Payel / Abe, Naoki / Lozano, Aurelie / Li, Qi et al. | 2023
- 1
-
Nowcasting of Extreme Precipitation Using Deep Generative ModelsBi, Haoran / Kyryliuk, Maksym / Wang, Zhiyi / Meo, Cristian / Wang, Yanbo / Imhoff, Ruben / Uijlenhoet, Remko / Dauwels, Justin et al. | 2023
- 1
-
KEPS-NET: Robust Parking slot Detection based Keypoint estimation for High Localization AccuracyLee, Jaewoo / Sung, Kapje / Park, Daeul / Jeon, Younghan et al. | 2023
- 1
-
Randmasking Augment: A Simple and Randomized Data Augmentation For Acoustic Scene ClassificationHan, Jubum / Matuszewski, Mateusz / Sikorski, Olaf / Sung, Hosang / Cho, Hoonyoung et al. | 2023
- 1
-
Structured Errors-in-Variables Modelling for Cortico-Muscular Coherence EnhancementGuo, Zhenghao / McClelland, Verity M. / Dai, Wei / Cvetkovic, Zoran et al. | 2023
- 1
-
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition ModelsHernandez, Steven M. / Zhao, Ding / Ding, Shaojin / Bruguier, Antoine / Prabhavalkar, Rohit / Sainath, Tara N. / He, Yanzhang / McGraw, Ian et al. | 2023
- 1
-
Abstract Representation for Multi-Intent Spoken Language UnderstandingAbrougui, Rim / Damnati, Geraldine / Heinecke, Johannes / Bechet, Frederic et al. | 2023
- 1
-
Learning to Locate the Text Forgery in Smartphone ScreenshotsYu, Zeqin / Li, Bin / Lin, Yuzhen / Zeng, Jinhua / Zeng, Jishen et al. | 2023
- 1
-
Hankel Structured Low Rank and Sparse Representation Via L0-Norm Optimization for Compressed Ultrasound Plane Wave Signal ReconstructionZhang, Miaomiao / Chen, Ji / Fu, Xiaoyan / Xin, Ge / Zhang, Jingzhi / Jiang, Na / D'Hooge, Jan et al. | 2023
- 1
-
Heterogeneous Graph Learning for Acoustic Event ClassificationShirian, Amir / Ahmadian, Mona / Somandepalli, Krishna / Guha, Tanaya et al. | 2023
- 1
-
Accelerating Matrix Trace Estimation by Aitken’s Δ2 ProcessKalantzis, Vassilis / Kollias, Georgios / Ubaru, Shashanka / Salonidis, Theodoros et al. | 2023
- 1
-
Predicting Brain Age Using Transferable Covariance Neural NetworksSihag, Saurabh / Mateos, Gonzalo / McMillan, Corey / Ribeiro, Alejandro et al. | 2023
- 1
-
Knowledge-Aware Graph Convolutional Network with Utterance-Specific Window Search for Emotion Recognition In ConversationsZhang, Xiaotong / He, Peng / Liu, Han / Yin, Zhengxi / Liu, Xinyue / Zhang, Xianchao et al. | 2023
- 1
-
DiffVoice: Text-to-Speech with Latent DiffusionLiu, Zhijun / Guo, Yiwei / Yu, Kai et al. | 2023
- 1
-
Optimal Condition Training for Target Source SeparationTzinis, Efthymios / Wichern, Gordon / Smaragdis, Paris / Roux, Jonathan Le et al. | 2023
- 1
-
Hierarchical Softmax for End-To-End Low-Resource Multilingual Speech RecognitionLiu, Qianying / Gong, Zhuo / Yang, Zhengdong / Yang, Yuhang / Li, Sheng / Ding, Chenchen / Minematsu, Nobuaki / Huang, Hao / Cheng, Fei / Chu, Chenhui et al. | 2023
- 1
-
Multi-Task Sub-Band Network For Deep Residual Echo SuppressionSun, Jiayao / Luo, Dawei / Li, Zhaoxia / Li, Jindong / Ju, Yukai / Li, Yang et al. | 2023
- 1
-
Deep Adaptive Superpixels For Hadamard Single Pixel Imaging In Near-Infrared SpectrumMonroy, Brayan / Bacca, Jorge / Arguello, Henry et al. | 2023
- 1
-
On The Detection of Synthetic Images Generated by Diffusion ModelsCorvi, Riccardo / Cozzolino, Davide / Zingarini, Giada / Poggi, Giovanni / Nagano, Koki / Verdoliva, Luisa et al. | 2023
- 1
-
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech SeparationRavenscroft, William / Goetze, Stefan / Hain, Thomas et al. | 2023
- 1
-
Robust Hyperspectral Anomaly Detection with Simultaneous Mixed Noise Removal via Constrained Convex OptimizationSato, Koyo / Ono, Shunsuke et al. | 2023
- 1
-
Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion RecognitionPraveen, R Gnana / Granger, Eric / Cardinal, Patrick et al. | 2023
- 1
-
Robust Dominant Periodicity Detection for Time Series with Missing DataWen, Qingsong / Yang, Linxiao / Sun, Liang et al. | 2023
- 1
-
Self Supervised Bert for Legal Text ClassificationPal, Arghya / Rajanala, Sailaja / Phan, Raphael C.-W. / Wong, Koksheik et al. | 2023
- 1
-
Make Your Enemy Your Friend: Improving Image Rotation Angle Estimation with HarmonicsYu, Kun / Hosseini, Morteza Darvish Morshedi / Peng, Anjie / Zeng, Hui / Goljan, Miroslav et al. | 2023
- 1
-
Multilingual End-To-End Spoken Language Understanding For Ultra-Low Footprint ApplicationsMuller, Markus / Alexandridis, Anastasios / Trozenski, Zach / Whiteman, Joel / Strimel, Grant / Susanj, Nathan / Mouchtaris, Athanasios / Kunzmann, Siegfried et al. | 2023
- 1
-
Self-Convolution for Automatic Speech RecognitionZhang, Tian-Hao / Liu, Qi / Qian, Xinyuan / Chen, Song-Lu / Chen, Feng / Yin, Xu-Cheng et al. | 2023
- 1
-
Avoid Overthinking in Self-Supervised Models for Speech RecognitionBerrebbi, Dan / Yan, Brian / Watanabe, Shinji et al. | 2023
- 1
-
Building Blocks for a Complex-Valued Transformer ArchitectureEilers, Florian / Jiang, Xiaoyi et al. | 2023
- 1
-
FedPrompt: Communication-Efficient and Privacy-Preserving Prompt Tuning in Federated LearningZhao, Haodong / Du, Wei / Li, Fangqi / Li, Peixuan / Liu, Gongshen et al. | 2023
- 1
-
Lexicon-injected Semantic Parsing for Task-Oriented DialogMeng, Xiaojun / Dai, Wenlin / Wang, Yasheng / Wang, Baojun / Wu, Zhiyong / Jiang, Xin / Liu, Qun et al. | 2023
- 1
-
Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting DiarizationCord-Landwehr, Tobias / Boeddeker, Christoph / Zorila, Catalin / Doddipatla, Rama / Haeb-Umbach, Reinhold et al. | 2023
- 1
-
Autotts: End-to-End Text-to-Speech Synthesis Through Differentiable Duration ModelingNguyen, Bac / Cardinaux, Fabien / Uhlich, Stefan et al. | 2023
- 1
-
Heuristic Masking for Text Representation PretrainingZhuang, Yimeng et al. | 2023
- 1
-
Speech Privacy Leakage from Shared Gradients in Distributed LearningLi, Zhuohang / Zhang, Jiaxin / Liu, Jian et al. | 2023
- 1
-
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword SpottingLabrador, Beltran / Zhao, Guanlong / Moreno, Ignacio Lopez / Scorza Scarpati, Angelo / Fowl, Liam / Wang, Quan et al. | 2023
- 1
-
Audio-Visual Inpainting: Reconstructing Missing Visual Information with SoundSanguineti, Valentina / Thakur, Sanket / Morerio, Pietro / Del Bue, Alessio / Murino, Vittorio et al. | 2023
- 1
-
Learnable Frontends That Do Not Learn: Quantifying Sensitivity To Filterbank InitialisationAnderson, Mark / Kinnunen, Tomi / Harte, Naomi et al. | 2023
- 1
-
Background-Weakening Consistency Regularization for Semi-Supervised Video Action DetectionZhong, Xian / Yi, Aoyu / Liu, Wenxuan / Huang, Wenxin / Zou, Chengming / Wang, Zheng et al. | 2023
- 1
-
Biologically-Inspired Continual Learning of Human Motion SequencesOtt, Joachim / Liu, Shih-Chii et al. | 2023
- 1
-
Deep Architecture for DOA Trajectory LocalizationJaiswal, Shreyas / Pandey, Ruchi / Nannuru, Santosh et al. | 2023
- 1
-
Cross-Subject Mental Fatigue Detection based on Separable Spatio-Temporal Feature AggregationYe, Yalan / He, Yutuo / Huang, Wanjing / Dong, Qiaosen / Wang, Chong / Wang, Guoqing et al. | 2023
- 1
-
Reliability Estimation for Synthetic Speech DetectionSalvi, Davide / Bestagini, Paolo / Tubaro, Stefano et al. | 2023
- 1
-
Personalizing Federated Learning with Over-The-Air ComputationsChen, Zihan / Li, Zeshen / Yang, Howard H. / Quek, Tony Q. S. et al. | 2023
- 1
-
Unified Keyword Spotting and Audio Tagging on Mobile Devices with TransformersDinkel, Heinrich / Wang, Yongqing / Yan, Zhiyong / Zhang, Junbo / Wang, Yujun et al. | 2023
- 1
-
Identifying Entrainment in Task-Oriented ConversationsChen, Run / Kim, Seokhwan / Papangelis, Alexandros / Hirschberg, Julia / Liu, Yang / Hakkani-Tur, Dilek et al. | 2023
- 1
-
Element Selection with Wide Class of Optimization Criteria Using Non-Convex Sparse OptimizationKawamura, Taiga / Ueno, Natsuki / Ono, Nobutaka et al. | 2023
- 1
-
Cardiac Disease Diagnosis on Imbalanced Electrocardiography Data Through Optimal Transport AugmentationQiu, Jielin / Zhu, Jiacheng / Xu, Mengdi / Huang, Peide / Rosenberg, Michael / Weber, Douglas / Liu, Emerson / Zhao, Ding et al. | 2023
- 1
-
A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech UnitsChen, Li-Wei / Watanabe, Shinji / Rudnicky, Alexander et al. | 2023
- 1
-
Exploring Language-Agnostic Speech Representations Using Domain Knowledge for Detecting Alzheimer’s DementiaShah, Zehra / Qi, Shi-Ang / Wang, Fei / Farrokh, Mahtab / Tasnim, Mashrura / Stroulia, Eleni / Greiner, Russell / Plitsis, Manos / Katsamanis, Athanasios et al. | 2023
- 1
-
Using Emotion Embeddings to Transfer Knowledge between Emotions, Languages, and Annotation FormatsChochlakis, Georgios / Mahajan, Gireesh / Baruah, Sabyasachee / Burghardt, Keith / Lerman, Kristina / Narayanan, Shrikanth et al. | 2023
- 1
-
MDR-MFI:Multi-Branch Decoupled Regression and Multi-Scale Feature Interaction for Partial-to-Partial Cloud RegistrationDai, Weidong / Yan, Xuejun / Wang, Jingjing / Xie, Di / Pu, Shiliang et al. | 2023
- 1
-
Semantically-Informed Deep Neural Networks For Sound RecognitionEsposito, Michele / Valente, Giancarlo / Plasencia-Calana, Yenisel / Dumontier, Michel / Giordano, Bruno L. / Formisano, Elia et al. | 2023
- 1
-
X-SEPFORMER: End-To-End Speaker Extraction Network with Explicit Optimization on Speaker ConfusionLiu, Kai / Du, Ziqing / Wan, Xucheng / Zhou, Huan et al. | 2023
- 1
-
QI-TTS: Questioning Intonation Control for Emotional Speech SynthesisTang, Haobin / Zhang, Xulong / Wang, Jianzong / Cheng, Ning / Xiao, Jing et al. | 2023
- 1
-
Cross-Head Supervision for Crowd Counting with Noisy AnnotationsDai, Mingliang / Huang, Zhizhong / Gao, Jiaqi / Shan, Hongming / Zhang, Junping et al. | 2023
- 1
-
UNeXt: a Low-Dose CT denoising UNet model with the modified ConvNeXt blockMazandarani, Farzan Niknejad / Babyn, Paul / Alirezaie, Javad et al. | 2023
- 1
-
Compensatory Debiasing For Gender Imbalances In Language ModelsWoo, Tae-Jin / Nam, Woo-Jeoung / Ju, Yeong-Joon / Lee, Seong-Whan et al. | 2023
- 1
-
Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech UnderstandingLi, Yingting / Mehrish, Ambuj / Bhardwaj, Rishabh / Majumder, Navonil / Cheng, Bo / Zhao, Shuai / Zadeh, Amir / Mihalcea, Rada / Poria, Soujanya et al. | 2023
- 1
-
Semantics-Disentangled Contrastive Embedding for Generalized Zero-Shot LearningNi, Jian / Liao, Yong et al. | 2023
- 1
-
Scalable Multi-Task Semantic Communication System with Feature Importance RankingHu, Jiangjing / Wang, Fengyu / Xu, Wenjun / Gao, Hui / Zhang, Ping et al. | 2023
- 1
-
Contextual Similarity is More Valuable Than Character Similarity: An Empirical Study for Chinese Spell CheckingZhang, Ding / Li, Yinghui / Zhou, Qingyu / Ma, Shirong / Li, Yangning / Cao, Yunbo / Zheng, Hai-Tao et al. | 2023
- 1
-
Cov Loss: Covariance-Based Loss for Deep Face RecognitionAlkanhal, Ibrahim / Almansour, Abdullah / Alsalloom, Lamia / Aljadaany, Raied / Savvides, Marios et al. | 2023
- 1
-
Space-Time Graph Neural Networks with Stochastic Graph PerturbationsHadou, Samar / Kanatsoulis, Charilaos I. / Ribeiro, Alejandro et al. | 2023
- 1
-
Automatic Error Detection in Integrated Circuits Image Segmentation: A Data-Driven ApproachZhang, Zhikang / Trindade, Bruno Machado / Green, Michael / Yu, Zifan / Pawlowicz, Christopher / Ren, Fengbo et al. | 2023
- 1
-
Deep Autoencoding One-Class time Series Anomaly DetectionMou, Xudong / Wang, Rui / Wang, Tiejun / Sun, Jie / Li, Bo / Wo, Tianyu / Liu, Xudong et al. | 2023
- 1
-
Variational Message Passing-Based Respiratory Motion Estimation and Detection Using Radar SignalsModerl, Jakob / Leitinger, Erik / Pernkopf, Franz / Witrisal, Klaus et al. | 2023
- 1
-
Overlay Cognitive Radio Using Symbol Level Precoding With Quantized CSILiu, Lu / Swindlehurst, A. Lee et al. | 2023
- 1
-
Self-Paced Partial Domain-Aware Learning for Face Anti-SpoofingChen, Zhiyi / Lu, Yao / Deng, Xinzhe / Meng, Jia / Zhang, Shengchuan / Cao, Liujuan et al. | 2023
- 1
-
Free-View Expressive Talking Head Video EditingHuang, Yuantian / Iizuka, Satoshi / Fukui, Kazuhiro et al. | 2023
- 1
-
Towards Domain Generalisation in ASR with Elitist Sampling and Ensemble Knowledge DistillationAhmad, Rehan / Jalal, Md Asif / Umar Farooq, Muhammad / Ollerenshaw, Anna / Hain, Thomas et al. | 2023
- 1
-
Personalized Task Load Prediction in Speech CommunicationSpang, Robert P. / El Hajal, Karl / Moller, Sebastian / Cernak, Milos et al. | 2023
- 1
-
Cutting Through the Noise: An Empirical Comparison of Psycho-Acoustic and Envelope-based Features for Machinery Fault DetectionWibbrock, Peter / Richter, Yvonne / Pelkmann, David / Ren, Zhao / Palmer, Gregory et al. | 2023
- 1
-
Radio Sensing with Large Intelligent Surface for 6GVaca-Rubio, Cristian J. / Ramirez-Espinosa, Pablo / Kansanen, Kimmo / Tan, Zheng-Hua / Carvalho, Elisabeth de et al. | 2023
- 1
-
Boosting the Accuracy of SRAM-Based in-Memory Architectures Via Maximum Likelihood-Based Error Compensation MethodKim, Hyungyo / Shanbhag, Naresh et al. | 2023
- 1
-
DocRED-FE: A Document-Level Fine-Grained Entity and Relation Extraction DatasetWang, Hongbo / Xiong, Weimin / Song, Yifan / Zhu, Dawei / Xia, Yu / Li, Sujian et al. | 2023
- 1
-
Affinity Learning With Blind-Spot Self-Supervision for Image DenoisingZhou, Yuhongze / Zhou, Liguang / Laradji, Issam Hadj / Lun Lam, Tin / Xu, Yangsheng et al. | 2023
- 1
-
Learning Audio-Visual DereverberationChen, Changan / Sun, Wei / Harwath, David / Grauman, Kristen et al. | 2023
- 1
-
Boosting Person Re-Identification with Viewpoint Contrastive Learning and Adversarial TrainingShi, Xingyue / Liu, Hong / Shi, Wei / Zhou, Zihui / Li, Yidi et al. | 2023
- 1
-
Lightweight Prosody-TTS for Multi-Lingual Multi-Speaker ScenarioPamisetty, Giridhar / Varun, S Chaitanya / Murty, K Sri Rama et al. | 2023
- 1
-
Dialogue Context Modelling for Action Item Detection: Solution for ICASSP 2023 Mug Challenge Track 5Huang, Jie / Feng, Xiachong / Ye, Yangfan / Zhao, Liang / Feng, Xiaocheng / Qin, Bing / Liu, Ting et al. | 2023
- 1
-
A New Approach to Extract Fetal Electrocardiogram Using Affine Combination of Adaptive FiltersXuan, Yu / Zhang, Xiangyu / Li, Shuyue Stella / Shen, Zihan / Xie, Xin / Garcia, Leibny Paola / Togneri, Roberto et al. | 2023
- 1
-
Towards Learning Emotion Information from Short Segments of SpeechPurohit, Tilak / Yadav, Sarthak / Vlasenko, Bogdan / Dubagunta, S. Pavankumar / Magimai.-Doss, Mathew et al. | 2023
- 1
-
Knowledge-Aware Few Shot Learning for Event Detection from Short TextsGuo, Jinjin / Huang, Zhichao / Xu, Guangning / Zhang, Bowen / Duan, Chaoqun et al. | 2023
- 1
-
Rethink Pair-Wise Self-Supervised Cross-Modal Retrieval From A Contrastive Learning PerspectiveGong, Tiantian / Wang, Junsheng / Zhang, Liyan et al. | 2023
- 1
-
BHE-DARTS: Bilevel Optimization Based on Hypergradient Estimation for Differentiable Architecture SearchCai, Zicheng / Chen, Lei / Liu, Hai-Lin et al. | 2023
- 1
-
Optimizing Distributed Multi-Sensor Multi-Target Tracking Algorithm Based On Labeled Multi-Bernoulli FilterLiu, Honggang / Yang, Jinlong / Xu, Yue / Yang, Le et al. | 2023
- 1
-
Multi-Microphone Speaker Separation by Spatial RegionsWechsler, Julian / Chetupalli, Srikanth Raj / Mack, Wolfgang / Habets, Emanuel A. P. et al. | 2023
- 1
-
Learning To Generate 3d Representations of Building Roofs Using Single-View Aerial ImageryKhomiakov, Maxim / Mahou, Alejandro Valverde / Sanchez, Alba Reinders / Frellsen, Jes / Andersen, Michael Riis et al. | 2023
- 1
-
Single-Shot Fractional Fourier Phase RetrievalYang, Yixiao / Tao, Ran et al. | 2023
- 1
-
A Perceptual Neural Audio Coder with a Mean-Scale HyperpriorByun, Joon / Shin, Seungmin / Park, Youngcheol / Sung, Jongmo / Beack, Seungkwon et al. | 2023
- 1
-
Joint Compression and Demosaicking For Satellite ImagesBacchus, Pascal / Fraisse, Renaud / Roumy, Aline / Guillemot, Christine et al. | 2023
- 1
-
On the Role of LIP Articulation in Visual Speech PerceptionAldeneh, Zakaria / Fedzechkina, Masha / Seto, Skyler / Metcalf, Katherine / Sarabia, Miguel / Apostoloff, Nicholas / Theobald, Barry-John et al. | 2023
- 1
-
Graphit: Iterative Reweighted ℓ1 Algorithm for Sparse Graph Inference in State-Space ModelsChouzenoux, Emilie / Elvira, Victor et al. | 2023
- 1
-
DialogMI: A Dialogue Model Based on Enhancing Dialogue Mutual InformationZhang, Yibo / Gong, Ping / Wang, Zelin / Li, Zhe / Yang, Xuanyuan et al. | 2023
- 1
-
Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech CodingYang, Haici / Lim, Wootaek / Kim, Minje et al. | 2023
- 1
-
Learning Hybrid Representations of Semantics and Distortion for Blind Image Quality AssessmentWang, Xiaoqi / Xiong, Jian / Li, Bo / Suo, Jinli / Gao, Hao et al. | 2023
- 1
-
Joint Symbol-Level Precoding and Sub-Block-Level RIS Design for Dual-Function Radar-CommunicationsWu, Linlong / Wang, Bowen / Cheng, Ziyang / M. R, Bhavani Shankar / Ottersten, Bjorn et al. | 2023
- 1
-
Simplicial Vector Autoregressive Model For Streaming Edge FlowsKrishnan, Joshin / Money, Rohan / Beferull-Lozano, Baltasar / Isufi, Elvin et al. | 2023
- 1
-
Spatially Selective Deep Non-Linear Filters For Speaker ExtractionTesch, Kristina / Gerkmann, Timo et al. | 2023
- 1
-
Lit the Darkness: Three-Stage Zero-Shot Learning for Low-Light Enhancement with Multi-Neighbor Enhancement FactorsSaeed, Mariam / Torki, Marwan et al. | 2023
- 1
-
Monocular 3D Human Pose Estimation Based on Global Temporal-Attentive and Joints-Attention In VideoHe, Ruhan / Xiang, Shanshan / Tao, Peng / Yu, Yongsheng et al. | 2023
- 1
-
From Easy to Hard: Two-Stage Selector and Reader for Multi-Hop Question AnsweringLi, Xin-Yi / Lei, Wei-Jun / Yang, Yu-Bin et al. | 2023
- 1
-
MHLAT: Multi-Hop Label-Wise Attention Model for Automatic ICD CodingDuan, Junwen / Jiang, Han / Yu, Ying et al. | 2023
- 1
-
A Multi-Stage Low-Latency Enhancement System for Hearing AidsOuyang, Chengwei / Fei, Kexin / Zhou, Haoshuai / Lu, Congxi / Li, Linkai et al. | 2023
- 1
-
DAIS: The Delft Database of EEG Recordings of Dutch Articulated and Imagined SpeechDekker, Bo / Schouten, Alfred C. / Scharenborg, Odette et al. | 2023
- 1
-
A Two-Branch Network for Video Anomaly Detection with Spatio-Temporal Feature LearningLi, Guoqiu / Chen, Shengjie / Yang, Yujiu / Guo, Zhenhua et al. | 2023
- 1
-
Mitigating Domain Dependency for Improved Speech Enhancement Via SNR Loss BoostingYin, Lili / Wu, Di / Qiu, Zhibin / Huang, Hao et al. | 2023
- 1
-
The Role of Memory in Social Learning When Sharing Partial OpinionsCirillo, Michele / Bordignon, Virginia / Matta, Vincenzo / Sayed, A. H. et al. | 2023
- 1
-
Autonomous Navigation of a Robotic Swarm in Space Exploration MissionsZhang, Siwei / Baumgartner, Tobias / Staudinger, Emanuel / Pohlmann, Robert / Broghammer, Fabio / Dammann, Armin et al. | 2023
- 1
-
Masked Autoencoders are Articulatory LearnersAttia, Ahmed Adel / Espy-Wilson, Carol Y. et al. | 2023
- 1
-
Learn Topological Representation with Flexible Manifold LayerJiao, Ziheng / Zhang, Hongyuan / Li, Xuelong et al. | 2023
- 1
-
FED-3DA: A Dynamic and Personalized Federated Learning FrameworkWang, Hui / Sun, Jie / Wo, Tianyu / Liu, Xudong et al. | 2023
- 1
-
MCNeT: Measurement-Consistent Networks Via A Deep Implicit Layer For Solving Inverse ProblemsMourya, Rahul / Mota, Joao F. C. et al. | 2023
- 1
-
JNDMix: Jnd-Based Data Augmentation for No-Reference Image Quality AssessmentSheng, Jiamu / Fan, Jiayuan / Ye, Peng / Cao, Jianjian et al. | 2023
- 1
-
Improving Speech Prosody of Audiobook Text-To-Speech Synthesis with Acoustic and Textual ContextsXin, Detai / Adavanne, Sharath / Ang, Federico / Kulkarni, Ashish / Takamichi, Shinnosuke / Saruwatari, Hiroshi et al. | 2023
- 1
-
Continuous Action Space-Based Spoken Language Acquisition Agent Using Residual Sentence Embedding and Transformer DecoderKomatsu, Ryota / Kimura, Yusuke / Okamoto, Takuma / Shinozaki, Takahiro et al. | 2023
- 1
-
Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary TasksWang, Weiran / Zhao, Ding / Ding, Shaojin / Zhang, Hao / Chang, Shuo-Yiin / Rybach, David / Sainath, Tara N. / He, Yanzhang / McGraw, Ian / Kumar, Shankar et al. | 2023
- 1
-
Prompttts: Controllable Text-To-Speech With Text DescriptionsGuo, Zhifang / Leng, Yichong / Wu, Yihan / Zhao, Sheng / Tan, Xu et al. | 2023
- 1
-
Audio-Visual Speaker Diarization in the Framework of Multi-User Human-Robot InteractionDhaussy, Timothee / Jabaian, Bassam / Lefevre, Fabrice / Horaud, Radu et al. | 2023
- 1
-
EBEN: Extreme Bandwidth Extension Network Applied To Speech Signals Captured With Noise-Resilient Body-Conduction MicrophonesHauret, Julien / Joubaud, Thomas / Zimpfer, Veronique / Bavu, Eric et al. | 2023
- 1
-
Improving Speech Enhancement via Event-Based QueryXin, Yifei / Peng, Xiulian / Lu, Yan et al. | 2023
- 1
-
FEW-Shot Continual Learning with Weight Alignment and Positive Enhancement for Bioacoustic Event DetectionWu, Xiaoxiao / Xu, Dongxing / Wei, Haoran / Long, Yanhua et al. | 2023
- 1
-
Active IRS-Assisted MIMO Channel Estimation and PredictionHaider, Mirza A. / Pavel, Saidur R. / Zhang, Yimin D. / Aboutanios, Elias et al. | 2023
- 1
-
Audio-Driven Facial Landmark Generation in Violin Performance using 3DCNN Network with Self Attention ModelLin, Ting-Wei / Liu, Chao-Lin / Su, Li et al. | 2023
- 1
-
Towards Explainable Recommendation Via Bert-Guided Explanation GeneratorZhan, Huijing / Li, Ling / Li, Shaohua / Liu, Weide / Gupta, Manas / Kot, Alex C. et al. | 2023
- 1
-
Articulatory Representation Learning via Joint Factor Analysis and Neural Matrix FactorizationLian, Jiachen / Black, Alan W / Lu, Yijing / Goldstein, Louis / Watanabe, Shinji / Anumanchipalli, Gopala K. et al. | 2023
- 1
-
Mask the Bias: Improving Domain-Adaptive Generalization of CTC-Based ASR with Internal Language Model EstimationDas, Nilaksh / Sunkara, Monica / Bodapati, Sravan / Cai, Jinglun / Kulshreshtha, Devang / Farris, Jeff / Kirchhoff, Katrin et al. | 2023
- 1
-
Comparison of Soft and Hard Target RNN-T Distillation for Large-Scale ASRHwang, Dongseong / Chai Sim, Khe / Zhang, Yu / Strohman, Trevor et al. | 2023
- 1
-
Test-Time Training-Free Domain AdaptationFeng, Yongxiang / He, Weihua / You, Kaichao / Liu, Bing / Zhang, Ziyang / Wang, Yaoyuan / Li, Minglei / Lou, Yihang / Li, Jiawei / Li, Guoqi et al. | 2023
- 1
-
Rumor Detection Via Assessing the Spreading Propensity of UsersZheng, Peng / Huang, Zhen / Dou, Yong / Yan, YeQing et al. | 2023
- 1
-
Multi-Layer Feature Division Transferable Adversarial AttackJin, Zikang / Yin, Changchun / Li, Piji / Zhou, Lu / Fang, Liming / Chang, Xiangmao / Liu, Zhe et al. | 2023
- 1
-
Hybridformer: Improving Squeezeformer with Hybrid Attention and NSR MechanismYang, Yuguang / Pan, Yu / Yin, Jingjing / Han, Jiangyu / Ma, Lei / Lu, Heng et al. | 2023
- 1
-
FedAudio: A Federated Learning Benchmark for Audio TasksZhang, Tuo / Feng, Tiantian / Alam, Samiul / Lee, Sunwoo / Zhang, Mi / Narayanan, Shrikanth S. / Avestimehr, Salman et al. | 2023
- 1
-
Signal Analysis-Synthesis Using the Quantum Fourier TransformSharma, Aradhita / Uehara, Glen / Narayanaswamy, Vivek / Miller, Leslie / Spanias, Andreas et al. | 2023
- 1
-
Improved Wordpcfg for Passwords with Maximum Probability SegmentationLi, Wenting / Yang, Jiahong / Cheng, Haibo / Wang, Ping / Liang, Kaitai et al. | 2023
- 1
-
Uncer2Natural: Uncertainty-Aware Unsupervised Image DenoisingHuang, Chenyu / Tan, Weimin / Shi, Jiaxing / Xing, Zhen / Yan, Bo et al. | 2023
- 1
-
A Lightweight Convolutional Neural Network using Feature Filtering ModuleJing, Nan / Zhang, Yu et al. | 2023
- 1
-
Simultaneous Estimation of Direction of Arrival and Sound Speed Using a Non-Uniform Sensor ArrayNishimura, Ryouichi / Takizawa, Kenichi et al. | 2023
- 1
-
Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping LossesAi, Yang / Ling, Zhen-Hua et al. | 2023
- 1
-
Boosting Transferability of Adversarial Example via an Enhanced Euler’s MethodPeng, Anjie / Lin, Zhi / Zeng, Hui / Yu, Wenxin / Kang, Xiangui et al. | 2023
- 1
-
Motion Matters: A Novel Motion Modeling for Cross-View Gait Feature LearningLi, Jingqi / Gao, Jiaqi / Zhang, Yuzhen / Shan, Hongming / Zhang, Junping et al. | 2023
- 1
-
Disambiguation of Cognitive Impairment Diagnosis with EEG-Based Dual-Contrastive LearningSong, Zhenxi / Pei, Zian / Ren, Huixia / Zhu, Lin / Guo, Yi / Zhang, Zhiguo et al. | 2023
- 1
-
Multi-View Graph Regularized Deep Autoencoder-Like NMF FrameworkZhao, Liang / Wang, Zihao / Wang, Ziyue / Chen, Zhikui et al. | 2023
- 1
-
Jazznet: A Dataset of Fundamental Piano Patterns for Music Audio Machine Learning ResearchAdegbija, Tosiron et al. | 2023
- 1
-
Robust Self-Guided Deep Image PriorBell, Evan / Liang, Shijun / Qu, Qing / Ravishankar, Saiprasad et al. | 2023
- 1
-
Vehicle View Synthesis by Generative Adversarial NetworkHu, Chan-Shuo / Tseng, Sung-Wei / Fan, Xin-Yun / Chiang, Chen-Kuo et al. | 2023
- 1
-
Refined Pseudo Labeling for Source-Free Domain Adaptive Object DetectionZhang, Siqi / Zhang, Lu / Liu, Zhiyong et al. | 2023
- 1
-
MossFormer: Pushing the Performance Limit of Monaural Speech Separation Using Gated Single-Head Transformer with Convolution-Augmented Joint Self-AttentionsZhao, Shengkui / Ma, Bin et al. | 2023
- 1
-
Semantic-Preserving Augmentation for Robust Image-Text RetrievalKim, Sunwoo / Shim, Kyuhong / Nguyen, Luong Trung / Shim, Byonghyo et al. | 2023
- 1
-
Choice Fusion As Knowledge For Zero-Shot Dialogue State TrackingSu, Ruolin / Yang, Jingfeng / Wu, Ting-Wei / Juang, Biing-Hwang et al. | 2023
- 1
-
Introducing Topography in Convolutional Neural NetworksPoli, Maxime / Dupoux, Emmanuel / Riad, Rachid et al. | 2023
- 1
-
Dynamic Multi-View Scene Reconstruction Using Neural Implicit SurfaceChen, Decai / Lu, Haofei / Feldmann, Ingo / Schreer, Oliver / Eisert, Peter et al. | 2023
- 1
-
Adaptive Semantic Fusion Framework for Unsupervised Monocular Depth EstimationLi, Ruoqi / Yu, Huimin / Du, Kaiyang / Xiao, Zhuoling / Yan, Bo / Yuan, Zhengxi et al. | 2023
- 1
-
Speaker Diaphragm Excursion Prediction: Deep Attention and Online AdaptationRen, Yuwei / Zivney, Matt / Huang, Yin / Choy, Eddie / Patel, Chirag / Xu, Hao et al. | 2023
- 1
-
Speech-Text Based Multi-Modal Training with Bidirectional Attention for Improved Speech RecognitionYang, Yuhang / Xu, Haihua / Huang, Hao / Chng, Eng Siong / Li, Sheng et al. | 2023
- 1
-
Joint Data Association, NLOS Mitigation, and Clutter Suppression for Networked Device-Free Sensing in 6G Cellular NetworkShi, Qin / Liu, Liang / Zhang, Shuowen et al. | 2023
- 1
-
Attention Mixup: An Accurate Mixup Scheme Based On Interpretable Attention Mechanism for Multi-Label Audio ClassificationLiu, Wuyang / Ren, Yanzhen / Wang, Jingru et al. | 2023
- 1
-
Receptive Field Reliant Zero-Cost Proxies for Neural Architecture SearchKeserwani, Prateek / Miriyala, Srinivas Soumitri / Rajendiran, Vikram N. / Shivamurthappa, Pradeep N. et al. | 2023
- 1
-
Full-Band General Audio Synthesis with Score-Based DiffusionPascual, Santiago / Bhattacharya, Gautam / Yeh, Chunghsin / Pons, Jordi / Serra, Joan et al. | 2023
- 1
-
Cross-Lingual Transfer Learning for Alzheimer’s Detection from Spontaneous SpeechTamm, Bastiaan / Vandenberghe, Rik / Van Hamme, Hugo et al. | 2023
- 1
-
Compressive Channel Estimation for IRS-Aided Millimeter-Wave Systems via Two-Stage Lamp NetworkTsai, Wen-Chiao / Chen, Chi-Wei / Wu, An-Yeu Andy et al. | 2023
- 1
-
RDO Candidate Selection for Maximizing Coding Efficiency in a Practical HEVC EncoderSainio, Joose / Mercat, Alexandre / Vanne, Jarno et al. | 2023
- 1
-
Dynamic Selection of p-norm in Linear Adaptive Filtering via online Kernel-based Reinforcement LearningVu, Minh / Akiyama, Yuki / Slavakis, Konstantinos et al. | 2023
- 1
-
Order Reduction of Multi-Channel FIR Filters by Balanced TruncationHilgemann, Florian / Jax, Peter et al. | 2023
- 1
-
HDNet: Hierarchical Dynamic Network for Gait Recognition using Millimeter-wave radarHuang, Yanyan / Wang, Yong / Shi, Kun / Gu, Chaojie / Fu, Yu / Zhuo, Cheng / Shi, Zhiguo et al. | 2023
- 1
-
Gct: Gated Contextual Transformer for Sequential Audio TaggingHou, Yuanbo / Wang, Yun / Wang, Wenwu / Botteldooren, Dick et al. | 2023
- 1
-
Joint Channel and Direction Estimation for Ground-to-UAV Communications Enabled by a Simultaneous Reflecting and Sensing RISHe, Jiguang / Fakhreddine, Aymen / Alexandropoulos, George C. et al. | 2023
- 1
-
Transaudio: Towards the Transferable Adversarial Audio Attack Via Learning Contextualized PerturbationsQi, Gege / Chen, Yuefeng / Zhu, Yao / Hui, Binyuan / Li, Xiaodan / Mao, Xiaofeng / Zhang, Rong / Xue, Hui et al. | 2023
- 1
-
SyncNet: Correlating Objective for Time Delay Estimation in Audio SignalsRaina, Akshay / Arora, Vipul et al. | 2023
- 1
-
Robust Data-Driven Accelerated Mirror DescentTan, Hong Ye / Mukherjee, Subhadip / Tang, Junqi / Hauptmann, Andreas / Schonlieb, Carola-Bibiane et al. | 2023
- 1
-
Speech Separation with Large-Scale Self-Supervised LearningChen, Zhuo / Kanda, Naoyuki / Wu, Jian / Wu, Yu / Wang, Xiaofei / Yoshioka, Takuya / Li, Jinyu / Sivasankaran, Sunit / Eskimez, Sefik Emre et al. | 2023
- 1
-
Robust Audio-Visual ASR with Unified Cross-Modal AttentionLi, Jiahong / Li, Chenda / Wu, Yifei / Qian, Yanmin et al. | 2023
- 1
-
Multi-Object Localization and Irrelevant-Semantic Separation for Nuclei Segmentation in Histopathology ImagesTang, Ya / Ye, Xiongjun / Li, Xuanya / Chen, Zhineng et al. | 2023
- 1
-
Unlimited Sampling of FRI Signals Independent of Sampling RateGuo, Ruiming / Bhandari, Ayush et al. | 2023
- 1
-
Pyramid Dynamic Inference: Encouraging Faster Inference Via Early Exit BoostingBanijamali, Ershad / Kharazmi, Pegah / Eghbali, Sepehr / Wang, Jixuan / Chung, Clement / Choudhary, Samridhi et al. | 2023
- 1
-
Self-Supervised Facial Action Unit Detection with Region and Relation LearningSong, Juan / Liu, Zhilei et al. | 2023
- 1
-
Two-Step Band-Split Neural Network Approach For Full-Band Residual Echo SuppressionZhang, Zihan / Zhang, Shimin / Liu, Mingshuai / Leng, Yanhong / Han, Zhe / Chen, Li / Xie, Lei et al. | 2023
- 1
-
Incorporating Uncertainty from Speaker Embedding Estimation to Speaker VerificationWang, Qiongqiong / Lee, Kong Aik / Liu, Tianchi et al. | 2023
- 1
-
Improving Occluded Human Pose Estimation Via Linked JointsYe, Suhang / Hong, Zebo / Zheng, Jiawen / Zhang, Shengchuan et al. | 2023
- 1
-
Improved Belief Propagation Decoding of Turbo CodesShen, Yifei / Ren, Yuqing / Kristensen, Andreas Toftegaard / You, Xiaohu / Zhang, Chuan / Burg, Andreas et al. | 2023
- 1
-
LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAMZhang, Letian / Wang, Jinping / Jie, Lu / Chen, Nanjie / Tan, Xiaojun / Duan, Zhifei et al. | 2023
- 1
-
Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant SupervisionFeng, Shuo / Li, Piji et al. | 2023
- 1
-
Spatially Informed Independent vector analysis for Source Extraction based on the convolutive Transfer Function ModelWang, Xianrui / Brendel, Andreas / Huang, Gongping / Yang, Yichen / Kellermann, Walter / Chen, Jingdong et al. | 2023
- 1
-
Multimodal Knowledge Distillation for Arbitrary-Oriented Object Detection in Aerial ImagesHuang, Zhanchao / Li, Wei / Tao, Ran et al. | 2023
- 1
-
Deep Triple-Supervision Learning Unannotated Surgical Endoscopic Video Data for Monocular Dense Depth EstimationFan, Wenkang / Zhang, Kaiyun / Shi, Hong / Chen, Jianhua / Chen, Yinran / Luo, Xiongbiao et al. | 2023
- 1
-
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured PruningHuang, Sung-Feng / Chen, Chia-Ping / Chen, Zhi-Sheng / Tsai, Yu-Pao / Lee, Hung-Yi et al. | 2023
- 1
-
WAVELET2VEC: A Filter Bank Masked Autoencoder for EEG-Based Seizure Subtype ClassificationPeng, Ruimin / Zhao, Changming / Xu, Yifan / Jiang, Jun / Kuang, Guangtao / Shao, Jianbo / Wu, Dongrui et al. | 2023
- 1
-
Enhance Transferability of Adversarial Examples with Model ArchitectureFan, Mingyuan / Guo, Wenzhong / Ying, Zuobin / Liu, Ximeng et al. | 2023
- 1
-
Unsupervised Voice Type Discrimination Score Adaptation Using X-Vector ClustersLindsey, Mark / Vuong, Tyler / Stern, Richard M. et al. | 2023
- 1
-
An Analysis of Degenerating Speech Due to Progressive Dysarthria on ASR PerformanceTomanek, Katrin / Seaver, Katie / Jiang, Pan-Pan / Cave, Richard / Harrell, Lauren / Green, Jordan R. et al. | 2023
- 1
-
Music Rearrangement Using Hierarchical SegmentationPlachouras, Christos / Miron, Marius et al. | 2023
- 1
-
Backdoor Defense via Suppressing Model ShortcutsYang, Sheng / Li, Yiming / Jiang, Yong / Xia, Shu-Tao et al. | 2023
- 1
-
Learning Dynamic Graphs under Partial ObservabilityCirillo, Michele / Matta, Vincenzo / Sayed, Ali H. et al. | 2023
- 1
-
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the InputNiizumi, Daisuke / Takeuchi, Daiki / Ohishi, Yasunori / Harada, Noboru / Kashino, Kunio et al. | 2023
- 1
-
Asynchronous Federated Learning for Real-Time Multiple Licence Plate Recognition Through Semantic CommunicationXie, Renyou / Li, Chaojie / Zhou, Xiaojun / Dong, Zhaoyang et al. | 2023
- 1
-
Sparse Non-Contact Multiple People Localization and Vital Signs Monitoring Via FMCW RadarEder, Yonathan / Liu, Zhuoyang / Eldar, Yonina C. et al. | 2023
- 1
-
Enhancing Ontology Translation Through Cross-Lingual AgreementTian, Mingjie / Giunchiglia, Fausto / Song, Rui / Chen, Xing / Xu, Hao et al. | 2023
- 1
-
Improving Image Captioning with Control Signal of Sentence QualityZhu, Zhangzi / Wang, Shuai / Qu, Hong et al. | 2023
- 1
-
DyLiteRADHAR: Dynamic Lightweight Slowfast Network for Human Activity Recognition Using MMWAVE RadarSheng, Biyun / Bao, Yan / Xiao, Fu / Gui, Linqing et al. | 2023
- 1
-
Native Multi-Band Audio Coding Within Hyper-Autoencoded Reconstruction Propagation NetworksPetermann, Darius / Jang, Inseon / Kim, Minje et al. | 2023
- 1
-
Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural TransducersYang, Zijian / Zhou, Wei / Schluter, Ralf / Ney, Hermann et al. | 2023
- 1
-
Explanations for Automatic Speech RecognitionWu, Xiaoliang / Bell, Peter / Rajan, Ajitha et al. | 2023
- 1
-
Step restriction for improving adversarial attacksGoto, Keita / Otake, Shinta / Kawakami, Rei / Inoue, Nakamasa et al. | 2023
- 1
-
Multi-Stream Facial Adaptive Network for Expression Recognition from a Single ImageZhang, Baichuan / Meng, Fanyang / Ding, Runwei / Liu, Mengyuan et al. | 2023
- 1
-
Brainnetformer: Decoding Brain Cognitive States with Spatial-Temporal Cross AttentionSheng, Leheng / Wang, Wenhan / Shi, Zhiyi / Zhan, Jichao / Kong, Youyong et al. | 2023
- 1
-
Passive Detection of Rank-One Gaussian Signals for Known Channel Subspaces and Arbitrary NoiseRamirez, D. / Santamaria, I. / Scharf, L. L. et al. | 2023
- 1
-
Next-Speaker Prediction Based on Non-Verbal Information in Multi-Party Video ConversationMizuno, Saki / Hojo, Nobukatsu / Kobashikawa, Satoshi / Masumura, Ryo et al. | 2023
- 1
-
Improved Indoor Localization With NLOS Signal PropagationsHuang, Wei / Zhao, Yixin / Wu, Xuechao / Yin, Le et al. | 2023
- 1
-
Self-Supervised Learning of Audio Representations using Angular Contrastive LossWang, Shanshan / Tripathy, Soumya / Mesaros, Annamaria et al. | 2023
- 1
-
Data2vec-Aqc: Search for the Right Teaching Assistant in the Teacher-Student Training SetupLodagala, Vasista Sai / Ghosh, Sreyan / Umesh, S. et al. | 2023
- 1
-
Estimation of Time-Varying Graph Topologies from Graph SignalsLiu, Yuhao / Cui, Chen / Ajirak, Marzieh / Djuric, Petar M. et al. | 2023
- 1
-
Extreme Audio Time Stretching Using Neural SynthesisFierro, Leonardo / Wright, Alec / Valimaki, Vesa / Hamalainen, Matti et al. | 2023
- 1
-
Data-Aware Zero-Shot Neural Architecture Search for Image RecognitionFan, Yi / Niu, Zhong-Han / Yang, Yu-Bin et al. | 2023
- 1
-
ITER-SIS: Robust Unlimited Sampling Via Iterative Signal SievingGuo, Ruiming / Bhandari, Ayush et al. | 2023
- 1
-
Transformer-Based Deep Hashing Method for Multi-Scale Feature FusionHe, Chao / Wei, Hongxi et al. | 2023
- 1
-
Real-Time MRI Video Synthesis from Time Aligned Phonemes with Sequence-to-Sequence NetworksUdupa, Sathvik / Ghosh, Prasanta Kumar et al. | 2023
- 1
-
AutoGCF: Personalized Aggregation on Neural Graph Collaborative FilteringYou, Xiaoyu / Li, Chi / Xu, Jianwei / Zhang, Mi et al. | 2023
- 1
-
Learnt Mutual Feature Compression for Machine VisionLiu, Tie / Xu, Mai / Li, Shengxi / Chen, Chaoran / Yang, Li / Lv, Zhuoyi et al. | 2023
- 1
-
Robust multi-modal speech emotion recognition with ASR error adaptationLin, Binghuai / Wang, Liyuan et al. | 2023
- 1
-
Non-Convex Approaches for Low-Rank Tensor Completion under Tubal SamplingTan, Zheng / Huang, Longxiu / Cai, HanQin / Lou, Yifei et al. | 2023
- 1
-
Vision, Deduction and Alignment: An Empirical Study on Multi-Modal Knowledge Graph AlignmentLi, Yangning / Chen, Jiaoyan / Li, Yinghui / Xiang, Yuejia / Chen, Xi / Zheng, Hai-Tao et al. | 2023
- 1
-
Towards Improved Sonar Performance Using Environment-Informed Sparse Sub-Array ProcessingL'Her, Alexandre / Dremeau, Angelique / Courtois, Florent Le / Real, Gaultier / Cristol, Xavier / Stephan, Yann et al. | 2023
- 1
-
Zone Plate Virtual Lenses for Memory-Constrained NLOS ImagingLuesia-Lahoz, Pablo / Gutierrez, Diego / Munoz, Adolfo et al. | 2023
- 1
-
Robust Subspace Tracking with Contamination Mitigation via α-DivergenceThanh, Le Trung / Rekavandi, Aref Miri / Seghouane, Abd-Krim / Abed-Meraim, Karim et al. | 2023
- 1
-
Using Modified Adult Speech as Data Augmentation for Child Speech RecognitionFan, Zijian / Cao, Xinwei / Salvi, Giampiero / Svendsen, Torbjorn et al. | 2023
- 1
-
Graph-Graph Context Dependency Attention for Graph Edit DistanceJia, Ruiqi / Feng, Xianbing / Lyu, Xiaoqing / Tang, Zhi et al. | 2023
- 1
-
Stuart: Individualized Classroom Observation of Students with Automatic Behavior Recognition And TrackingZhou, Huayi / Jiang, Fei / Si, Jiaxin / Xiong, Lili / Lu, Hongtao et al. | 2023
- 1
-
Benchmarking Convolutional Neural Network Inference on Low-Power Edge DevicesFerraz, Oscar / Araujo, Helder / Silva, Vitor / Falcao, Gabriel et al. | 2023
- 1
-
Disentangling the Horowitz Factor: Learning Content and Style From Expressive Piano PerformanceZhang, Huan / Dixon, Simon et al. | 2023
- 1
-
Compressive Sensing with Tensorized AutoencoderHyder, Rakib / Asif, M. Salman et al. | 2023
- 1
-
The Uniqueness Problem of Physical Law LearningScholl, Philipp / Bacho, Aras / Boche, Holger / Kutyniok, Gitta et al. | 2023
- 1
-
ASSD: Synthetic Speech Detection in the AAC Compressed DomainSingh Yadav, Amit Kumar / Xiang, Ziyue / Bartusiak, Emily R. / Bestagini, Paolo / Tubaro, Stefano / Delp, Edward J. et al. | 2023
- 1
-
Bias Reduced Semidefinite Relaxation Method for Multistatic Localization in the Absence of Transmitter Position And Its SynchronizationPei, Jian / Wang, Gang / Ho, K. C. / Huang, Lei et al. | 2023
- 1
-
A Radar-Jammer Zero-Sum Repeated Bayesian GameSuvorova, Sofia / Pezeshki, Ali / Kyprianou, Ross / Moran, Bill et al. | 2023
- 1
-
Inverse Quadratic Transform for Minimizing A Sum of RatiosChen, Yannan / Zhao, Licheng / Zhang, Yaowen / Shen, Kaiming et al. | 2023
- 1
-
Dynamic Vehicle Graph Interaction for Trajectory Prediction Based on Video SignalsChen, Jian / Wang, Wei / Chen, Junxin / Cai, Ming et al. | 2023
- 1
-
SENER: Sentiment Element Named Entity Recognition for Aspect-Based Sentiment AnalysisLee, Sun-Kyung / Kim, Jong-Hwan et al. | 2023