Test Your Samples Jointly: Pseudo-Reference for Image Quality Evaluation (English)
- New search for: Tworski, Marcelin
- New search for: Lathuiliere, Stephane
- New search for: Tworski, Marcelin
- New search for: Lathuiliere, Stephane
In:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
;
1-5
;
2023
-
ISBN:
-
ISSN:
- Conference paper / Electronic Resource
-
Title:Test Your Samples Jointly: Pseudo-Reference for Image Quality Evaluation
-
Contributors:Tworski, Marcelin ( author ) / Lathuiliere, Stephane ( author )
-
Published in:
-
Publisher:
- New search for: IEEE
-
Publication date:2023-06-04
-
Size:1116609 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
-
Source:
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Learning ASR Pathways: A Sparse Multilingual ASR ModelYang, Mu / Tjandra, Andros / Liu, Chunxi / Zhang, David / Le, Duc / Kalinli, Ozlem et al. | 2023
- 1
-
Real-Time Target Sound ExtractionVeluri, Bandhav / Chan, Justin / Itani, Malek / Chen, Tuochao / Yoshioka, Takuya / Gollakota, Shyamnath et al. | 2023
- 1
-
Multi-Scale Receptive Field Graph Model for Emotion Recognition in ConversationsWei, Jie / Hu, Guanyu / Tuan, Luu Anh / Yang, Xinyu / Zhu, Wenjing et al. | 2023
- 1
-
Twitter Stance Detection via Neural Production SystemsZhang, Bowen / Ding, Daijun / Xu, Guangning / Guo, Jinjin / Huang, Zhichao / Huang, Xu et al. | 2023
- 1
-
Lost In Translation: Generating Adversarial Examples Robust to Round-Trip TranslationBhandari, Neel / Chen, Pin-Yu et al. | 2023
- 1
-
LDTSF: A Label-Decoupling Teacher-Student Framework for Semi-Supervised Echocardiography SegmentationZhang, Jiapeng / Wang, Yongxiong / Pan, Zhiqun / Tang, Zhenhui / Chen, Lijun / Liu, Jinlong et al. | 2023
- 1
-
SLBERT: A Novel Pre-Training Framework for Joint Speech and Language ModelingSusladkar, Onkar / Gatti, Prajwal / Kumar Yadav, Santosh et al. | 2023
- 1
-
Iterative Shallow Fusion of Backward Language Model for End-To-End Speech RecognitionOgawa, Atsunori / Moriya, Takafumi / Kamo, Naoyuki / Tawara, Naohiro / Delcroix, Marc et al. | 2023
- 1
-
Seri: Sketching-Reasoning-Integrating Progressive Workflow for Empathetic Response GenerationBi, Guanqun / Cao, Yanan / Li, Piji / Xie, Yuqiang / Fang, Fang / Lin, Zheng et al. | 2023
- 1
-
Vitasd: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial DiagnosisCao, Xu / Ye, Wenqian / Sizikova, Elena / Bai, Xue / Coffee, Megan / Zeng, Hongwu / Cao, Jianguo et al. | 2023
- 1
-
The Role of Initial Entanglement in Adaptive Gibbs State Preparation on Quantum ComputersEconomou, Sophia E. / Warren, Ada / Barnes, Edwin et al. | 2023
- 1
-
Multilevel FISTA for Image RestorationLauga, Guillaume / Riccietti, Elisa / Pustelnik, Nelly / Goncalves, Paulo et al. | 2023
- 1
-
JPEG Pleno Call for Proposals Responses Quality AssessmentPrazeres, Joao / Luo, Zhe / Pinheiro, Antonio M. G. / da Silva Cruz, Luis A. / Perry, Stuart et al. | 2023
- 1
-
Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention MechanismLi, Dichucheng / Che, Mingjin / Meng, Wenwu / Wu, Yulun / Yu, Yi / Xia, Fan / Li, Wei et al. | 2023
- 1
-
WITT: A Wireless Image Transmission Transformer for Semantic CommunicationsYang, Ke / Wang, Sixian / Dai, Jincheng / Tan, Kailin / Niu, Kai / Zhang, Ping et al. | 2023
- 1
-
Kernel Estimation and Deconvolution for Blind Image Super-ResolutionGong, Jiali / Gao, Hongfan / Chao, Jiahao / Zhou, Zhou / Yang, Zhengfeng / Zeng, Zhenbing et al. | 2023
- 1
-
Learned Video Coding with Motion Compensation Mixture ModelDinh, Khanh Quoc / Pyo Choi, Kwang et al. | 2023
- 1
-
Improving Few-Shot Learning for Talking Face System with TTS Data AugmentationChen, Qi / Ma, Ziyang / Liu, Tao / Tan, Xu / Lu, Qu / Yu, Kai / Chen, Xie et al. | 2023
- 1
-
A Synthetic Corpus Generation Method for Neural Vocoder TrainingWang, Zilin / Liu, Peng / Chen, Jun / Li, Sipan / Bai, Jinfeng / He, Gang / Wu, Zhiyong / Meng, Helen et al. | 2023
- 1
-
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource HeadphonesShashaank, N / Banar, Berker / Izadi, Mohammad Rasool / Kemmerer, Jeremy / Zhang, Shuo / Huang, Chuan-Che Jeff et al. | 2023
- 1
-
Robust Acoustic And Semantic Contextual Biasing In Neural Transducers For Speech RecognitionFu, Xuandi / Sathyendra, Kanthashree Mysore / Gandhe, Ankur / Liu, Jing / Strimel, Grant P. / McGowan, Ross / Mouchtaris, Athanasios et al. | 2023
- 1
-
Multi-Task Bias-Variance Trade-Off Through Functional ConstraintsCervino, Juan / Bazerque, Juan Andres / Calvo-Fullana, Miguel / Ribeiro, Alejandro et al. | 2023
- 1
-
Towards a More Stable and General Subgraph Information BottleneckLiu, Hongzhi / Zheng, Kaizhong / Yu, Shujian / Chen, Badong et al. | 2023
- 1
-
Unsupervised Domain Adaptation via Subspace Interpolating Deep Dictionary Learning: A Case Study in Machine InspectionKumar, Kriti / Majumdar, Angshul / Kumar, A Anil / Girish Chandra, M et al. | 2023
- 1
-
Adaptive Filtering Algorithms For Set-Valued Observations-Symmetric Measurement Approach To Unlabeled And Anonymized DataKrishnamurthy, Vikram et al. | 2023
- 1
-
Classification of Synthetic Facial Attributes by Means of Hybrid Classification/Localization Patch-Based AnalysisWang, Jun / Tondi, Benedetta / Barni, Mauro et al. | 2023
- 1
-
A Point is A Wave: Point-Wave Network for Place RecognitionLi, Ge / Zhang, Ruonan et al. | 2023
- 1
-
Robust and Globally Sparse Pca via Majorization-Minimization and Variable SplittingBrehier, Hugo / Breloy, Arnaud / El Korso, Mohammed Nabil / Kumar, Sandeep et al. | 2023
- 1
-
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed PrototypesXu, Xinzhou / Deng, Jun / Zhang, Zixing / Yang, Zhen / Schuller, Bjorn W. et al. | 2023
- 1
-
Multi-Task Transformer with Relation-Attention and Type-Attention for Named Entity RecognitionMo, Ying / Tang, Hongyin / Liu, Jiahao / Wang, Qifan / Xu, Zenglin / Wang, Jingang / Wu, Wei / Li, Zhoujun et al. | 2023
- 1
-
Self-Supervised Representations in Speech-Based Depression DetectionWu, Wen / Zhang, Chao / Woodland, Philip C. et al. | 2023
- 1
-
A Simple Yet Effective Approach to Structured Knowledge DistillationLin, Wenye / Li, Yangming / Liu, Lemao / Shi, Shuming / Zheng, Hai-Tao et al. | 2023
- 1
-
Leveraging Neural Koopman Operators to Learn Continuous Representations of Dynamical Systems from Scarce DataFrion, Anthony / Drumetz, Lucas / Mura, Mauro Dalla / Tochon, Guillaume / Aissa-El-Bey, Abdeldjalil et al. | 2023
- 1
-
WUDA: Unsupervised Domain Adaptation Based on Weak Source Domain LabelsLiu, Shengjie / Zhu, Chuang / Li, Yuan / Tang, Wenqi et al. | 2023
- 1
-
A Memory-Free Evolving Bipolar Neural Network for Efficient Multi-Label Stream LearningMishra, Sourav / Sundaram, Suresh et al. | 2023
- 1
-
Prototype Knowledge Distillation for Medical Segmentation with Missing ModalityWang, Shuai / Yan, Zipei / Zhang, Daoan / Wei, Haining / Li, Zhongsen / Li, Rui et al. | 2023
- 1
-
A Novel Efficient Multi-View Traffic-Related Object Detection FrameworkYang, Kun / Liu, Jing / Yang, Dingkang / Wang, Hanqi / Sun, Peng / Zhang, Yanni / Liu, Yan / Song, Liang et al. | 2023
- 1
-
Learning with Multigraph Convolutional FiltersButler, Landon / Parada-Mayorga, Alejandro / Ribeiro, Alejandro et al. | 2023
- 1
-
Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-DistillationZhang, Jing-Xuan / Wan, Genshun / Ling, Zhen-Hua / Pan, Jia / Gao, Jianqing / Liu, Cong et al. | 2023
- 1
-
Exploring Wav2vec 2.0 Fine Tuning for Improved Speech Emotion RecognitionChen, Li-Wei / Rudnicky, Alexander et al. | 2023
- 1
-
Reducing the GAP Between Streaming and Non-Streaming Transducer-Based ASR by Adaptive Two-Stage Knowledge DistillationTang, Haitao / Fu, Yu / Sun, Lei / Xue, Jiabin / Liu, Dan / Li, Yongchao / Ma, Zhiqiang / Wu, Minghui / Pan, Jia / Wan, Genshun et al. | 2023
- 1
-
Generalized Invariant Matching Property Via LassoDu, Kang / Xiang, Yu et al. | 2023
- 1
-
Efficient Feature Extraction for Non-Maximum Suppression in Visual Person DetectionSymeonidis, Charalampos / Mademlis, Ioannis / Pitas, Ioannis / Nikolaidis, Nikos et al. | 2023
- 1
-
Visual-Aware Text-to-Speech*Zhou, Mohan / Bai, Yalong / Zhang, Wei / Yao, Ting / Zhao, Tiejun / Mei, Tao et al. | 2023
- 1
-
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar SamplesRyu, Hyeonggon / Senocak, Arda / So Kweon, In / Son Chung, Joon et al. | 2023
- 1
-
Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech RecognitionChen, Xie / Ma, Ziyang / Tang, Changli / Wang, Yujin / Zheng, Zhisheng et al. | 2023
- 1
-
Do Prosody Transfer Models Transfer ProsodyƒSigurgeirsson, Atli Thor / King, Simon et al. | 2023
- 1
-
Rate Splitting and Precoding Strategies for Multi-User MIMO Broadcast Channels with Common and Private StreamsKhamidullina, Liana / de Almeida, Andre L. F. / Haardt, Martin et al. | 2023
- 1
-
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command RecognitionYang, Chao-Han Huck / Li, Bo / Zhang, Yu / Chen, Nanxin / Sainath, Tara N. / Marco Siniscalchi, Sabato / Lee, Chin-Hui et al. | 2023
- 1
-
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech RecognitionVander Eeckt, Steven / Van Hamme, Hugo et al. | 2023
- 1
-
VPPT: Visual Pre-Trained Prompt Tuning Framework for Few-Shot Image ClassificationSong, Zhao / Yang, Ke / Guan, Naiyang / Zhu, Junjie / Qiao, Peng / Hu, Qingyong et al. | 2023
- 1
-
Test Your Samples Jointly: Pseudo-Reference for Image Quality EvaluationTworski, Marcelin / Lathuiliere, Stephane et al. | 2023
- 1
-
Waveform Design to Improve the Estimation of Target Parameters Using the Fourier Transform Method in a MIMO OFDM DFRC SystemBhogavalli, Satwika / Grivel, Eric / Hari, K.V.S. / Corretja, Vincent et al. | 2023
- 1
-
Modify: Model-Driven Face Stylization Without Style ImagesDing, Yuhe / Liang, Jian / Cao, Jie / Zheng, Aihua / He, Ran et al. | 2023
- 1
-
TINYCOD: Tiny and Effective Model for Camouflaged Object DetectionXing, Haozhe / Gao, Shuyong / Tang, Hao / Mok, Tsui Qin / Kang, Yanlan / Zhang, Wenqiang et al. | 2023
- 1
-
Automatic Segmentation of Nasopharyngeal Carcinoma in CT Images Using Dual Attention and Edge DetectionWang, Qizhi / Huang, Wei / Zhang, Yuan / Li, Xuanya / Ye, Xiongjun / Hu, Kai et al. | 2023
- 1
-
Fast and Efficient Speech Enhancement with Variational AutoencodersSadeghi, Mostafa / Serizel, Romain et al. | 2023
- 1
-
Representation of Vocal Tract Length Transformation Based on Group TheoryMiyashita, Atsushi / Toda, Tomoki et al. | 2023
- 1
-
Sandformer: CNN and Transformer under Gated Fusion for Sand Dust Image RestorationShi, Jun / Wei, Bingcai / Zhou, Gang / Zhang, Liye et al. | 2023
- 1
-
Utility Polelocalization by Learning from Ambient Traces on Distributed Acoustic SensingJiang, Zhuocheng / Tian, Yue / Ding, Yangmin / Ozharar, Sarper / Wang, Ting et al. | 2023
- 1
-
Multi-User Methods for Vibrational Radar Backscatter CommunicationsCenters, Jessica / Krolik, Jeffrey et al. | 2023
- 1
-
Target Sound Extraction with Variable Cross-Modality CluesLi, Chenda / Qian, Yao / Chen, Zhuo / Wang, Dongmei / Yoshioka, Takuya / Liu, Shujie / Qian, Yanmin / Zeng, Michael et al. | 2023
- 1
-
Model-Free Learning of Optimal Beamformers for Passive IRS-Assisted Sumrate MaximizationHashmi, Hassaan / Pougkakiotis, Spyridon / Kalogerias, Dionysios S. et al. | 2023
- 1
-
Strategies for Enhanced Signal Modulation Classifications Under Unknown Symbol Rates and Noise ConditionsWang, Ruixuan / Qi, Yue / Vaezi, Mojtaba / Jiao, Xun / Amin, Moeness et al. | 2023
- 1
-
Target Velocity Estimation for Quantization-Based Cooperative MIMO Radar and Communications SystemWang, Zhen / Yan, Xuedan / He, Qian / Blum, Rick S. et al. | 2023
- 1
-
Margin-Mixup: A Method for Robust Speaker Verification In Multi-Speaker AudioThienpondt, Jenthe / Madhu, Nilesh / Demuynck, Kris et al. | 2023
- 1
-
Evopose: A Recursive Transformer for 3D Human Pose Estimation with Kinematic Structure PriorsZhang, Yaqi / Lu, Yan / Liu, Bin / Zhao, Zhiwei / Chu, Qi / Yu, Nenghai et al. | 2023
- 1
-
Subspace-Based Detector For Distributed Mmwave Mimo Radar SensorsAhmadi, Moein / Alaee-Kerahroodi, Mohammad / M. R., Bhavani Shankar / Ottersten, Bjorn et al. | 2023
- 1
-
A Unitary Transform Based Generalized Approximate Message PassingZhu, Jiang / Meng, Xiangming / Lei, Xupeng / Guo, Qinghua et al. | 2023
- 1
-
Adaptive Data Augmentation for Contrastive LearningZhang, Yuhan / Zhu, He / Yu, Shan et al. | 2023
- 1
-
E2E Segmentation in a Two-Pass Cascaded Encoder ASR ModelHuang, W. Ronny / Chang, Shuo-Yiin / Sainath, Tara N. / He, Yanzhang / Rybach, David / David, Robert / Prabhavalkar, Rohit / Allauzen, Cyril / Peyser, Cal / Strohman, Trevor D. et al. | 2023
- 1
-
Binary Sequence Set Optimization for CDMA Applications via Mixed-Integer Quadratic ProgrammingYang, Alan / Mina, Tara / Gao, Grace et al. | 2023
- 1
-
Blind Polynomial RegressionNatali, Alberto / Leus, Geert et al. | 2023
- 1
-
ERSAM: Neural Architecture Search for Energy-Efficient and Real-Time Social Ambiance MeasurementLi, Chaojian / Chen, Wenwan / Yuan, Jiayi / Lin, Yingyan Celine / Sabharwal, Ashutosh et al. | 2023
- 1
-
Statistical Analysis of Speech Disorder Specific Features to Characterise Dysarthria Severity LevelJoshy, Amlu Anna / Parameswaran, P. N. / Nair, Siddharth R. / Rajan, Rajeev et al. | 2023
- 1
-
Generalized Relative Harmonic CoefficientsHu, Yonggang / Gannot, Sharon / Abhayapala, Thushara D. et al. | 2023
- 1
-
Perceptual–Neural–Physical Sound MatchingHan, Han / Lostanlen, Vincent / Lagrange, Mathieu et al. | 2023
- 1
-
Improved Training Of Mixture-Of-Experts Language GANsChai, Yekun / Yin, Qiyue / Zhang, Junge et al. | 2023
- 1
-
Spatial-Domain Object Detection Under Mimo-Fmcw Automotive Radar InterferenceJin, Sian / Wang, Pu / Boufounos, Petros / Takahashi, Ryuhei / Roy, Sumit et al. | 2023
- 1
-
I See What You Hear: A Vision-Inspired Method to Localize WordsSamragh, Mohammad / Kundu, Arnav / Hu, Ting-Yao / Chadha, Aman / Srivastava, Ashish / Cho, Minsik / Tuzel, Oncel / Naik, Devang et al. | 2023
- 1
-
Lightweight Fisher Vector Transfer Learning for Video DeduplicationHenry, Chris / Liao, Rijun / Lin, Ruiyuan / Zhang, Zhebin / Sun, Hongyu / Li, Zhu et al. | 2023
- 1
-
Difference Coarrays of Rational ArraysKulkarni, Pranav / Vaidyanathan, P. P. et al. | 2023
- 1
-
SIGVIC: Spatial Importance Guided Variable-Rate Image CompressionLiang, Jiaming / Liu, Meiqin / Yao, Chao / Lin, Chunyu / Zhao, Yao et al. | 2023
- 1
-
UCONV-Conformer: High Reduction of Input Sequence Length for End-to-End Speech RecognitionAndrusenko, Andrei / Nasretdinov, Rauf / Romanenko, Aleksei et al. | 2023
- 1
-
Unsupervised Noise Adaptation Using Data SimulationChen, Chen / Hu, Yuchen / Zou, Heqing / Sun, Linhui / Chng, Eng Siong et al. | 2023
- 1
-
Logo-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression RecognitionMa, Fuyan / Sun, Bin / Li, Shutao et al. | 2023
- 1
-
Adaptive Time-Scale Modification for Improving Speech Intelligibility Based On Phoneme Clustering For Streaming ServicesJang, Sohee / Kim, Jiye / Kim, Yeon-Ju / Chang, Joon-Hyuk et al. | 2023
- 1
-
Learning to Reconnect Interrupted Trajectories for Weakly Supervised Multi-Object TrackingLi, Yu-Lei / Lu, Yang / Li, Jie / Wang, Hanzi et al. | 2023
- 1
-
Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASRBotros, Rami / Prabhavalkar, Rohit / Schalkwyk, Johan / Chelba, Ciprian / Sainath, Tara N. / Beaufays, Francoise et al. | 2023
- 1
-
Deepspace: Dynamic Spatial and Source CUE Based Source Separation for Dialog EnhancementMaster, Aaron / Lu, Lie / Samuelsson, Jonas / Lehtonen, Heidi-Maria / Norcross, Scott / Swedlow, Nathan / Howard, Audrey et al. | 2023
- 1
-
Batch-Ensemble Stochastic Neural Networks for Out-of-Distribution DetectionChen, Xiongjie / Li, Yunpeng / Yang, Yongxin et al. | 2023
- 1
-
Cross-Lingual Alzheimer’s Disease Detection Based on Paralinguistic and Pre-Trained FeaturesChen, Xuchu / Pu, Yu / Li, Jinpeng / Zhang, Wei-Qiang et al. | 2023
- 1
-
Multi-Carrier Wideband OCDM-Based THZ Automotive RadarBhattacharjee, Sangeeta / Mishra, Kumar Vijay / Annavajjala, Ramesh / Murthy, Chandra R. et al. | 2023
- 1
-
Low Precision Representations for High Dimensional ModelsSaha, Rajarshi / Pilanci, Mert / Goldsmith, Andrea J. et al. | 2023
- 1
-
Hypernetwork-Based Adaptive Image RestorationAharon, Shai / Ben-Artzi, Gil et al. | 2023
- 1
-
Your Camera Improves Your Point Cloud CompressionLin, Yuhuan / Xu, Tongda / Zhu, Ziyu / Li, Yanghao / Wang, Zhe / Wang, Yan et al. | 2023
- 1
-
Pseudo-Query Generation For Semi-Supervised Visual Grounding With Knowledge DistillationJin, Jianglin / Ye, Jiabo / Lin, Xin / He, Liang et al. | 2023
- 1
-
2DSBG: A 2d Semi Bi-Gaussian Filter Adapted for Adjacent and Multi-Scale Line Feature DetectionMagnier, Baptiste / Shokouh, Ghulam Sakhi / Berthier, Louis / Pie, Marcel / Ruggiero, Adrien et al. | 2023
- 1
-
Estimation of High-Dimensional Differential Graphs from Multi-Attribute DataTugnait, Jitendra K. et al. | 2023
- 1
-
Joint Unsupervised and Supervised Learning for Context-Aware Language IdentificationPark, Jinseok / Kim, Hyung Yong / Park, Jihwan / Kim, Byeong-Yeol / Choi, Shukjae / Lim, Yunkyu et al. | 2023
- 1
-
Improving Transformer-Based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention HeadsJeoung, Ye-Rin / Yang, Joon-Young / Choi, Jeong-Hwan / Chang, Joon-Hyuk et al. | 2023
- 1
-
On the Value of Stochastic Side Information in Online LearningJia, Junzhang / Wu, Xuetong / Evans, Jamie / Zhu, Jingge et al. | 2023
- 1
-
Learning Task-Aligned Mask Query for Instance SegmentationFu, Bin / He, Hongliang / Wei, Pengxu / Chen, Jie et al. | 2023
- 1
-
On The Primal and Dual Formulations Of The Discrete Mumford-Shah FunctionalPustelnik, Nelly et al. | 2023
- 1
-
Robust Angle Estimation for Hybrid mmWave SystemsLin, Yuan-Pei / Yang, Ting-Ming et al. | 2023
- 1
-
On The Fairness of Multitask Representation LearningLi, Yingcong / Oymak, Samet et al. | 2023
- 1
-
VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature DistillationLiu, Yuhao / Gong, Cheng / Wang, Longbiao / Wu, Xixin / Liu, Qiuyu / Dang, Jianwu et al. | 2023
- 1
-
Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-Based Speech Recognition of Low-Resource LanguageSoky, Kak / Li, Sheng / Chu, Chenhui / Kawahara, Tatsuya et al. | 2023
- 1
-
Pop2Piano : Pop Audio-Based Piano Cover GenerationChoi, Jongho / Lee, Kyogu et al. | 2023
- 1
-
Multi-Lingual Pronunciation Assessment with Unified Phoneme Set and Language-Specific EmbeddingsLin, Binghuai / Wang, Liyuan et al. | 2023
- 1
-
Interpolation Filter Model For Ramanujan Subspace SignalsKulkarni, Pranav / Vaidyanathan, P. P. et al. | 2023
- 1
-
Online Binaural Speech Separation Of Moving Speakers With A Wavesplit NetworkHan, Cong / Mesgarani, Nima et al. | 2023
- 1
-
A Hybrid Deep Neural Network for Nonlinear Causality Analysis in Complex Industrial Control SystemFeng, Tian / Chen, Qiming / Shi, Yao / Lang, Xun / Xie, Lei / Su, Hongye et al. | 2023
- 1
-
Autovocoder: Fast Waveform Generation from a Learned Speech Representation Using Differentiable Digital Signal ProcessingWebber, Jacob J / Valentini-Botinhao, Cassia / Williams, Evelyn / Henter, Gustav Eje / King, Simon et al. | 2023
- 1
-
Self-Sufficient Framework for Continuous Sign Language RecognitionJang, Youngjoon / Oh, Youngtaek / Cho, Jae Won / Kim, Myungchul / Kim, Dong-Jin / Kweon, In So / Son Chung, Joon et al. | 2023
- 1
-
Signal Processing On Product SpacesRoddenberry, T. Mitchell / Grande, Vincent P. / Frantzen, Florian / Schaub, Michael T. / Segarra, Santiago et al. | 2023
- 1
-
On the Effectiveness of Monoaural Target Source Extraction for Distant end-to-end Automatic Speech RecognitionZorila, Catalin / Doddipatla, Rama et al. | 2023
- 1
-
MAID: A Conditional Diffusion Model for Long Music Audio InpaintingLiu, Kaiyang / Gan, Wendong / Yuan, Chenchen et al. | 2023
- 1
-
Semi-Federated Learning for Edge Intelligence with Imperfect SICNi, Wanli / Zheng, Jingheng / Eldar, Yonina C. / You, Changsheng / Huang, Kaibin et al. | 2023
- 1
-
Dual Collaborative Visual-Semantic Mapping for Multi-Label Zero-Shot Image RecognitionHu, Yunqing / Jin, Xuan / Chen, Xi / Zhang, Yin et al. | 2023
- 1
-
Topological Slepians: Maximally Localized Representations of Signals Over Simplicial ComplexesBattiloro, Claudio / Di Lorenzo, Paolo / Barbarossa, Sergio et al. | 2023
- 1
-
Efficient Feature Fusion for Learning-Based Photometric StereoJu, Yakun / Lam, Kin-Man / Xiao, Jun / Zhang, Cong / Yang, Cuixin / Dong, Junyu et al. | 2023
- 1
-
Improving Scheduled Sampling for Neural Transducer-Based ASRMoriya, Takafumi / Ashihara, Takanori / Sato, Hiroshi / Matsuura, Kohei / Tanaka, Tomohiro / Masumura, Ryo et al. | 2023
- 1
-
Unobtrusive Respiratory Monitoring System for Intensive CareTan, Xudong / Hu, Menghan / Zhai, Guangtao / Zhu, Yan / Li, Wenfang / Zhang, XiaoPing et al. | 2023
- 1
-
Integrating the Sensing and Radio Communications Channel Modelling From Radar Mutual InterferenceCardona, Narcis / Romero, J. Samuel / Yang, Wenfei / Li, Jian et al. | 2023
- 1
-
TDMA-Based Multi-User Binary Computation Offloading in the Finite-Block-Length RegimeManouchehrpour, M. Amin / Lehal, Harvinder / Salmani, Mahsa / Davidson, Timothy N. et al. | 2023
- 1
-
Multispectral Image Fusion based on Super Pixel SegmentationOfir, Nati et al. | 2023
- 1
-
Optimal Transport with a Diversified Memory Bank for Cross-Domain Speaker VerificationZhang, Ruiteng / Wei, Jianguo / Lu, Xugang / Lu, Wenhuan / Jin, Di / Zhang, Lin / Xu, Junhai et al. | 2023
- 1
-
Fast Low-Latency Convolution by Low-Rank Tensor ApproximationJalmby, Martin / Elvander, Filip / van Waterschoot, Toon et al. | 2023
- 1
-
A Controllable Lifestyle Simulator for Use in Deep Reinforcement Learning AlgorithmsBraz, Libio Goncalves / Susaiyah, Allmin et al. | 2023
- 1
-
BTS-E: Audio Deepfake Detection Using Breathing-Talking-Silence EncoderDoan, Thien-Phuc / Nguyen-Vu, Long / Jung, Souhwan / Hong, Kihun et al. | 2023
- 1
-
Study of Manifold Geometry Using Multiscale Non-Negative Kernel GraphsHurtado, Carlos / Shekkizhar, Sarath / Ruiz-Hidalgo, Javier / Ortega, Antonio et al. | 2023
- 1
-
Learning Silhouettes with Group Sparse AutoencodersTheodosis, Emmanouil / Ba, Demba et al. | 2023
- 1
-
ScaleMix: Intra- And Inter-Layer Multiscale Feature Combination for Change DetectionHuang, Rui / Zhao, Qingyi / Wang, Ruofei / Liu, Caihua / Gao, Sihua / Zhang, Yuxiang / Fan, Wei et al. | 2023
- 1
-
Is Multi-Task Learning an Upper Bound for Continual Learning?Wu, Zihao / Tran, Huy / Pirsiavash, Hamed / Kolouri, Soheil et al. | 2023
- 1
-
Local Graph-Homomorphic Processing for Privatized Distributed SystemsRizk, Elsa / Vlaski, Stefan / Sayed, Ali H. et al. | 2023
- 1
-
MASKED-AP: Attention Pyramid Convolutional Neural Network with Mask for Cervical Cell ClassificationJin, Yu / Liu, Juan / Chen, Hua / Duan, Wensi / Cao, Dehua / Pang, Baochuan et al. | 2023
- 1
-
Pondering About Task Spatial Misalignment: Classification-Localization Equilibrated Object DetectionZhang, Yudong / Lu, Wei / Wang, Xu / Wang, Pengkun / Wang, Yang et al. | 2023
- 1
-
Multiple Access Computation Offloading for the K-User CaseLiu, Xiaomeng / Schaible, Christian / Davidson, Timothy N. et al. | 2023
- 1
-
Movienet-PS: A Large-Scale Person Search Dataset in the WildQin, Jie / Zheng, Peng / Yan, Yichao / Quan, Rong / Cheng, Xiaogang / Ni, Bingbing et al. | 2023
- 1
-
Spatial Similarity Guidance for Few-Shot SegmentationLuo, Xiaoliu / Duan, Zhao / Zhang, Taiping et al. | 2023
- 1
-
Efficient Monaural Speech Enhancement with Universal Sample Rate Band-Split RNNYu, Jianwei / Luo, Yi et al. | 2023
- 1
-
Code-Switching Speech Synthesis Based on Self-Supervised Learning and Domain Adaptive Speaker EncoderLin, Yi-Xing / Pai, Cheng-Hsun / Le, Phuong Thi / Prihasto, Bima / Huang, Chien-Ling / Wang, Jia Ching et al. | 2023
- 1
-
Mixed Sample Augmentation for Online DistillationShen, Yiqing / Xu, Liwu / Yang, Yuzhe / Li, Yaqian / Guo, Yandong et al. | 2023
- 1
-
Meeting Action Item Detection with Regularized Context ModelingLiu, Jiaqing / Deng, Chong / Zhang, Qinglin / Chen, Qian / Wang, Wen et al. | 2023
- 1
-
CLMAE: A Liter and Faster Masked AutoencodersSong, Yiran / Ma, Lizhuang et al. | 2023
- 1
-
Graph Signal Processing for Narrowband Direction of Arrival EstimationLi, Disheng / Liu, Wei / Zakharov, Yuriy / Mitchell, Paul D et al. | 2023
- 1
-
Privacy-Preserving Automatic Speaker DiarizationTeixeira, Francisco / Abad, Alberto / Raj, Bhiksha / Trancoso, Isabel et al. | 2023
- 1
-
An End-to-End Neural Network for Image-to-Audio TransformationChen, Liu / Deisher, Michael / Georges, Munir et al. | 2023
- 1
-
Joint Multi-Level Feature Network for Lightweight Person Re-IdentificationZhang, Yunzuo / Kang, Weili / Liu, Yameng / Zhu, Pengfei et al. | 2023
- 1
-
Learning Cross-Modal Audiovisual Representations with Ladder Networks for Emotion RecognitionGoncalves, Lucas / Busso, Carlos et al. | 2023
- 1
-
Quantized Precoding and RIS-Assisted Modulation for Integrated Sensing and Communications SystemsPrasobh Sankar, R. S. / Prabhakar Chepuri, Sundeep et al. | 2023
- 1
-
Towards Adversarially Robust Continual LearningBai, Tao / Chen, Chen / Lyu, Lingjuan / Zhao, Jun / Wen, Bihan et al. | 2023
- 1
-
Ultimate Negative Sampling for Contrastive LearningGuo, Huijie / Shi, Lei et al. | 2023
- 1
-
A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech TranslationHuang, Wen-Chin / Peloquin, Benjamin / Kao, Justine / Wang, Changhan / Gong, Hongyu / Salesky, Elizabeth / Adi, Yossi / Lee, Ann / Chen, Peng-Jen et al. | 2023
- 1
-
T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5Hsu, Chan-Jan / Chung, Ho-Lam / Lee, Hung-Yi / Tsao, Yu et al. | 2023
- 1
-
CD-FSOD: A Benchmark For Cross-Domain Few-Shot Object DetectionXiong, Wuti et al. | 2023
- 1
-
Elliptical Wishart Distribution: Maximum Likelihood Estimator from Information GeometryAyadi, Imen / Bouchard, Florent / Pascal, Frederic et al. | 2023
- 1
-
Distributed Bayesian Tracking on the Special Euclidean Group Using Lie Algebra Parametric ApproximationsBordin, Claudio J. / de Figueredo, Caio G. / Bruno, Marcelo G. S. et al. | 2023
- 1
-
Asynchronous Social LearningCemri, Mert / Bordignon, Virginia / Kayaalp, Mert / Shumovskaia, Valentina / Sayed, Ali H. et al. | 2023
- 1
-
Cramér-Rao Bound on Lie Groups with Observations on Lie Groups: Application to SE(2)Labsir, Samy / Renaux, Alexandre / Vila-Valls, Jordi / Chaumette, Eric et al. | 2023
- 1
-
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network Using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech EnhancementZhao, Shengkui / Ma, Bin et al. | 2023
- 1
-
Extended Kalman Filter for Graph Signals in Nonlinear Dynamic SystemsSagi, Guy / Shlezinger, Nir / Routtenberg, Tirza et al. | 2023
- 1
-
Perspective Projection-Based 3d CT Reconstruction from Biplanar X-RaysKyung, Daeun / Jo, Kyungmin / Choo, Jaegul / Lee, Joonseok / Choi, Edward et al. | 2023
- 1
-
Tg-Critic: A Timbre-Guided Model For Reference-Independent Singing EvaluationSun, Xiaoheng / Gao, Yuejie / Lin, Hanyao / Liu, Huaping et al. | 2023
- 1
-
Exploration of Language Dependency for Japanese Self-Supervised Speech Representation ModelsAshihara, Takanori / Moriya, Takafumi / Matsuura, Kohei / Tanaka, Tomohiro et al. | 2023
- 1
-
Frequency Bin-Wise Single Channel Speech Presence Probability Estimation Using Multiple DNNSTao, Shuai / Reddy, Himavanth / Jensen, Jesper Rindom / Christensen, Mads Grasboll et al. | 2023
- 1
-
Structural Optimization of Factor Graphs for Symbol Detection via Continuous Clustering and Machine LearningRapp, Lukas / Schmid, Luca / Rode, Andrej / Schmalen, Laurent et al. | 2023
- 1
-
Selective Film Conditioning with CTC-Based ASR Probability for Speech EnhancementYang, Da-Hee / Chang, Joon-Hyuk et al. | 2023
- 1
-
Egocentric Action Anticipation for Personal HealthRodin, Ivan / Furnari, Antonino / Mavroeidis, Dimitrios / Farinella, Giovanni Maria et al. | 2023
- 1
-
Enhanced Low-Resolution LiDAR-Camera Calibration via Depth Interpolation and Supervised Contrastive LearningZhang, Zhikang / Yu, Zifan / You, Suya / Rao, Raghuveer / Agarwal, Sanjeev / Ren, Fengbo et al. | 2023
- 1
-
SCSGNet: Spatial-Correlated and Shape-Guided Network for Breast Mass SegmentationLi, Qingqiu / Xu, Jilan / Yuan, Runtian / Zhang, Yuejie / Feng, Rui et al. | 2023
- 1
-
A Progressive Neural Network for Acoustic Echo CancellationChen, Zhuangqi / Xia, Xianjun / Sun, Siyu / Wang, Ziqian / Chen, Cheng / Xie, Guoliang / Zhang, Pingjian / Xiao, Yijian et al. | 2023
- 1
-
Ensemble Knowledge Distillation of Self-Supervised Speech ModelsHuang, Kuan -Po / Feng, Tzu-Hsun / Fu, Yu-Kuan / Hsu, Tsu-Yuan / Yen, Po-Chieh / Tseng, Wei-Cheng / Chang, Kai-Wei / Lee, Hung-Yi et al. | 2023
- 1
-
On Crowdsourcing-Design with Comparison Category Rating for Evaluating Speech Enhancement AlgorithmsSuarez, Angelica S. Z. / Laroche, Clement / Clemmensen, Line H. / Das, Sneha et al. | 2023
- 1
-
Rate-Distortion Optimization with Alternative References for UGC Video CompressionXiong, Xin / Pavez, Eduardo / Ortega, Antonio / Adsumilli, Balu et al. | 2023
- 1
-
Audiodec: An Open-Source Streaming High-Fidelity Neural Audio CodecWu, Yi-Chiao / Gebru, Israel D. / Markovic, Dejan / Richard, Alexander et al. | 2023
- 1
-
Image Reconstruction without Explicit PriorsGao, Angela F. / Leong, Oscar / Sun, He / Bouman, Katherine L. et al. | 2023
- 1
-
Classification via Subspace Learning Machine (SLM): Methodology and Performance EvaluationFu, Hongyu / Yang, Yijing / Mishra, Vinod K. / Jay Kuo, C.-C. et al. | 2023
- 1
-
A Multi-Scale Feature Aggregation Based Lightweight Network for Audio-Visual Speech EnhancementXu, Haitao / Wei, Liangfa / Zhang, Jie / Yang, Jianming / Wang, Yannan / Gao, Tian / Fang, Xin / Dai, Lirong et al. | 2023
- 1
-
Multi-Scale Compositional Constraints for Representation Learning on VideosParaskevopoulos, Georgios / Lavania, Chandrashekhar / Chum, Lovish / Sundaram, Shiva et al. | 2023
- 1
-
Enhanced GM-PHD Filter for Real Time Satellite Multi-Target TrackingAguilar, Camilo / Ortner, Mathias / Zerubia, Josiane et al. | 2023
- 1
-
De’hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech RecognitionNg, Dianwen / Zhang, Ruixi / Yip, Jia Qi / Yang, Zhao / Ni, Jinjie / Zhang, Chong / Ma, Yukun / Ni, Chongjia / Chng, Eng Siong / Ma, Bin et al. | 2023
- 1
-
Weakly- and Semi-Supervised Object LocalizationHuang, Zhen-Tang / Chen, Yan-He / Yeh, Mei-Chen et al. | 2023
- 1
-
Torchaudio-Squim: Reference-Less Speech Quality and Intelligibility Measures in TorchaudioKumar, Anurag / Tan, Ke / Ni, Zhaoheng / Manocha, Pranay / Zhang, Xiaohui / Henderson, Ethan / Xu, Buye et al. | 2023
- 1
-
Coarse-to-Fine Covid-19 Segmentation via Vision-Language AlignmentShan, Dandan / Li, Zihan / Chen, Wentao / Li, Qingde / Tian, Jie / Hong, Qingqi et al. | 2023
- 1
-
EMC2-Net: Joint Equalization and Modulation Classification Based on Constellation NetworkRyu, Hyun / Choi, Junil et al. | 2023
- 1
-
Ripple Sparse Self-Attention for Monaural Speech EnhancementZhang, Qiquan / Zhu, Hongxu / Song, Qi / Qian, Xinyuan / Ni, Zhaoheng / Li, Haizhou et al. | 2023
- 1
-
A Physically Explainable Framework for Human-Related Anomaly DetectionJiang, Yalong / Li, Huining / Li, Changkang et al. | 2023
- 1
-
Noncoherent Multiuser Grassmannian Constellations for the Mimo Multiple Access ChannelAlvarez-Vizoso, Javier / Cuevas, Diego / Beltran, Carlos / Santamaria, Ignacio / Tucek, Vit / Peters, Gunnar et al. | 2023
- 1
-
Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification SystemsCai, Danwei / Cai, Zexin / Li, Ming et al. | 2023
- 1
-
A Compensated Shrinkage Affine Projection Algorithm for Debiased Sparse Adaptive FilteringZhang, Yi / Yamada, Isao et al. | 2023
- 1
-
Cross-Domain Object Classification Via Successive Subspace AlignmentChen, Kecheng / Li, Haoliang / Yan, Hong et al. | 2023
- 1
-
Textless Direct Speech-to-Speech Translation with Discrete Speech RepresentationLi, Xinjian / Jia, Ye / Chiu, Chung-Cheng et al. | 2023
- 1
-
Speaker-Independent Acoustic-to-Articulatory Speech InversionWu, Peter / Chen, Li-Wei / Cho, Cheol Jun / Watanabe, Shinji / Goldstein, Louis / Black, Alan W / Anumanchipalli, Gopala K. et al. | 2023
- 1
-
Single-Photon Image Super-Resolution via Self-Supervised LearningChen, Yiwei / Jiang, Chen / Pan, Yu et al. | 2023
- 1
-
TSPTQ-ViT: Two-Scaled Post-Training Quantization for Vision TransformerTai, Yu-Shan / Lin, Ming-Guang / Wu, An-Yeu Andy et al. | 2023
- 1
-
Sparse Error Correction for Power Network ParametersSenaratne, Dilan / Kim, Jinsub et al. | 2023
- 1
-
An Evaluation Platform to Scope Performance of Synthetic Environments in Autonomous Ground Vehicles SimulationBai, Xiangyu / Jiang, Le / Luo, Yedi / Gupta, Aniket / Kaveti, Pushyami / Singh, Hanumant / Ostadabbas, Sarah et al. | 2023
- 1
-
Quaternion Orthogonal Transformer for Facial Expression Recognition in the WildZhou, Yu / Guo, Liyuan / Jin, Lianghai et al. | 2023
- 1
-
HQP-MVS:High-Quality Plane Priors Assisted Multi-View Stereo for Low-Textured AreasTian, Zefan / Wang, Rongjie / Wang, Zhenyu / Wang, Ronggang et al. | 2023
- 1
-
Daily Mental Health Monitoring from Speech: A Real-World Japanese Dataset and Multitask Learning AnalysisSong, Meishu / Triantafyllopoulos, Andreas / Yang, Zijiang / Takeuchi, Hiroki / Nakamura, Toru / Kishi, Akifumi / Ishizawa, Tetsuro / Yoshiuchi, Kazuhiro / Jing, Xin / Karas, Vincent et al. | 2023
- 1
-
ICCRN: Inplace Cepstral Convolutional Recurrent Neural Network for Monaural Speech EnhancementLiu, Jinjiang / Zhang, Xueliang et al. | 2023
- 1
-
CROSSSPEECH: Speaker-Independent Acoustic Representation for Cross-Lingual Speech SynthesisKim, Ji-Hoon / Yang, Hong-Sun / Ju, Yoon-Cheol / Kim, Il-Hwan / Kim, Byeong-Yeol et al. | 2023
- 1
-
Ensemble Prosody Prediction For Expressive Speech SynthesisTeh, Tian Huey / Hu, Vivian / Ram Mohan, Devang S / Hodari, Zack / Wallis, Christopher G. R. / Gomez Ibarrondo, Tomas / Torresquintero, Alexandra / Leoni, James / Gales, Mark / King, Simon et al. | 2023
- 1
-
Progressive Meta-Pooling Learning for Lightweight Image Classification ModelDong, Peijie / Niu, Xin / Tian, Zhiliang / Li, Lujun / Wang, Xiaodong / Wei, Zimian / Pan, Hengyue / Li, Dongsheng et al. | 2023
- 1
-
Euro: Espnet Unsupervised ASR Open-Source ToolkitGao, Dongji / Shi, Jiatong / Chuang, Shun-Po / Garcia, Leibny Paola / Lee, Hung-Yi / Watanabe, Shinji / Khudanpur, Sanjeev et al. | 2023
- 1
-
Learning Generalizable Light Field Networks from Few ImagesLi, Qian / Multon, Franck / Boukhayma, Adnane et al. | 2023
- 1
-
Cross-Domain Diffusion Based Speech Enhancement for Very Noisy SpeechWang, Heming / Wang, DeLiang et al. | 2023
- 1
-
A Few Shot Learning of Singing Technique Conversion Based on Cycle Consistency Generative Adversarial NetworksChen, Po-Wei / Soo, Von-Wun et al. | 2023
- 1
-
Compressed Distributed Regression over Adaptive NetworksCarpentiero, Marco / Matta, Vincenzo / Sayed, Ali H. et al. | 2023
- 1
-
An Approach to Ontological Learning from Weak LabelsShah, Ankit / Tang, Larry / Chou, Po Hao / Zheng, Yi Yu / Ge, Ziqian / Raj, Bhiksha et al. | 2023
- 1
-
Sequential Datum–Wise Joint Feature Selection and Classification in the Presence of External ClassifierEkanayake, Sachini Piyoni / Zois, DaphneynStavroula / Chelmis, Charalampos et al. | 2023
- 1
-
Learning From Label Proportion with Online Pseudo-Label Decision by Regret MinimizationMatsuo, Shinnosuke / Bise, Ryoma / Uchida, Seiichi / Suehiro, Daiki et al. | 2023
- 1
-
Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech SeparationLi, Chenda / Wu, Yifei / Qian, Yanmin et al. | 2023
- 1
-
Fine-Grained Emotional Control of Text-to-Speech: Learning to Rank Inter- and Intra-Class Emotion IntensitiesWang, Shijun / Guenason, Jon / Borth, Damian et al. | 2023
- 1
-
Role of Bias Terms in Dot-Product AttentionNamazifar, Mahdi / Hazarika, Devamanyu / Hakkani-Tur, Dilek et al. | 2023
- 1
-
Learning Interpretable Filters In Wav-UNet For Speech EnhancementMathieu, Felix / Courtat, Thomas / Richard, Gael / Peeters, Geoffroy et al. | 2023
- 1
-
Cochlear Decomposition: A Novel Bio-Inspired Multiscale Analysis FrameworkAlfalahi, Hessa / Khandoker, Ahsan / Alhussein, Ghada / Hadjileontiadis, Leontios et al. | 2023
- 1
-
Contrastive Learning of Sentence Embeddings in Product SearchZhang, Bo-Wen / Yan, Yan / Yu, Jiapei et al. | 2023
- 1
-
Leveraging Sparsity with Spiking Recurrent Neural Networks for Energy-Efficient Keyword SpottingDampfhoffer, Manon / Mesquida, Thomas / Hardy, Emmanuel / Valentian, Alexandre / Anghel, Lorena et al. | 2023
- 1
-
A Quantum Approach for Stochastic Constrained Binary OptimizationGupta, Sarthak / Kekatos, Vassilis et al. | 2023
- 1
-
Joint Antenna Selection and Beamforming in Integrated Automotive Radar Sensing-Communications with Quantized Double Phase ShiftersXu, Lifan / Sun, Shunqiao / Zhang, Yimin D. / Petropulu, Athina et al. | 2023
- 1
-
MODEFORMER: Modality-Preserving Embedding For Audio-Video Synchronization Using TransformersGupta, Akash / Tripathi, Rohun / Jang, Wondong et al. | 2023
- 1
-
Semi-Supervised Learning with Per-Class Adaptive Confidence Scores for Acoustic Environment Classification with Imbalanced DataFiorio, Luan Vinicius / Karanov, Boris / David, Johan / Houtum, Wim van / Widdershoven, Frans / Aarts, Ronald M. et al. | 2023
- 1
-
Database-Aware ASR Error Correction for Speech-to-SQL ParsingShao, Yutong / Kumar, Arun / Nakashole, Ndapa et al. | 2023
- 1
-
Convolutional Filtering on Sampled ManifoldsWang, Zhiyang / Ruiz, Luana / Ribeiro, Alejandro et al. | 2023
- 1
-
A Database for Multi-Modal Short Video Quality AssessmentZhang, Yukun / Wang, Chuan / Zhang, Sanyi / Cao, Xiaochun et al. | 2023
- 1
-
Diagonal State Space Augmented Transformers for Speech RecognitionSaon, George / Gupta, Ankit / Cui, Xiaodong et al. | 2023
- 1
-
Unrestricted Anchor Graph Based GCN for Incomplete Multi-View ClusteringZhao, Liang / Wang, Zihao / Yuan, Yukun / Ding, Feng et al. | 2023
- 1
-
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech SynthesisKaneko, Takuhiro / Kameoka, Hirokazu / Tanaka, Kou / Seki, Shogo et al. | 2023
- 1
-
High-Dimensional Confidence Regions in Sparse MRIHoppe, Frederik / Krahmer, Felix / Mayrink Verdun, Claudio / Menzel, Marion I. / Rauhut, Holger et al. | 2023
- 1
-
Towards Efficient and Optimal Joint Beamforming and Antenna Selection: A Machine Learning ApproachShrestha, Sagar / Fu, Xiao / Hong, Mingyi et al. | 2023
- 1
-
Quantum Graph TransformersKollias, Georgios / Kalantzis, Vassilis / Salonidis, Theodoros / Ubaru, Shashanka et al. | 2023
- 1
-
Deep3DSketch: 3D Modeling from Free-Hand Sketches with View- and Structural-Aware Adversarial TrainingChen, Tianrun / Fu, Chenglong / Zhu, Lanyun / Mao, Papa / Zhang, Jia / Zang, Ying / Sun, Lingyun et al. | 2023
- 1
-
PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many MappingLee, Junhyeok / Han, Seungu / Cho, Hyunjae / Jung, Wonbin et al. | 2023
- 1
-
A Method of Constructing and Automatically Labeling Radio Frequency Signal Training Dataset for UAVLiu, Chao / Ma, Ruipeng / Si, Zheng / Chi, Mingmin et al. | 2023
- 1
-
An Online Algorithm for Contrastive Principal Component AnalysisGolkar, Siavash / Lipshutz, David / Tesileanu, Tiberiu / Chklovskii, Dmitri B. et al. | 2023
- 1
-
Small-Footprint Slimmable Networks for Keyword SpottingAkhtar, Zuhaib / Khursheed, Mohammad Omar / Du, Dongsu / Liu, Yuzong et al. | 2023
- 1
-
UFO2: A Unified Pre-Training Framework for Online and Offline Speech RecognitionFu, Li / Li, Siqi / Li, Qingtao / Deng, Liping / Li, Fangzhu / Fan, Lu / Chen, Meng / He, Xiaodong et al. | 2023
- 1
-
Audio Coding With Unified Noise Shaping And Phase Contrast ControlJo, Byeongho / Beack, Seungkwon / Lee, Taejin et al. | 2023
- 1
-
Learning To Locate Visual Answer In Video Corpus Using QuestionLi, Bin / Weng, Yixuan / Sun, Bin / Li, Shutao et al. | 2023
- 1
-
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional NetworksWang, Kuan-Chen / Liu, Kai-Chun / Peng, Sheng-Yu / Tsao, Yu et al. | 2023
- 1
-
K2NN: Self-Supervised Learning with Hierarchical Nearest Neighbors for Remote SensingYuan, Jianlong / Xu, Yuanhong / Wang, Zhibin et al. | 2023
- 1
-
Approximation Error Back-Propagation for Q-Function in Scalable Reinforcement Learning with Tree Dependence StructureYan, Yuzi / Dong, Yu / Ma, Kai / Shen, Yuan et al. | 2023
- 1
-
Multi-Resolution Sequence Aggregation and Model-Agnostic Framework for Time-Series ForecastingLyu, Juhyun / Yang, Jinseok / Kim, Junghee / Lim, Woohyung / Ahn, Wonbin / Kang, Dongwan / Kim, Minjae / Kim, Nam Soo et al. | 2023
- 1
-
DMSA: Dynamic Multi-Scale Unsupervised Semantic Segmentation Based On Adaptive AffinityYang, Kun / Lu, Jun et al. | 2023
- 1
-
A Discriminative Multi-Channel Noise Feature Representation Model for Image Manipulation LocalizationZhou, Yang / Wang, Hongxia / Zeng, Qiang / Zhang, Rui / Meng, Sijiang et al. | 2023
- 1
-
Incorporating Visual Information Reconstruction into Progressive Learning for Optimizing audio-visual Speech EnhancementZhang, Chen-Yue / Chen, Hang / Du, Jun / Yin, Bao-Cai / Pan, Jia / Lee, Chin-Hui et al. | 2023
- 1
-
Equivalence of Aperture Reduction in Element Space and Constrained Combination of DFT Beams in BeamspaceRakhimov, Damir / Haardt, Martin et al. | 2023
- 1
-
Contrastive Learning at the Relation and Event Level for Rumor DetectionXu, Yingrui / Hu, Jingyuan / Ge, Jingguo / Wu, Yulei / Li, Tong / Li, Hui et al. | 2023
- 1
-
Beamforming Optimization in RIS-Aided Mimo Systems Under Multiple-Reflection EffectsWijekoon, Dilki / Mezghani, Amine / Hossain, Ekram et al. | 2023
- 1
-
EEG2IMAGE: Image Reconstruction from EEG Brain SignalsSingh, Prajwal / Pandey, Pankaj / Miyapuram, Krishna / Raman, Shanmuganathan et al. | 2023
- 1
-
Dual Meta Calibration Mix for Improving Generalization in Meta-LearningMi, Ze-Yu / Yang, Yu-Bin et al. | 2023
- 1
-
Implicit Bayes Adaptation: A Collaborative Transport ApproachJiang, Bo / Krim, Hamid / Wu, Tianfu / Cansever, Derya et al. | 2023
- 1
-
Blind Source Counting and Separation with Relative Harmonic CoefficientsSun, Huiyuan / Samarasinghe, Prasanga / Abhayapala, Thushara et al. | 2023
- 1
-
YOLOX-B: A Better Yolox Model for Real-Time Driver Behavior DetectionGuo, Xu / Ma, Ming / Zhang, Jiaqiang / Li, Shaojie et al. | 2023
- 1
-
Active Noise Control over 3D Space: A Realistic Error Microphone Geometry DesignSun, Huiyuan / Samarasinghe, Prasanga / Abhayapala, Thushara et al. | 2023
- 1
-
A Multi-Stage Hierarchical Relational Graph Neural Network for Multimodal Sentiment AnalysisGong, Peizhu / Liu, Jin / Zhang, Xiliang / Li, Xingye et al. | 2023
- 1
-
Single-Sample Direction-of-Arrival Estimation for Fast and Robust 3D Localization With Real Measurements from a Massive MIMO SystemMazokha, Stepan / Naderi, Sanaz / Orfanidis, Georgios I. / Sklivanitis, George / Pados, Dimitris A. / Hallstrom, Jason O. et al. | 2023
- 1
-
Low in Resolution, High in Precision: UAV Detection with Super-Resolution and Motion Information ExtractionWang, Hanzhuo / Wang, Xingjian / Zhou, Chengwei / Meng, Wenchao / Shi, Zhiguo et al. | 2023
- 1
-
Continuous Descriptor-Based Control for Deep Audio SynthesisDevis, Ninon / Demerle, Nils / Nabi, Sarah / Genova, David / Esling, Philippe et al. | 2023
- 1
-
SSGD: A Smartphone Screen Glass Dataset for Defect DetectionHan, Haonan / Yang, Rui / Li, Shuyan / Hu, Runze / Li, Xiu et al. | 2023
- 1
-
Leveraging Phone-Level Linguistic-Acoustic Similarity For Utterance-Level Pronunciation ScoringLiu, Wei / Fu, Kaiqi / Tian, Xiaohai / Shi, Shuju / Li, Wei / Ma, Zejun / Lee, Tan et al. | 2023
- 1
-
Learning Unbiased Rewards with Mutual Information in Adversarial Imitation LearningZhang, Lihua / Liu, Quan / Huang, Zhigang / Wu, Lan et al. | 2023
- 1
-
Parasympathetic-Sympathetic Causal Interactions and Perceived Workload for Varying Difficulty Affective Computing TasksLavanuru, Pravallika / Pratiher, Sawon / Sahoo, Karuna P. / Acharya, Mrinal / S, Sreejith / Ghosh, Nirmalya / Patra, Amit et al. | 2023
- 1
-
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference InformationLiu, Zhengyuan / Chen, Nancy F. et al. | 2023
- 1
-
Deep Plug-and-Play for Tensor Robust Principal Component AnalysisTan, Hao / Wang, Jianjun / Kong, Weichao et al. | 2023
- 1
-
Contrastive Learning-Based Audio to Lyrics Alignment for Multiple LanguagesDurand, Simon / Stoller, Daniel / Ewert, Sebastian et al. | 2023
- 1
-
Robust Knowledge Distillation from RNN-T Models with Noisy Training Labels Using Full-Sum LossZeineldeen, Mohammad / Audhkhasi, Kartik / Baskar, Murali Karthick / Ramabhadran, Bhuvana et al. | 2023
- 1
-
Hiding Speaker’s Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis PipelineNoe, Paul-Gauthier / Miao, Xiaoxiao / Wang, Xin / Yamagishi, Junichi / Bonastre, Jean-Francois / Matrouf, Driss et al. | 2023
- 1
-
ICEL: Learning with Inconsistent ExplanationsLiu, Biao / Wu, Xiaoyu / Yuan, Bo et al. | 2023
- 1
-
Facial Texure Perceiver: Towards High-Fidelity Facial Texture Recovery with Input-Level Inductive Biased Perceiver IOLee, Seungeun et al. | 2023
- 1
-
Single-Shot Domain Adaptation via Target-Aware Generative AugmentationsSubramanyam, Rakshith / Thopalli, Kowshik / Berman, Spring / Turaga, Pavan / Thiagarajan, Jayaraman J. et al. | 2023
- 1
-
Distance-Based Weight Transfer for Fine-Tuning From Near-Field to Far-Field Speaker VerificationZhang, Li / Wang, Qing / Wang, Hongji / Li, Yue / Rao, Wei / Wang, Yannan / Xie, Lei et al. | 2023
- 1
-
Efficient and Effective Multi-Camera Pose Estimation with Weighted M-Estimate Sample ConsensusLin, Xinyu / Zhou, Yingjie / Zhang, Xun / Liu, Yipeng / Zhu, Ce et al. | 2023
- 1
-
Paaploss: A Phonetic-Aligned Acoustic Parameter Loss for Speech EnhancementYang, Muqiao / Konan, Joseph / Bick, David / Zeng, Yunyang / Han, Shuo / Kumar, Anurag / Watanabe, Shinji / Raj, Bhiksha et al. | 2023
- 1
-
A Novel Extrapolation Technique to Accelerate WMMSEZhou, Kaiwen / Chen, Zhilin / Liu, Guochen / Chen, Zhitang et al. | 2023
- 1
-
Improving Non-Autoregressive Speech Recognition with Autoregressive PretrainingLi, Yanjia / Samarakoon, Lahiru / Fung, Ivan et al. | 2023
- 1
-
CORSD: Class-Oriented Relational Self DistillationYu, Muzhou / Tan, Sia Huat / Wu, Kailu / Dong, Runpei / Zhang, Linfeng / Ma, Karsheng et al. | 2023
- 1
-
Short-Segment Speaker Verification Using ECAPA-TDNN with Multi-Resolution EncoderHan, Sangwook / Ahn, Youngdo / Kang, Kyeongmuk / Shin, Jong Won et al. | 2023
- 1
-
Prefix Tuning for Automated Audio CaptioningKim, Minkyu / Sung-Bin, Kim / Oh, Tae-Hyun et al. | 2023
- 1
-
Real-Time Multichannel Speech Separation and Enhancement Using a Beamspace-Domain-Based Lightweight CNNOlivieri, Marco / Comanducci, Luca / Pezzoli, Mirco / Balsarri, Davide / Menescardi, Luca / Buccoli, Michele / Pecorino, Simone / Grosso, Antonio / Antonacci, Fabio / Sarti, Augusto et al. | 2023
- 1
-
LongFNT: Long-Form Speech Recognition with Factorized Neural TransducerGong, Xun / Wu, Yu / Li, Jinyu / Liu, Shujie / Zhao, Rui / Chen, Xie / Qian, Yanmin et al. | 2023
- 1
-
WIFI-Based Robust Child Presence Detection for Smart CarsJayaweera, Sakila S. / Wang, Beibei / Zeng, Xiaolu / Wang, Wei-Hsiang / Ray Liu, K. J. et al. | 2023
- 1
-
CANDY: Category-Kernelized Dynamic Convolution for Instance SegmentationLu, Yao / Chen, Zhiyi / Chen, Zehui / Hu, Jie / Cao, Liujuan / Zhang, Shengchuan et al. | 2023
- 1
-
Distance-Based Online Label Inference Attacks Against Split LearningLiu, Junlin / Lyu, Xinchen et al. | 2023
- 1
-
Combining the Silhouette and Skeleton Data for Gait RecognitionWang, Likai / Han, Ruize / Feng, Wei et al. | 2023
- 1
-
Comparing Decentralized Gradient Descent Approaches and GuaranteesMoothedath, Shana / Vaswani, Namrata et al. | 2023
- 1
-
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural DiarizationLandini, Federico / Diez, Mireia / Lozano-Diez, Alicia / Burget, Lukas et al. | 2023
- 1
-
D-CONFORMER: Deformable Sparse Transformer Augmented Convolution for Voxel-Based 3D Object DetectionZhao, Xiao / Su, Liuzhen / Zhang, Xukun / Yang, Dingkang / Sun, Mingyang / Wang, Shunli / Zhai, Peng / Zhang, Lihua et al. | 2023
- 1
-
Spatial Inference Using Censored Multiple Testing with Fdr ControlGolz, Martin / Zoubir, Abdelhak M. / Koivunen, Visa et al. | 2023
- 1
-
Runtime Prediction of Machine Learning Algorithms in Automl SystemsDube, Parijat / Salonidis, Theodoros / Ram, Parikshit / Verma, Ashish et al. | 2023
- 1
-
Transformer-Based Bioacoustic Sound Event Detection on Few-Shot Learning TasksYou, Liwen / Coyotl, Erika Pelaez / Gunturu, Suren / Van Segbroeck, Maarten et al. | 2023
- 1
-
Unlimited Sampling in Phase SpaceZhang, Peiyu / Bhandari, Ayush et al. | 2023
- 1
-
Integrated Sensing and Full-Duplex Communication: Joint Transceiver Beamforming and Power AllocationHe, Zhenyao / Xu, Wei / Shen, Hong / Kwan Ng, Derrick Wing / Eldar, Yonina C. / You, Xiaohu et al. | 2023
- 1
-
Online Model Compression for Federated Learning with Large ModelsYang, Tien-Ju / Xiao, Yonghui / Motta, Giovanni / Beaufays, Francoise / Mathews, Rajiv / Chen, Mingqing et al. | 2023
- 1
-
Active Beam Tracking with Reconfigurable Intelligent SurfaceHan, Han / Jiang, Tao / Yu, Wei et al. | 2023
- 1
-
A Magnetic Framelet-Based Convolutional Neural Network for Directed GraphsLin, Lequan / Gao, Junbin et al. | 2023
- 1
-
An Edge Alignment-Based Orientation Selection Method for Neutron TomographyYang, Diyu / Tang, Shimin / Venkatakrishnan, Singanallur V. / Chowdhury, Mohammad S. N. / Zhang, Yuxuan / Bilheux, Hassina Z. / Buzzard, Gregery T. / Bouman, Charles A. et al. | 2023
- 1
-
SMUG: Towards Robust Mri Reconstruction by Smoothed UnrollingLi, Hui / Jia, Jinghan / Liang, Shijun / Yao, Yuguang / Ravishankar, Saiprasad / Liu, Sijia et al. | 2023
- 1
-
Weavspeech: Data Augmentation Strategy For Automatic Speech Recognition Via Semantic-Aware WeavingSeo, Kyusung / Park, Joonhyung / Song, Jaeyun / Yang, Eunho et al. | 2023
- 1
-
CTTSR: A Hybrid CNN-Transformer Network for Scene Text Image Super-ResolutionDai, Kaiwei / Kang, Nan / Kuang, Li et al. | 2023
- 1
-
M22: Rate-Distortion Inspired Gradient CompressionLiu, Yangyi / Salehkalaibar, Sadaf / Rini, Stefano / Chen, Jun et al. | 2023
- 1
-
Joint Training of Hierarchical GANs and Semantic Segmentation for Expression TranslationBodur, Rumeysa / Bhattarai, Binod / Kim, Tae-Kyun et al. | 2023
- 1
-
Performance Comparison of TTS Models for Brazilian Portuguese to Establish a BaselineLobato, Wilmer / Farias, Felipe / Cruz, William / Amadeus, Marcellus et al. | 2023
- 1
-
On Adversarial Robustness of Audio ClassifiersLu, Kangkang / Nguyen, Manh Cuong / Xu, Xun / Foo, Chuan Sheng et al. | 2023
- 1
-
Audio-Driven High Definetion and Lip-Synchronized Talking Face Generation Based on Face ReenactmentWang, Xianyu / Zhang, Yuhan / He, Weihua / Wang, Yaoyuan / Li, Minglei / Wang, Yuchen / Zhang, Jingyi / Zhou, Shunbo / Zhang, Ziyang et al. | 2023
- 1
-
Text-To-Speech Synthesis Based on Latent Variable Conversion Using Diffusion Probabilistic Model and Variational AutoencoderYasuda, Yusuke / Toda, Tomoki et al. | 2023
- 1
-
Representation Learning of Clinical Multivariate Time Series with Random Filter BanksKeshavarzian, Alireza / Salehinejad, Hojjat / Valaee, Shahrokh et al. | 2023
- 1
-
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and TranslationLam, Tsz Kin / Schamoni, Shigehiko / Riezler, Stefan et al. | 2023
- 1
-
SDRNet: Shape Decoupled Regression Network for 3d face ReconstructionZhang, Shikun / Song, Fengyi / Song, Ge / Yang, Ming et al. | 2023
- 1
-
IR-ECG: Invertible Reconstruction of ECGWang, Peng / Huang, Xi / Cui, Li et al. | 2023
- 1
-
Data Leakage in Cross-Modal Retrieval Training: A Case StudyWeck, Benno / Serra, Xavier et al. | 2023
- 1
-
EfficientSpeech: An On-Device Text to Speech ModelAtienza, Rowel et al. | 2023
- 1
-
Subband Dependency Modeling for Sound Event DetectionGuan, Yadong / Zheng, Guibin / Han, Jiqing / Wang, Huanliang et al. | 2023
- 1
-
Tracking Targets in Hyper-Scale Cameras Using Movement PredicationYu, Jiaping / Zhou, Tongqing / Cai, Zhiping / Kuang, Wenyuan et al. | 2023
- 1
-
Revisit Out-Of-Vocabulary Problem For Slot Filling: A Unified Contrastive Framework With Multi-Level Data AugmentationsGuo, Daichi / Dong, Guanting / Fu, Dayuan / Wu, Yuxiang / Zeng, Chen / Hui, Tingfeng / Wang, Liwen / Li, Xuefeng / Wang, Zechen / He, Keqing et al. | 2023
- 1
-
End-to-End Amp Modeling: from Data to Controllable Guitar Amplifier ModelsJuvela, Lauri / Damskagg, Eero-Pekka / Peussa, Aleksi / Makinen, Jaakko / Sherson, Thomas / Mimilakis, Stylianos I. / Rauhanen, Kimmo / Gotsopoulos, Athanasios et al. | 2023
- 1
-
TAPLoss: A Temporal Acoustic Parameter Loss for Speech EnhancementZeng, Yunyang / Konan, Joseph / Han, Shuo / Bick, David / Yang, Muqiao / Kumar, Anurag / Watanabe, Shinji / Raj, Bhiksha et al. | 2023
- 1
-
Decaying Contrast for Fine-Grained Video Representation LearningZhang, Heng / Su, Bing et al. | 2023
- 1
-
EMCLR: Expectation Maximization Contrastive Learning RepresentationsLiu, Meng / Yi, Ran / Ma, Lizhuang et al. | 2023
- 1
-
Difference Guided VHR Remote Sensing Image Change DetectionSun, Jiukai / Liu, Ganchao / Li, Xuelong / Yuan, Yuan et al. | 2023
- 1
-
Topology Uncertainty Modeling For Imbalanced Node Classification on GraphsGao, Jiayi / Li, Jiaxing / Zhang, Ke / Kong, Youyong et al. | 2023
- 1
-
SSI-Net: A Multi-Stage Speech Signal Improvement System for ICASSP 2023 SSI ChallengeZhu, Weixin / Wang, Zilin / Lin, Jiuxin / Zeng, Chang / Yu, Tao et al. | 2023
- 1
-
Blind Acoustic Room Parameter Estimation Using Phase FeaturesIck, Christopher / Mehrabi, Adib / Jin, Wenyu et al. | 2023
- 1
-
Exploiting Speaker Embeddings for Improved Microphone Clustering and Speech Separation in ad-hoc Microphone ArraysKindt, Stijn / Thienpondt, Jenthe / Madhu, Nilesh et al. | 2023
- 1
-
Classification of the Cervical Vertebrae Maturation (CVM) Stages Using the Tripod NetworkAtici, Salih / Pan, Hongyi / Elnagar, Mohammed H. / Allareddy, Veerasathpurush / Suhaym, Omar / Ansari, Rashid / Cetin, Ahmet Enis et al. | 2023
- 1
-
A Deep Fusion Rule for Infrared and Visible Image Fusion: Feature Communication for Importance AssessmentLv, Xuran / Cheng, Jinyong / Lv, Guohua / Wei, Zhonghe et al. | 2023
- 1
-
On the Role of Visual Context in Enriching Music RepresentationsAvramidis, Kleanthis / Stewart, Shanti / Narayanan, Shrikanth et al. | 2023
- 1
-
Designing A 3d-Aware Stylenerf Encoder for Face EditingYang, Songlin / Wang, Wei / Peng, Bo / Dong, Jing et al. | 2023
- 1
-
Sensor Selection for Angle of Arrival Estimation Based on the Two-Target Cramér-Rao BoundKokke, Costas A. / Coutino, Mario / Anitori, Laura / Heusdens, Richard / Leus, Geert et al. | 2023
- 1
-
A Meta-Gnn Approach to Personalized Seizure Detection and ClassificationRahmani, Abdellah / Venkitaraman, Arun / Frossard, Pascal et al. | 2023
- 1
-
Does a Quieter City Mean Fewer Complaints? The Sounds of New York City During Covid-19 LockdownCartwright, Mark / Fuentes, Magdalena / Mydlarz, Charlie / Miranda, Fabio / Bello, Juan Pablo et al. | 2023
- 1
-
ECGT2T: Towards Synthesizing Twelve-Lead Electrocardiograms from Two Asynchronous LeadsJo, Yong-Yeon / Choi, Young Sang / Jang, Jong-Hwan / Kwon, Joon-Myoung et al. | 2023
- 1
-
Once-for-All Sequence Compression for Self-Supervised Speech ModelsChen, Hsuan-Jui / Meng, Yen / Lee, Hung-yi et al. | 2023
- 1
-
UX-Net: Filter-and-Process-Based Improved U-Net for real-time time-domain audio SeparationPatel, Kashyap / Kovalyov, Anton / Panahi, Issa et al. | 2023
- 1
-
Dasformer: Deep Alternating Spectrogram Transformer For Multi/Single-Channel Speech SeparationWang, Shuo / Kong, Xiangyu / Peng, Xiulian / Movassagh, Hesam / Prakash, Vinod / Lu, Yan et al. | 2023
- 1
-
Audio Barlow Twins: Self-Supervised Audio Representation LearningAnton, Jonah / Coppock, Harry / Shukla, Pancham / Schuller, Bjorn W. et al. | 2023
- 1
-
Confidence-Based Event-Centric Online Video Question Answering on a Newly Constructed ATBS DatasetKong, Weikai / Ye, Shuhong / Yao, Chenglin / Ren, Jianfeng et al. | 2023
- 1
-
Mcrood: Multi-Class Radar Out-Of-Distribution DetectionKahya, Sabri Mustafa / Sami Yavuz, Muhammet / Steinbach, Eckehard et al. | 2023
- 1
-
Pre-Training Strategies Using Contrastive Learning and Playlist Information for Music Classification and SimilarityAlonso-Jimenez, Pablo / Favory, Xavier / Foroughmand, Hadrien / Bourdalas, Grigoris / Serra, Xavier / Lidy, Thomas / Bogdanov, Dmitry et al. | 2023
- 1
-
Multimodal Dyadic Impression Recognition via Listener Adaptive Cross-Domain FusionLi, Yuanchao / Bell, Peter / Lai, Catherine et al. | 2023
- 1
-
Forensics for Adversarial Machine Learning Through Attack Mapping IdentificationYan, Allen / Kim, Jinsub / Raich, Raviv et al. | 2023
- 1
-
Sketch Less Face Image Retrieval: A New ChallengeDai, Dawei / Li, Yutang / Wang, Liang / Fu, Shiyu / Xia, Shuyin / Wang, Guoyin et al. | 2023
- 1
-
Sample-Adapt Fusion Network for RGB-D Hand Detection in the WildLiu, Xingyu / Ren, Pengfei / Chen, Yuchen / Liu, Cong / Wang, Jing / Sun, Haifeng / Qi, Qi / Wang, Jingyu et al. | 2023
- 1
-
Semantic Preserving Learning for Task-Oriented Point Cloud DownsamplingXiong, Jianyu / Dai, Tao / Zha, Yaohua / Wang, Xin / Xia, Shu-Tao et al. | 2023
- 1
-
Subgradient Descent Learning with Over-the-Air ComputationGez, Tamir L. S. / Cohen, Kobi et al. | 2023
- 1
-
Rigid-Body Sound Synthesis with Differentiable Modal ResonatorsDiaz, Rodrigo / Hayes, Ben / Saitis, Charalampos / Fazekas, Gyorgy / Sandler, Mark et al. | 2023
- 1
-
Better Together: Dialogue Separation and Voice Activity Detection for Audio Personalization in TVTorcoli, Matteo / Habets, Emanuel A. P. et al. | 2023
- 1
-
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument ClassificationZhong, Zhi / Hirano, Masato / Shimada, Kazuki / Tateishi, Kazuya / Takahashi, Shusuke / Mitsufuji, Yuki et al. | 2023
- 1
-
Hadamard Layer to Improve Semantic SegmentationHoyos, Angello / Rivera, Mariano et al. | 2023
- 1
-
Decoding Musical Pitch from Human Brain Activity with Automatic Voxel-Wise Whole-Brain FMRI Feature SelectionCheung, Vincent K.M. / Peng, Yueh-Po / Lin, Jing-Hua / Su, Li et al. | 2023
- 1
-
Graph Wavelet-Based Point Cloud Geometric Denoising with Surface-Consistent Non-Negative Kernel RegressionWatanabe, Ryosuke / Nonaka, Keisuke / Pavez, Eduardo / Kobayashi, Tatsuya / Ortega, Antonio et al. | 2023
- 1
-
Semi-Swinderain: Semi-Supervised Image Deraining Network Using SWIN TransformerRen, Chun / Yan, Danfeng / Cai, Yuanqiang / Li, Yangchun et al. | 2023
- 1
-
Hierarchical Multi-Task Learning for Fabric Component Analysis Based on NIR Spectral SignalsKim, Joseph / Wu, Dong / Chi, Mingmin / Xu, Gaoqi et al. | 2023
- 1
-
Transferring Quantified Emotion Knowledge for the Detection of Depression in Alzheimer’s Disease Using ForestnetsPerez-Toro, P. A. / Rodriguez-Salas, D. / Arias-Vergara, T. / Bayerl, S. P. / Klumpp, P. / Riedhammer, K. / Schuster, M. / Noth, E. / Maier, A. / Orozco-Arroyave, J. R. et al. | 2023
- 1
-
End-to-End Classification of Cell-Cycle Stages with Center-Cell Focus Tracker Using Recurrent Neural NetworksJose, Abin / Roy, Rijo / Eschweiler, Dennis / Laube, Ina / Azad, Reza / Moreno-Andres, Daniel / Stegmaier, Johannes et al. | 2023
- 1
-
Client Selection for Generalization in Accelerated Federated Learning: A Bandit ApproachAmi, Dan Ben / Cohen, Kobi / Zhao, Qing et al. | 2023
- 1
-
Efficient Speech Translation with Dynamic Latent PerceiversTsiamas, Ioannis / Gallego, Gerard I. / Fonollosa, Jose A. R. / Costa-jussa, Marta R. et al. | 2023
- 1
-
Towards Privacy and Utility in Tourette TIC Detection Through Pretraining Based on Publicly Available Video Data of Healthy SubjectsSophie Brugge, Nele / Mohammadi, Esfandiar / Munchau, Alexander / Baumer, Tobias / Frings, Christian / Beste, Christian / Roessner, Veit / Handels, Heinz et al. | 2023
- 1
-
Mixer: DNN Watermarking using Image MixupKallas, Kassem / Furon, Teddy et al. | 2023
- 1
-
Targeted Adversarial Attacks Against Neural Machine TranslationSadrizadeh, Sahar / Aghdam, AmirHossein Dabiri / Dolamic, Ljiljana / Frossard, Pascal et al. | 2023
- 1
-
Supervised Hierarchical Clustering Using Graph Neural Networks for Speaker DiarizationSingh, Prachi / Kaul, Amrit / Ganapathy, Sriram et al. | 2023
- 1
-
FindAdaptNet: Find and Insert Adapters by Learned Layer ImportanceHuang, Junwei / Ganesan, Karthik / Maiti, Soumi / Min Kim, Young / Chang, Xuankai / Liang, Paul / Watanabe, Shinji et al. | 2023
- 1
-
An Effective Anomalous Sound Detection Method Based on Representation Learning with Simulated AnomaliesChen, Han / Song, Yan / Zhuo, Zhu / Zhou, Yu / Li, Yu-Hong / Xue, Hui / McLoughlin, Ian et al. | 2023
- 1
-
Batch Normalization Damages Federated Learning on NON-IID Data: Analysis and RemedyWang, Yanmeng / Shi, Qingjiang / Chang, Tsung-Hui et al. | 2023
- 1
-
Convolution-Based Channel-Frequency Attention for Text-Independent Speaker VerificationLi, Jingyu / Tian, Yusheng / Lee, Tan et al. | 2023
- 1
-
Learning Properties of Holomorphic Neural Networks of Dual VariablesKozlov, Dmitry / Bakulin, Mikhail / Pavlov, Stanislav / Zuev, Aleksandr / Krylova, Mariya / Kharchikov, Igor et al. | 2023
- 1
-
Recursive/Iterative Unique Projection-Aggregation Decoding of Reed-Muller CodesHashemipour-Nazari, Marzieh / Debets, Renate / Goossens, Kees / Balatsoukas-Stimming, Alexios et al. | 2023
- 1
-
Improved Deep Speaker Localization and Tracking: Revised Training Paradigm and Controlled LatencyBohlender, Alexander / Roelens, Liesbeth / Madhu, Nilesh et al. | 2023
- 1
-
Static-Scene Constrained Optimization for Matrix/Tensor-Decomposition-free Foreground-Background SeparationNaganuma, Kazuki / Ono, Shunsuke et al. | 2023
- 1
-
Image Inpainting with Semantic-Aware TransformerChen, Shiyu / Yu, Wenxin / Wang, Qi / Gong, Jun / Chen, Peng et al. | 2023
- 1
-
MCNET: Fuse Multiple Cues for Multichannel Speech EnhancementYang, Yujie / Quan, Changsheng / Li, Xiaofei et al. | 2023
- 1
-
Co-Operative CNN for Visual Saliency Prediction on WCE ImagesDimas, George / Koulaouzidis, Anastasios / Iakovidis, Dimitris K. et al. | 2023
- 1
-
ISmallNet: Densely Nested Network with Label Decoupling for Infrared Small Target DetectionHu, Zhiheng / Wang, Yongzhen / Li, Peng / Qin, Jie / Xie, Haoran / Wei, Mingqiang et al. | 2023
- 1
-
Improved Projection Learning for Lower Dimensional Feature MapsPrice, Ilan / Tanner, Jared et al. | 2023
- 1
-
Wordreg: Mitigating the Gap between Training and Inference with Worst-Case Drop RegularizationXia, Jun / Wang, Ge / Hu, Bozhen / Tan, Cheng / Zheng, Jiangbin / Xu, Yongjie / Li, Stan Z. et al. | 2023
- 1
-
The NIO System for Audio-Visual Diarization and Recognition in MISP Challenge 2022Xu, Gaopeng / Wang, Xianliang / Wang, Sang / Yuan, Junfeng / Guo, Wei / Li, Wei / Gao, Jie et al. | 2023
- 1
-
Vision Transformer with Progressive Tokenization for CT Metal Artifact ReductionZheng, Songwei / Zhang, Dong / Yu, Chunyan / Zhu, Danhong / Zhu, Longlong / Liu, Hao / Huang, Zhongzheng et al. | 2023
- 1
-
A Critical Look at Recent Trends in Compression of Channel State InformationOrnhag, Marcus Valtonen / Adalbjornsson, Stefan / Guler, Puren / Mahdavi, Mojtaba et al. | 2023
- 1
-
Speech Emotion Recognition Via Two-Stream Pooling Attention With Discriminative Channel WeightingLiu, Ke / Wang, Dekui / Wu, Dongya / Feng, Jun et al. | 2023
- 1
-
DATA2VEC-SG: Improving Self-Supervised Learning Representations for Speech Generation TasksWang, Heming / Qian, Yao / Yang, Hemin / Kanda, Nauyuki / Wang, Peidong / Yoshioka, Takuya / Wang, Xiaofei / Wang, Yiming / Liu, Shujie / Chen, Zhuo et al. | 2023
- 1
-
Spherical Vector Quantization for Spatial Direction CodingRagot, Stephane / Vasilache, Adriana et al. | 2023
- 1
-
Perceive and Predict: Self-Supervised Speech Representation Based Loss Functions for Speech EnhancementClose, George / Ravenscroft, William / Hain, Thomas / Goetze, Stefan et al. | 2023
- 1
-
DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial ExpressionsHwang, Geumbyeol / Hong, Sunwon / Lee, Seunghyun / Park, Sungwoo / Chae, Gyeongsu et al. | 2023
- 1
-
Naturalistic Head Motion Generation from SpeechMittal, Trisha / Aldeneh, Zakaria / Fedzechkina, Masha / Ranjan, Anurag / Theobald, Barry-John et al. | 2023
- 1
-
Bayesian Methods for Optical Flow Estimation Using a Variational Approximation, with Applications to UltrasoundDorazil, Jan / Fleury, Bernard H. / Hlawatsch, Franz et al. | 2023
- 1
-
FNeural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated full- and sub-band ModelingWang, Zhong-Qiu / Cornell, Samuele / Choi, Shukjae / Lee, Younglo / Kim, Byeong-Yeol / Watanabe, Shinji et al. | 2023
- 1
-
LSTM-Based Video Quality Prediction Accounting for Temporal Distortions in Videoconferencing CallsMittag, Gabriel / Naderi, Babak / Gopal, Vishak / Cutler, Ross et al. | 2023
- 1
-
Applying Independent Vector Analysis on EEG-Based Motor Imagery ClassificationMoraes, Caroline P. A. / Aristimunha, Bruno / Dos Santos, Lucas Heck / Pinaya, Walter Hugo Lopez / de Camargo, Raphael Yokoingawa / Fantinato, Denis G. / Neves, Aline et al. | 2023
- 1
-
Hierarchical Pronunciation Assessment with Multi-Aspect AttentionDo, Heejin / Kim, Yunsu / Lee, Gary Geunbae et al. | 2023
- 1
-
Zero-Shot Anomalous Sound Detection in Domestic Environments Using Large-Scale Pretrained Audio Pattern Recognition ModelsIlic Mezza, Alessandro / Zanetti, Giulio / Cobos, Maximo / Antonacci, Fabio et al. | 2023
- 1
-
Improving Bert Fine-Tuning via Stabilizing Cross-Layer Mutual InformationLi, Jicun / Li, Xingjian / Wang, Tianyang / Wang, Shi / Cao, Yanan / Xu, Chengzhong / Dou, Dejing et al. | 2023
- 1
-
A Model-Based Hearing Compensation Method Using a Self-Supervised FrameworkNiu, Yadong / Li, Nan / Wu, Xihong / Chen, Jing et al. | 2023
- 1
-
Structured Pruning of Self-Supervised Pre-Trained Models for Speech Recognition and UnderstandingPeng, Yifan / Kim, Kwangyoun / Wu, Felix / Sridhar, Prashant / Watanabe, Shinji et al. | 2023
- 1
-
Contrastive Domain Adaptation Via Delimitation DiscriminatorWei, Xing / Wen, Bin / Chen, Lei / Liu, Yujie / Zhao, Chong / Lu, Yang et al. | 2023
- 1
-
Efficient Siamese Network for UAV TrackingZhang, Xiaohan / Wang, Dong / Ma, Xiaohong et al. | 2023
- 1
-
Counterfactual Explanation for Multivariate Times Series Using A Contrastive Variational AutoencoderTodo, William / Selmani, Merwann / Laurent, Beatrice / Loubes, Jean-Michel et al. | 2023
- 1
-
Long-Term Synchronization of Wireless Acoustic Sensor Networks with Nonpersistent Acoustic Activity Using Coherence StateChinaev, Aleksej / Knaepper, Niklas / Enzner, Gerald et al. | 2023
- 1
-
CN-CVS: A Mandarin Audio-Visual Dataset for Large Vocabulary Continuous Visual to Speech SynthesisChen, Chen / Wang, Dong / Zheng, Thomas Fang et al. | 2023
- 1
-
Real-Time Speech Enhancement with Dynamic Attention SpanZheng, Chengyu / Zhou, Yuan / Peng, Xiulian / Zhang, Yuan / Lu, Yan et al. | 2023
- 1
-
Neurally Augmented State Space Model for Simultaneous Communication and Tracking with Low Complexity ReceiversPedraza, Fernando / Caire, Giuseppe et al. | 2023
- 1
-
Cosmopolite Sound Monitoring (CoSMo): A Study of Urban Sound Event Detection Systems Generalizing to Multiple CitiesAngulo, Florian / Essid, Slim / Peeters, Geoffroy / Mietlicki, Christophe et al. | 2023
- 1
-
NC-WAMKD: Neighborhood Correction Weight-Adaptive Multi-Teacher Knowledge Distillation for Graph-Based Semi-Supervised Node ClassificationLiu, Jiahao / Guo, Pengcheng / Song, Yonghong et al. | 2023
- 1
-
F-PABEE: Flexible-Patience-Based Early Exiting For Single-Label and Multi-Label Text Classification TasksGao, Xiangxiang / Zhu, Wei / Gao, Jiasheng / Yin, Congrui et al. | 2023
- 1
-
Speech and Noise Dual-Stream Spectrogram Refine Network With Speech Distortion Loss For Robust Speech RecognitionLu, Haoyu / Li, Nan / Song, Tongtong / Wang, Longbiao / Dang, Jianwu / Wang, Xiaobao / Zhang, Shiliang et al. | 2023
- 1
-
Streaming Stroke Classification of Online HandwritingLiu, Jing-Yu / Zhang, Yan-Ming / Yin, Fei / Liu, Cheng-Lin et al. | 2023
- 1
-
Reducing Language Confusion for Code-Switching Speech Recognition with Token-Level Language DiarizationLiu, Hexin / Xu, Haihua / Garcia, Leibny Paola / Khong, Andy W. H. / He, Yi / Khudanpur, Sanjeev et al. | 2023
- 1
-
Cross-Modal Audio-Visual Co-Learning for Text-Independent Speaker VerificationLiu, Meng / Lee, Kong Aik / Wang, Longbiao / Zhang, Hanyi / Zeng, Chang / Dang, Jianwu et al. | 2023
- 1
-
Egocentric Audio-Visual Noise SuppressionSharma, Roshan / He, Weipeng / Lin, Ju / Lakomkin, Egor / Liu, Yang / Kalgaonkar, Kaustubh et al. | 2023
- 1
-
Sparse Graph Learning with Spectrum Prior for Deep Graph Convolutional NetworksZeng, Jin / Liu, Yang / Cheung, Gene / Hu, Wei et al. | 2023
- 1
-
A Game of Snakes and GansAsokan, Siddarth / Mohammed, Fatwir Sheikh / Sekhar Seelamantula, Chandra et al. | 2023
- 1
-
Enabling Large-Scale Image Search with Co-Attention MechanismHu, Zechao / Bors, Adrian G. et al. | 2023
- 1
-
Deep Manifold Graph Auto-Encoder For Attributed Graph EmbeddingHu, Bozhen / Zang, Zelin / Xia, Jun / Wu, Lirong / Tan, Cheng / Li, Stan Z. et al. | 2023
- 1
-
Learning Expressive And Generalizable Motion Features For Face Forgery DetectionZhang, Jingyi / Zhang, Peng / Wang, Jingjing / Xie, Di / Pu, Shiliang et al. | 2023
- 1
-
Self-Supervised Speech Representation Learning for Keyword-Spotting With Light-Weight TransformersGao, Chenyang / Gu, Yue / Caliva, Francesco / Liu, Yuzong et al. | 2023
- 1
-
UPGLADE: Unplugged Plug-and-Play Audio Declipper Based on Consensus Equilibrium of DNN and Sparse OptimizationTanaka, Tomoro / Yatabe, Kohei / Oikawa, Yasuhiro et al. | 2023
- 1
-
Spatio-Temporal Structure Consistency for Semi-Supervised Medical Image ClassificationLei, Wentao / Liu, Lei / Liu, Li et al. | 2023
- 1
-
A Bandit Online Convex Optimization Approach To Distributed Energy Management In Networked SystemsTsetis, Ioannis / Cheng, Xiaotong / Maghsudi, Setareh et al. | 2023
- 1
-
Efficiently Fusing Sparse Lidar for Enhanced Self-Supervised Monocular Depth EstimationWang, Yue / Gong, Mingrong / Xia, Lei / Zhang, Qieshi / Cheng, Jun et al. | 2023
- 1
-
Exploiting Prompt Learning with Pre-Trained Language Models for Alzheimer’s Disease DetectionWang, Yi / Deng, Jiajun / Wang, Tianzi / Zheng, Bo / Hu, Shoukang / Liu, Xunying / Meng, Helen et al. | 2023
- 1
-
Sparse Bayesian Learning Assisted Decision Fusion in Millimeter Wave Massive MIMO Sensor NetworksChawla, Apoorva / Ciuonzo, Domenico / Rossi, Pierluigi Salvo et al. | 2023
- 1
-
FedVMR: A New Federated Learning Method for Video Moment RetrievalWang, Yan / Luo, Xin / Chen, Zhen-Duo / Zhang, Peng-Fei / Liu, Meng / Xu, Xin-Shun et al. | 2023
- 1
-
Context-Aware Face Clustering with Graph Convolutional NetworksZhang, Dafeng / Guo, Jiangbo / Jin, Zhezhu et al. | 2023
- 1
-
Constrained non-negative PARAFAC2 for electromyogram separationMagbonde, Abile / Quaine, Franck / Rivet, Bertrand et al. | 2023
- 1
-
Continuous Learning for Blind Image Quality Assessment with Contrastive TransformerYang, Jifan / Wang, Zhongyuan / Huang, Baojin / Deng, Lianbing et al. | 2023
- 1
-
Surface-Sampling Based Objective Quality Assessment Metrics for MeshesFu, Chunyang / Zhang, Xiang / Nguyen-Canh, Thuong / Xu, Xiaozhong / Li, Ge / Liu, Shan et al. | 2023
- 1
-
Exploration Into Translation-Equivariant Image QuantizationShin, Woncheol / Lee, Gyubok / Lee, Jiyoung / Lyou, Eunyi / Lee, Joonseok / Choi, Edward et al. | 2023
- 1
-
Deep Subband Network for Joint Suppression of Echo, Noise and Reverberation in Real-Time Fullband Speech CommunicationXiong, Feifei / Dong, Minya / Zhou, Kechenying / Zhu, Houwei / Feng, Jinwei et al. | 2023
- 1
-
More Speaking or More Speakers?Berrebbi, Dan / Collobert, Ronan / Jaitly, Navdeep / Likhomanenko, Tatiana et al. | 2023
- 1
-
Neighborhood Information-Based Label Refinement for Person Re-Identification with Label NoiseZhong, Xian / Su, Shuaipeng / Liu, Wenxuan / Jia, Xuemei / Huang, Wenxin / Wang, Mengdie et al. | 2023
- 1
-
Universal Speaker Recognition Encoders for Different Speech Segments DurationNovoselov, Sergey / Volokhov, Vladimir / Lavrentyeva, Galina et al. | 2023
- 1
-
Joint Neural Representation for Multiple Light FieldsGuludec, Guillaume Le / Guillemot, Christine et al. | 2023
- 1
-
Semi-Supervised Speech Enhancement Based On Speech PurityCui, Zihao / Zhang, Shilei / Chen, Yanan / Gao, Yingying / Deng, Chao / Feng, Junlan et al. | 2023
- 1
-
Continuous Interaction with A Smart Speaker via Low-Dimensional Embeddings of Dynamic Hand PoseXu, Songpei / Kaul, Chaitanya / Ge, Xuri / Murray-Smith, Roderick et al. | 2023
- 1
-
Analyzing Acoustic Word Embeddings from Pre-Trained Self-Supervised Speech ModelsSanabria, Ramon / Tang, Hao / Goldwater, Sharon et al. | 2023
- 1
-
Scalable Weight Reparametrization for Efficient Transfer LearningKim, Byeonggeun / Lee, Jun-Tae / Yang, Seunghan / Chang, Simyung et al. | 2023
- 1
-
Efficient Large-Scale Audio Tagging Via Transformer-to-CNN Knowledge DistillationSchmid, Florian / Koutini, Khaled / Widmer, Gerhard et al. | 2023
- 1
-
Weight-Sharing Supernet for Searching Specialized Acoustic Event Classification Networks Across Device ConstraintsLin, Guan-Ting / Tang, Qingming / Kao, Chieh-Chi / Rozgic, Viktor / Wang, Chao et al. | 2023
- 1
-
Building Change Detection Using Cross-Temporal Feature Interaction NetworkFeng, Yuchao / Jiang, Jiawei / Xu, Honghui / Zheng, Jianwei et al. | 2023
- 1
-
RCDPT: Radar-Camera Fusion Dense Prediction TransformerLo, Chen-Chou / Vandewalle, Patrick et al. | 2023
- 1
-
Global HRTF Interpolation Via Learned Affine Transformation of Hyper-Conditioned FeaturesLee, Jin Woo / Lee, Sungho / Lee, Kyogu et al. | 2023
- 1
-
Wireless Power Transfer Using Chirp WaveformsRoy, Arijit / Psomas, Constantinos / Krikidis, Ioannis et al. | 2023
- 1
-
Analysing the Masked Predictive Coding Training Criterion for Pre-Training a Speech Representation ModelYadav, Hemant / Sitaram, Sunayana / Shah, Rajiv Ratn et al. | 2023
- 1
-
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation TypesRudovic, Oggi / Chang, Wonil / Garg, Vineet / Dighe, Pranay / Simha, Pramod / Berkowitz, Jack / Abdelaziz, Ahmed H. / Kajarekar, Sachin / Marchi, Erik / Adya, Saurabh et al. | 2023
- 1
-
MRML: Multimodal Rumor Detection by Deep Metric LearningPeng, Liwen / Jian, Songlei / Li, Dongsheng / Shen, Siqi et al. | 2023
- 1
-
Face Recognition on Point Cloud with Cgan-Top for DenoisingLiu, Junyu / Ren, Jianfeng / Sun, Hongliang / Jiang, Xudong et al. | 2023
- 1
-
Any-to-Any Voice Conversion with F0 and Timbre Disentanglement and Novel Timbre ConditioningKovela, Sudheer / Valle, Rafael / Dantrey, Ambrish / Catanzaro, Bryan et al. | 2023
- 1
-
Inverse Reinforcement Learning with Graph Neural Networks for IoT Resource AllocationWang, Guangchen / Cheng, Peng / Chen, Zhuo / Xiang, Wei / Vucetic, Branka / Li, Yonghui et al. | 2023
- 1
-
NNSVS: A Neural Network-Based Singing Voice Synthesis ToolkitYamamoto, Ryuichi / Yoneyama, Reo / Toda, Tomoki et al. | 2023
- 1
-
Overview of the L3DAS23 Challenge on Audio-Visual Extended RealityMarinoni, Christian / Gramaccioni, Riccardo F. / Chen, Changan / Uncini, Aurelio / Comminiello, Danilo et al. | 2023
- 1
-
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)Zhang, Qinglin / Deng, Chong / Liu, Jiaqing / Yu, Hai / Chen, Qian / Wang, Wen / Yan, Zhijie / Liu, Jinglin / Ren, Yi / Zhao, Zhou et al. | 2023
- 1
-
Multilingual Alzheimer’s Dementia Recognition through Spontaneous Speech: A Signal Processing Grand ChallengeLuz, Saturnino / Haider, Fasih / Fromm, Davida / Lazarou, Ioulietta / Kompatsiaris, Ioannis / MacWhinney, Brian et al. | 2023
- 1
-
Divcon: Learning Concept Sequences for Semantically Diverse Image CaptioningZheng, Yue / Li, Ya-Li / Wang, Shengjin et al. | 2023
- 1
-
Exploiting Virtual Array Diversity for Accurate Radar DetectionGuan, Junfeng / Madani, Sohrab / Ahmed, Waleed / Hussein, Samah / Gupta, Saurabh / Hassanieh, Haitham et al. | 2023
- 1
-
Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed NetworksChen, Yiyue / Hashemi, Abolfazl / Vikalo, Haris et al. | 2023
- 1
-
SAN: A Robust End-to-End ASR Model ArchitectureMin, Zeping / Ge, Qian / Huang, Guanhua et al. | 2023
- 1
-
Resource Allocation for UAV-Enabled Integrated Sensing and Communication (ISAC) via Multi-Objective OptimizationRezaei, Omid / Naghsh, Mohammad Mahdi / Karbasi, Seyed Mohammad / Nayebi, Mohammad Mahdi et al. | 2023
- 1
-
Removing Radio Frequency Interference From Auroral Kilometric Radiation With Stacked AutoencodersChang, Allen / Knapp, Mary / LaBelle, James / Swoboda, John / Volz, Ryan / Erickson, Philip J. et al. | 2023
- 1
-
Soft Label Coding for end-to-end Sound Source Localization with ad-hoc Microphone ArraysFeng, Linfeng / Gong, Yijun / Zhang, Xiao-Lei et al. | 2023
- 1
-
Study And Design Of Robust Personal Sound Zones With Vast Using Low Rank RirsBhattacharjee, Sankha Subhra / Shi, Liming / Ping, Guoli / Shen, Xiaoxiang / Christensen, Mads Grasboll et al. | 2023
- 1
-
ROI-Based Deep Image Compression with Swin TransformersLi, Binglin / Liang, Jie / Fu, Haisheng / Han, Jingning et al. | 2023
- 1
-
Event-Based Visual MicrophoneHoward, Matthew / Hirakawa, Keigo et al. | 2023
- 1
-
Named Entity Detection and Injection for Direct Speech TranslationGaido, Marco / Tang, Yun / Kulikov, Ilia / Huang, Rongqing / Gong, Hongyu / Inaguma, Hirofumi et al. | 2023
- 1
-
Efficient Stuttering Event Detection Using Siamese NetworksMohapatra, Payal / Islam, Bashima / Islam, Md Tamzeed / Jiao, Ruochen / Zhu, Qi et al. | 2023
- 1
-
BadRes: Reveal the Backdoors Through Residual ConnectionHe, Mingrui / Chen, Tianyu / Zhou, Haoyi / Zhang, Shanghang / Li, Jianxin et al. | 2023
- 1
-
End-to-End Unsupervised Sketch to Image GenerationLv, Xingming / Wu, Lei / Cheng, Zhenwei / Meng, Xiangxu et al. | 2023
- 1
-
Trinet: Stabilizing Self-Supervised Learning From Complete or Slow CollapseCao, Lixin / Wang, Jun / Yang, Ben / Su, Dan / Yu, Dong et al. | 2023
- 1
-
ERBNet: An Effective Representation Based Network for Unbiased Scene Graph GenerationMa, Wenxi / Hou, Tianxiang / Di, Qianji / Qi, Zhongang / Shan, Ying / Wang, Hanzi et al. | 2023
- 1
-
Deformable Cross Attention for Learning Optical FlowAbdein, Rokia / Xiang, Xuezhi / Lv, Ning / Saddik, Abdulmotaleb El et al. | 2023
- 1
-
Optimal Kernel for Real-Time Arbitrary-Shaped Text DetectionMa, Haozhao / Yang, Chuang / Yuan, Yuan / Wang, Qi et al. | 2023
- 1
-
SVMV: Spatiotemporal Variance-Supervised Motion Volume for Video Frame InterpolationLuo, Yao / Pan, Jinshan / Tang, Jinhui et al. | 2023
- 1
-
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and RescoringLi, Mohan / Do, Cong-Thanh / Doddipatla, Rama et al. | 2023
- 1
-
Two-Stage Neural Network for ICASSP 2023 Speech Signal Improvement ChallengeLiu, Mingshuai / Lv, Shubo / Zhang, Zihan / Han, Runduo / Hao, Xiang / Xia, Xianjun / Chen, Li / Xiao, Yijian / Xie, Lei et al. | 2023
- 1
-
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And RecognitionWang, Zhe / Wu, Shilong / Chen, Hang / He, Mao-Kui / Du, Jun / Lee, Chin-Hui / Chen, Jingdong / Watanabe, Shinji / Siniscalchi, Sabato / Scharenborg, Odette et al. | 2023
- 1
-
Implicit Vehicle Positioning with Cooperative Lidar SensingBarbieri, Luca / Tedeschini, Bernardo Camajori / Brambilla, Mattia / Nicoli, Monica et al. | 2023
- 1
-
Self-Supervised Guided Hypergraph Feature Propagation for Semi-Supervised Classification with Missing Node FeaturesLei, Chengxiang / Fu, Sichao / Wang, Yuetian / Qiu, Wenhao / Hu, Yachen / Peng, Qinmu / You, Xinge et al. | 2023
- 1
-
Differential Analysis for Networks Obeying Conservation LawsRayas, Anirudh / Anguluri, Rajasekhar / Cheng, Jiajun / Dasarathy, Gautam et al. | 2023
- 1
-
Hardware-Limited Non-Uniform Task-Based QuantizersBernardo, Neil Irwin / Zhu, Jingge / Eldar, Yonina C. / Evans, Jamie et al. | 2023
- 1
-
Adaptive Noise Canceller Algorithm with SNR-Based Stepsize and Data-Dependent AveragingSugiyama, Akihiko et al. | 2023
- 1
-
Signal Processing And Quantum State Tomography on Noisy DevicesShi, Wenbo / Malaney, Robert et al. | 2023
- 1
-
In-Sensor & Neuromorphic Computing Are all You Need for Energy Efficient Computer VisionDatta, Gourav / Liu, Zeyu / Kaiser, Md Abdullah-Al / Kundu, Souvik / Mathai, Joe / Yin, Zihan / Jacob, Ajey P. / Jaiswal, Akhilesh R. / Beerel, Peter A. et al. | 2023
- 1
-
Adversarial Contrastive Distillation with Adaptive DenoisingWang, Yuzheng / Chen, Zhaoyu / Yang, Dingkang / Liu, Yang / Liu, Siao / Zhang, Wenqiang / Qi, Lizhe et al. | 2023
- 1
-
On Designing Light-Weight Object Trackers Through Network Pruning: Use CNNS or Transformers?Aggarwal, Saksham / Gupta, Taneesh / Sahu, Pawan K. / Chavan, Arnav / Tiwari, Rishabh / Prasad, Dilip K. / Gupta, Deepak K. et al. | 2023
- 1
-
Variational Inference Aided Estimation of Time Varying ChannelsBock, Benedikt / Baur, Michael / Rizzello, Valentina / Utschick, Wolfgang et al. | 2023
- 1
-
Class-Incremental Learning on Multivariate Time Series Via Shape-Aligned Temporal DistillationQiao, Zhongzheng / Hu, Minghui / Jiang, Xudong / Suganthan, Ponnuthurai Nagaratnam / Savitha, Ramasamy et al. | 2023
- 1
-
Inv-Senet: Invariant Self Expression Network for Clustering Under Biased DataSingh, Ashutosh / Singh, Ashish / Masoomi, Aria / Imbiriba, Tales / Learned-Miller, Erik / Erdogmus, Deniz et al. | 2023
- 1
-
Fine-Grained Textual Knowledge Transfer to Improve RNN Transducers for Speech Recognition and UnderstandingSunder, Vishal / Thomas, Samuel / Kuo, Hong-Kwang J. / Kingsbury, Brian / Fosler-Lussier, Eric et al. | 2023
- 1
-
Training Neural Networks for Sequential Change-Point DetectionLee, Junghwan / Xie, Yao / Cheng, Xiuyuan et al. | 2023
- 1
-
High-Resolution Neural Network Processing of LFM Radar PulsesAkhtar, Jabran et al. | 2023
- 1
-
MLCGAN: Multi-Lead ECG Synthesis with Multi Label Conditional Generative Adversarial NetworkWu, Jian / Wang, Liping / Pan, Hailin / Wang, Binyu et al. | 2023
- 1
-
NRTSI: Non-Recurrent Time Series ImputationShan, Siyuan / Li, Yang / Oliva, Junier B. et al. | 2023
- 1
-
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASRSanabria, Ramon / Bogoychev, Nikolay / Markl, Nina / Carmantini, Andrea / Klejch, Ondrej / Bell, Peter et al. | 2023
- 1
-
Centralized Cascade Multi-Channel Noise Reduction and Acoustic Feedback Cancellation in a Wireless Acoustic Sensor And Actuator NetworkRuiz, Santiago / van Waterschoot, Toon / Moonen, Marc et al. | 2023
- 1
-
Intent Does Matter! Propagating High-Order Relations for Exploring Interest PreferencesZheng, Xiangping / Liang, Xun / Wu, Bo / Feng, Junlan / Guo, Yuhui / Zhang, Sensen et al. | 2023
- 1
-
Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage ApproachWu, Shih-Lun / Yang, Yi-Hsuan et al. | 2023
- 1
-
Input-Dependent Dynamical Channel Association For Knowledge DistillationTang, Qiankun / Zhang, Yuan / Xu, Xiaogang / Wang, Jun / Guo, Yimin et al. | 2023
- 1
-
Robust Adaptive Beamforming with Proximal MethodLi, Ruifu / Cabric, Danijela et al. | 2023
- 1
-
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel AudioZhang, Yang / Puvvada, Krishna C. / Lavrukhin, Vitaly / Ginsburg, Boris et al. | 2023