Efficient Feature Extraction for Non-Maximum Suppression in Visual Person Detection (English)
- New search for: Symeonidis, Charalampos
- New search for: Mademlis, Ioannis
- New search for: Pitas, Ioannis
- New search for: Nikolaidis, Nikos
- New search for: Symeonidis, Charalampos
- New search for: Mademlis, Ioannis
- New search for: Pitas, Ioannis
- New search for: Nikolaidis, Nikos
In:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
;
1-5
;
2023
-
ISBN:
-
ISSN:
- Conference paper / Electronic Resource
-
Title:Efficient Feature Extraction for Non-Maximum Suppression in Visual Person Detection
-
Contributors:Symeonidis, Charalampos ( author ) / Mademlis, Ioannis ( author ) / Pitas, Ioannis ( author ) / Nikolaidis, Nikos ( author )
-
Published in:
-
Publisher:
- New search for: IEEE
-
Publication date:2023-06-04
-
Size:1076591 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
-
Source:
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Learning ASR Pathways: A Sparse Multilingual ASR ModelYang, Mu / Tjandra, Andros / Liu, Chunxi / Zhang, David / Le, Duc / Kalinli, Ozlem et al. | 2023
- 1
-
Real-Time Target Sound ExtractionVeluri, Bandhav / Chan, Justin / Itani, Malek / Chen, Tuochao / Yoshioka, Takuya / Gollakota, Shyamnath et al. | 2023
- 1
-
Multi-Scale Receptive Field Graph Model for Emotion Recognition in ConversationsWei, Jie / Hu, Guanyu / Tuan, Luu Anh / Yang, Xinyu / Zhu, Wenjing et al. | 2023
- 1
-
Twitter Stance Detection via Neural Production SystemsZhang, Bowen / Ding, Daijun / Xu, Guangning / Guo, Jinjin / Huang, Zhichao / Huang, Xu et al. | 2023
- 1
-
Lost In Translation: Generating Adversarial Examples Robust to Round-Trip TranslationBhandari, Neel / Chen, Pin-Yu et al. | 2023
- 1
-
LDTSF: A Label-Decoupling Teacher-Student Framework for Semi-Supervised Echocardiography SegmentationZhang, Jiapeng / Wang, Yongxiong / Pan, Zhiqun / Tang, Zhenhui / Chen, Lijun / Liu, Jinlong et al. | 2023
- 1
-
SLBERT: A Novel Pre-Training Framework for Joint Speech and Language ModelingSusladkar, Onkar / Gatti, Prajwal / Kumar Yadav, Santosh et al. | 2023
- 1
-
Iterative Shallow Fusion of Backward Language Model for End-To-End Speech RecognitionOgawa, Atsunori / Moriya, Takafumi / Kamo, Naoyuki / Tawara, Naohiro / Delcroix, Marc et al. | 2023
- 1
-
Seri: Sketching-Reasoning-Integrating Progressive Workflow for Empathetic Response GenerationBi, Guanqun / Cao, Yanan / Li, Piji / Xie, Yuqiang / Fang, Fang / Lin, Zheng et al. | 2023
- 1
-
Vitasd: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial DiagnosisCao, Xu / Ye, Wenqian / Sizikova, Elena / Bai, Xue / Coffee, Megan / Zeng, Hongwu / Cao, Jianguo et al. | 2023
- 1
-
The Role of Initial Entanglement in Adaptive Gibbs State Preparation on Quantum ComputersEconomou, Sophia E. / Warren, Ada / Barnes, Edwin et al. | 2023
- 1
-
Multilevel FISTA for Image RestorationLauga, Guillaume / Riccietti, Elisa / Pustelnik, Nelly / Goncalves, Paulo et al. | 2023
- 1
-
JPEG Pleno Call for Proposals Responses Quality AssessmentPrazeres, Joao / Luo, Zhe / Pinheiro, Antonio M. G. / da Silva Cruz, Luis A. / Perry, Stuart et al. | 2023
- 1
-
Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention MechanismLi, Dichucheng / Che, Mingjin / Meng, Wenwu / Wu, Yulun / Yu, Yi / Xia, Fan / Li, Wei et al. | 2023
- 1
-
WITT: A Wireless Image Transmission Transformer for Semantic CommunicationsYang, Ke / Wang, Sixian / Dai, Jincheng / Tan, Kailin / Niu, Kai / Zhang, Ping et al. | 2023
- 1
-
Kernel Estimation and Deconvolution for Blind Image Super-ResolutionGong, Jiali / Gao, Hongfan / Chao, Jiahao / Zhou, Zhou / Yang, Zhengfeng / Zeng, Zhenbing et al. | 2023
- 1
-
Learned Video Coding with Motion Compensation Mixture ModelDinh, Khanh Quoc / Pyo Choi, Kwang et al. | 2023
- 1
-
Improving Few-Shot Learning for Talking Face System with TTS Data AugmentationChen, Qi / Ma, Ziyang / Liu, Tao / Tan, Xu / Lu, Qu / Yu, Kai / Chen, Xie et al. | 2023
- 1
-
A Synthetic Corpus Generation Method for Neural Vocoder TrainingWang, Zilin / Liu, Peng / Chen, Jun / Li, Sipan / Bai, Jinfeng / He, Gang / Wu, Zhiyong / Meng, Helen et al. | 2023
- 1
-
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource HeadphonesShashaank, N / Banar, Berker / Izadi, Mohammad Rasool / Kemmerer, Jeremy / Zhang, Shuo / Huang, Chuan-Che Jeff et al. | 2023
- 1
-
Robust Acoustic And Semantic Contextual Biasing In Neural Transducers For Speech RecognitionFu, Xuandi / Sathyendra, Kanthashree Mysore / Gandhe, Ankur / Liu, Jing / Strimel, Grant P. / McGowan, Ross / Mouchtaris, Athanasios et al. | 2023
- 1
-
Multi-Task Bias-Variance Trade-Off Through Functional ConstraintsCervino, Juan / Bazerque, Juan Andres / Calvo-Fullana, Miguel / Ribeiro, Alejandro et al. | 2023
- 1
-
Towards a More Stable and General Subgraph Information BottleneckLiu, Hongzhi / Zheng, Kaizhong / Yu, Shujian / Chen, Badong et al. | 2023
- 1
-
Unsupervised Domain Adaptation via Subspace Interpolating Deep Dictionary Learning: A Case Study in Machine InspectionKumar, Kriti / Majumdar, Angshul / Kumar, A Anil / Girish Chandra, M et al. | 2023
- 1
-
Adaptive Filtering Algorithms For Set-Valued Observations-Symmetric Measurement Approach To Unlabeled And Anonymized DataKrishnamurthy, Vikram et al. | 2023
- 1
-
Classification of Synthetic Facial Attributes by Means of Hybrid Classification/Localization Patch-Based AnalysisWang, Jun / Tondi, Benedetta / Barni, Mauro et al. | 2023
- 1
-
A Point is A Wave: Point-Wave Network for Place RecognitionLi, Ge / Zhang, Ruonan et al. | 2023
- 1
-
Robust and Globally Sparse Pca via Majorization-Minimization and Variable SplittingBrehier, Hugo / Breloy, Arnaud / El Korso, Mohammed Nabil / Kumar, Sandeep et al. | 2023
- 1
-
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed PrototypesXu, Xinzhou / Deng, Jun / Zhang, Zixing / Yang, Zhen / Schuller, Bjorn W. et al. | 2023
- 1
-
Multi-Task Transformer with Relation-Attention and Type-Attention for Named Entity RecognitionMo, Ying / Tang, Hongyin / Liu, Jiahao / Wang, Qifan / Xu, Zenglin / Wang, Jingang / Wu, Wei / Li, Zhoujun et al. | 2023
- 1
-
Self-Supervised Representations in Speech-Based Depression DetectionWu, Wen / Zhang, Chao / Woodland, Philip C. et al. | 2023
- 1
-
A Simple Yet Effective Approach to Structured Knowledge DistillationLin, Wenye / Li, Yangming / Liu, Lemao / Shi, Shuming / Zheng, Hai-Tao et al. | 2023
- 1
-
Leveraging Neural Koopman Operators to Learn Continuous Representations of Dynamical Systems from Scarce DataFrion, Anthony / Drumetz, Lucas / Mura, Mauro Dalla / Tochon, Guillaume / Aissa-El-Bey, Abdeldjalil et al. | 2023
- 1
-
WUDA: Unsupervised Domain Adaptation Based on Weak Source Domain LabelsLiu, Shengjie / Zhu, Chuang / Li, Yuan / Tang, Wenqi et al. | 2023
- 1
-
A Memory-Free Evolving Bipolar Neural Network for Efficient Multi-Label Stream LearningMishra, Sourav / Sundaram, Suresh et al. | 2023
- 1
-
Prototype Knowledge Distillation for Medical Segmentation with Missing ModalityWang, Shuai / Yan, Zipei / Zhang, Daoan / Wei, Haining / Li, Zhongsen / Li, Rui et al. | 2023
- 1
-
A Novel Efficient Multi-View Traffic-Related Object Detection FrameworkYang, Kun / Liu, Jing / Yang, Dingkang / Wang, Hanqi / Sun, Peng / Zhang, Yanni / Liu, Yan / Song, Liang et al. | 2023
- 1
-
Learning with Multigraph Convolutional FiltersButler, Landon / Parada-Mayorga, Alejandro / Ribeiro, Alejandro et al. | 2023
- 1
-
Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-DistillationZhang, Jing-Xuan / Wan, Genshun / Ling, Zhen-Hua / Pan, Jia / Gao, Jianqing / Liu, Cong et al. | 2023
- 1
-
Exploring Wav2vec 2.0 Fine Tuning for Improved Speech Emotion RecognitionChen, Li-Wei / Rudnicky, Alexander et al. | 2023
- 1
-
Reducing the GAP Between Streaming and Non-Streaming Transducer-Based ASR by Adaptive Two-Stage Knowledge DistillationTang, Haitao / Fu, Yu / Sun, Lei / Xue, Jiabin / Liu, Dan / Li, Yongchao / Ma, Zhiqiang / Wu, Minghui / Pan, Jia / Wan, Genshun et al. | 2023
- 1
-
Generalized Invariant Matching Property Via LassoDu, Kang / Xiang, Yu et al. | 2023
- 1
-
Efficient Feature Extraction for Non-Maximum Suppression in Visual Person DetectionSymeonidis, Charalampos / Mademlis, Ioannis / Pitas, Ioannis / Nikolaidis, Nikos et al. | 2023
- 1
-
Visual-Aware Text-to-Speech*Zhou, Mohan / Bai, Yalong / Zhang, Wei / Yao, Ting / Zhao, Tiejun / Mei, Tao et al. | 2023
- 1
-
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar SamplesRyu, Hyeonggon / Senocak, Arda / So Kweon, In / Son Chung, Joon et al. | 2023
- 1
-
Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech RecognitionChen, Xie / Ma, Ziyang / Tang, Changli / Wang, Yujin / Zheng, Zhisheng et al. | 2023
- 1
-
Do Prosody Transfer Models Transfer ProsodyƒSigurgeirsson, Atli Thor / King, Simon et al. | 2023
- 1
-
Rate Splitting and Precoding Strategies for Multi-User MIMO Broadcast Channels with Common and Private StreamsKhamidullina, Liana / de Almeida, Andre L. F. / Haardt, Martin et al. | 2023
- 1
-
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command RecognitionYang, Chao-Han Huck / Li, Bo / Zhang, Yu / Chen, Nanxin / Sainath, Tara N. / Marco Siniscalchi, Sabato / Lee, Chin-Hui et al. | 2023
- 1
-
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech RecognitionVander Eeckt, Steven / Van Hamme, Hugo et al. | 2023
- 1
-
VPPT: Visual Pre-Trained Prompt Tuning Framework for Few-Shot Image ClassificationSong, Zhao / Yang, Ke / Guan, Naiyang / Zhu, Junjie / Qiao, Peng / Hu, Qingyong et al. | 2023
- 1
-
Test Your Samples Jointly: Pseudo-Reference for Image Quality EvaluationTworski, Marcelin / Lathuiliere, Stephane et al. | 2023
- 1
-
Waveform Design to Improve the Estimation of Target Parameters Using the Fourier Transform Method in a MIMO OFDM DFRC SystemBhogavalli, Satwika / Grivel, Eric / Hari, K.V.S. / Corretja, Vincent et al. | 2023
- 1
-
Modify: Model-Driven Face Stylization Without Style ImagesDing, Yuhe / Liang, Jian / Cao, Jie / Zheng, Aihua / He, Ran et al. | 2023
- 1
-
TINYCOD: Tiny and Effective Model for Camouflaged Object DetectionXing, Haozhe / Gao, Shuyong / Tang, Hao / Mok, Tsui Qin / Kang, Yanlan / Zhang, Wenqiang et al. | 2023
- 1
-
Automatic Segmentation of Nasopharyngeal Carcinoma in CT Images Using Dual Attention and Edge DetectionWang, Qizhi / Huang, Wei / Zhang, Yuan / Li, Xuanya / Ye, Xiongjun / Hu, Kai et al. | 2023
- 1
-
Fast and Efficient Speech Enhancement with Variational AutoencodersSadeghi, Mostafa / Serizel, Romain et al. | 2023
- 1
-
Representation of Vocal Tract Length Transformation Based on Group TheoryMiyashita, Atsushi / Toda, Tomoki et al. | 2023
- 1
-
Sandformer: CNN and Transformer under Gated Fusion for Sand Dust Image RestorationShi, Jun / Wei, Bingcai / Zhou, Gang / Zhang, Liye et al. | 2023
- 1
-
Utility Polelocalization by Learning from Ambient Traces on Distributed Acoustic SensingJiang, Zhuocheng / Tian, Yue / Ding, Yangmin / Ozharar, Sarper / Wang, Ting et al. | 2023
- 1
-
Multi-User Methods for Vibrational Radar Backscatter CommunicationsCenters, Jessica / Krolik, Jeffrey et al. | 2023
- 1
-
Target Sound Extraction with Variable Cross-Modality CluesLi, Chenda / Qian, Yao / Chen, Zhuo / Wang, Dongmei / Yoshioka, Takuya / Liu, Shujie / Qian, Yanmin / Zeng, Michael et al. | 2023
- 1
-
Model-Free Learning of Optimal Beamformers for Passive IRS-Assisted Sumrate MaximizationHashmi, Hassaan / Pougkakiotis, Spyridon / Kalogerias, Dionysios S. et al. | 2023
- 1
-
Strategies for Enhanced Signal Modulation Classifications Under Unknown Symbol Rates and Noise ConditionsWang, Ruixuan / Qi, Yue / Vaezi, Mojtaba / Jiao, Xun / Amin, Moeness et al. | 2023
- 1
-
Target Velocity Estimation for Quantization-Based Cooperative MIMO Radar and Communications SystemWang, Zhen / Yan, Xuedan / He, Qian / Blum, Rick S. et al. | 2023
- 1
-
Margin-Mixup: A Method for Robust Speaker Verification In Multi-Speaker AudioThienpondt, Jenthe / Madhu, Nilesh / Demuynck, Kris et al. | 2023
- 1
-
Evopose: A Recursive Transformer for 3D Human Pose Estimation with Kinematic Structure PriorsZhang, Yaqi / Lu, Yan / Liu, Bin / Zhao, Zhiwei / Chu, Qi / Yu, Nenghai et al. | 2023
- 1
-
Subspace-Based Detector For Distributed Mmwave Mimo Radar SensorsAhmadi, Moein / Alaee-Kerahroodi, Mohammad / M. R., Bhavani Shankar / Ottersten, Bjorn et al. | 2023
- 1
-
A Unitary Transform Based Generalized Approximate Message PassingZhu, Jiang / Meng, Xiangming / Lei, Xupeng / Guo, Qinghua et al. | 2023
- 1
-
Adaptive Data Augmentation for Contrastive LearningZhang, Yuhan / Zhu, He / Yu, Shan et al. | 2023
- 1
-
E2E Segmentation in a Two-Pass Cascaded Encoder ASR ModelHuang, W. Ronny / Chang, Shuo-Yiin / Sainath, Tara N. / He, Yanzhang / Rybach, David / David, Robert / Prabhavalkar, Rohit / Allauzen, Cyril / Peyser, Cal / Strohman, Trevor D. et al. | 2023
- 1
-
Binary Sequence Set Optimization for CDMA Applications via Mixed-Integer Quadratic ProgrammingYang, Alan / Mina, Tara / Gao, Grace et al. | 2023
- 1
-
Blind Polynomial RegressionNatali, Alberto / Leus, Geert et al. | 2023
- 1
-
ERSAM: Neural Architecture Search for Energy-Efficient and Real-Time Social Ambiance MeasurementLi, Chaojian / Chen, Wenwan / Yuan, Jiayi / Lin, Yingyan Celine / Sabharwal, Ashutosh et al. | 2023
- 1
-
Statistical Analysis of Speech Disorder Specific Features to Characterise Dysarthria Severity LevelJoshy, Amlu Anna / Parameswaran, P. N. / Nair, Siddharth R. / Rajan, Rajeev et al. | 2023
- 1
-
Generalized Relative Harmonic CoefficientsHu, Yonggang / Gannot, Sharon / Abhayapala, Thushara D. et al. | 2023
- 1
-
Perceptual–Neural–Physical Sound MatchingHan, Han / Lostanlen, Vincent / Lagrange, Mathieu et al. | 2023
- 1
-
Improved Training Of Mixture-Of-Experts Language GANsChai, Yekun / Yin, Qiyue / Zhang, Junge et al. | 2023
- 1
-
Spatial-Domain Object Detection Under Mimo-Fmcw Automotive Radar InterferenceJin, Sian / Wang, Pu / Boufounos, Petros / Takahashi, Ryuhei / Roy, Sumit et al. | 2023
- 1
-
I See What You Hear: A Vision-Inspired Method to Localize WordsSamragh, Mohammad / Kundu, Arnav / Hu, Ting-Yao / Chadha, Aman / Srivastava, Ashish / Cho, Minsik / Tuzel, Oncel / Naik, Devang et al. | 2023
- 1
-
Lightweight Fisher Vector Transfer Learning for Video DeduplicationHenry, Chris / Liao, Rijun / Lin, Ruiyuan / Zhang, Zhebin / Sun, Hongyu / Li, Zhu et al. | 2023
- 1
-
Difference Coarrays of Rational ArraysKulkarni, Pranav / Vaidyanathan, P. P. et al. | 2023
- 1
-
SIGVIC: Spatial Importance Guided Variable-Rate Image CompressionLiang, Jiaming / Liu, Meiqin / Yao, Chao / Lin, Chunyu / Zhao, Yao et al. | 2023
- 1
-
UCONV-Conformer: High Reduction of Input Sequence Length for End-to-End Speech RecognitionAndrusenko, Andrei / Nasretdinov, Rauf / Romanenko, Aleksei et al. | 2023
- 1
-
Unsupervised Noise Adaptation Using Data SimulationChen, Chen / Hu, Yuchen / Zou, Heqing / Sun, Linhui / Chng, Eng Siong et al. | 2023
- 1
-
Logo-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression RecognitionMa, Fuyan / Sun, Bin / Li, Shutao et al. | 2023
- 1
-
Adaptive Time-Scale Modification for Improving Speech Intelligibility Based On Phoneme Clustering For Streaming ServicesJang, Sohee / Kim, Jiye / Kim, Yeon-Ju / Chang, Joon-Hyuk et al. | 2023
- 1
-
Learning to Reconnect Interrupted Trajectories for Weakly Supervised Multi-Object TrackingLi, Yu-Lei / Lu, Yang / Li, Jie / Wang, Hanzi et al. | 2023
- 1
-
Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASRBotros, Rami / Prabhavalkar, Rohit / Schalkwyk, Johan / Chelba, Ciprian / Sainath, Tara N. / Beaufays, Francoise et al. | 2023
- 1
-
Deepspace: Dynamic Spatial and Source CUE Based Source Separation for Dialog EnhancementMaster, Aaron / Lu, Lie / Samuelsson, Jonas / Lehtonen, Heidi-Maria / Norcross, Scott / Swedlow, Nathan / Howard, Audrey et al. | 2023
- 1
-
Batch-Ensemble Stochastic Neural Networks for Out-of-Distribution DetectionChen, Xiongjie / Li, Yunpeng / Yang, Yongxin et al. | 2023
- 1
-
Cross-Lingual Alzheimer’s Disease Detection Based on Paralinguistic and Pre-Trained FeaturesChen, Xuchu / Pu, Yu / Li, Jinpeng / Zhang, Wei-Qiang et al. | 2023
- 1
-
Multi-Carrier Wideband OCDM-Based THZ Automotive RadarBhattacharjee, Sangeeta / Mishra, Kumar Vijay / Annavajjala, Ramesh / Murthy, Chandra R. et al. | 2023
- 1
-
Low Precision Representations for High Dimensional ModelsSaha, Rajarshi / Pilanci, Mert / Goldsmith, Andrea J. et al. | 2023
- 1
-
Hypernetwork-Based Adaptive Image RestorationAharon, Shai / Ben-Artzi, Gil et al. | 2023
- 1
-
Your Camera Improves Your Point Cloud CompressionLin, Yuhuan / Xu, Tongda / Zhu, Ziyu / Li, Yanghao / Wang, Zhe / Wang, Yan et al. | 2023
- 1
-
Pseudo-Query Generation For Semi-Supervised Visual Grounding With Knowledge DistillationJin, Jianglin / Ye, Jiabo / Lin, Xin / He, Liang et al. | 2023
- 1
-
2DSBG: A 2d Semi Bi-Gaussian Filter Adapted for Adjacent and Multi-Scale Line Feature DetectionMagnier, Baptiste / Shokouh, Ghulam Sakhi / Berthier, Louis / Pie, Marcel / Ruggiero, Adrien et al. | 2023
- 1
-
Estimation of High-Dimensional Differential Graphs from Multi-Attribute DataTugnait, Jitendra K. et al. | 2023
- 1
-
Joint Unsupervised and Supervised Learning for Context-Aware Language IdentificationPark, Jinseok / Kim, Hyung Yong / Park, Jihwan / Kim, Byeong-Yeol / Choi, Shukjae / Lim, Yunkyu et al. | 2023
- 1
-
Improving Transformer-Based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention HeadsJeoung, Ye-Rin / Yang, Joon-Young / Choi, Jeong-Hwan / Chang, Joon-Hyuk et al. | 2023
- 1
-
On the Value of Stochastic Side Information in Online LearningJia, Junzhang / Wu, Xuetong / Evans, Jamie / Zhu, Jingge et al. | 2023
- 1
-
Learning Task-Aligned Mask Query for Instance SegmentationFu, Bin / He, Hongliang / Wei, Pengxu / Chen, Jie et al. | 2023
- 1
-
On The Primal and Dual Formulations Of The Discrete Mumford-Shah FunctionalPustelnik, Nelly et al. | 2023
- 1
-
Robust Angle Estimation for Hybrid mmWave SystemsLin, Yuan-Pei / Yang, Ting-Ming et al. | 2023
- 1
-
On The Fairness of Multitask Representation LearningLi, Yingcong / Oymak, Samet et al. | 2023
- 1
-
VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature DistillationLiu, Yuhao / Gong, Cheng / Wang, Longbiao / Wu, Xixin / Liu, Qiuyu / Dang, Jianwu et al. | 2023
- 1
-
Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-Based Speech Recognition of Low-Resource LanguageSoky, Kak / Li, Sheng / Chu, Chenhui / Kawahara, Tatsuya et al. | 2023
- 1
-
Pop2Piano : Pop Audio-Based Piano Cover GenerationChoi, Jongho / Lee, Kyogu et al. | 2023
- 1
-
Multi-Lingual Pronunciation Assessment with Unified Phoneme Set and Language-Specific EmbeddingsLin, Binghuai / Wang, Liyuan et al. | 2023
- 1
-
Interpolation Filter Model For Ramanujan Subspace SignalsKulkarni, Pranav / Vaidyanathan, P. P. et al. | 2023
- 1
-
Online Binaural Speech Separation Of Moving Speakers With A Wavesplit NetworkHan, Cong / Mesgarani, Nima et al. | 2023
- 1
-
A Hybrid Deep Neural Network for Nonlinear Causality Analysis in Complex Industrial Control SystemFeng, Tian / Chen, Qiming / Shi, Yao / Lang, Xun / Xie, Lei / Su, Hongye et al. | 2023
- 1
-
Autovocoder: Fast Waveform Generation from a Learned Speech Representation Using Differentiable Digital Signal ProcessingWebber, Jacob J / Valentini-Botinhao, Cassia / Williams, Evelyn / Henter, Gustav Eje / King, Simon et al. | 2023
- 1
-
Self-Sufficient Framework for Continuous Sign Language RecognitionJang, Youngjoon / Oh, Youngtaek / Cho, Jae Won / Kim, Myungchul / Kim, Dong-Jin / Kweon, In So / Son Chung, Joon et al. | 2023
- 1
-
Signal Processing On Product SpacesRoddenberry, T. Mitchell / Grande, Vincent P. / Frantzen, Florian / Schaub, Michael T. / Segarra, Santiago et al. | 2023
- 1
-
On the Effectiveness of Monoaural Target Source Extraction for Distant end-to-end Automatic Speech RecognitionZorila, Catalin / Doddipatla, Rama et al. | 2023
- 1
-
MAID: A Conditional Diffusion Model for Long Music Audio InpaintingLiu, Kaiyang / Gan, Wendong / Yuan, Chenchen et al. | 2023
- 1
-
Semi-Federated Learning for Edge Intelligence with Imperfect SICNi, Wanli / Zheng, Jingheng / Eldar, Yonina C. / You, Changsheng / Huang, Kaibin et al. | 2023
- 1
-
Dual Collaborative Visual-Semantic Mapping for Multi-Label Zero-Shot Image RecognitionHu, Yunqing / Jin, Xuan / Chen, Xi / Zhang, Yin et al. | 2023
- 1
-
Topological Slepians: Maximally Localized Representations of Signals Over Simplicial ComplexesBattiloro, Claudio / Di Lorenzo, Paolo / Barbarossa, Sergio et al. | 2023
- 1
-
Efficient Feature Fusion for Learning-Based Photometric StereoJu, Yakun / Lam, Kin-Man / Xiao, Jun / Zhang, Cong / Yang, Cuixin / Dong, Junyu et al. | 2023
- 1
-
Improving Scheduled Sampling for Neural Transducer-Based ASRMoriya, Takafumi / Ashihara, Takanori / Sato, Hiroshi / Matsuura, Kohei / Tanaka, Tomohiro / Masumura, Ryo et al. | 2023
- 1
-
Unobtrusive Respiratory Monitoring System for Intensive CareTan, Xudong / Hu, Menghan / Zhai, Guangtao / Zhu, Yan / Li, Wenfang / Zhang, XiaoPing et al. | 2023
- 1
-
Integrating the Sensing and Radio Communications Channel Modelling From Radar Mutual InterferenceCardona, Narcis / Romero, J. Samuel / Yang, Wenfei / Li, Jian et al. | 2023
- 1
-
TDMA-Based Multi-User Binary Computation Offloading in the Finite-Block-Length RegimeManouchehrpour, M. Amin / Lehal, Harvinder / Salmani, Mahsa / Davidson, Timothy N. et al. | 2023
- 1
-
Multispectral Image Fusion based on Super Pixel SegmentationOfir, Nati et al. | 2023
- 1
-
Optimal Transport with a Diversified Memory Bank for Cross-Domain Speaker VerificationZhang, Ruiteng / Wei, Jianguo / Lu, Xugang / Lu, Wenhuan / Jin, Di / Zhang, Lin / Xu, Junhai et al. | 2023
- 1
-
Fast Low-Latency Convolution by Low-Rank Tensor ApproximationJalmby, Martin / Elvander, Filip / van Waterschoot, Toon et al. | 2023
- 1
-
A Controllable Lifestyle Simulator for Use in Deep Reinforcement Learning AlgorithmsBraz, Libio Goncalves / Susaiyah, Allmin et al. | 2023
- 1
-
BTS-E: Audio Deepfake Detection Using Breathing-Talking-Silence EncoderDoan, Thien-Phuc / Nguyen-Vu, Long / Jung, Souhwan / Hong, Kihun et al. | 2023
- 1
-
Study of Manifold Geometry Using Multiscale Non-Negative Kernel GraphsHurtado, Carlos / Shekkizhar, Sarath / Ruiz-Hidalgo, Javier / Ortega, Antonio et al. | 2023
- 1
-
Learning Silhouettes with Group Sparse AutoencodersTheodosis, Emmanouil / Ba, Demba et al. | 2023
- 1
-
ScaleMix: Intra- And Inter-Layer Multiscale Feature Combination for Change DetectionHuang, Rui / Zhao, Qingyi / Wang, Ruofei / Liu, Caihua / Gao, Sihua / Zhang, Yuxiang / Fan, Wei et al. | 2023
- 1
-
Is Multi-Task Learning an Upper Bound for Continual Learning?Wu, Zihao / Tran, Huy / Pirsiavash, Hamed / Kolouri, Soheil et al. | 2023
- 1
-
Local Graph-Homomorphic Processing for Privatized Distributed SystemsRizk, Elsa / Vlaski, Stefan / Sayed, Ali H. et al. | 2023
- 1
-
MASKED-AP: Attention Pyramid Convolutional Neural Network with Mask for Cervical Cell ClassificationJin, Yu / Liu, Juan / Chen, Hua / Duan, Wensi / Cao, Dehua / Pang, Baochuan et al. | 2023
- 1
-
Pondering About Task Spatial Misalignment: Classification-Localization Equilibrated Object DetectionZhang, Yudong / Lu, Wei / Wang, Xu / Wang, Pengkun / Wang, Yang et al. | 2023
- 1
-
Multiple Access Computation Offloading for the K-User CaseLiu, Xiaomeng / Schaible, Christian / Davidson, Timothy N. et al. | 2023
- 1
-
Movienet-PS: A Large-Scale Person Search Dataset in the WildQin, Jie / Zheng, Peng / Yan, Yichao / Quan, Rong / Cheng, Xiaogang / Ni, Bingbing et al. | 2023
- 1
-
Spatial Similarity Guidance for Few-Shot SegmentationLuo, Xiaoliu / Duan, Zhao / Zhang, Taiping et al. | 2023
- 1
-
Efficient Monaural Speech Enhancement with Universal Sample Rate Band-Split RNNYu, Jianwei / Luo, Yi et al. | 2023
- 1
-
Code-Switching Speech Synthesis Based on Self-Supervised Learning and Domain Adaptive Speaker EncoderLin, Yi-Xing / Pai, Cheng-Hsun / Le, Phuong Thi / Prihasto, Bima / Huang, Chien-Ling / Wang, Jia Ching et al. | 2023
- 1
-
Mixed Sample Augmentation for Online DistillationShen, Yiqing / Xu, Liwu / Yang, Yuzhe / Li, Yaqian / Guo, Yandong et al. | 2023
- 1
-
Meeting Action Item Detection with Regularized Context ModelingLiu, Jiaqing / Deng, Chong / Zhang, Qinglin / Chen, Qian / Wang, Wen et al. | 2023
- 1
-
CLMAE: A Liter and Faster Masked AutoencodersSong, Yiran / Ma, Lizhuang et al. | 2023
- 1
-
Graph Signal Processing for Narrowband Direction of Arrival EstimationLi, Disheng / Liu, Wei / Zakharov, Yuriy / Mitchell, Paul D et al. | 2023
- 1
-
Privacy-Preserving Automatic Speaker DiarizationTeixeira, Francisco / Abad, Alberto / Raj, Bhiksha / Trancoso, Isabel et al. | 2023
- 1
-
An End-to-End Neural Network for Image-to-Audio TransformationChen, Liu / Deisher, Michael / Georges, Munir et al. | 2023
- 1
-
Joint Multi-Level Feature Network for Lightweight Person Re-IdentificationZhang, Yunzuo / Kang, Weili / Liu, Yameng / Zhu, Pengfei et al. | 2023
- 1
-
Learning Cross-Modal Audiovisual Representations with Ladder Networks for Emotion RecognitionGoncalves, Lucas / Busso, Carlos et al. | 2023
- 1
-
Quantized Precoding and RIS-Assisted Modulation for Integrated Sensing and Communications SystemsPrasobh Sankar, R. S. / Prabhakar Chepuri, Sundeep et al. | 2023
- 1
-
Towards Adversarially Robust Continual LearningBai, Tao / Chen, Chen / Lyu, Lingjuan / Zhao, Jun / Wen, Bihan et al. | 2023
- 1
-
Ultimate Negative Sampling for Contrastive LearningGuo, Huijie / Shi, Lei et al. | 2023
- 1
-
A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech TranslationHuang, Wen-Chin / Peloquin, Benjamin / Kao, Justine / Wang, Changhan / Gong, Hongyu / Salesky, Elizabeth / Adi, Yossi / Lee, Ann / Chen, Peng-Jen et al. | 2023
- 1
-
T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5Hsu, Chan-Jan / Chung, Ho-Lam / Lee, Hung-Yi / Tsao, Yu et al. | 2023
- 1
-
CD-FSOD: A Benchmark For Cross-Domain Few-Shot Object DetectionXiong, Wuti et al. | 2023
- 1
-
Elliptical Wishart Distribution: Maximum Likelihood Estimator from Information GeometryAyadi, Imen / Bouchard, Florent / Pascal, Frederic et al. | 2023
- 1
-
Distributed Bayesian Tracking on the Special Euclidean Group Using Lie Algebra Parametric ApproximationsBordin, Claudio J. / de Figueredo, Caio G. / Bruno, Marcelo G. S. et al. | 2023
- 1
-
Asynchronous Social LearningCemri, Mert / Bordignon, Virginia / Kayaalp, Mert / Shumovskaia, Valentina / Sayed, Ali H. et al. | 2023
- 1
-
Cramér-Rao Bound on Lie Groups with Observations on Lie Groups: Application to SE(2)Labsir, Samy / Renaux, Alexandre / Vila-Valls, Jordi / Chaumette, Eric et al. | 2023
- 1
-
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network Using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech EnhancementZhao, Shengkui / Ma, Bin et al. | 2023
- 1
-
Extended Kalman Filter for Graph Signals in Nonlinear Dynamic SystemsSagi, Guy / Shlezinger, Nir / Routtenberg, Tirza et al. | 2023
- 1
-
Perspective Projection-Based 3d CT Reconstruction from Biplanar X-RaysKyung, Daeun / Jo, Kyungmin / Choo, Jaegul / Lee, Joonseok / Choi, Edward et al. | 2023
- 1
-
Tg-Critic: A Timbre-Guided Model For Reference-Independent Singing EvaluationSun, Xiaoheng / Gao, Yuejie / Lin, Hanyao / Liu, Huaping et al. | 2023
- 1
-
Exploration of Language Dependency for Japanese Self-Supervised Speech Representation ModelsAshihara, Takanori / Moriya, Takafumi / Matsuura, Kohei / Tanaka, Tomohiro et al. | 2023
- 1
-
Frequency Bin-Wise Single Channel Speech Presence Probability Estimation Using Multiple DNNSTao, Shuai / Reddy, Himavanth / Jensen, Jesper Rindom / Christensen, Mads Grasboll et al. | 2023
- 1
-
Structural Optimization of Factor Graphs for Symbol Detection via Continuous Clustering and Machine LearningRapp, Lukas / Schmid, Luca / Rode, Andrej / Schmalen, Laurent et al. | 2023
- 1
-
Selective Film Conditioning with CTC-Based ASR Probability for Speech EnhancementYang, Da-Hee / Chang, Joon-Hyuk et al. | 2023
- 1
-
Egocentric Action Anticipation for Personal HealthRodin, Ivan / Furnari, Antonino / Mavroeidis, Dimitrios / Farinella, Giovanni Maria et al. | 2023
- 1
-
Enhanced Low-Resolution LiDAR-Camera Calibration via Depth Interpolation and Supervised Contrastive LearningZhang, Zhikang / Yu, Zifan / You, Suya / Rao, Raghuveer / Agarwal, Sanjeev / Ren, Fengbo et al. | 2023
- 1
-
SCSGNet: Spatial-Correlated and Shape-Guided Network for Breast Mass SegmentationLi, Qingqiu / Xu, Jilan / Yuan, Runtian / Zhang, Yuejie / Feng, Rui et al. | 2023
- 1
-
A Progressive Neural Network for Acoustic Echo CancellationChen, Zhuangqi / Xia, Xianjun / Sun, Siyu / Wang, Ziqian / Chen, Cheng / Xie, Guoliang / Zhang, Pingjian / Xiao, Yijian et al. | 2023
- 1
-
Ensemble Knowledge Distillation of Self-Supervised Speech ModelsHuang, Kuan -Po / Feng, Tzu-Hsun / Fu, Yu-Kuan / Hsu, Tsu-Yuan / Yen, Po-Chieh / Tseng, Wei-Cheng / Chang, Kai-Wei / Lee, Hung-Yi et al. | 2023
- 1
-
On Crowdsourcing-Design with Comparison Category Rating for Evaluating Speech Enhancement AlgorithmsSuarez, Angelica S. Z. / Laroche, Clement / Clemmensen, Line H. / Das, Sneha et al. | 2023
- 1
-
Rate-Distortion Optimization with Alternative References for UGC Video CompressionXiong, Xin / Pavez, Eduardo / Ortega, Antonio / Adsumilli, Balu et al. | 2023
- 1
-
Audiodec: An Open-Source Streaming High-Fidelity Neural Audio CodecWu, Yi-Chiao / Gebru, Israel D. / Markovic, Dejan / Richard, Alexander et al. | 2023
- 1
-
Image Reconstruction without Explicit PriorsGao, Angela F. / Leong, Oscar / Sun, He / Bouman, Katherine L. et al. | 2023
- 1
-
Classification via Subspace Learning Machine (SLM): Methodology and Performance EvaluationFu, Hongyu / Yang, Yijing / Mishra, Vinod K. / Jay Kuo, C.-C. et al. | 2023
- 1
-
A Multi-Scale Feature Aggregation Based Lightweight Network for Audio-Visual Speech EnhancementXu, Haitao / Wei, Liangfa / Zhang, Jie / Yang, Jianming / Wang, Yannan / Gao, Tian / Fang, Xin / Dai, Lirong et al. | 2023
- 1
-
Multi-Scale Compositional Constraints for Representation Learning on VideosParaskevopoulos, Georgios / Lavania, Chandrashekhar / Chum, Lovish / Sundaram, Shiva et al. | 2023
- 1
-
Enhanced GM-PHD Filter for Real Time Satellite Multi-Target TrackingAguilar, Camilo / Ortner, Mathias / Zerubia, Josiane et al. | 2023
- 1
-
De’hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech RecognitionNg, Dianwen / Zhang, Ruixi / Yip, Jia Qi / Yang, Zhao / Ni, Jinjie / Zhang, Chong / Ma, Yukun / Ni, Chongjia / Chng, Eng Siong / Ma, Bin et al. | 2023
- 1
-
Weakly- and Semi-Supervised Object LocalizationHuang, Zhen-Tang / Chen, Yan-He / Yeh, Mei-Chen et al. | 2023
- 1
-
Torchaudio-Squim: Reference-Less Speech Quality and Intelligibility Measures in TorchaudioKumar, Anurag / Tan, Ke / Ni, Zhaoheng / Manocha, Pranay / Zhang, Xiaohui / Henderson, Ethan / Xu, Buye et al. | 2023
- 1
-
Coarse-to-Fine Covid-19 Segmentation via Vision-Language AlignmentShan, Dandan / Li, Zihan / Chen, Wentao / Li, Qingde / Tian, Jie / Hong, Qingqi et al. | 2023
- 1
-
EMC2-Net: Joint Equalization and Modulation Classification Based on Constellation NetworkRyu, Hyun / Choi, Junil et al. | 2023
- 1
-
Ripple Sparse Self-Attention for Monaural Speech EnhancementZhang, Qiquan / Zhu, Hongxu / Song, Qi / Qian, Xinyuan / Ni, Zhaoheng / Li, Haizhou et al. | 2023
- 1
-
A Physically Explainable Framework for Human-Related Anomaly DetectionJiang, Yalong / Li, Huining / Li, Changkang et al. | 2023
- 1
-
Noncoherent Multiuser Grassmannian Constellations for the Mimo Multiple Access ChannelAlvarez-Vizoso, Javier / Cuevas, Diego / Beltran, Carlos / Santamaria, Ignacio / Tucek, Vit / Peters, Gunnar et al. | 2023
- 1
-
Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification SystemsCai, Danwei / Cai, Zexin / Li, Ming et al. | 2023
- 1
-
A Compensated Shrinkage Affine Projection Algorithm for Debiased Sparse Adaptive FilteringZhang, Yi / Yamada, Isao et al. | 2023
- 1
-
Cross-Domain Object Classification Via Successive Subspace AlignmentChen, Kecheng / Li, Haoliang / Yan, Hong et al. | 2023
- 1
-
Textless Direct Speech-to-Speech Translation with Discrete Speech RepresentationLi, Xinjian / Jia, Ye / Chiu, Chung-Cheng et al. | 2023
- 1
-
Speaker-Independent Acoustic-to-Articulatory Speech InversionWu, Peter / Chen, Li-Wei / Cho, Cheol Jun / Watanabe, Shinji / Goldstein, Louis / Black, Alan W / Anumanchipalli, Gopala K. et al. | 2023
- 1
-
Single-Photon Image Super-Resolution via Self-Supervised LearningChen, Yiwei / Jiang, Chen / Pan, Yu et al. | 2023
- 1
-
TSPTQ-ViT: Two-Scaled Post-Training Quantization for Vision TransformerTai, Yu-Shan / Lin, Ming-Guang / Wu, An-Yeu Andy et al. | 2023
- 1
-
Sparse Error Correction for Power Network ParametersSenaratne, Dilan / Kim, Jinsub et al. | 2023
- 1
-
An Evaluation Platform to Scope Performance of Synthetic Environments in Autonomous Ground Vehicles SimulationBai, Xiangyu / Jiang, Le / Luo, Yedi / Gupta, Aniket / Kaveti, Pushyami / Singh, Hanumant / Ostadabbas, Sarah et al. | 2023
- 1
-
Quaternion Orthogonal Transformer for Facial Expression Recognition in the WildZhou, Yu / Guo, Liyuan / Jin, Lianghai et al. | 2023
- 1
-
HQP-MVS:High-Quality Plane Priors Assisted Multi-View Stereo for Low-Textured AreasTian, Zefan / Wang, Rongjie / Wang, Zhenyu / Wang, Ronggang et al. | 2023
- 1
-
Daily Mental Health Monitoring from Speech: A Real-World Japanese Dataset and Multitask Learning AnalysisSong, Meishu / Triantafyllopoulos, Andreas / Yang, Zijiang / Takeuchi, Hiroki / Nakamura, Toru / Kishi, Akifumi / Ishizawa, Tetsuro / Yoshiuchi, Kazuhiro / Jing, Xin / Karas, Vincent et al. | 2023
- 1
-
ICCRN: Inplace Cepstral Convolutional Recurrent Neural Network for Monaural Speech EnhancementLiu, Jinjiang / Zhang, Xueliang et al. | 2023
- 1
-
CROSSSPEECH: Speaker-Independent Acoustic Representation for Cross-Lingual Speech SynthesisKim, Ji-Hoon / Yang, Hong-Sun / Ju, Yoon-Cheol / Kim, Il-Hwan / Kim, Byeong-Yeol et al. | 2023
- 1
-
Ensemble Prosody Prediction For Expressive Speech SynthesisTeh, Tian Huey / Hu, Vivian / Ram Mohan, Devang S / Hodari, Zack / Wallis, Christopher G. R. / Gomez Ibarrondo, Tomas / Torresquintero, Alexandra / Leoni, James / Gales, Mark / King, Simon et al. | 2023
- 1
-
Progressive Meta-Pooling Learning for Lightweight Image Classification ModelDong, Peijie / Niu, Xin / Tian, Zhiliang / Li, Lujun / Wang, Xiaodong / Wei, Zimian / Pan, Hengyue / Li, Dongsheng et al. | 2023
- 1
-
Euro: Espnet Unsupervised ASR Open-Source ToolkitGao, Dongji / Shi, Jiatong / Chuang, Shun-Po / Garcia, Leibny Paola / Lee, Hung-Yi / Watanabe, Shinji / Khudanpur, Sanjeev et al. | 2023
- 1
-
Learning Generalizable Light Field Networks from Few ImagesLi, Qian / Multon, Franck / Boukhayma, Adnane et al. | 2023
- 1
-
Cross-Domain Diffusion Based Speech Enhancement for Very Noisy SpeechWang, Heming / Wang, DeLiang et al. | 2023
- 1
-
A Few Shot Learning of Singing Technique Conversion Based on Cycle Consistency Generative Adversarial NetworksChen, Po-Wei / Soo, Von-Wun et al. | 2023
- 1
-
Compressed Distributed Regression over Adaptive NetworksCarpentiero, Marco / Matta, Vincenzo / Sayed, Ali H. et al. | 2023
- 1
-
An Approach to Ontological Learning from Weak LabelsShah, Ankit / Tang, Larry / Chou, Po Hao / Zheng, Yi Yu / Ge, Ziqian / Raj, Bhiksha et al. | 2023
- 1
-
Sequential Datum–Wise Joint Feature Selection and Classification in the Presence of External ClassifierEkanayake, Sachini Piyoni / Zois, DaphneynStavroula / Chelmis, Charalampos et al. | 2023
- 1
-
Learning From Label Proportion with Online Pseudo-Label Decision by Regret MinimizationMatsuo, Shinnosuke / Bise, Ryoma / Uchida, Seiichi / Suehiro, Daiki et al. | 2023
- 1
-
Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech SeparationLi, Chenda / Wu, Yifei / Qian, Yanmin et al. | 2023
- 1
-
Fine-Grained Emotional Control of Text-to-Speech: Learning to Rank Inter- and Intra-Class Emotion IntensitiesWang, Shijun / Guenason, Jon / Borth, Damian et al. | 2023
- 1
-
Role of Bias Terms in Dot-Product AttentionNamazifar, Mahdi / Hazarika, Devamanyu / Hakkani-Tur, Dilek et al. | 2023
- 1
-
Learning Interpretable Filters In Wav-UNet For Speech EnhancementMathieu, Felix / Courtat, Thomas / Richard, Gael / Peeters, Geoffroy et al. | 2023
- 1
-
Cochlear Decomposition: A Novel Bio-Inspired Multiscale Analysis FrameworkAlfalahi, Hessa / Khandoker, Ahsan / Alhussein, Ghada / Hadjileontiadis, Leontios et al. | 2023
- 1
-
Contrastive Learning of Sentence Embeddings in Product SearchZhang, Bo-Wen / Yan, Yan / Yu, Jiapei et al. | 2023
- 1
-
Leveraging Sparsity with Spiking Recurrent Neural Networks for Energy-Efficient Keyword SpottingDampfhoffer, Manon / Mesquida, Thomas / Hardy, Emmanuel / Valentian, Alexandre / Anghel, Lorena et al. | 2023
- 1
-
A Quantum Approach for Stochastic Constrained Binary OptimizationGupta, Sarthak / Kekatos, Vassilis et al. | 2023
- 1
-
Joint Antenna Selection and Beamforming in Integrated Automotive Radar Sensing-Communications with Quantized Double Phase ShiftersXu, Lifan / Sun, Shunqiao / Zhang, Yimin D. / Petropulu, Athina et al. | 2023
- 1
-
MODEFORMER: Modality-Preserving Embedding For Audio-Video Synchronization Using TransformersGupta, Akash / Tripathi, Rohun / Jang, Wondong et al. | 2023
- 1
-
Semi-Supervised Learning with Per-Class Adaptive Confidence Scores for Acoustic Environment Classification with Imbalanced DataFiorio, Luan Vinicius / Karanov, Boris / David, Johan / Houtum, Wim van / Widdershoven, Frans / Aarts, Ronald M. et al. | 2023
- 1
-
Database-Aware ASR Error Correction for Speech-to-SQL ParsingShao, Yutong / Kumar, Arun / Nakashole, Ndapa et al. | 2023
- 1
-
Convolutional Filtering on Sampled ManifoldsWang, Zhiyang / Ruiz, Luana / Ribeiro, Alejandro et al. | 2023
- 1
-
A Database for Multi-Modal Short Video Quality AssessmentZhang, Yukun / Wang, Chuan / Zhang, Sanyi / Cao, Xiaochun et al. | 2023
- 1
-
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)Zhang, Qinglin / Deng, Chong / Liu, Jiaqing / Yu, Hai / Chen, Qian / Wang, Wen / Yan, Zhijie / Liu, Jinglin / Ren, Yi / Zhao, Zhou et al. | 2023
- 1
-
Multilingual Alzheimer’s Dementia Recognition through Spontaneous Speech: A Signal Processing Grand ChallengeLuz, Saturnino / Haider, Fasih / Fromm, Davida / Lazarou, Ioulietta / Kompatsiaris, Ioannis / MacWhinney, Brian et al. | 2023
- 1
-
Divcon: Learning Concept Sequences for Semantically Diverse Image CaptioningZheng, Yue / Li, Ya-Li / Wang, Shengjin et al. | 2023
- 1
-
Exploiting Virtual Array Diversity for Accurate Radar DetectionGuan, Junfeng / Madani, Sohrab / Ahmed, Waleed / Hussein, Samah / Gupta, Saurabh / Hassanieh, Haitham et al. | 2023
- 1
-
Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed NetworksChen, Yiyue / Hashemi, Abolfazl / Vikalo, Haris et al. | 2023
- 1
-
SAN: A Robust End-to-End ASR Model ArchitectureMin, Zeping / Ge, Qian / Huang, Guanhua et al. | 2023
- 1
-
Resource Allocation for UAV-Enabled Integrated Sensing and Communication (ISAC) via Multi-Objective OptimizationRezaei, Omid / Naghsh, Mohammad Mahdi / Karbasi, Seyed Mohammad / Nayebi, Mohammad Mahdi et al. | 2023
- 1
-
Removing Radio Frequency Interference From Auroral Kilometric Radiation With Stacked AutoencodersChang, Allen / Knapp, Mary / LaBelle, James / Swoboda, John / Volz, Ryan / Erickson, Philip J. et al. | 2023
- 1
-
Soft Label Coding for end-to-end Sound Source Localization with ad-hoc Microphone ArraysFeng, Linfeng / Gong, Yijun / Zhang, Xiao-Lei et al. | 2023
- 1
-
Study And Design Of Robust Personal Sound Zones With Vast Using Low Rank RirsBhattacharjee, Sankha Subhra / Shi, Liming / Ping, Guoli / Shen, Xiaoxiang / Christensen, Mads Grasboll et al. | 2023
- 1
-
ROI-Based Deep Image Compression with Swin TransformersLi, Binglin / Liang, Jie / Fu, Haisheng / Han, Jingning et al. | 2023
- 1
-
Event-Based Visual MicrophoneHoward, Matthew / Hirakawa, Keigo et al. | 2023
- 1
-
Named Entity Detection and Injection for Direct Speech TranslationGaido, Marco / Tang, Yun / Kulikov, Ilia / Huang, Rongqing / Gong, Hongyu / Inaguma, Hirofumi et al. | 2023
- 1
-
Efficient Stuttering Event Detection Using Siamese NetworksMohapatra, Payal / Islam, Bashima / Islam, Md Tamzeed / Jiao, Ruochen / Zhu, Qi et al. | 2023
- 1
-
BadRes: Reveal the Backdoors Through Residual ConnectionHe, Mingrui / Chen, Tianyu / Zhou, Haoyi / Zhang, Shanghang / Li, Jianxin et al. | 2023
- 1
-
End-to-End Unsupervised Sketch to Image GenerationLv, Xingming / Wu, Lei / Cheng, Zhenwei / Meng, Xiangxu et al. | 2023
- 1
-
Trinet: Stabilizing Self-Supervised Learning From Complete or Slow CollapseCao, Lixin / Wang, Jun / Yang, Ben / Su, Dan / Yu, Dong et al. | 2023
- 1
-
ERBNet: An Effective Representation Based Network for Unbiased Scene Graph GenerationMa, Wenxi / Hou, Tianxiang / Di, Qianji / Qi, Zhongang / Shan, Ying / Wang, Hanzi et al. | 2023
- 1
-
Deformable Cross Attention for Learning Optical FlowAbdein, Rokia / Xiang, Xuezhi / Lv, Ning / Saddik, Abdulmotaleb El et al. | 2023
- 1
-
Optimal Kernel for Real-Time Arbitrary-Shaped Text DetectionMa, Haozhao / Yang, Chuang / Yuan, Yuan / Wang, Qi et al. | 2023
- 1
-
SVMV: Spatiotemporal Variance-Supervised Motion Volume for Video Frame InterpolationLuo, Yao / Pan, Jinshan / Tang, Jinhui et al. | 2023
- 1
-
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and RescoringLi, Mohan / Do, Cong-Thanh / Doddipatla, Rama et al. | 2023
- 1
-
Two-Stage Neural Network for ICASSP 2023 Speech Signal Improvement ChallengeLiu, Mingshuai / Lv, Shubo / Zhang, Zihan / Han, Runduo / Hao, Xiang / Xia, Xianjun / Chen, Li / Xiao, Yijian / Xie, Lei et al. | 2023
- 1
-
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And RecognitionWang, Zhe / Wu, Shilong / Chen, Hang / He, Mao-Kui / Du, Jun / Lee, Chin-Hui / Chen, Jingdong / Watanabe, Shinji / Siniscalchi, Sabato / Scharenborg, Odette et al. | 2023
- 1
-
Implicit Vehicle Positioning with Cooperative Lidar SensingBarbieri, Luca / Tedeschini, Bernardo Camajori / Brambilla, Mattia / Nicoli, Monica et al. | 2023
- 1
-
Self-Supervised Guided Hypergraph Feature Propagation for Semi-Supervised Classification with Missing Node FeaturesLei, Chengxiang / Fu, Sichao / Wang, Yuetian / Qiu, Wenhao / Hu, Yachen / Peng, Qinmu / You, Xinge et al. | 2023
- 1
-
Differential Analysis for Networks Obeying Conservation LawsRayas, Anirudh / Anguluri, Rajasekhar / Cheng, Jiajun / Dasarathy, Gautam et al. | 2023
- 1
-
Hardware-Limited Non-Uniform Task-Based QuantizersBernardo, Neil Irwin / Zhu, Jingge / Eldar, Yonina C. / Evans, Jamie et al. | 2023
- 1
-
Adaptive Noise Canceller Algorithm with SNR-Based Stepsize and Data-Dependent AveragingSugiyama, Akihiko et al. | 2023
- 1
-
Signal Processing And Quantum State Tomography on Noisy DevicesShi, Wenbo / Malaney, Robert et al. | 2023
- 1
-
In-Sensor & Neuromorphic Computing Are all You Need for Energy Efficient Computer VisionDatta, Gourav / Liu, Zeyu / Kaiser, Md Abdullah-Al / Kundu, Souvik / Mathai, Joe / Yin, Zihan / Jacob, Ajey P. / Jaiswal, Akhilesh R. / Beerel, Peter A. et al. | 2023
- 1
-
Adversarial Contrastive Distillation with Adaptive DenoisingWang, Yuzheng / Chen, Zhaoyu / Yang, Dingkang / Liu, Yang / Liu, Siao / Zhang, Wenqiang / Qi, Lizhe et al. | 2023
- 1
-
On Designing Light-Weight Object Trackers Through Network Pruning: Use CNNS or Transformers?Aggarwal, Saksham / Gupta, Taneesh / Sahu, Pawan K. / Chavan, Arnav / Tiwari, Rishabh / Prasad, Dilip K. / Gupta, Deepak K. et al. | 2023
- 1
-
Variational Inference Aided Estimation of Time Varying ChannelsBock, Benedikt / Baur, Michael / Rizzello, Valentina / Utschick, Wolfgang et al. | 2023
- 1
-
Class-Incremental Learning on Multivariate Time Series Via Shape-Aligned Temporal DistillationQiao, Zhongzheng / Hu, Minghui / Jiang, Xudong / Suganthan, Ponnuthurai Nagaratnam / Savitha, Ramasamy et al. | 2023
- 1
-
Inv-Senet: Invariant Self Expression Network for Clustering Under Biased DataSingh, Ashutosh / Singh, Ashish / Masoomi, Aria / Imbiriba, Tales / Learned-Miller, Erik / Erdogmus, Deniz et al. | 2023
- 1
-
Fine-Grained Textual Knowledge Transfer to Improve RNN Transducers for Speech Recognition and UnderstandingSunder, Vishal / Thomas, Samuel / Kuo, Hong-Kwang J. / Kingsbury, Brian / Fosler-Lussier, Eric et al. | 2023
- 1
-
Training Neural Networks for Sequential Change-Point DetectionLee, Junghwan / Xie, Yao / Cheng, Xiuyuan et al. | 2023
- 1
-
High-Resolution Neural Network Processing of LFM Radar PulsesAkhtar, Jabran et al. | 2023
- 1
-
MLCGAN: Multi-Lead ECG Synthesis with Multi Label Conditional Generative Adversarial NetworkWu, Jian / Wang, Liping / Pan, Hailin / Wang, Binyu et al. | 2023
- 1
-
NRTSI: Non-Recurrent Time Series ImputationShan, Siyuan / Li, Yang / Oliva, Junier B. et al. | 2023
- 1
-
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASRSanabria, Ramon / Bogoychev, Nikolay / Markl, Nina / Carmantini, Andrea / Klejch, Ondrej / Bell, Peter et al. | 2023
- 1
-
Centralized Cascade Multi-Channel Noise Reduction and Acoustic Feedback Cancellation in a Wireless Acoustic Sensor And Actuator NetworkRuiz, Santiago / van Waterschoot, Toon / Moonen, Marc et al. | 2023
- 1
-
Intent Does Matter! Propagating High-Order Relations for Exploring Interest PreferencesZheng, Xiangping / Liang, Xun / Wu, Bo / Feng, Junlan / Guo, Yuhui / Zhang, Sensen et al. | 2023
- 1
-
Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage ApproachWu, Shih-Lun / Yang, Yi-Hsuan et al. | 2023
- 1
-
Input-Dependent Dynamical Channel Association For Knowledge DistillationTang, Qiankun / Zhang, Yuan / Xu, Xiaogang / Wang, Jun / Guo, Yimin et al. | 2023
- 1
-
Robust Adaptive Beamforming with Proximal MethodLi, Ruifu / Cabric, Danijela et al. | 2023
- 1
-
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel AudioZhang, Yang / Puvvada, Krishna C. / Lavrukhin, Vitaly / Ginsburg, Boris et al. | 2023
- 1
-
An Isotropy Analysis for Self-Supervised Acoustic Unit Embeddings on the Zero Resource Speech Challenge 2021 FrameworkChen, Jianan / Sakti, Sakriani et al. | 2023
- 1
-
Bimodal Fusion Network for Basic Taste Sensation Recognition from Electroencephalography and ElectromyographyGao, Han / Zhao, Shuo / Li, Huiyan / Liu, Li / Wang, You / Hu, Ruifen / Zhang, Jin / Li, Guang et al. | 2023
- 1
-
Papez: Resource-Efficient Speech Separation with Auditory Working MemoryOh, Hyunseok / Yi, Juheon / Lee, Youngki et al. | 2023
- 1
-
Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding TasksVillatoro-Tello, Esau / Madikeri, Srikanth / Zuluaga-Gomez, Juan / Sharma, Bidisha / Saeed Sarfjoo, Seyyed / Nigmatulina, Iuliia / Motlicek, Petr / Ivanov, Alexei V. / Ganapathiraju, Aravind et al. | 2023
- 1
-
Search for Efficient Deep Visual-Inertial Odometry Through Neural Architecture SearchChen, Yu / Yang, Mingyu / Kim, Hun-Seok et al. | 2023
- 1
-
Prune Then Distill: Dataset Distillation with Importance SamplingSundar, Anirudh S / Keskin, Gokce / Chandak, Chander / Chen, I-Fan / Ghahremani, Pegah / Ghosh, Shalini et al. | 2023
- 1
-
CF-VTON: Multi-Pose Virtual Try-on with Cross-Domain FusionDu, Chenghu / Xiong, Shengwu et al. | 2023
- 1
-
LQGNET: Hybrid Model-Based and Data-Driven Linear Quadratic Stochastic ControlCasspi, Solomon Goldgraber / Husser, Oliver / Revach, Guy / Shlezinger, Nir et al. | 2023
- 1
-
Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-Trained RepresentationsShen, Siyuan / Liu, Feng / Zhou, Aimin et al. | 2023
- 1
-
GTN-Bailando: Genre Consistent long-Term 3D Dance Generation Based on Pre-Trained Genre Token NetworkZhuang, Haolin / Lei, Shun / Xiao, Long / Li, Weiqin / Chen, Liyang / Yang, Sicheng / Wu, Zhiyong / Kang, Shiyin / Meng, Helen et al. | 2023
- 1
-
Streaming Multi-Channel Speech Separation with Online Time-Domain Generalized Wiener FilterLuo, Yi et al. | 2023
- 1
-
String-Based Molecule Generation Via Multi-Decoder VAEKwon, Kisoo / Jeong, Kuhwan / Park, Junghyun / Na, Hwidong / Shin, Jinwoo et al. | 2023
- 1
-
Robust Spatiotemporal Fusion of Satellite Images via Convex OptimizationIsono, Ryosuke / Naganuma, Kazuki / Ono, Shunsuke et al. | 2023
- 1
-
A Sidecar Separator Can Convert A Single-Talker Speech Recognition System to A Multi-Talker OneMeng, Lingwei / Kang, Jiawen / Cui, Mingyu / Wang, Yuejiao / Wu, Xixin / Meng, Helen et al. | 2023
- 1
-
N2MVSNet: Non-Local Neighbors Aware Multi-View Stereo NetworkZhang, Zhe / Gao, Huachen / Hu, Yuxi / Wang, Ronggang et al. | 2023
- 1
-
Windowed Fourier Analysis for Signal Processing on Graph BundlesRoddenberry, T. Mitchell / Segarra, Santiago et al. | 2023
- 1
-
Diffusion-Based Generative Speech Source SeparationScheibler, Robin / Ji, Youna / Chung, Soo-Whan / Byun, Jaeuk / Choe, Soyeon / Choi, Min-Seok et al. | 2023
- 1
-
Shuffled Autoregression for Motion InterpolationHuang, Shuo / Jia, Jia / Yang, Zongxin / Wang, Wei / Wu, Haozhe / Yang, Yi / Xing, Junliang et al. | 2023
- 1
-
Joint Estimation of DOA and Distance in Noisy Reverberant ConditionsBu, Suliang / Zhao, Tuo / Zhao, Yunxin et al. | 2023
- 1
-
Change Point Detection with Neural Online Density-Ratio EstimatorWang, Xiuheng / Borsoi, Ricardo Augusto / Richard, Cedric / Chen, Jie et al. | 2023
- 1
-
Towards Low-Power Heart Rate Estimation Based on User’s Demographics and Activity Level For WearablesPacheco, Andre G. C. / Cabello, Frank A. C. / Fonoff, Adriana M. O. / Rodrigues, Paula G. / Penatti, Otavio A. B. / Pinto, Paula R. et al. | 2023
- 1
-
ifUNet++: Iterative Feedback UNet++ for Infrared Small Target DetectionWeng, Zhangying / Li, Peng / Zhuang, Xin / Yan, Xuefeng / Gong, Lina / Xie, Haoran / Wei, Mingqiang et al. | 2023
- 1
-
Vararray Meets T-Sot: Advancing the State of the Art of Streaming Distant Conversational Speech RecognitionKanda, Naoyuki / Wu, Jian / Wang, Xiaofei / Chen, Zhuo / Li, Jinyu / Yoshioka, Takuya et al. | 2023
- 1
-
Binary Image Fast Perfect Recovery from Sparse 2D-DFT CoefficientsPei, Soo-Chang / Chang, Kuo-Wei et al. | 2023
- 1
-
Time-Aware Multiway Adaptive Fusion Network for Temporal Knowledge Graph Question AnsweringLiu, Yonghao / Liang, Di / Fang, Fang / Wang, Sirui / Wu, Wei / Jiang, Rui et al. | 2023
- 1
-
Exploiting Interactivity and Heterogeneity for Sleep Stage Classification Via Heterogeneous Graph Neural NetworkJia, Ziyu / Lin, Youfang / Zhou, Yuhan / Cai, Xiyang / Zheng, Peng / Li, Qiang / Wang, Jing et al. | 2023
- 1
-
When is Mimo Massive in Radar?Shah, Jaimin / Cardone, Martina / Dytso, Alex / Rush, Cynthia et al. | 2023
- 1
-
Detecting Malicious Migration on Edge to Prevent Running Data LeakageWong, Yuchen / Shen, Qingni / Li, Cong / Liu, Cunzhan / Ai, Tianxiang et al. | 2023
- 1
-
PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image TranslationRen, Bin / Tang, Hao / Wang, Yiming / Li, Xia / Wang, Wei / Sebe, Mcu et al. | 2023
- 1
-
Interpolation of Spatial Room Impulse Responses Using Partial Optimal TransportGeldert, Aaron / Meyer-Kahlen, Nils / Schlecht, Sebastian J. et al. | 2023
- 1
-
Knowledge-Augmented Frame Semantic Parsing with Hybrid Prompt-TuningZhang, Rui / Sun, Yajing / Yang, Jingyuan / Peng, Wei et al. | 2023
- 1
-
HappyQuokka System for ICASSP 2023 Auditory EEG ChallengePiao, Zhenyu / Kim, Miseul / Yoon, Hyungchan / Kang, Hong-Goo et al. | 2023
- 1
-
Deep Unfolded Tensor Robust PCA With Self-Supervised LearningDong, Harry / Shah, Megna / Donegan, Sean / Chi, Yuejie et al. | 2023
- 1
-
Continual Learning for On-Device Speech Recognition Using Disentangled ConformersDiwan, Anuj / Yeh, Ching-Feng / Hsu, Wei-Ning / Tomasello, Paden / Choi, Eunsol / Harwath, David / Mohamed, Abdelrahman et al. | 2023
- 1
-
Robust Online Multiband Drift Estimation in Electrophysiology DataWindolf, Charlie / Paulk, Angelique C. / Kfir, Yoav / Trautmann, Eric / Meszena, Domokos / Munoz, William / Caprara, Irene / Jamali, Mohsen / Boussard, Julien / Williams, Ziv M. et al. | 2023
- 1
-
Progressive Refinement Learning Based on Feature Cross Perception for Residential Areas Semantic SegmentationLyu, Xinran / Zhang, Libao et al. | 2023
- 1
-
Improving Adversarial Robustness with Hypersphere Embedding and Angular-Based RegularizationsFakorede, Olukorede / Nirala, Ashutosh / Atsague, Modeste / Tian, Jin et al. | 2023
- 1
-
Graph Contrastive Learning with Learnable Graph AugmentationPu, Xinyan / Zhang, Ke / Shu, Huazhong / Coatrieux, Jean Louis / Kong, Youyong et al. | 2023
- 1
-
To Regularize or Not to Regularize: The Role of Positivity in Sparse Array Interpolation with a Single SnapshotHucumenoglu, Mehmet Can / Sarangi, Pulak / Rajamaki, Robin / Pal, Piya et al. | 2023
- 1
-
TeAw: Text-Aware Few-Shot Remote Sensing Image Scene ClassificationCheng, Kaihui / Yang, Chule / Fan, Zunlin / Wu, Dayan / Guan, Naiyang et al. | 2023
- 1
-
RIS Reflection and Placement Optimisation for Underlay D2D Communications in Cognitive Cellular NetworksGhose, Sarbani / Mishra, Deepak / Maity, Santi P. / Alexandropoulos, George C. et al. | 2023
- 1
-
Not All Classes are Equal: Adaptively Focus-Aware Confidence for Semi-Supervised Object DetectionZhu, Hui / Lu, Yongchun / Zhao, Hongyu / Zhao, Guoqing / Zhao, Xiaofang et al. | 2023
- 1
-
Adversarial Data Augmentation Using VAE-GAN for Disordered Speech RecognitionJin, Zengrui / Xie, Xurong / Geng, Mengzhe / Wang, Tianzi / Hu, Shujie / Deng, Jiajun / Li, Guinan / Liu, Xunying et al. | 2023
- 1
-
Multi-Blank Transducers for Speech RecognitionXu, Hainan / Jia, Fei / Majumdar, Somshubra / Watanabe, Shinji / Ginsburg, Boris et al. | 2023
- 1
-
End-to-End Word-Level Disfluency Detection and Classification in Children’s Reading AssessmentVenkatasubramaniam, Lavanya / Sunder, Vishal / Fosler-Lussier, Eric et al. | 2023
- 1
-
Speech Emotion Recognition via Heterogeneous Feature LearningLiu, Ke / Wu, DongYa / Wang, Dekui / Feng, Jun et al. | 2023
- 1
-
A Study on Bias and Fairness in Deep Speaker RecognitionHajavi, Amirhossein / Etemad, Ali et al. | 2023
- 1
-
Retinal Biomarkers for Detecting Diabetic Retinopaty Using Smartphone-Based Deep Learning FrameworksKarakaya, Mahmut / Aygun, Ramazan S. et al. | 2023
- 1
-
Hierarchical Interactive Reconstruction Network for Video Compressive SensingZhang, Tong / Cui, Wenxue / Hui, Chen / Jiang, Feng et al. | 2023
- 1
-
A Unified Uncertainty-Aware Exploration: Combining Epistemic and Aleatory UncertaintyMalekzadeh, Parvin / Hou, Ming / Plataniotis, Konstantinos N. et al. | 2023
- 1
-
FedSD: A New Federated Learning Structure Used in Non-iid DataYi, Minmin / Ning, Houchun / Liu, Peng et al. | 2023
- 1
-
Towards Dialogue Modeling Beyond TextWu, Tongzi / Zhou, Yuhao / Ling, Wang / Yang, Hojin / Veloso, Joana / Sun, Lin / Huang, Ruixin / Guimaraes, Norberto / Sanner, Scott et al. | 2023
- 1
-
DPP-Based Client Selection for Federated Learning with NON-IID DATAZhang, Yuxuan / Xu, Chao / Yang, Howard H. / Wang, Xijun / Quek, Tony Q. S. et al. | 2023
- 1
-
Learning Robust Self-Attention Features for Speech Emotion Recognition with Label-Adaptive MixupKang, Lei / Zhang, Lichao / Jiang, Dazhi et al. | 2023
- 1
-
Adaptive Eccm for Mitigating Smart JammersJain, Shashwat / Pattanayak, Kunal / Krishnamurthy, Vikram / Berry, Christopher et al. | 2023
- 1
-
IAST: Instance Association Relying on Spatio-Temporal Features for Video Instance SegmentationChen, Junhao / Liu, Sheng / Chen, Ruixiang / Guo, Bingnan / Zhang, Feng et al. | 2023
- 1
-
Exploring the Role of Fricatives in Classifying Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis and Parkinson’s DiseaseBhattacharjee, Tanuka / Belur, Yamini / Nalini, Atchayaram / Yadav, Ravi / Ghosh, Prasanta Kumar et al. | 2023
- 1
-
Stay In The Middle: A Semi-Supervised Model for CT Metal Artifact ReductionWang, Tao / Yu, Hui / Lu, Zexin / Zhang, Zhongzhou / Zhou, Jiliu / Zhang, Yi et al. | 2023
- 1
-
Neural Fourier Shift for Binaural Speech RenderingWoo Lee, Jin / Lee, Kyogu et al. | 2023
- 1
-
Semi-Supervised Contrastive Learning with Soft Mask Attention for Facial Action Unit DetectionLiu, Zhongling / Liu, Rujie / Shi, Ziqiang / Liu, Liu / Mi, Xiaoyu / Murase, Kentaro et al. | 2023
- 1
-
Recursive Estimation of User Intent From Noninvasive Electroencephalography Using Discriminative ModelsSmedemark-Margulies, Niklas / Celik, Basak / Imbiriba, Tales / Kocanaogullari, Aziz / Erdogmus, Deniz et al. | 2023
- 1
-
Diabetic Retinopathy Grading with Weakly-Supervised Lesion PriorsHou, Junlin / Xiao, Fan / Xu, Jilan / Feng, Rui / Zhang, Yuejie / Zou, Haidong / Lu, Lina / Xue, Wenwen et al. | 2023
- 1
-
Prompt-Distiller: Few-Shot Knowledge Distillation for Prompt-Based Language Learners with Dual Contrastive LearningHou, Boyu / Wang, Chengyu / Chen, Xiaoqing / Qiu, Minghui / Feng, Liang / Huang, Jun et al. | 2023
- 1
-
Contextually-Rich Human Affect Perception Using Multimodal Scene InformationBose, Digbalay / Hebbar, Rajat / Somandepalli, Krishna / Narayanan, Shrikanth et al. | 2023
- 1
-
Stabilising and Accelerating Light Gated Recurrent Units for Automatic Speech RecognitionMoumen, Adel / Parcollet, Titouan et al. | 2023
- 1
-
Sampling Order-Limited Signals on the SphereKhan, Muhammad Salaar Arif / Nadeem, Salman / Khalid, Zubair et al. | 2023
- 1
-
Sequence-Based Device-Free Gesture Recognition Framework for Multi-Channel Acoustic SignalsYang, Zhizheng / Wang, Xun / Xia, Dongyu / Wang, Wei / Dai, Haipeng et al. | 2023
- 1
-
Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech RecognitionEeckt, Steven Vander / Van Hamme, Hugo et al. | 2023
- 1
-
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems?Shi, Xuan / Cooper, Erica / Wang, Xin / Yamagishi, Junichi / Narayanan, Shrikanth et al. | 2023
- 1
-
MGAT: Multi-Granularity Attention Based Transformers for Multi-Modal Emotion RecognitionFan, Weiquan / Xing, Xiaofen / Cai, Bolun / Xu, Xiangmin et al. | 2023
- 1
-
HPFTN: Hierarchical Progressive Fusion Transformer Network for Video DenoisingZhang, Shuaitao / Zhang, Yuan / Zhao, Zheng / Xie, Di / Pu, Shiliang et al. | 2023
- 1
-
Soft 2D-to-3D Delivery Using Deep Graph Neural Networks for Holographic-Type CommunicationFujihashi, Takuya / Koike-Akino, Toshiaki / Watanabe, Takashi et al. | 2023
- 1
-
CLAP Learning Audio Concepts from Natural Language SupervisionElizalde, Benjamin / Deshmukh, Soham / Ismail, Mahmoud Al / Wang, Huaming et al. | 2023
- 1
-
Soft Dynamic Time Warping for Multi-Pitch Estimation and BeyondKrause, Michael / Weis, Christof / Muller, Meinard et al. | 2023
- 1
-
SPECTRANET-SO(3): Learning Satellite Orientation from Optical Spectra by Implicitly Modeling Mutually Exclusive Probability Distributions on The Rotation ManifoldPhelps, Matthew / Swindle, Thomas / Gazak, J. Zachary / Vandenberg, Andrew / Fletcher, Justin et al. | 2023
- 1
-
Channel Estimation in Massive MIMO with Heavy-Tailed Noise: Gaussian-Mixture Versus Cauchy ModelsGulgun, Ziya / Larsson, Erik G. et al. | 2023
- 1
-
Speech Intelligibility Classifiers from 550k Disordered Speech SamplesVenugopalan, Subhashini / Tobin, Jimmy / Yang, Samuel J. / Seaver, Katie / Cave, Richard J.N. / Jiang, Pan-Pan / Zeghidour, Neil / Heywood, Rus / Green, Jordan / Brenner, Michael P. et al. | 2023
- 1
-
Filler Word Detection with Hard Category Mining and Inter-Category Focal LossZhao, Zhiyuan / Wu, Lijun / Tang, Chuanxin / Yin, Dacheng / Zhao, Yucheng / Luo, Chong et al. | 2023
- 1
-
Modular Conformer Training for Flexible End-to-End ASRAudhkhasi, Kartik / Farris, Brian / Ramabhadran, Bhuvana / Moreno, Pedro J. et al. | 2023
- 1
-
Untargeted Backdoor Attack Against Object DetectionLuo, Chengxiao / Li, Yiming / Jiang, Yong / Xia, Shu-Tao et al. | 2023
- 1
-
Cross-Modality depth Estimation via Unsupervised Stereo RGB-to-infrared TranslationTang, Shi / Ye, Xinchen / Xue, Fei / Xu, Rui et al. | 2023
- 1
-
A Dynamic Cross-Scale Transformer with Dual-Compound Representation for 3D Medical Image SegmentationZhang, Ruixia / Wang, Zhiqiong / Wang, Zhongyang / Xin, Junchang et al. | 2023
- 1
-
Generic Dependency Modeling for Multi-Party ConversationShen, Weizhou / Quan, Xiaojun / Yang, Ke et al. | 2023
- 1
-
WL-MSR: Watch and Listen for Multimodal Subtitle RecognitionLiu, Jiawei / Wang, Hao / Wang, Weining / He, Xingjian / Liu, Jing et al. | 2023
- 1
-
Residual Hybrid Attention Network for Compression Artifact ReductionLuo, Bingchun / Yu, Wei et al. | 2023
- 1
-
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech RecognitionSahai, Saumya Y. / Liu, Jing / Muniyappa, Thejaswi / Sathyendra, Kanthashree M. / Alexandridis, Anastasios / Strimel, Grant P. / McGowan, Ross / Rastrow, Ariya / Chang, Feng-Ju / Mouchtaris, Athanasios et al. | 2023
- 1
-
Look and Think: Intrinsic Unification of Self-Attention and Convolution for Spatial-Channel SpecificityGao, Xiang / Lin, Honghui / Li, Yu / Fang, Ruiyan / Zhang, Xin et al. | 2023
- 1
-
Higher-Order Link Prediction Via Learnable Maximum Mean DiscrepancyKaranikolas, Georgios V. / Pages-Zamora, Alba / Giannakis, Georgios B. et al. | 2023
- 1
-
EI2SR: Learning an Enhanced Intra-Instance Semantic Relationship for Arbitrary-Shaped Scene Text DetectionShu, Yan / Liu, Shaohui / Zhou, Yu / Xu, Honglei / Jiang, Feng et al. | 2023
- 1
-
Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant EnvironmentsNeri, Julian / Braun, Sebastian et al. | 2023
- 1
-
Comparative Layer-Wise Analysis of Self-Supervised Speech ModelsPasad, Ankita / Shi, Bowen / Livescu, Karen et al. | 2023
- 1
-
Maximum Likelihood Distillation for Robust Modulation ClassificationMaroto, Javier / Bovet, Gerome / Frossard, Pascal et al. | 2023
- 1
-
Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image ProcessingVali, Mohammad Hassan / Backstrom, Tom et al. | 2023
- 1
-
Deep Fusion of Multi-Object Densities Using TransformerLi, Lechi / Dai, Chen / Xia, Yuxuan / Svensson, Lennart et al. | 2023
- 1
-
Core: Transferable Long-Range Time Series Forecasting Enhanced by Covariates-Guided RepresentationLi, Xin-Yi / Zhong, Pei-Nan / Chen, Di / Yang, Yu-Bin et al. | 2023
- 1
-
Toward Privacy-Enhancing Ambulatory-Based Well-Being Monitoring: Investigating User Re-Identification Risk in Multimodal DataPranjal, Ravi / Seshadri, Ranjana / Kumar Sanath Kumar Kadaba, Rakesh / Feng, Tiantian / Narayanan, Shrikanth S. / Chaspari, Theodora et al. | 2023
- 1
-
Mutually Guided Few-Shot Learning For Relational Triple ExtractionYang, Chengmei / Jiang, Shuai / He, Bowei / Ma, Chen / He, Lianghua et al. | 2023
- 1
-
Guide and Select: A Transformer-Based Multimodal Fusion Method for Points of Interest Description GenerationLiu, Hanqing / Wang, Wei / Hu, Niu / Zheng, Hai-Tao / Xie, Rui / Wu, Wei / Bai, Yang et al. | 2023
- 1
-
Interpretation of Neural Networks is Susceptible to Universal Adversarial PerturbationsOskouie, Haniyeh Ehsani / Farnia, Farzan et al. | 2023
- 1
-
High-Resolution Embedding Extractor for Speaker DiarisationHeo, Hee-Soo / Kwon, Youngki / Lee, Bong-Jin / Kim, You Jin / Jung, Jee-Weon et al. | 2023
- 1
-
Prosody-Controllable Spontaneous TTS with Neural HMMSLameris, Harm / Mehta, Shivam / Henter, Gustav Eje / Gustafson, Joakim / Szekely, Eva et al. | 2023
- 1
-
Faster Than Fast: Accelerating the Griffin-Lim AlgorithmNenov, Rossen / Nguyen, Dang-Khoa / Balazs, Peter et al. | 2023
- 1
-
Scalable and Secure Federated XGBoostNguyen, Quang Minh / Khanh Le, Nhan / Nguyen, Lam M. et al. | 2023
- 1
-
A Generalized Subspace Distribution Adaptation Framework for Cross-Corpus Speech Emotion RecognitionLi, Shaokai / Song, Peng / Ji, Liang / Jin, Yun / Zheng, Wenming et al. | 2023
- 1
-
ClassA Entropy for the Analysis of Structural Complexity of Physiological SignalsXiao, Hongjian / Li, Ling / Mandic, Danilo P. et al. | 2023
- 1
-
Improving Disfluency Detection with Multi-Scale Self Attention and Contrastive LearningWang, Peiying / Duan, Chaoqun / Chen, Meng / He, Xiaodong et al. | 2023
- 1
-
Time-Resolved FMRI Shared Response Model Using Gaussian Process Factor AnalysisEbrahimi, MohammadReza / Calarco, Navona / Hawco, Colin / Voineskos, Aristotle / Khisti, Ashish et al. | 2023
- 1
-
Dynamic TF-TDNN: Dynamic Time Delay Neural Network Based on Temporal-Frequency Attention for Dialect RecognitionLiao, Chao / Huang, Jinwen / Yuan, Huan / Yao, Peng / Tan, Jianchao / Zhang, Dawei / Deng, Feng / Wang, Xiaorui / Song, Chengru et al. | 2023
- 1
-
Contrastive Learning of Functionality-Aware Code EmbeddingsLi, Yiyang / Wu, Hongqiu / Zhao, Hai et al. | 2023
- 1
-
Ultrasound Image Quality Control Using Speech-Assisted Switchable CycleGANHuh, Jaeyoung / Khan, Shujaat / Sun Lee, Eun / Chul Ye, Jong et al. | 2023
- 1
-
Super Dilated Nested Arrays with Ideal Critical Weights and Increased Degrees of FreedomShaalan, Ahmed M. A. / Du, Jun et al. | 2023
- 1
-
Transient Dictionary Learning for Compressed Time-of-Flight ImagingConde, Miguel Heredia et al. | 2023
- 1
-
Does Your Model Think Like an Engineer? Explainable AI for Bearing Fault Detection with Deep LearningDecker, Thomas / Lebacher, Michael / Tresp, Volker et al. | 2023
- 1
-
FAPM: Fast Adaptive Patch Memory for Real-Time Industrial Anomaly DetectionKim, Donghyeong / Park, Chaewon / Cho, Suhwan / Lee, Sangyoun et al. | 2023
- 1
-
A Distributed Adaptive Algorithm for Non-Smooth Spatial Filtering ProblemsHovine, Charles / Bertrand, Alexander et al. | 2023
- 1
-
Graph Learning from Gaussian and Stationary Graph SignalsBuciulea, Andrei / Marques, Antonio G. et al. | 2023
- 1
-
Spatio-Temporal Attention in Multi-Granular Brain Chronnectomes For Detection of Autism Spectrum DisorderOrme-Rogers, James / Srivastava, Ajitesh et al. | 2023
- 1
-
Priv-Aug-Shap-ECGResNet: Privacy Preserving Shapley-Value Attributed Augmented Resnet for Practical Single-Lead Electrocardiogram ClassificationUkil, Arijit / Marin, Leandro / Jara, Antonio J. et al. | 2023
- 1
-
Efficient Online Convolutional Dictionary Learning Using Approximate Sparse ComponentsVeshki, Farshad G. / Vorobyov, Sergiy A. et al. | 2023
- 1
-
Low-Latency Electrolaryngeal Speech Enhancement Based on Fastspeech2-Based Voice Conversion and Self-Supervised Speech RepresentationKobayashi, Kazuhiro / Hayashi, Tomoki / Toda, Tomoki et al. | 2023
- 1
-
Zero-Shot Personalized Lip-To-Speech Synthesis with Face Image Based Voice ControlSheng, Zheng-Yan / Ai, Yang / Ling, Zhen-Hua et al. | 2023
- 1
-
mmWave Wi-Fi Trajectory Estimation with Continuous-Time Neural Dynamic LearningVaca-Rubio, Cristian J. / Wang, Pu / Koike-Akino, Toshiaki / Wang, Ye / Boufounos, Petros / Popovski, Petar et al. | 2023
- 1
-
Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech EnhancementValentini-Botinhao, Cassia / Aldana Blanco, Andrea Lorena / Klejch, Ondrej / Bell, Peter et al. | 2023
- 1
-
D-3DLD: Depth-Aware Voxel Space Mapping for Monocular 3D Lane Detection with UncertaintyKim, Nayeon / Byeon, Moonsub / Ji, Daehyun / Oh, Dokwan et al. | 2023
- 1
-
Finer-Grained Decomposition for Parallel Quantum Mimo ProcessingKim, Minsung / Jamieson, Kyle et al. | 2023
- 1
-
Deep Root Music Algorithm for Data-Driven Doa EstimationShmuel, Dor H. / Merkofer, Julian P. / Revach, Guy / van Sloun, Ruud J. G. / Shlezinger, Nir et al. | 2023
- 1
-
Police: Provably Optimal Linear Constraint Enforcement For Deep Neural NetworksBalestriero, Randall / LeCun, Yann et al. | 2023
- 1
-
A Novel Metric For Evaluating Audio Caption SimilarityBhosale, Swapnil / Chakraborty, Rupayan / Kopparapu, Sunil Kumar et al. | 2023
- 1
-
Generalized Two-Stage Particle Filter for High DimensionsIloska, Marija / Bugallo, Monica F. et al. | 2023
- 1
-
Mitigating Unintended Memorization in Language Models Via Alternating TeachingLiu, Zhe / Zhang, Xuedong / Peng, Fuchun et al. | 2023
- 1
-
Adaptive Multi-Corpora Language Model Training for Speech RecognitionMa, Yingyi / Liu, Zhe / Zhang, Xuedong et al. | 2023
- 1
-
Domain Adaptation without Catastrophic Forgetting on a Small-Scale Partially-Labeled Corpus for Speech Emotion RecognitionZhu, Zhi / Sato, Yoshinao et al. | 2023
- 1
-
SingNet: a real-time Singing Voice beat and Downbeat Tracking SystemHeydari, Mojtaba / Wang, Ju-Chiang / Duan, Zhiyao et al. | 2023
- 1
-
PCQA-Graphpoint: Efficient Deep-Based Graph Metric for Point Cloud Quality AssessmentTliba, Marouane / Chetouani, Aladine / Valenzise, Giuseppe / Dufaux, Frederic et al. | 2023
- 1
-
Adaptive Step-Size Methods for Compressed SGDSubramaniam, Adarsh M. / Magesh, Akshayaa / Veeravalli, Venugopal V. et al. | 2023
- 1
-
Leveraging Multiple Sources in Automatic African American English Dialect Detection for Adults and ChildrenJohnson, Alexander / Shetty, Vishwas M. / Ostendorf, Mari / Alwan, Abeer et al. | 2023
- 1
-
Adaptive Simulated Annealing Through Alternating Rényi Divergence MinimizationGuilmeau, Thomas / Chouzenoux, Emilie / Elvira, Victor et al. | 2023
- 1
-
NAS-DYMC: NAS-Based Dynamic Multi-Scale Convolutional Neural Network for Sound Event DetectionWang, Jun / Yao, Peng / Deng, Feng / Tan, Jianchao / Song, Chengru / Wang, Xiaorui et al. | 2023
- 1
-
Wespeaker: A Research and Production Oriented Speaker Embedding Learning ToolkitWang, Hongji / Liang, Chengdong / Wang, Shuai / Chen, Zhengyang / Zhang, Binbin / Xiang, Xu / Deng, Yanlei / Qian, Yanmin et al. | 2023
- 1
-
Privacy Preserving Face Recognition with Lensless CameraHenry, Chris / Asif, M. Salman / Li, Zhu et al. | 2023
- 1
-
Exploiting CCTV Cameras for Hand Hygiene Recognition in ICUHuang, Weijun / Huang, Jia / Wang, Guowei / Lu, Hongzhou / He, Min / Wang, Wenjin et al. | 2023
- 1
-
Learning Sparse auto-Encoders for Green AI image codingGille, Cyprien / Guyard, Frederic / Antonini, Marc / Barlaud, Michel et al. | 2023
- 1
-
3D Audio Signal Processing Systems for Speech Enhancement and Sound Localization and DetectionBai, Jisheng / Huang, Siwei / Yin, Han / Jia, Yafei / Wang, Mou / Chen, Jianfeng et al. | 2023
- 1
-
Quantum Variational Bayes on ManifoldsLopatnikova, Anna / Tran, Minh-Ngoc et al. | 2023
- 1
-
Exploring Complementary Features in Multi-Modal Speech Emotion RecognitionWang, Suzhen / Ma, Yifeng / Ding, Yu et al. | 2023
- 1
-
Deep Spatio-Temporal Multiplex Graph Learning for Cardiac Imaging ClassificationBanus, Jaume / Ogier, Augustin / Hullin, Roger / Meyer, Philippe / van Heeswijk, Ruud B. / Richiardi, Jonas et al. | 2023
- 1
-
Sign Language Recognition via Deformable 3D Convolutions and Modulated Graph Convolutional NetworksPapadimitriou, Katerina / Potamianos, Gerasimos et al. | 2023
- 1
-
Unsupervised word Segmentation Based on Word InfluenceYan, Ruohao / Zhang, Huaping / Silamu, Wushour / Hamdulla, Askar et al. | 2023
- 1
-
TAPE: An End-to-End Timbre-Aware Pitch EstimatorTamer, Nazif Can / Ozer, Yigitcan / Muller, Meinard / Serra, Xavier et al. | 2023
- 1
-
Text Classification In The Wild: A Large-Scale Long-Tailed Name Normalization DatasetQi, Jiexing / Li, Shuhao / Guo, Zhixin / Huang, Yusheng / Zhou, Chenghu / Zhang, Weinan / Wang, Xinbing / Lin, Zhouhan et al. | 2023
- 1
-
Designing and Evaluating Speech Emotion Recognition Systems: A Reality Check Case Study with IEMOCAPAntoniou, Nikolaos / Katsamanis, Athanasios / Giannakopoulos, Theodoros / Narayanan, Shrikanth et al. | 2023
- 1
-
TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 Dns-ChallengeJu, Yukai / Chen, Jun / Zhang, Shimin / He, Shulin / Rao, Wei / Zhu, Weixin / Wang, Yannan / Yu, Tao / Shang, Shidong et al. | 2023
- 1
-
General or Specific? Investigating Effective Privacy Protection in Federated Learning for Speech Emotion RecognitionTan, Chao / Cao, Yang / Li, Sheng / Yoshikawa, Masatoshi et al. | 2023
- 1
-
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram TransformerLi, Kang / Song, Yan / Dai, Li-Rong / McLoughlin, Ian / Fang, Xin / Liu, Lin et al. | 2023
- 1
-
Nested Attention Network with Graph Filtering for Visual Question and AnsweringLu, Jing / Wu, Chunlei / Wang, Leiquan / Yuan, Shaozu / Wu, Jie et al. | 2023
- 1
-
Defending Against Universal Patch Attacks by Restricting Token Attention in Vision TransformersYu, Hongwei / Chen, Jiansheng / Ma, Huimin / Yu, Cheng / Ding, Xinlong et al. | 2023
- 1
-
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech SynthesisXue, Jinlong / Deng, Yayue / Wang, Fengping / Li, Ya / Gao, Yingming / Tao, Jianhua / Sun, Jianqing / Liang, Jiaen et al. | 2023
- 1
-
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource LanguagesBhogale, Kaushal / Raman, Abhigyan / Javed, Tahir / Doddapaneni, Sumanth / Kunchukuttan, Anoop / Kumar, Pratyush / Khapra, Mitesh M. et al. | 2023
- 1
-
Effectiveness of Inter- and Intra-Subarray Spatial Features for Acoustic Scene ClassificationKawamura, Takao / Kinoshita, Yuma / Ono, Nobutaka / Scheibler, Robin et al. | 2023
- 1
-
Bayesian Network Modeling and Prediction of Transitions Within the Homelessness SystemRahman, Khandker Sadia / Zois, Daphney-Stavroula / Chelmis, Charalampos et al. | 2023
- 1
-
Adaptive Knowledge Distillation Between Text and Speech Pre-Trained ModelsNi, Jinjie / Ma, Yukun / Wang, Wen / Chen, Qian / Ng, Dianwen / Lei, Han / Nguyen, Trung Hieu / Zhang, Chong / Ma, Bin / Cambria, Erik et al. | 2023
- 1
-
Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation AnnotationsCheng, Zhenxiao / Zhou, Jie / Wu, Wen / Chen, Qin / He, Liang et al. | 2023
- 1
-
Comparative Study of IRS Assisted Opportunistic Communications Over i.i.d. and los channelsYashvanth, L. / Murthy, Chandra R. et al. | 2023
- 1
-
Multi-Head Attention and GRU for Improved Match-Mismatch Classification of Speech Stimulus and EEG ResponseBorsdorf, Marvin / Pahuja, Saurav / Ivucic, Gabriel / Cai, Siqi / Li, Haizhou / Schultz, Tanja et al. | 2023
- 1
-
DTTR: Detecting Text with TransformersYang, Jing / You, Zhiqiang / Zhong, Zhiwei / Liu, Peng / Mei, Langqi / Huang, Shenguang et al. | 2023
- 1
-
DST: Deformable Speech Transformer for Emotion RecognitionChen, Weidong / Xing, Xiaofen / Xu, Xiangmin / Pang, Jianxin / Du, Lan et al. | 2023
- 1
-
Cross-Training: A Semi-Supervised Training Scheme for Speech RecognitionKhorram, Soheil / Tripathi, Anshuman / Kim, Jaeyoung / Lu, Han / Zhang, Qian / Prabhavalkar, Rohit / Sak, Hasim et al. | 2023
- 1
-
Wav2Seq: Pre-Training Speech-to-Text Encoder-Decoder Models Using Pseudo LanguagesWu, Felix / Kim, Kwangyoun / Watanabe, Shinji / Han, Kyu J. / McDonald, Ryan / Weinberger, Kilian Q. / Artzi, Yoav et al. | 2023
- 1
-
MLP-GAN for Brain Vessel Image SegmentationXie, Bin / Tang, Hao / Duan, Bin / Cai, Dawen / Yan, Yan et al. | 2023
- 1
-
Stacking-Based Attention Temporal Convolutional Network for Action SegmentationYang, Liu / Jiang, Yu / Hong, Junkun / Wu, Zhenjie / Yang, Zhan / Long, Jun et al. | 2023
- 1
-
Probabilistic Back-ends for Online Speaker Recognition and ClusteringSholokhov, Alexey / Kuzmin, Nikita / Lee, Kong Aik / Chng, Eng Siong et al. | 2023
- 1
-
Information Extraction from Pill Bottle Images via Text StitchingGupta, Rahul Kumar / Roy, Shilka / Jos, Sujit / S., Unni V. / Lavoie, Lauren / Medous, Frederic / Smith, Walter et al. | 2023
- 1
-
Semi-Supervised Remote Sensing Image Change Detection Using Mean Teacher Model for Constructing Pseudo-LabelsMao, Zan / Tong, Xinyu / Luo, Ze et al. | 2023
- 1
-
Analysing Discrete Self Supervised Speech Representation For Spoken Language ModelingSicherman, Amitay / Adi, Yossi et al. | 2023
- 1
-
Flowpose: Conditional Normalizing Flows for 3D Human Pose and Shape Estimation from Monocular VideosDu, Yaoyao / Zhang, Zixiao / Li, Zhihao / Wei, Peng / Liao, Qingmin / Yang, Wenming et al. | 2023
- 1
-
Glacier: Glass-Box Transformer for Interpretable Dynamic NeuroimagingMahmood, Usman / Fu, Zening / Calhoun, Vince / Plis, Sergey et al. | 2023
- 1
-
NBA-OMP: Near-Field Beam-Split-Aware Orthogonal Matching Pursuit for Wideband THz Channel EstimationElbir, Ahmet M. / Vijay Mishra, Kumar / Chatzinotas, Symeon et al. | 2023
- 1
-
MUG: A General Meeting Understanding and Generation BenchmarkZhang, Qinglin / Deng, Chong / Liu, Jiaqing / Yu, Hai / Chen, Qian / Wang, Wen / Yan, Zhijie / Liu, Jinglin / Ren, Yi / Zhao, Zhou et al. | 2023
- 1
-
Automatic Classification of Vocal Intensity Category from SpeechKodali, Manila / Kadiri, Sudarsana Reddy / Laaksonen, Laura / Alku, Paavo et al. | 2023
- 1
-
A Template Matching Approach for Reference Picture Padding in Video CodingHorst, Nicolas / Das, Priyanka / Wien, Mathias et al. | 2023
- 1
-
An Efficient Relay Selection Scheme for Relay-assisted HARQDing, Weihang / Shikh-Bahaei, Mohammad et al. | 2023
- 1
-
Sora: Scalable Black-Box Reachability Analyser on Neural NetworksXu, Peipei / Wang, Fu / Ruan, Wenjie / Zhang, Chi / Huang, Xiaowei et al. | 2023
- 1
-
The First Pathloss Radio Map Prediction ChallengeYapar, Cagkan / Jaensch, Fabian / Levie, Ron / Kutyniok, Gitta / Caire, Giuseppe et al. | 2023
- 1
-
U-Shiftformer: Brain Tumor Segmentation Using A Shifted Attention MechanismLin, Chih-Wei / Chen, Zhongsheng et al. | 2023
- 1
-
Does Human Speech Follow Benford’s Law?Hsu, Leo / Berisha, Visar et al. | 2023
- 1
-
Conversation-Oriented ASR with Multi-Look-Ahead CBS ArchitectureZhao, Huaibo / Fujie, Shinya / Ogawa, Tetsuji / Sakuma, Jin / Kida, Yusuke / Kobayashi, Tetsunori et al. | 2023
- 1
-
Towards a Unified Training for Levenshtein TransformerZheng, Kangjie / Wang, Longyue / Wang, Zhihao / Chen, Binqi / Zhang, Ming / Tu, Zhaopeng et al. | 2023
- 1
-
A Principled Approach to Model Validation in Domain GeneralizationLyu, Boyang / Nguyen, Thuan / Scheutz, Matthias / Ishwar, Prakash / Aeron, Shuchin et al. | 2023
- 1
-
Neural Networks with Quantization ConstraintsHounie, Ignacio / Elenter, Juan / Ribeiro, Alejandro et al. | 2023
- 1
-
Direct Position Determination with One-Bit Signal for Multiple TargetsNi, Lihua / Zhang, Di / Xing, Tianyi / Ran, Maoyan / Liu, Ning / Wan, Qun et al. | 2023
- 1
-
Learning to Balance the Global Coherence and Informativeness in Knowledge-Grounded Dialogue GenerationNiu, Chenxu / Hu, Yue / Peng, Wei / Xie, Yuqiang et al. | 2023
- 1
-
Backdoor Attack Against Automatic Speaker Verification Models in Federated LearningMeng, Dan / Wang, Xue / Wang, Jun et al. | 2023
- 1
-
Wireless Deep Speech Semantic TransmissionXiao, Zixuan / Yao, Shengshi / Dai, Jincheng / Wang, Sixian / Niu, Kai / Zhang, Ping et al. | 2023
- 1
-
Context-Aware Fine-Tuning of Self-Supervised Speech ModelsShon, Suwon / Wu, Felix / Kim, Kwangyoun / Sridhar, Prashant / Livescu, Karen / Watanabe, Shinji et al. | 2023
- 1
-
Improved Acoustic-to-Articulatory Inversion Using Representations from Pretrained Self-Supervised Learning ModelsUdupa, Sathvik / C, Siddarth / Ghosh, Prasanta Kumar et al. | 2023
- 1
-
Lightweight Annotation and Class Weight Training for Automatic Estimation of Alarm Audibility in NoiseEffa, Francois / Serizel, Romain / Arz, Jean-Pierre / Grimault, Nicolas et al. | 2023
- 1
-
Disentangled Training with Adversarial Examples for Robust Small-Footprint Keyword SpottingWang, Zhenyu / Wan, Li / Zhang, Biqiao / Huang, Yiteng / Li, Shang-Wen / Sun, Ming / Lei, Xin / Yang, Zhaojun et al. | 2023
- 1
-
Numerical Semantic Modeling for Implicit Discourse Relation RecognitionWang, Chenxu / Jian, Ping / Wang, Hai et al. | 2023
- 1
-
Stereoscopic Video Retargeting Based on Camera Motion ClassificationCai, Linghui / Tang, Zhenhua et al. | 2023
- 1
-
Spoofed Training Data for Speech Spoofing Countermeasure Can Be Efficiently Created Using Neural VocodersWang, Xin / Yamagishi, Junichi et al. | 2023
- 1
-
Massively Multilingual Shallow Fusion with Large Language ModelsHu, Ke / Sainath, Tara N. / Li, Bo / Du, Nan / Huang, Yanping / Dai, Andrew M. / Zhang, Yu / Cabrera, Rodrigo / Chen, Zhifeng / Strohman, Trevor et al. | 2023
- 1
-
SDTN: Speaker Dynamics Tracking Network for Emotion Recognition in ConversationChen, Jiawei / Huang, Peijie / Huang, Guotai / Li, Qianer / Xu, Yuhong et al. | 2023
- 1
-
Improving CTC-Based ASR Models With Gated Interlayer CollaborationYang, Yuting / Li, Yuke / Du, Binbin et al. | 2023
- 1
-
Restoration of Time-Varying Graph Signals using Deep Algorithm UnrollingKojima, Hayate / Noguchi, Hikari / Yamada, Koki / Tanaka, Yuichi et al. | 2023
- 1
-
A Dual-Path Transformer Network for Scene Text DetectionLin, Jingyu / Yan, Yan / Wang, Hanzi et al. | 2023
- 1
-
Audio-Visual Speech Enhancement with a Deep Kalman Filter Generative ModelGolmakani, Ali / Sadeghi, Mostafa / Serizel, Romain et al. | 2023
- 1
-
Ideal: Improved Dense Local Contrastive Learning For Semi-Supervised Medical Image SegmentationBasak, Hritam / Chattopadhyay, Soumitri / Kundu, Rohit / Nag, Sayan / Mallipeddi, Rammohan et al. | 2023
- 1
-
Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis SystemYoshimura, Takenori / Takaki, Shinji / Nakamura, Kazuhiro / Oura, Keiichiro / Hono, Yukiya / Hashimoto, Kei / Nankaku, Yoshihiko / Tokuda, Keiichi et al. | 2023
- 1
-
Symbol Level Precoding in the RF Domain for Low Hardware Complexity RIS-Assisted MU-MISO SystemsTsinos, Christos G. / Tsiftsis, Theodoros A. / Schober, Robert et al. | 2023
- 1
-
CTCBERT: Advancing Hidden-Unit Bert with CTC ObjectivesFan, Ruchao / Wang, Yiming / Gaur, Yashesh / Li, Jinyu et al. | 2023
- 1
-
Sine: Similarity-Regularized Intra-Class Exploitation for Cross-Granularity Few-Shot LearningYang, Jinhai / Yang, Hua et al. | 2023
- 1
-
Topological Signal Processing Over Weighted Simplicial ComplexesBattiloro, Claudio / Sardellitti, Stefania / Barbarossa, Sergio / Lorenzo, Paolo Di et al. | 2023
- 1
-
Neural Mode EstimationSun, Peng / Wen, Zhenyu / Zhou, Yejian / Hong, Zhen / Lin, Tao et al. | 2023
- 1
-
Meta Learning with Adaptive Loss Weight for Low-Resource Speech RecognitionWang, Qiulin / Hu, Wenxuan / Li, Lin / Hong, Qingyang et al. | 2023
- 1
-
An Auto-Encoder Based Method for Camera Fingerprint CompressionZhang, Kaixuan / Liu, Zihan / Hu, Jiashang / Wang, Shilin et al. | 2023
- 1
-
A Transformer-Based E2E SLU Model for Improved Semantic ParsingIstaiteh, Othman / Kussad, Yasmeen / Daqour, Yahya / Habib, Maria / Habash, Mohammad / Gowda, Dhananjaya et al. | 2023
- 1
-
Procontext: Exploring Progressive Context Transformer for TrackingLan, Jin-Peng / Cheng, Zhi-Qi / He, Jun-Yan / Li, Chenyang / Luo, Bin / Bao, Xu / Xiang, Wangmeng / Geng, Yifeng / Xie, Xuansong et al. | 2023
- 1
-
Achieving Fair Speech Emotion Recognition via Perceptual FairnessChien, Woan-Shiuan / Lee, Chi-Chun et al. | 2023
- 1
-
Unsupervised Pre-Training for Data-Efficient Text-to-Speech on Low Resource LanguagesPark, Seongyeon / Song, Myungseo / Kim, Bohyung / Oh, Tae-Hyun et al. | 2023
- 1
-
Image Sharing Chain Detection VIA Sequence-To-Sequence ModelYou, Jiaxiang / Li, Yuanman / Liang, Rongqin / Tan, Yuxuan / Zhou, Jiantao / Li, Xia et al. | 2023
- 1
-
NCL: Textual Backdoor Defense Using Noise-Augmented Contrastive LearningZhai, Shengfang / Shen, Qingni / Chen, Xiaoyi / Wang, Weilong / Li, Cong / Fang, Yuejian / Wu, Zhonghai et al. | 2023
- 1
-
Higher-Order Spatio-Temporal Neural Networks for Covid-19 ForecastingChen, Yuzhou / Batsakis, Sotiris / Poor, H. Vincent et al. | 2023
- 1
-
Regression to Classification: Waveform Encoding for Neural Field-Based Audio Signal RepresentationKim, TaeSoo / Rho, Daniel / Lee, Gahui / Park, JaeHan / Ko, Jong Hwan et al. | 2023
- 1
-
Visual Answer Localization with Cross-Modal Mutual Knowledge TransferWeng, Yixuan / Li, Bin et al. | 2023