Hyperbolic Audio Source Separation (English)
- New search for: Petermann, Darius
- New search for: Wichern, Gordon
- New search for: Subramanian, Aswin
- New search for: Roux, Jonathan Le
- New search for: Petermann, Darius
- New search for: Wichern, Gordon
- New search for: Subramanian, Aswin
- New search for: Roux, Jonathan Le
In:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
;
1-5
;
2023
-
ISBN:
-
ISSN:
- Conference paper / Electronic Resource
-
Title:Hyperbolic Audio Source Separation
-
Contributors:Petermann, Darius ( author ) / Wichern, Gordon ( author ) / Subramanian, Aswin ( author ) / Roux, Jonathan Le ( author )
-
Published in:
-
Publisher:
- New search for: IEEE
-
Publication date:2023-06-04
-
Size:5787149 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
-
Source:
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Sequential Invariant Information BottleneckZhang, Yichen / Yu, Shujian / Chen, Badong et al. | 2023
- 1
-
Projected Hierarchical ALS for Generalized Boolean Matrix FactorizationFarias, Rodrigo Cabral / Miron, Sebastian et al. | 2023
- 1
-
Tensorized LSSVMS For Multitask RegressionLiu, Jiani / Tao, Qinghua / Zhu, Ce / Liu, Yipeng / Suykens, Johan A.K. et al. | 2023
- 1
-
Robust GMM Parameter Estimation via the K-BM AlgorithmKenig, Ori / Todros, Koby / Adali, Tulay et al. | 2023
- 1
-
Nord: Non-Matching Reference Based Relative Depth Estimation from Binaural SpeechManocha, Pranay / Gebru, Israel D. / Kumar, Anurag / Markovic, Dejan / Richard, Alexander et al. | 2023
- 1
-
Deep Reinforcement Learning for Green UAV-Assisted Data CollectionMondal, Abhishek / Mishra, Deepak / Prasad, Ganesh / Hossain, Ashraf et al. | 2023
- 1
-
Multitrack Music TransformerDong, Hao-Wen / Chen, Ke / Dubnov, Shlomo / McAuley, Julian / Berg-Kirkpatrick, Taylor et al. | 2023
- 1
-
Rate Region Characterization for Semantics and Bits based Multiuser CommunicationsMu, Xidong / Liu, Yuanwei et al. | 2023
- 1
-
Multiple Target Measurements: Bayesian Framework for Moving Object Detection in Mimo RadarEisele, Bastian / Bereyhi, Ali / Muller, Ralf et al. | 2023
- 1
-
Frequency-Aware Attentional Feature Fusion for Deepfake DetectionTian, Cheng / Luo, Zhiming / Shi, Guimin / Li, Shaozi et al. | 2023
- 1
-
Parallel 2D Seismic Ray Tracing Using Cuda on a Jetson NanoShin, Ban-Sok / Wientgens, Luis / Shutin, Dmitriy et al. | 2023
- 1
-
Central Nodes Detection from Partially Observed Graph SignalsHe, Yiran / Wai, Hoi-To et al. | 2023
- 1
-
Disentangled Feature Learning for Real-Time Neural Speech CodingJiang, Xue / Peng, Xiulian / Zhang, Yuan / Lu, Yan et al. | 2023
- 1
-
Noise-Aware Target Extension with Self-Distillation for Robust Speech RecognitionSeong, Ju-Seok / Choi, Jeong-Hwan / Kyung, Jehyun / Jeoung, Ye-Rin / Chang, Joon-Hyuk et al. | 2023
- 1
-
Imaginary Voice: Face-Styled Diffusion Model for Text-to-SpeechLee, Jiyoung / Son Chung, Joon / Chung, Soo-Whan et al. | 2023
- 1
-
Target-Speaker Voice Activity Detection Via Sequence-to-Sequence PredictionCheng, Ming / Wang, Weiqing / Zhang, Yucong / Qin, Xiaoyi / Li, Ming et al. | 2023
- 1
-
Detail-Aware Uncalibrated Photometric StereoAgudo, Antonio et al. | 2023
- 1
-
CAN2V: Can-Bus Data-Based Seq2seq Model for Vehicle Velocity PredictionCho, Jae-Heung / Chang, Joon-Hyuk et al. | 2023
- 1
-
SemGeo: Semantic Keywords for Cross-View Image Geo-LocalizationRodrigues, Royston / Tani, Masahiro et al. | 2023
- 1
-
Direction-of-Arrival Estimation Using Gaussian Process InterpolationKhurjekar, Ishan D. / Gerstoft, Peter / Mecklenbrauker, Christoph F. / Michalopoulou, Zoi-Heleni et al. | 2023
- 1
-
Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov ModelsYeh, Sung-Lin / Tang, Hao et al. | 2023
- 1
-
Waveform Boundary Detection for Partially Spoofed AudioCai, Zexin / Wang, Weiqing / Li, Ming et al. | 2023
- 1
-
Fast 3D Human Pose Estimation Using RF SignalsYu, Cong / Zhang, Dongheng / Wu, Zhi / Xie, Chunyang / Lu, Zhi / Hu, Yang / Chen, Yan et al. | 2023
- 1
-
LABANet: Lead-Assisting Backbone Attention Network for Oral Multi-Pathology SegmentationChen, Huabao / Huang, Xiaolong / Li, Qiankun / Wang, Jianqing / Fang, Bo / Chen, Junxin et al. | 2023
- 1
-
Boosting Bert Subnets with Neural GraftingHu, Ting / Meinel, Christoph / Yang, Haojin et al. | 2023
- 1
-
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using AdaptersPeng, Junyi / Stafylakis, Themos / Gu, Rongzhi / Plchot, Oldrich / Mosner, Ladislav / Burget, Lukas / Cernocky, Jan et al. | 2023
- 1
-
Wavsyncswap: End-To-End Portrait-Customized Audio-Driven Talking Face GenerationBao, Weihong / Chen, Liyang / Zhou, Chaoyong / Yang, Sicheng / Wu, Zhiyong et al. | 2023
- 1
-
Fine-Grained Private Knowledge DistillationLi, Yuntong / Wang, Shaowei / Wang, Yingying / Li, Jin / Qian, Yuqiu / Xin, Bangzhou / Yang, Wei et al. | 2023
- 1
-
Neural Diarization with Non-Autoregressive Intermediate AttractorsFujita, Yusuke / Komatsu, Tatsuya / Scheibler, Robin / Kida, Yusuke / Ogawa, Tetsuji et al. | 2023
- 1
-
Cross-Site Generalization for Imbalanced Epileptic ClassificationAbdallah, Tala / Jrad, Nisrine / Abdallah, Fahed / Humeau-Heurtier, Anne / Van Bogaert, Patrick et al. | 2023
- 1
-
PMMSD: Development of the Matrix Sentence Intelligibility Dataset for Mandarin with Lombard EffectPei, Hanchen / Yang, Yuhong / Chen, Xufeng / Liu, Qingmu / Chen, Hongyang / Tu, Weiping / Lin, Song et al. | 2023
- 1
-
Hardware Friendly Spline Sketched LidarSheehan, Michael P. / Tachella, Julian / Davies, Mike E. et al. | 2023
- 1
-
Ontology-Aware Network for Zero-Shot Sketch-Based Image RetrievalZhang, Haoxiang / Jiang, He / Wang, Ziqiang / Cheng, Deqiang et al. | 2023
- 1
-
Post-Trained Language Model Adaptive to Extractive Summarization of Long Spoken DocumentsOk, Hyunjong / Park, Seong-Bae et al. | 2023
- 1
-
APGP: Accuracy-Preserving Generative Perturbation for Defending Against Model Cloning AttacksCheng, Anda / Cheng, Jian et al. | 2023
- 1
-
Robust Log-Based Anomaly Detection with Hierarchical Contrastive LearningZhao, Yuhui / Yang, Ruichun / Yang, Ning / Lin, Tao / Fu, Qiuai / Ma, Yuchi et al. | 2023
- 1
-
Learning Speech Representations with Flexible Hidden Feature DimensionsTang, Huaizhen / Zhang, Xulong / Wang, Jianzong / Cheng, Ning / Xiao, Jing et al. | 2023
- 1
-
Expectation Propagation on Factor Graphs Based on Matrix DecompositionMekhiche, Adam / Cipriano, Antonio Maria / Poulliat, Charly et al. | 2023
- 1
-
PRRD: Pixel-Region Relation Distillation For Efficient Semantic SegmentationWang, Chen / Zhong, Jiang / Dai, Qizhu / Qi, Yafei / Li, Rongzhen / Lei, Qin / Fang, Bin / Li, Xue et al. | 2023
- 1
-
Joint Noise Reduction and Listening Enhancement for Full-End Speech EnhancementLi, Haoyu / Liu, Yun / Yamagishi, Junichi et al. | 2023
- 1
-
Towards Robust Data-Driven Underwater Acoustic Localization: A Deep CNN Solution with Performance Guarantees for Model MismatchWeiss, Amir / Singer, Andrew C. / Wornell, Gregory W. et al. | 2023
- 1
-
TF-GRIDNET: Making Time-Frequency Domain Models Great Again for Monaural Speaker SeparationWang, Zhong-Qiu / Cornell, Samuele / Choi, Shukjae / Lee, Younglo / Kim, Byeong-Yeol / Watanabe, Shinji et al. | 2023
- 1
-
Fast Cross-Correlation for TDoA Estimation on Small Aperture Microphone ArraysGrondin, Francois / Maheux, Marc-Antoine / Lauzon, Jean-Samuel / Vincent, Jonathan / Michaud, Francois et al. | 2023
- 1
-
Certified Robustness of Quantum Classifiers Against Adversarial Examples Through Quantum NoiseHuang, Jhih-Cing / Tsai, Yu-Lin / Yang, Chao-Han Huck / Su, Cheng-Fang / Yu, Chia-Mu / Chen, Pin-Yu / Kuo, Sy-Yen et al. | 2023
- 1
-
Deep AHS: A Deep Learning Approach to Acoustic Howling SuppressionZhang, Hao / Yu, Meng / Yu, Dong et al. | 2023
- 1
-
PCF: ECAPA-TDNN with Progressive Channel Fusion for Speaker VerificationZhao, Zhenduo / Li, Zhuo / Wang, Wenchao / Zhang, Pengyuan et al. | 2023
- 1
-
DASA: Difficulty-Aware Semantic Augmentation for Speaker VerificationWang, Yuanyuan / Zhang, Yang / Wu, Zhiyong / Yang, Zhihan / Wei, Tao / Zou, Kun / Meng, Helen et al. | 2023
- 1
-
Relevance Propagation through Deep Conditional Random FieldsYang, Xiangyu / Joukovsky, Boris / Deligiannis, Nikos et al. | 2023
- 1
-
Benchmark of Physiological Model Based and Deep Learning Based Remote Photoplethysmography in Automotive ApplicationsWang, Zhiyu / Yang, Xuezhi / Lu, Hongzhou / Shan, Caifeng / Wang, Wenjin et al. | 2023
- 1
-
WeSinger 2: Fully Parallel Singing Voice Synthesis via Multi-Singer Conditional Adversarial TrainingZhang, Zewang / Zheng, Yibin / Li, Xinhui / Lu, Li et al. | 2023
- 1
-
Managing Information Updating with Edge Computing: A Distributed and Learning ApproachHe, Junyi / Zhang, Di / Liu, Shumeng / Zhou, Yuezhi / Zhang, Yaoxue et al. | 2023
- 1
-
Loss Function Design for DNN-Based Sound Event Localization and Detection on Low-Resource Realistic DataWang, Qing / Du, Jun / Nian, Zhaoxu / Niu, Shutong / Chai, Li / Wu, Huaxin / Pan, Jia / Lee, Chin-Hui et al. | 2023
- 1
-
BECTRA: Transducer-Based End-To-End ASR with Bert-Enhanced EncoderHiguchi, Yosuke / Ogawa, Tetsuji / Kobayashi, Tetsunori / Watanabe, Shinji et al. | 2023
- 1
-
Vision Transformer-Based Feature Extraction for Generalized Zero-Shot LearningKim, Jiseob / Shim, Kyuhong / Kim, Junhan / Shim, Byonghyo et al. | 2023
- 1
-
On Designing A 3d Imaging Summer Project For Ontario’s High School Students During Covid-19 PandemicLan, Fengbo / Cheung, Gene / Arora, Prabhkirat / Richard-Koko, Deinabo / Cole, Lisa et al. | 2023
- 1
-
Dynamic Alignment Mask CTC: Improved Mask CTC With Aligned Cross EntropyZhang, Xulong / Tang, Haobin / Wang, Jianzong / Cheng, Ning / Luo, Jian / Xiao, Jing et al. | 2023
- 1
-
Global-Context Aware Generative Protein DesignTan, Cheng / Gao, Zhangyang / Xia, Jun / Hu, Bozhen / Li, Stan Z. et al. | 2023
- 1
-
MvCo-DoT: Multi-View Contrastive Domain Transfer Network for Medical Report GenerationWang, Ruizhi / Wang, Xiangtao / Xu, Zhenghua / Xu, Wenting / Chen, Junyang / Lukasiewicz, Thomas et al. | 2023
- 1
-
Active Selection of Source Patients in Transfer Learning for Epileptic Seizure Detection Using Riemannian ManifoldOrihara, Toshiki / Hassan, Kazi Mahmudul / Tanaka, Toshihisa et al. | 2023
- 1
-
Exploring Subgroup Performance in End-to-End Speech ModelsKoudounas, Alkis / Pastor, Eliana / Attanasio, Giuseppe / Mazzia, Vittorio / Giollo, Manuel / Gueudre, Thomas / Cagliero, Luca / de Alfaro, Luca / Baralis, Elena / Amberti, Daniele et al. | 2023
- 1
-
Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural VocoderYoneyama, Reo / Wu, Yi-Chiao / Toda, Tomoki et al. | 2023
- 1
-
CyFi-TTS: Cyclic Normalizing Flow with Fine-Grained Representation for End-to-End Text-to-SpeechHwang, In-Sun / Han, Young-Sub / Jeon, Byoung-Ki et al. | 2023
- 1
-
GaPP: Multi-Target Tracking with Gaussian ProcessesGoodyer, Fred / Ahmad, Bashar I. / Godsill, Simon et al. | 2023
- 1
-
AMPose: Alternately Mixed Global-Local Attention Model for 3D Human Pose EstimationLin, Hongxin / Chiu, Yunwei / Wu, Peiyuan et al. | 2023
- 1
-
Modulo EEG Signal Recovery Using TransformerGeng, Tianyu / Ji, Feng / Pratibha / Tay, Wee Peng et al. | 2023
- 1
-
Conditional Conformer: Improving Speaker Modulation For Single And Multi-User Speech EnhancementO'Malley, Tom / Ding, Shaojin / Narayanan, Arun / Wang, Quan / Rikhye, Rajeev / Liang, Qiao / He, Yanzhang / McGraw, Ian et al. | 2023
- 1
-
Robust Data2VEC: Noise-Robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive LearningZhu, Qiu-Shi / Zhou, Long / Zhang, Jie / Liu, Shu-Jie / Hu, Yu-Chen / Dai, Li-Rong et al. | 2023
- 1
-
Semantics-Aware Gamma Correction for Unsupervised Low-Light Image EnhancementChen, Yu-Hsuan / Pan, Fu-Cheng / Liao, Yu-Chien / Kao, Jao-Hong / Wang, Yu-Chiang Frank et al. | 2023
- 1
-
Time-Frequency Awareness Network For Human Mesh Recovery From VideosZhang, Boyang / Wu, Suping / Jia, Meining et al. | 2023
- 1
-
Could the BubbleView Metaphor be used to Infer Visual Attention on 3D Graphical Content?Bruckert, Alexandre / Abid, Mona / Da Silva, Matthieu Perreira / Le Callet, Patrick et al. | 2023
- 1
-
Enhancement of Text-Predicting Style Token With Generative Adversarial Network for Expressive Speech SynthesisKanagawa, Hiroki / Ijima, Yusuke et al. | 2023
- 1
-
Modeling Global Latent Semantic in Multi-Turn Conversations with Random Context ReconstructionZhang, Chengwen / Wu, Danqin et al. | 2023
- 1
-
Extended Expectation Maximization for Under-Fitted ModelsRekavandi, Aref Miri / Seghouane, Abd-Krim / Boussaid, Farid / Bennamoun, Mohammed et al. | 2023
- 1
-
Visual Information Matters for ASR Error CorrectionKumar, Vanya Bannihatti / Cheng, Shanbo / Peng, Ningxin / Zhang, Yuchen et al. | 2023
- 1
-
Gesper: A Unified Framework for General Speech RestorationChen, Jun / Shi, Yupeng / Liu, Wenzhe / Rao, Wei / He, Shulin / Li, Andong / Wang, Yannan / Wu, Zhiyong / Shang, Shidong / Zheng, Chengshi et al. | 2023
- 1
-
Select The Best: Enhancing Graph Representation with Adaptive Negative Sample SelectionZheng, Xiangping / Liang, Xun / Wu, Bo et al. | 2023
- 1
-
AV-TAD: Audio-Visual Temporal Action Detection With TransformerLi, Yangcheng / Yu, Zefang / Xiang, Suncheng / Liu, Ting / Fu, Yuzhuo et al. | 2023
- 1
-
An Interpretable Model Using Evidence Information for Multi-Hop Question Answering Over Long TextsChen, Yanyi / Liu, Ruifang / Liu, Xiyan / Shi, Yidong / Bai, Ge et al. | 2023
- 1
-
Toward A Multimodal Approach for Disfluency Detection and CategorizationRomana, Amrit / Koishida, Kazuhito et al. | 2023
- 1
-
One-Shot Action Detection via Attention Zooming InHsieh, He-Yen / Chen, Ding-Jie / Chang, Cheng-Wei / Liu, Tyng-Luh et al. | 2023
- 1
-
On Tracking a Stochastically Time-Varying SubspaceSolo, Victor et al. | 2023
- 1
-
Enhancing Multimodal Alignment with Momentum Augmentation for Dense Video CaptioningWei, Yiwei / Yuan, Shaozu / Chen, Meng / Wang, Longbiao et al. | 2023
- 1
-
Towards Real-Time Person Search with Invariant Feature LearningJia, Chengyou / Luo, Minnan / Dang, Zhuohang / Chang, Xiaojun / Zheng, Qinghua et al. | 2023
- 1
-
Semi-Supervised Sound Event Detection with Pre-Trained ModelXu, Liang / Wang, Lizhong / Bi, Sijun / Liu, Hanyue / Wang, Jing et al. | 2023
- 1
-
Fast and Exact Enumeration of Deep Networks Partitions RegionsBalestriero, Randall / LeCun, Yann et al. | 2023
- 1
-
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-SpeechSaeki, Takaaki / Zen, Heiga / Chen, Zhehuai / Morioka, Nobuyuki / Wang, Gary / Zhang, Yu / Bapna, Ankur / Rosenberg, Andrew / Ramabhadran, Bhuvana et al. | 2023
- 1
-
Improving Noisy Student Training on Non-Target Domain Data for Automatic Speech RecognitionChen, Yu / Ding, Wen / Lai, Junjie et al. | 2023
- 1
-
A Lightweight Fourier Convolutional Attention Encoder for Multi-Channel Speech EnhancementSun, Siyu / Jin, Jian / Han, Zhe / Xia, Xianjun / Chen, Li / Xiao, Yijian / Ding, Piao / Song, Shenyi / Togneri, Roberto / Zhang, Haijian et al. | 2023
- 1
-
ICStega: Image Captioning-based Semantically Controllable Linguistic SteganographyWang, Xilong / Wang, Yaofei / Chen, Kejiang / Ding, Jinyang / Zhang, Weiming / Yu, Nenghai et al. | 2023
- 1
-
Multi-Agent Reinforcement Learning for Covert Semantic Communications over Wireless NetworksWang, Yining / Hu, Ye / Du, Hongyang / Luo, Tao / Niyato, Dusit et al. | 2023
- 1
-
Precognition in Contextual Spoken Language Understanding via Knowledge DistillationSu, Nan / Du, Bingzhu / Zhang, Yuchi / Liu, Chao / Wang, Yongliang / Chen, Hong / Lu, Xin et al. | 2023
- 1
-
Improving Spoken Language Identification with Map-MixRajaa, Shangeth / Anandan, Kriti / Dalmia, Swaraj / Gupta, Tarun / Chng, Eng Siong et al. | 2023
- 1
-
Last: Scalable Lattice-Based Speech Modelling in JaxWu, Ke / Variani, Ehsan / Bagby, Tom / Riley, Michael et al. | 2023
- 1
-
Multiple Acoustic Features Speech Emotion Recognition Using Cross-Attention TransformerHe, Yurun / Minematsu, Nobuaki / Saito, Daisuke et al. | 2023
- 1
-
Output-Dependent Gaussian Process State-Space ModelLin, Zhidi / Cheng, Lei / Yin, Feng / Xu, Lexi / Cui, Shuguang et al. | 2023
- 1
-
Graph Based Semantic Ensemble of Riemannian Neural Structured Learning for BCI-EEG Signal ClassificationGupta, Vinay / Behera, Laxmidhar / Sandhan, Tushar et al. | 2023
- 1
-
On the Importance of Different Cough Phases for COVID-19 DetectionZhu, Yi / Shaik, Mahil Hussain / Falk, Tiago H. et al. | 2023
- 1
-
SRTNET: Time Domain Speech Enhancement via Stochastic RefinementQiu, Zhibin / Fu, Mengfan / Yu, Yinfeng / Yin, Lili / Sun, Fuchun / Huang, Hao et al. | 2023
- 1
-
TRICL: Triplet Continual LearningZhang, Xianchao / Wang, Guanglu / Zhang, Xiaotong / Liu, Han / Yin, Zhengxi / Yang, Wentao et al. | 2023
- 1
-
Modulation-Based Center Alignment and Motion Mining for Spatial Temporal Action DetectionZhao, Weiji / Huang, Kefeng / Zhang, Chongyang et al. | 2023
- 1
-
Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech SynthesisLei, Shun / Zhou, Yixuan / Chen, Liyang / Wu, Zhiyong / Kang, Shiyin / Meng, Helen et al. | 2023
- 1
-
Electric Network Frequency Detection Using Least Absolute DeviationsKorgialas, Christos / Kotropoulos, Constantine et al. | 2023
- 1
-
An Empirical Study on Speech Restoration Guided by Self-Supervised Speech RepresentationByun, Jaeuk / Ji, Youna / Chung, Soo-Whan / Choe, Soyeon / Choi, Min-Seok et al. | 2023
- 1
-
Source Localization for Extremely Large-Scale Antenna Arrays with Spatial Non-StationarityWu, Xiaohuan / Sun, Ji / Jia, Xiaoyuan / Wang, Shuxin et al. | 2023
- 1
-
Cross-Modal Optical Flow Estimation via Modality Compensation and AlignmentZhai, Mingliang / Ni, Kang / Xie, Jiucheng / Gao, Hao et al. | 2023
- 1
-
Detecting Out-of-Distribution Examples Via Class-Conditional Impressions ReappearingChen, Jinggang / Qu, Xiaoyang / Li, Junjie / Wang, Jianzong / Wan, Jiguang / Xiao, Jing et al. | 2023
- 1
-
OTW: Optimal Transport Warping for Time SeriesLatorre, Fabian / Liu, Chenghao / Sahoo, Doyen / Hoi, Steven C.H. et al. | 2023
- 1
-
Enhancing Unsupervised Speech Recognition with Diffusion GANSWu, Xianchao et al. | 2023
- 1
-
Cross-Modal Matching and Adaptive Graph Attention Network for RGB-D Scene RecognitionGuo, Yuhui / Liang, Xun / Kwok, James T. / Zheng, Xiangping / Wu, Bo / Ma, Yuefeng et al. | 2023
- 1
-
Column-Based Matrix Approximation with Quasi-Polynomial StructureChae, Jeongmin / Narayanamurthy, Praneeth / Bac, Selin / Sharada, Shaama Mallikarjun / Mitra, Urbashi et al. | 2023
- 1
-
Row Conditional-TGAN for Generating Synthetic Relational DatabasesGueye, Mohamed / Attabi, Yazid / Dumas, Maxime et al. | 2023
- 1
-
Product Graph Learning From Multi-Attribute Graph Signals with Inter-Layer CouplingZhang, Chenyue / He, Yiran / Wai, Hoi-To et al. | 2023
- 1
-
Unlimited Sampling Radar: Life Below the Quantization NoiseFeuillen, Thomas / Shankar MRR, Bhavani / Bhandari, Ayush et al. | 2023
- 1
-
SQA: Strong Guidance Query with Self-Selected Attention for Human-Object Interaction DetectionZhang, Feng / Sheng, Liu / Guo, Bingnan / Chen, Ruixiang / Chen, Junhao et al. | 2023
- 1
-
Optimal Compression for Minimizing Classification Error Probability: An Information-Theoretic ApproachGao, Jingchao / Tang, Ao / Xu, Weiyu et al. | 2023
- 1
-
Whether Contribution of Features Differ Between Video-Mediated and In-Person Meetings in Important Utterance EstimationNihei, Fumio / Ishii, Ryo / Nakano, Yukiko I. / Fukayama, Atsushi / Nakamura, Takao et al. | 2023
- 1
-
Hybrid Neural Network with Cross- and Self-Module Attention Pooling for Text-Independent Speaker VerificationAlam, Jahangir / Kang, Woo Hyun / Fathan, Abderrahim et al. | 2023
- 1
-
Light-Weight Sequential SBL Algorithm: An Alternative to OMPPote, Rohan R. / Rao, Bhaskar D. et al. | 2023
- 1
-
CyPMLI: WISL-Minimized Unimodular Sequence Design via Power Method-Like IterationsEamaz, Arian / Yeganegi, Farhang / Soltanalian, Mojtaba et al. | 2023
- 1
-
Invariant Adversarial Imitation Learning From Visual InputsZhang, Haoran / Tian, Yinghong / Yuan, Liang / Lu, Yue et al. | 2023
- 1
-
Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound ClassificationKang, Zuheng / He, Yayun / Wang, Jianzong / Peng, Junqing / Qu, Xiaoyang / Xiao, Jing et al. | 2023
- 1
-
Relating EEG Recordings to Speech Using Envelope Tracking and The Speech-FFRThornton, Mike / Mandic, Danilo / Reichenbach, Tobias et al. | 2023
- 1
-
A New Probabilistic Distance Metric with Application in Gaussian Mixture ReductionSajedi, Ahmad / Lawryshyn, Yuri A. / Plataniotis, Konstantinos N. et al. | 2023
- 1
-
Nonnegative Block-Term Decomposition with the β-Divergence: Joint Data Fusion and Blind Spectral UnmixingPrevost, C. / Leplat, V. et al. | 2023
- 1
-
Simultaneously Learning Robust Audio Embeddings and Balanced Hash Codes for Query-by-ExampleSingh, Anup / Demuynck, Kris / Arora, Vipul et al. | 2023
- 1
-
Robust and Parallelizable Tensor Completion Based on Tensor Factorization and Maximum Correntropy CriterionHe, Yicong / Atia, George K. et al. | 2023
- 1
-
Possibilistic Bernoulli Filter for Extended Target TrackingChen, Zhijin / Ristic, Branko / Kim, Du Yong et al. | 2023
- 1
-
Augmentation Robust Self-Supervised Learning for Human Activity RecognitionXu, Cong / Li, Yuhang / Lee, Dae / Hoon Park, Dae / Mao, Hongda / Do, Huyen / Chung, Jonathan / Nair, Dinesh et al. | 2023
- 1
-
RIS-Aided Wideband DFRC with Reconfigurable Holographic SurfaceWei, Tong / Wu, Linlong / Mishra, Kumar Vijay / Bhavani Shankar, M. R. et al. | 2023
- 1
-
Time-Varying Signals Recovery Via Graph Neural NetworksCastro-Correa, Jhon A. / Giraldo, Jhony H. / Mondal, Anindya / Badiey, Mohsen / Bouwmans, Thierry / Malliaros, Fragkiskos D. et al. | 2023
- 1
-
Low-Rank Plus Sparse Trajectory Decomposition for Direct Exoplanet ImagingVary, Simon / Daglayan, Hazan / Jacques, Laurent / Absil, P.-A. et al. | 2023
- 1
-
Healthcall Corpus and Transformer Embeddings from Healthcare Customer-Agent ConversationsLackovic, Nikola / Montacie, Claude / Lequilliec, Cedric / Caraty, Marie-Jose et al. | 2023
- 1
-
Tracking Objects and Activities with Attention for Temporal Sentence GroundingXiong, Zeyu / Liu, Daizong / Zhou, Pan / Zhu, Jiahao et al. | 2023
- 1
-
A Computationally Efficient Algorithm for Distributed Adaptive Signal Fusion Based on Fractional ProgramsMusluoglu, Cem Ates / Bertrand, Alexander et al. | 2023
- 1
-
Improving Weakly Supervised Sound Event Detection with Causal InterventionXin, Yifei / Yang, Dongchao / Cui, Fan / Wang, Yujun / Zou, Yuexian et al. | 2023
- 1
-
Phase Retrieval for Rydberg Quantum ArraysVouras, Peter / Vijay Mishra, Kumar / Artusio-Glimpse, Alexandra et al. | 2023
- 1
-
Federated Semi-Supervised Learning for Object Detection in Autonomous DrivingChi, Fangyuan / Wang, Yixiao / Nasiopoulos, Panos / Leung, Victor C. M. / Pourazad, Mahsa T. et al. | 2023
- 1
-
SIAST: A Slot Imbalance-Aware Self-Training Scheme for Semi-Supervised Slot FillingLiu, Jiachi / Xiong, Sishi / He, Yuehuan / Zhou, Tong / Wang, Liwen / Li, Xuefeng / Xiao, Bo et al. | 2023
- 1
-
Continual Cell Instance Segmentation of Microscopy ImagesChuang, Tzu-Ting / Wei, Ting-Yun / Hsieh, Yu-Hsing / Chen, Chu-Song / Yang, Huei-Fang et al. | 2023
- 1
-
Code-Switching Text Generation and Injection in Mandarin-English ASRYu, Haibin / Hu, Yuxuan / Qian, Yao / Jin, Ma / Liu, Linquan / Liu, Shujie / Shi, Yu / Qian, Yanmin / Lin, Edward / Zeng, Michael et al. | 2023
- 1
-
Multi-Speaker End-to-End Multi-Modal Speaker Diarization System for the MISP 2022 ChallengeLiu, Tao / Chen, Zhengyang / Qian, Yanmin / Yu, Kai et al. | 2023
- 1
-
How to Push the Fastest Model 50x Faster: Streaming Non-Autoregressive Speech Synthesis on Resouce-Limited DevicesNguyen, Van-Thinh / Pham, Hung-Cuong / Mac, Dang-Khoa et al. | 2023
- 1
-
Improving Transformer-Based Networks with Locality for Automatic Speaker VerificationSang, Mufan / Zhao, Yong / Liu, Gang / Hansen, John H.L. / Wu, Jian et al. | 2023
- 1
-
Quickest Change Detection with Leave-one-out Density EstimationLiang, Yuchen / Veeravalli, Venugopal V. et al. | 2023
- 1
-
CPA: Compressed Private Aggregation for Scalable Federated Learning Over Massive NetworksLang, Natalie / Sofer, Elad / Shlezinger, Nir / D'Oliveira, Rafael G. L. / El Rouayheb, Salim et al. | 2023
- 1
-
Optimizing Vision Transformers for Medical Image SegmentationLiu, Qianying / Kaul, Chaitanya / Wang, Jun / Anagnostopoulos, Christos / Murray-Smith, Roderick / Deligianni, Fani et al. | 2023
- 1
-
Jointly Visual- and Semantic-Aware Graph Memory Networks for Temporal Sentence Localization in VideosLiu, Daizong / Zhou, Pan et al. | 2023
- 1
-
Rate-Distortion Optimized Variable-Node-size Trisoup for Point Cloud CodingUnno, Kyohei / Matsuzaki, Kohei / Komorita, Satoshi / Kawamura, Kei et al. | 2023
- 1
-
Cross-Device Federated Learning for Mobile Health Diagnostics: A First Study on COVID-19 DetectionXia, Tong / Han, Jing / Ghosh, Abhirup / Mascolo, Cecilia et al. | 2023
- 1
-
Disentangling Speech from Surroundings with Neural EmbeddingsOmran, Ahmed / Zeghidour, Neil / Borsos, Zalan / de Chaumont Quitry, Felix / Slaney, Malcolm / Tagliasacchi, Marco et al. | 2023
- 1
-
Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion ModelRen, Zhiyuan / Pan, Zhihong / Zhou, Xin / Kang, Le et al. | 2023
- 1
-
Distributed Admm with Limited Communications Via Deep UnfoldingNoah, Yoav / Shlezinger, Nir et al. | 2023
- 1
-
Semantic Preprocessor for Image Compression for MachinesYang, Mingyi / Herranz, Luis / Yang, Fei / Murn, Luka / Blanch, Marc Gorriz / Wan, Shuai / Yang, Fuzheng / Mrak, Marta et al. | 2023
- 1
-
New Interpretable Patterns and Discriminative Features from Brain Functional Network Connectivity using Dictionary LearningGhayem, F. / Yang, H. / Kantar, F. / Kim, S.-J. / Calhoun, V. D. / Adali, T. et al. | 2023
- 1
-
Multi-User Data Detection in Massive MIMO with 1-Bit ADCSRadbord, Amin / Atzeni, Italo / Tolli, Antti et al. | 2023
- 1
-
Estimating and Analyzing Neural Information flow using Signal Processing on GraphsSchwock, Felix / Bloch, Julien / Atlas, Les / Abadi, Shima / Yazdan-Shahmorad, Azadeh et al. | 2023
- 1
-
Oct Image Blind Despeckling Based on Gradient Guided Filter with Speckle Statistical PriorLi, Sanqian / Xiong, Muxing / Yang, Bing / Zhang, Xiaoqing / Higashita, Risa / Liu, Jiang et al. | 2023
- 1
-
SSVMR: Saliency-Based Self-Training for Video-Music RetrievalCheng, Xuxin / Zhu, Zhihong / Li, Hongxiang / Li, Yaowei / Zou, Yuexian et al. | 2023
- 1
-
Efficient Multi-Scale Attention Module with Cross-Spatial LearningOuyang, Daliang / He, Su / Zhang, Guozhong / Luo, Mingzhu / Guo, Huaiyong / Zhan, Jian / Huang, Zhijie et al. | 2023
- 1
-
Local to global prior Learning for blind Unsupervised Image super ResolutionYamawaki, Kazuhiro / Han, Xian-Hua et al. | 2023
- 1
-
Multi-Observation Hidden Semi-Markov Model for Photoplethysmogram Signal Semantic SegmentationHasanzadeh, Navid / Valaee, Shahrokh / Salehinejad, Hojjat et al. | 2023
- 1
-
Relational Representation Learning for Zero-Shot Relation Extraction with Instance Prompting and Prototype RectificationDuan, Bin / Liu, Xingxian / Wang, Shusen / Xu, Yajing / Xiao, Bo et al. | 2023
- 1
-
Customized Automatic Face BeautificationChen, Wang / Chen, Peizhen / Chen, Weijie / Lin, Luojun et al. | 2023
- 1
-
Towards Trustworthy Multi-Label Sewer Defect Classification via Evidential Deep LearningZhao, Chenyang / Hu, Chuanfei / Shao, Hang / Wang, Zhe / Wang, Yongxiong et al. | 2023
- 1
-
Neural Architecture Search with Multimodal Fusion Methods for Diagnosing DementiaChatzianastasis, Michail / Ilias, Loukas / Askounis, Dimitris / Vazirgiannis, Michalis et al. | 2023
- 1
-
Infrared and Visible Image Fusion by Using Multi-Scale Transformation and Fractional-Order Gradient InformationWu, Shiwei / Zhang, Kang / Yuan, Xia / Zhao, Chunxia et al. | 2023
- 1
-
I3D: Transformer Architectures with Input-Dependent Dynamic Depth for Speech RecognitionPeng, Yifan / Lee, Jaesong / Watanabe, Shinji et al. | 2023
- 1
-
Semi-Supervised Graph Ultra-Sparsifier Using Reweighted ℓ1 OptimizationLi, Jiayu / Zhang, Tianyun / Jin, Shengmin / Zafarani, Reza et al. | 2023
- 1
-
Retrieval-Based Natural 3D Human Motion GenerationTan, Zehan / Yang, Weidong / Wu, Shuai et al. | 2023
- 1
-
Deep Learning-Based Compressive Sampling Optimization in Massive MIMO SystemsPavel, Saidur R. / Zhang, Yimin D. / Greco, Maria S. / Gini, Fulvio et al. | 2023
- 1
-
Multi-Resolution Location-Based Training for Multi-Channel Continuous Speech SeparationTaherian, Hassan / Wang, DeLiang et al. | 2023
- 1
-
Multitrack Music Transcription with a Time-Frequency PerceiverLu, Wei -Tsung / Wang, Ju-Chiang / Hung, Yun -Ning et al. | 2023
- 1
-
On the Relevance of the Differences Between HRTF Measurement Setups for Machine LearningPauwels, Johan / Picinali, Lorenzo et al. | 2023
- 1
-
On Neural Architectures for Deep Learning-Based Source Separation of Co-Channel OFDM SignalsLee, Gary C.F. / Weiss, Amir / Lancho, Alejandro / Polyanskiy, Yury / Wornell, Gregory W. et al. | 2023
- 1
-
LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-SpeechChen, Jie / Song, Xingchen / Peng, Zhendong / Zhang, Binbin / Pan, Fuping / Wu, Zhiyong et al. | 2023
- 1
-
Wekws: A Production First Small-Footprint End-to-End Keyword Spotting ToolkitWang, Jie / Xu, Menglong / Hou, Jingyong / Zhang, Binbin / Zhang, Xiao-Lei / Xie, Lei / Pan, Fuping et al. | 2023
- 1
-
Gated Enhanced RPN and Hybrid-View for Few-Shot Object DetectionWei, Xujun / Zhou, Zechu / Guo, Pinxue / Zhang, Wenqiang et al. | 2023
- 1
-
SW-WAVENET: Learning Representation from Spectrogram and Wavegram Using Wavenet for Anomalous Sound DetectionChen, Haihui / Ran, Likai / Sun, Xixia / Cai, Chao et al. | 2023
- 1
-
Towards Trustworthy Phoneme Boundary Detection with Autoregressive Model and Improved Evaluation MetricKim, Hyeongju / Choi, Hyeong-Seok et al. | 2023
- 1
-
Knowledge Transfer for on-Device Speech Emotion Recognition With Neural Structured LearningChang, Yi / Ren, Zhao / Nguyen, Thanh Tam / Qian, Kun / Schuller, Bjorn W. et al. | 2023
- 1
-
Asymptotic Bias and Variance of Kernel Ridge RegressionSolo, Victor et al. | 2023
- 1
-
Modeling Turn-Taking in Human-To-Human Spoken Dialogue Datasets Using Self-Supervised FeaturesMorais, Edmilson / Damasceno, Matheus / Aronowitz, Hagai / Satt, Aharon / Hoory, Ron et al. | 2023
- 1
-
Do Coarser Units Benefit Cluster Prediction-Based Speech Pre-Training?Elkahky, Ali / Hsu, Wei-Ning / Tomasello, Paden / Nguyen, Tu-Anh / Algayres, Robin / Adi, Yossi / Copet, Jade / Dupoux, Emmanuel / Mohamed, Abdelrahman et al. | 2023
- 1
-
Unsupervised Anomaly Detection and Localization of Machine Audio: A Gan-Based ApproachJiang, Anbai / Zhang, Wei-Qiang / Deng, Yufeng / Fan, Pingyi / Liu, Jia et al. | 2023
- 1
-
TRUSTERA: A Live Conversation Redaction SystemGouvea, Evandro / Dadgar, Ali / Jalalvand, Shahab / Chengalvarayan, Rathi / Jayakumar, Badrinath / Price, Ryan / Ruiz, Nicholas / McGovern, Jennifer / Bangalore, Srinivas / Stern, Ben et al. | 2023
- 1
-
A Comprehensive Comparison of Projections in Omnidirectional Super-ResolutionPi, Huicheng / Tian, Senmao / Lu, Ming / Liu, Jiaming / Guo, Yandong / Zhang, Shunli et al. | 2023
- 1
-
Compressive Estimation of Near Field Channels for Ultra Massive-Mimo Wideband THz SystemsTarboush, Simon / Ali, Anum / Al-Naffouri, Tareq Y. et al. | 2023
- 1
-
W2KPE: Keyphrase Extraction with Word-Word RelationCheng, Wen / Dong, Shichen / Wang, Wei et al. | 2023
- 1
-
CPD-GAN: Cascaded Pyramid Deformation GAN for Pose TransferHuang, Yuan / Tang, Yuting / Zheng, Xiu / Tang, Jie et al. | 2023
- 1
-
Auto-AVSR: Audio-Visual Speech Recognition with Automatic LabelsMa, Pingchuan / Haliassos, Alexandros / Fernandez-Lopez, Adriana / Chen, Honglie / Petridis, Stavros / Pantic, Maja et al. | 2023
- 1
-
SDG-L: A Semiparametric Deep Gaussian Process based Framework for Battery Capacity PredictionLiu, Hanbing / Wu, Yanru / Li, Yang / Kuruoglu, Ercan E. / Zhang, Xuan et al. | 2023
- 1
-
Eigen-Decomposition-Free Directed Graph Sampling via Gershgorin Disc AlignmentLi, Yuejiang / Vicky Zhao, H. / Cheung, Gene et al. | 2023
- 1
-
Self-Supervised Audio-Visual Speaker Representation with Co-Meta LearningChen, Hui / Zhang, Hanyi / Wang, Longbiao / Lee, Kong Aik / Liu, Meng / Dang, Jianwu et al. | 2023
- 1
-
An Asynchronous Updating Reinforcement Learning Framework for Task-Oriented Dialog SystemZhang, Sai / Hu, Yuwei / Wang, Xiaojie / Yuan, Caixia et al. | 2023
- 1
-
Gridless Target Localization for FDA-Mimo Radar with Sparse ArraysWu, Xiaohuan / Liu, Yaxin / Jia, Xiaoyuan et al. | 2023
- 1
-
Anchored Speech Recognition with Neural TransducersRaj, Desh / Jia, Junteng / Mahadeokar, Jay / Wu, Chunyang / Moritz, Niko / Zhang, Xiaohui / Kalinli, Ozlem et al. | 2023
- 1
-
A Novel Approach Based on Voronoï Cells to Classify Spectrogram Zeros of Multicomponent SignalsLaurent, N. / Meignen, S. / Colominas, M. A. / Miramont, J. M. / Auger, F. et al. | 2023
- 1
-
SLICER: Learning Universal Audio Representations Using Low-Resource Self-Supervised Pre-TrainingSeth, Ashish / Ghosh, Sreyan / Umesh, S. / Manocha, Dinesh et al. | 2023
- 1
-
Adapter Tuning With Task-Aware Attention MechanismLu, Jinliang / Jin, Feihu / Zhang, Jiajun et al. | 2023
- 1
-
Spice+: Evaluation of Automatic Audio Captioning Systems with Pre-Trained Language ModelsGontier, Felix / Serizel, Romain / Cerisara, Christophe et al. | 2023
- 1
-
HITSZ TMG at ICASSP 2023 SPGC Shared Task: Leveraging Pre-Training and Distillation Method for Title Generation with Limited ResourceXu, Tianxiao / Zheng, Zihao / Hu, Xinshuo / Sun, Zetian / Zhao, Yu / Hu, Baotian et al. | 2023
- 1
-
Prior-Enhanced Temporal Action Localization Using Subject-Aware Spatial AttentionLiu, Yifan / Tang, Youbao / Zhang, Ning / Lin, Ruei-Sung / Wang, Haoqian et al. | 2023
- 1
-
Phoneme-Level Bert for Enhanced Prosody of Text-To-Speech with Grapheme PredictionsLi, Yinghao Aaron / Han, Cong / Jiang, Xilin / Mesgarani, Nima et al. | 2023
- 1
-
CO-NET: Classification-Oriented Point Cloud Sampling via Informative Feature Learning and Non-Overlapped Local AdjustmentLin, Yanan / Chen, Keyu / Zhou, Shihao / Huang, Yunan / Lei, Yunqi et al. | 2023
- 1
-
An Automotive Radar Dataset For Object ClassificationShyam, Akshad / Komalavally, Kusum / Gautam, Monika / Kancharla, Vamshikrishna / Gudisa, Vennela / Patil, Virendra / Balasubramanian, Aanandh / Channappayya, Sumohana et al. | 2023
- 1
-
Personalized Federated Learning on Long-Tailed Data via Adversarial Feature AugmentationLu, Yang / Qian, Pinxin / Huang, Gang / Wang, Hanzi et al. | 2023
- 1
-
Robust Content-Variant Reference Image Quality Assessment Via Similar Patch MatchingShi, Wenbo / Yang, Wenming / Liao, Qingmin et al. | 2023
- 1
-
Selinet: A Lightweight Model for Single Channel Speech SeparationTan, Ha Minh / Vu, Duc-Quang / Wang, Jia-Ching et al. | 2023
- 1
-
Multi-Speaker Speech Synthesis from Electromyographic Signals by Soft Speech Unit PredictionScheck, Kevin / Schultz, Tanja et al. | 2023
- 1
-
Meta++ Network for Few-Shot Aerospace Crack SegmentationXu, Chengyuan / Liu, Kang / Li, Xuelong et al. | 2023
- 1
-
On Cross-Layer Alignment for Model Fusion of Heterogeneous Neural NetworksNguyen, Dang / Nguyen, Trang / Nguyen, Khai / Phung, Dinh / Bui, Hung / Ho, Nhat et al. | 2023
- 1
-
Gaussian Prior Reinforcement Learning for Nested Named Entity RecognitionYang, Yawen / Hu, Xuming / Ma, Fukun / Li, Shu'Ang / Liu, Aiwei / Wen, Lijie / Yu, Philip S. et al. | 2023
- 1
-
Near-field Localization with Dynamic Metasurface AntennasYang, Qianyu / Guerra, Anna / Guidi, Francesco / Shlezinger, Nir / Zhang, Haiyang / Dardari, Davide / Wang, Baoyun / Eldar, Yonina C. et al. | 2023
- 1
-
Performance of Social Machine Learning Under Limited DataHu, Ping / Bordignon, Virginia / Kayaalp, Mert / Sayed, Ali H. et al. | 2023
- 1
-
Efficient Quantized Constant Envelope Precoding for Multiuser Downlink Massive MIMO SystemsWu, Zheyu / Liu, Ya-Feng / Jiang, Bo / Dai, Yu-Hong et al. | 2023
- 1
-
Switching Kronecker Product Linear Filtering for Multispeaker Adaptive Speech DereverberationHuang, Gongping / Benesty, Jacob / Cohen, Israel / Winebrand, Emil / Chen, Jingdong / Kellermann, Walter et al. | 2023
- 1
-
SADE: A Self-Adaptive Expert for Multi-Dataset Question AnsweringPeng, Yixing / Wang, Quan / Mao, Zhendong / Zhang, Yongdong et al. | 2023
- 1
-
Multi-View Millimeter-Wave Imaging Over Wireless Cellular NetworkTong, Xin / Zhang, Zhaoyang / Yang, Zhaohui et al. | 2023
- 1
-
Effective Graph-Based Modeling of Articulation Traits for Mispronunciation Detection and DiagnosisYan, Bi-Cheng / Wang, Hsin-Wei / Wang, Yi-Cheng / Chen, Berlin et al. | 2023
- 1
-
Visual Prompting for Adversarial RobustnessChen, Aochuan / Lorenz, Peter / Yao, Yuguang / Chen, Pin-Yu / Liu, Sijia et al. | 2023
- 1
-
Inter-Scale Sure-Let Denoise with Structured Deep Image Prior: Interpretable Self-Supervised LearningLi, Jikai / Muramatsu, Shogo et al. | 2023
- 1
-
Personalized Speech Enhancement Combining Band-Split RNN and Speaker Attentive ModuleLe, Xiaohuai / Chen, Li / He, Chao / Guo, Yiqing / Chen, Cheng / Xia, Xianjun / Lu, Jing et al. | 2023
- 1
-
Grad-CAM-Inspired Interpretation of Nearfield Acoustic Holography using Physics-Informed Explainable Neural NetworkKafri, Hagar / Olivieri, Marco / Antonacci, Fabio / Moradi, Mordehay / Sarti, Augusto / Gannot, Sharon et al. | 2023
- 1
-
Epilepsy Detection Grand ChallengeChatzichristos, Christos / Bhagubai, Miguel / Van Paesschen, Wim / De Vos, Maarten et al. | 2023
- 1
-
Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-SpeechSingh, Abhayjeet / Nagireddi, Amala / G, Deekshitha / Bandekar, Jesuraja / R, Roopa / Badiger, Sandhya / Udupa, Sathvik / Ghosh, Prasanta Kumar / Murthy, Hema A / Zen, Heiga et al. | 2023
- 1
-
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)Zhang, Qinglin / Deng, Chong / Liu, Jiaqing / Yu, Hai / Chen, Qian / Wang, Wen / Yan, Zhijie / Liu, Jinglin / Ren, Yi / Zhao, Zhou et al. | 2023
- 1
-
Multilingual Alzheimer’s Dementia Recognition through Spontaneous Speech: A Signal Processing Grand ChallengeLuz, Saturnino / Haider, Fasih / Fromm, Davida / Lazarou, Ioulietta / Kompatsiaris, Ioannis / MacWhinney, Brian et al. | 2023
- 1
-
Divcon: Learning Concept Sequences for Semantically Diverse Image CaptioningZheng, Yue / Li, Ya-Li / Wang, Shengjin et al. | 2023
- 1
-
Exploiting Virtual Array Diversity for Accurate Radar DetectionGuan, Junfeng / Madani, Sohrab / Ahmed, Waleed / Hussein, Samah / Gupta, Saurabh / Hassanieh, Haitham et al. | 2023
- 1
-
Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed NetworksChen, Yiyue / Hashemi, Abolfazl / Vikalo, Haris et al. | 2023
- 1
-
SAN: A Robust End-to-End ASR Model ArchitectureMin, Zeping / Ge, Qian / Huang, Guanhua et al. | 2023
- 1
-
Resource Allocation for UAV-Enabled Integrated Sensing and Communication (ISAC) via Multi-Objective OptimizationRezaei, Omid / Naghsh, Mohammad Mahdi / Karbasi, Seyed Mohammad / Nayebi, Mohammad Mahdi et al. | 2023
- 1
-
Removing Radio Frequency Interference From Auroral Kilometric Radiation With Stacked AutoencodersChang, Allen / Knapp, Mary / LaBelle, James / Swoboda, John / Volz, Ryan / Erickson, Philip J. et al. | 2023
- 1
-
Soft Label Coding for end-to-end Sound Source Localization with ad-hoc Microphone ArraysFeng, Linfeng / Gong, Yijun / Zhang, Xiao-Lei et al. | 2023
- 1
-
Study And Design Of Robust Personal Sound Zones With Vast Using Low Rank RirsBhattacharjee, Sankha Subhra / Shi, Liming / Ping, Guoli / Shen, Xiaoxiang / Christensen, Mads Grasboll et al. | 2023
- 1
-
ROI-Based Deep Image Compression with Swin TransformersLi, Binglin / Liang, Jie / Fu, Haisheng / Han, Jingning et al. | 2023
- 1
-
Event-Based Visual MicrophoneHoward, Matthew / Hirakawa, Keigo et al. | 2023
- 1
-
Named Entity Detection and Injection for Direct Speech TranslationGaido, Marco / Tang, Yun / Kulikov, Ilia / Huang, Rongqing / Gong, Hongyu / Inaguma, Hirofumi et al. | 2023
- 1
-
Efficient Stuttering Event Detection Using Siamese NetworksMohapatra, Payal / Islam, Bashima / Islam, Md Tamzeed / Jiao, Ruochen / Zhu, Qi et al. | 2023
- 1
-
BadRes: Reveal the Backdoors Through Residual ConnectionHe, Mingrui / Chen, Tianyu / Zhou, Haoyi / Zhang, Shanghang / Li, Jianxin et al. | 2023
- 1
-
End-to-End Unsupervised Sketch to Image GenerationLv, Xingming / Wu, Lei / Cheng, Zhenwei / Meng, Xiangxu et al. | 2023
- 1
-
Trinet: Stabilizing Self-Supervised Learning From Complete or Slow CollapseCao, Lixin / Wang, Jun / Yang, Ben / Su, Dan / Yu, Dong et al. | 2023
- 1
-
ERBNet: An Effective Representation Based Network for Unbiased Scene Graph GenerationMa, Wenxi / Hou, Tianxiang / Di, Qianji / Qi, Zhongang / Shan, Ying / Wang, Hanzi et al. | 2023
- 1
-
Deformable Cross Attention for Learning Optical FlowAbdein, Rokia / Xiang, Xuezhi / Lv, Ning / Saddik, Abdulmotaleb El et al. | 2023
- 1
-
Optimal Kernel for Real-Time Arbitrary-Shaped Text DetectionMa, Haozhao / Yang, Chuang / Yuan, Yuan / Wang, Qi et al. | 2023
- 1
-
SVMV: Spatiotemporal Variance-Supervised Motion Volume for Video Frame InterpolationLuo, Yao / Pan, Jinshan / Tang, Jinhui et al. | 2023
- 1
-
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and RescoringLi, Mohan / Do, Cong-Thanh / Doddipatla, Rama et al. | 2023
- 1
-
Two-Stage Neural Network for ICASSP 2023 Speech Signal Improvement ChallengeLiu, Mingshuai / Lv, Shubo / Zhang, Zihan / Han, Runduo / Hao, Xiang / Xia, Xianjun / Chen, Li / Xiao, Yijian / Xie, Lei et al. | 2023
- 1
-
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And RecognitionWang, Zhe / Wu, Shilong / Chen, Hang / He, Mao-Kui / Du, Jun / Lee, Chin-Hui / Chen, Jingdong / Watanabe, Shinji / Siniscalchi, Sabato / Scharenborg, Odette et al. | 2023
- 1
-
Implicit Vehicle Positioning with Cooperative Lidar SensingBarbieri, Luca / Tedeschini, Bernardo Camajori / Brambilla, Mattia / Nicoli, Monica et al. | 2023
- 1
-
Self-Supervised Guided Hypergraph Feature Propagation for Semi-Supervised Classification with Missing Node FeaturesLei, Chengxiang / Fu, Sichao / Wang, Yuetian / Qiu, Wenhao / Hu, Yachen / Peng, Qinmu / You, Xinge et al. | 2023
- 1
-
Differential Analysis for Networks Obeying Conservation LawsRayas, Anirudh / Anguluri, Rajasekhar / Cheng, Jiajun / Dasarathy, Gautam et al. | 2023
- 1
-
Hardware-Limited Non-Uniform Task-Based QuantizersBernardo, Neil Irwin / Zhu, Jingge / Eldar, Yonina C. / Evans, Jamie et al. | 2023
- 1
-
Adaptive Noise Canceller Algorithm with SNR-Based Stepsize and Data-Dependent AveragingSugiyama, Akihiko et al. | 2023
- 1
-
Signal Processing And Quantum State Tomography on Noisy DevicesShi, Wenbo / Malaney, Robert et al. | 2023
- 1
-
In-Sensor & Neuromorphic Computing Are all You Need for Energy Efficient Computer VisionDatta, Gourav / Liu, Zeyu / Kaiser, Md Abdullah-Al / Kundu, Souvik / Mathai, Joe / Yin, Zihan / Jacob, Ajey P. / Jaiswal, Akhilesh R. / Beerel, Peter A. et al. | 2023
- 1
-
Adversarial Contrastive Distillation with Adaptive DenoisingWang, Yuzheng / Chen, Zhaoyu / Yang, Dingkang / Liu, Yang / Liu, Siao / Zhang, Wenqiang / Qi, Lizhe et al. | 2023
- 1
-
On Designing Light-Weight Object Trackers Through Network Pruning: Use CNNS or Transformers?Aggarwal, Saksham / Gupta, Taneesh / Sahu, Pawan K. / Chavan, Arnav / Tiwari, Rishabh / Prasad, Dilip K. / Gupta, Deepak K. et al. | 2023
- 1
-
Variational Inference Aided Estimation of Time Varying ChannelsBock, Benedikt / Baur, Michael / Rizzello, Valentina / Utschick, Wolfgang et al. | 2023
- 1
-
Class-Incremental Learning on Multivariate Time Series Via Shape-Aligned Temporal DistillationQiao, Zhongzheng / Hu, Minghui / Jiang, Xudong / Suganthan, Ponnuthurai Nagaratnam / Savitha, Ramasamy et al. | 2023
- 1
-
Inv-Senet: Invariant Self Expression Network for Clustering Under Biased DataSingh, Ashutosh / Singh, Ashish / Masoomi, Aria / Imbiriba, Tales / Learned-Miller, Erik / Erdogmus, Deniz et al. | 2023
- 1
-
Fine-Grained Textual Knowledge Transfer to Improve RNN Transducers for Speech Recognition and UnderstandingSunder, Vishal / Thomas, Samuel / Kuo, Hong-Kwang J. / Kingsbury, Brian / Fosler-Lussier, Eric et al. | 2023
- 1
-
Training Neural Networks for Sequential Change-Point DetectionLee, Junghwan / Xie, Yao / Cheng, Xiuyuan et al. | 2023
- 1
-
High-Resolution Neural Network Processing of LFM Radar PulsesAkhtar, Jabran et al. | 2023
- 1
-
MLCGAN: Multi-Lead ECG Synthesis with Multi Label Conditional Generative Adversarial NetworkWu, Jian / Wang, Liping / Pan, Hailin / Wang, Binyu et al. | 2023
- 1
-
NRTSI: Non-Recurrent Time Series ImputationShan, Siyuan / Li, Yang / Oliva, Junier B. et al. | 2023
- 1
-
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASRSanabria, Ramon / Bogoychev, Nikolay / Markl, Nina / Carmantini, Andrea / Klejch, Ondrej / Bell, Peter et al. | 2023
- 1
-
Centralized Cascade Multi-Channel Noise Reduction and Acoustic Feedback Cancellation in a Wireless Acoustic Sensor And Actuator NetworkRuiz, Santiago / van Waterschoot, Toon / Moonen, Marc et al. | 2023
- 1
-
Intent Does Matter! Propagating High-Order Relations for Exploring Interest PreferencesZheng, Xiangping / Liang, Xun / Wu, Bo / Feng, Junlan / Guo, Yuhui / Zhang, Sensen et al. | 2023
- 1
-
Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage ApproachWu, Shih-Lun / Yang, Yi-Hsuan et al. | 2023
- 1
-
Input-Dependent Dynamical Channel Association For Knowledge DistillationTang, Qiankun / Zhang, Yuan / Xu, Xiaogang / Wang, Jun / Guo, Yimin et al. | 2023
- 1
-
Robust Adaptive Beamforming with Proximal MethodLi, Ruifu / Cabric, Danijela et al. | 2023
- 1
-
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel AudioZhang, Yang / Puvvada, Krishna C. / Lavrukhin, Vitaly / Ginsburg, Boris et al. | 2023
- 1
-
An Isotropy Analysis for Self-Supervised Acoustic Unit Embeddings on the Zero Resource Speech Challenge 2021 FrameworkChen, Jianan / Sakti, Sakriani et al. | 2023
- 1
-
Bimodal Fusion Network for Basic Taste Sensation Recognition from Electroencephalography and ElectromyographyGao, Han / Zhao, Shuo / Li, Huiyan / Liu, Li / Wang, You / Hu, Ruifen / Zhang, Jin / Li, Guang et al. | 2023
- 1
-
Papez: Resource-Efficient Speech Separation with Auditory Working MemoryOh, Hyunseok / Yi, Juheon / Lee, Youngki et al. | 2023
- 1
-
Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding TasksVillatoro-Tello, Esau / Madikeri, Srikanth / Zuluaga-Gomez, Juan / Sharma, Bidisha / Saeed Sarfjoo, Seyyed / Nigmatulina, Iuliia / Motlicek, Petr / Ivanov, Alexei V. / Ganapathiraju, Aravind et al. | 2023
- 1
-
Search for Efficient Deep Visual-Inertial Odometry Through Neural Architecture SearchChen, Yu / Yang, Mingyu / Kim, Hun-Seok et al. | 2023
- 1
-
Prune Then Distill: Dataset Distillation with Importance SamplingSundar, Anirudh S / Keskin, Gokce / Chandak, Chander / Chen, I-Fan / Ghahremani, Pegah / Ghosh, Shalini et al. | 2023
- 1
-
CF-VTON: Multi-Pose Virtual Try-on with Cross-Domain FusionDu, Chenghu / Xiong, Shengwu et al. | 2023
- 1
-
LQGNET: Hybrid Model-Based and Data-Driven Linear Quadratic Stochastic ControlCasspi, Solomon Goldgraber / Husser, Oliver / Revach, Guy / Shlezinger, Nir et al. | 2023
- 1
-
Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-Trained RepresentationsShen, Siyuan / Liu, Feng / Zhou, Aimin et al. | 2023
- 1
-
GTN-Bailando: Genre Consistent long-Term 3D Dance Generation Based on Pre-Trained Genre Token NetworkZhuang, Haolin / Lei, Shun / Xiao, Long / Li, Weiqin / Chen, Liyang / Yang, Sicheng / Wu, Zhiyong / Kang, Shiyin / Meng, Helen et al. | 2023
- 1
-
Streaming Multi-Channel Speech Separation with Online Time-Domain Generalized Wiener FilterLuo, Yi et al. | 2023
- 1
-
String-Based Molecule Generation Via Multi-Decoder VAEKwon, Kisoo / Jeong, Kuhwan / Park, Junghyun / Na, Hwidong / Shin, Jinwoo et al. | 2023
- 1
-
Robust Spatiotemporal Fusion of Satellite Images via Convex OptimizationIsono, Ryosuke / Naganuma, Kazuki / Ono, Shunsuke et al. | 2023
- 1
-
A Sidecar Separator Can Convert A Single-Talker Speech Recognition System to A Multi-Talker OneMeng, Lingwei / Kang, Jiawen / Cui, Mingyu / Wang, Yuejiao / Wu, Xixin / Meng, Helen et al. | 2023
- 1
-
N2MVSNet: Non-Local Neighbors Aware Multi-View Stereo NetworkZhang, Zhe / Gao, Huachen / Hu, Yuxi / Wang, Ronggang et al. | 2023
- 1
-
Windowed Fourier Analysis for Signal Processing on Graph BundlesRoddenberry, T. Mitchell / Segarra, Santiago et al. | 2023
- 1
-
Diffusion-Based Generative Speech Source SeparationScheibler, Robin / Ji, Youna / Chung, Soo-Whan / Byun, Jaeuk / Choe, Soyeon / Choi, Min-Seok et al. | 2023
- 1
-
Shuffled Autoregression for Motion InterpolationHuang, Shuo / Jia, Jia / Yang, Zongxin / Wang, Wei / Wu, Haozhe / Yang, Yi / Xing, Junliang et al. | 2023
- 1
-
Joint Estimation of DOA and Distance in Noisy Reverberant ConditionsBu, Suliang / Zhao, Tuo / Zhao, Yunxin et al. | 2023
- 1
-
Change Point Detection with Neural Online Density-Ratio EstimatorWang, Xiuheng / Borsoi, Ricardo Augusto / Richard, Cedric / Chen, Jie et al. | 2023
- 1
-
Towards Low-Power Heart Rate Estimation Based on User’s Demographics and Activity Level For WearablesPacheco, Andre G. C. / Cabello, Frank A. C. / Fonoff, Adriana M. O. / Rodrigues, Paula G. / Penatti, Otavio A. B. / Pinto, Paula R. et al. | 2023
- 1
-
ifUNet++: Iterative Feedback UNet++ for Infrared Small Target DetectionWeng, Zhangying / Li, Peng / Zhuang, Xin / Yan, Xuefeng / Gong, Lina / Xie, Haoran / Wei, Mingqiang et al. | 2023
- 1
-
Vararray Meets T-Sot: Advancing the State of the Art of Streaming Distant Conversational Speech RecognitionKanda, Naoyuki / Wu, Jian / Wang, Xiaofei / Chen, Zhuo / Li, Jinyu / Yoshioka, Takuya et al. | 2023
- 1
-
Binary Image Fast Perfect Recovery from Sparse 2D-DFT CoefficientsPei, Soo-Chang / Chang, Kuo-Wei et al. | 2023
- 1
-
Time-Aware Multiway Adaptive Fusion Network for Temporal Knowledge Graph Question AnsweringLiu, Yonghao / Liang, Di / Fang, Fang / Wang, Sirui / Wu, Wei / Jiang, Rui et al. | 2023
- 1
-
Exploiting Interactivity and Heterogeneity for Sleep Stage Classification Via Heterogeneous Graph Neural NetworkJia, Ziyu / Lin, Youfang / Zhou, Yuhan / Cai, Xiyang / Zheng, Peng / Li, Qiang / Wang, Jing et al. | 2023
- 1
-
When is Mimo Massive in Radar?Shah, Jaimin / Cardone, Martina / Dytso, Alex / Rush, Cynthia et al. | 2023
- 1
-
Detecting Malicious Migration on Edge to Prevent Running Data LeakageWong, Yuchen / Shen, Qingni / Li, Cong / Liu, Cunzhan / Ai, Tianxiang et al. | 2023
- 1
-
PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image TranslationRen, Bin / Tang, Hao / Wang, Yiming / Li, Xia / Wang, Wei / Sebe, Mcu et al. | 2023
- 1
-
Interpolation of Spatial Room Impulse Responses Using Partial Optimal TransportGeldert, Aaron / Meyer-Kahlen, Nils / Schlecht, Sebastian J. et al. | 2023
- 1
-
Knowledge-Augmented Frame Semantic Parsing with Hybrid Prompt-TuningZhang, Rui / Sun, Yajing / Yang, Jingyuan / Peng, Wei et al. | 2023
- 1
-
HappyQuokka System for ICASSP 2023 Auditory EEG ChallengePiao, Zhenyu / Kim, Miseul / Yoon, Hyungchan / Kang, Hong-Goo et al. | 2023
- 1
-
Deep Unfolded Tensor Robust PCA With Self-Supervised LearningDong, Harry / Shah, Megna / Donegan, Sean / Chi, Yuejie et al. | 2023
- 1
-
Continual Learning for On-Device Speech Recognition Using Disentangled ConformersDiwan, Anuj / Yeh, Ching-Feng / Hsu, Wei-Ning / Tomasello, Paden / Choi, Eunsol / Harwath, David / Mohamed, Abdelrahman et al. | 2023
- 1
-
Robust Online Multiband Drift Estimation in Electrophysiology DataWindolf, Charlie / Paulk, Angelique C. / Kfir, Yoav / Trautmann, Eric / Meszena, Domokos / Munoz, William / Caprara, Irene / Jamali, Mohsen / Boussard, Julien / Williams, Ziv M. et al. | 2023
- 1
-
Progressive Refinement Learning Based on Feature Cross Perception for Residential Areas Semantic SegmentationLyu, Xinran / Zhang, Libao et al. | 2023
- 1
-
Improving Adversarial Robustness with Hypersphere Embedding and Angular-Based RegularizationsFakorede, Olukorede / Nirala, Ashutosh / Atsague, Modeste / Tian, Jin et al. | 2023
- 1
-
Graph Contrastive Learning with Learnable Graph AugmentationPu, Xinyan / Zhang, Ke / Shu, Huazhong / Coatrieux, Jean Louis / Kong, Youyong et al. | 2023
- 1
-
To Regularize or Not to Regularize: The Role of Positivity in Sparse Array Interpolation with a Single SnapshotHucumenoglu, Mehmet Can / Sarangi, Pulak / Rajamaki, Robin / Pal, Piya et al. | 2023
- 1
-
TeAw: Text-Aware Few-Shot Remote Sensing Image Scene ClassificationCheng, Kaihui / Yang, Chule / Fan, Zunlin / Wu, Dayan / Guan, Naiyang et al. | 2023
- 1
-
RIS Reflection and Placement Optimisation for Underlay D2D Communications in Cognitive Cellular NetworksGhose, Sarbani / Mishra, Deepak / Maity, Santi P. / Alexandropoulos, George C. et al. | 2023
- 1
-
Not All Classes are Equal: Adaptively Focus-Aware Confidence for Semi-Supervised Object DetectionZhu, Hui / Lu, Yongchun / Zhao, Hongyu / Zhao, Guoqing / Zhao, Xiaofang et al. | 2023
- 1
-
Adversarial Data Augmentation Using VAE-GAN for Disordered Speech RecognitionJin, Zengrui / Xie, Xurong / Geng, Mengzhe / Wang, Tianzi / Hu, Shujie / Deng, Jiajun / Li, Guinan / Liu, Xunying et al. | 2023
- 1
-
Multi-Blank Transducers for Speech RecognitionXu, Hainan / Jia, Fei / Majumdar, Somshubra / Watanabe, Shinji / Ginsburg, Boris et al. | 2023
- 1
-
End-to-End Word-Level Disfluency Detection and Classification in Children’s Reading AssessmentVenkatasubramaniam, Lavanya / Sunder, Vishal / Fosler-Lussier, Eric et al. | 2023
- 1
-
Speech Emotion Recognition via Heterogeneous Feature LearningLiu, Ke / Wu, DongYa / Wang, Dekui / Feng, Jun et al. | 2023
- 1
-
A Study on Bias and Fairness in Deep Speaker RecognitionHajavi, Amirhossein / Etemad, Ali et al. | 2023
- 1
-
Retinal Biomarkers for Detecting Diabetic Retinopaty Using Smartphone-Based Deep Learning FrameworksKarakaya, Mahmut / Aygun, Ramazan S. et al. | 2023
- 1
-
Hierarchical Interactive Reconstruction Network for Video Compressive SensingZhang, Tong / Cui, Wenxue / Hui, Chen / Jiang, Feng et al. | 2023
- 1
-
A Unified Uncertainty-Aware Exploration: Combining Epistemic and Aleatory UncertaintyMalekzadeh, Parvin / Hou, Ming / Plataniotis, Konstantinos N. et al. | 2023
- 1
-
FedSD: A New Federated Learning Structure Used in Non-iid DataYi, Minmin / Ning, Houchun / Liu, Peng et al. | 2023
- 1
-
Towards Dialogue Modeling Beyond TextWu, Tongzi / Zhou, Yuhao / Ling, Wang / Yang, Hojin / Veloso, Joana / Sun, Lin / Huang, Ruixin / Guimaraes, Norberto / Sanner, Scott et al. | 2023
- 1
-
DPP-Based Client Selection for Federated Learning with NON-IID DATAZhang, Yuxuan / Xu, Chao / Yang, Howard H. / Wang, Xijun / Quek, Tony Q. S. et al. | 2023
- 1
-
Learning Robust Self-Attention Features for Speech Emotion Recognition with Label-Adaptive MixupKang, Lei / Zhang, Lichao / Jiang, Dazhi et al. | 2023
- 1
-
Adaptive Eccm for Mitigating Smart JammersJain, Shashwat / Pattanayak, Kunal / Krishnamurthy, Vikram / Berry, Christopher et al. | 2023
- 1
-
IAST: Instance Association Relying on Spatio-Temporal Features for Video Instance SegmentationChen, Junhao / Liu, Sheng / Chen, Ruixiang / Guo, Bingnan / Zhang, Feng et al. | 2023
- 1
-
Exploring the Role of Fricatives in Classifying Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis and Parkinson’s DiseaseBhattacharjee, Tanuka / Belur, Yamini / Nalini, Atchayaram / Yadav, Ravi / Ghosh, Prasanta Kumar et al. | 2023
- 1
-
Stay In The Middle: A Semi-Supervised Model for CT Metal Artifact ReductionWang, Tao / Yu, Hui / Lu, Zexin / Zhang, Zhongzhou / Zhou, Jiliu / Zhang, Yi et al. | 2023
- 1
-
Neural Fourier Shift for Binaural Speech RenderingWoo Lee, Jin / Lee, Kyogu et al. | 2023
- 1
-
Semi-Supervised Contrastive Learning with Soft Mask Attention for Facial Action Unit DetectionLiu, Zhongling / Liu, Rujie / Shi, Ziqiang / Liu, Liu / Mi, Xiaoyu / Murase, Kentaro et al. | 2023
- 1
-
Recursive Estimation of User Intent From Noninvasive Electroencephalography Using Discriminative ModelsSmedemark-Margulies, Niklas / Celik, Basak / Imbiriba, Tales / Kocanaogullari, Aziz / Erdogmus, Deniz et al. | 2023
- 1
-
Diabetic Retinopathy Grading with Weakly-Supervised Lesion PriorsHou, Junlin / Xiao, Fan / Xu, Jilan / Feng, Rui / Zhang, Yuejie / Zou, Haidong / Lu, Lina / Xue, Wenwen et al. | 2023
- 1
-
Prompt-Distiller: Few-Shot Knowledge Distillation for Prompt-Based Language Learners with Dual Contrastive LearningHou, Boyu / Wang, Chengyu / Chen, Xiaoqing / Qiu, Minghui / Feng, Liang / Huang, Jun et al. | 2023
- 1
-
Contextually-Rich Human Affect Perception Using Multimodal Scene InformationBose, Digbalay / Hebbar, Rajat / Somandepalli, Krishna / Narayanan, Shrikanth et al. | 2023
- 1
-
Stabilising and Accelerating Light Gated Recurrent Units for Automatic Speech RecognitionMoumen, Adel / Parcollet, Titouan et al. | 2023
- 1
-
Sampling Order-Limited Signals on the SphereKhan, Muhammad Salaar Arif / Nadeem, Salman / Khalid, Zubair et al. | 2023
- 1
-
Sequence-Based Device-Free Gesture Recognition Framework for Multi-Channel Acoustic SignalsYang, Zhizheng / Wang, Xun / Xia, Dongyu / Wang, Wei / Dai, Haipeng et al. | 2023
- 1
-
Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech RecognitionEeckt, Steven Vander / Van Hamme, Hugo et al. | 2023
- 1
-
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems?Shi, Xuan / Cooper, Erica / Wang, Xin / Yamagishi, Junichi / Narayanan, Shrikanth et al. | 2023
- 1
-
MGAT: Multi-Granularity Attention Based Transformers for Multi-Modal Emotion RecognitionFan, Weiquan / Xing, Xiaofen / Cai, Bolun / Xu, Xiangmin et al. | 2023
- 1
-
HPFTN: Hierarchical Progressive Fusion Transformer Network for Video DenoisingZhang, Shuaitao / Zhang, Yuan / Zhao, Zheng / Xie, Di / Pu, Shiliang et al. | 2023
- 1
-
Soft 2D-to-3D Delivery Using Deep Graph Neural Networks for Holographic-Type CommunicationFujihashi, Takuya / Koike-Akino, Toshiaki / Watanabe, Takashi et al. | 2023
- 1
-
CLAP Learning Audio Concepts from Natural Language SupervisionElizalde, Benjamin / Deshmukh, Soham / Ismail, Mahmoud Al / Wang, Huaming et al. | 2023
- 1
-
Soft Dynamic Time Warping for Multi-Pitch Estimation and BeyondKrause, Michael / Weis, Christof / Muller, Meinard et al. | 2023
- 1
-
SPECTRANET-SO(3): Learning Satellite Orientation from Optical Spectra by Implicitly Modeling Mutually Exclusive Probability Distributions on The Rotation ManifoldPhelps, Matthew / Swindle, Thomas / Gazak, J. Zachary / Vandenberg, Andrew / Fletcher, Justin et al. | 2023
- 1
-
Channel Estimation in Massive MIMO with Heavy-Tailed Noise: Gaussian-Mixture Versus Cauchy ModelsGulgun, Ziya / Larsson, Erik G. et al. | 2023
- 1
-
Speech Intelligibility Classifiers from 550k Disordered Speech SamplesVenugopalan, Subhashini / Tobin, Jimmy / Yang, Samuel J. / Seaver, Katie / Cave, Richard J.N. / Jiang, Pan-Pan / Zeghidour, Neil / Heywood, Rus / Green, Jordan / Brenner, Michael P. et al. | 2023
- 1
-
Filler Word Detection with Hard Category Mining and Inter-Category Focal LossZhao, Zhiyuan / Wu, Lijun / Tang, Chuanxin / Yin, Dacheng / Zhao, Yucheng / Luo, Chong et al. | 2023
- 1
-
Modular Conformer Training for Flexible End-to-End ASRAudhkhasi, Kartik / Farris, Brian / Ramabhadran, Bhuvana / Moreno, Pedro J. et al. | 2023
- 1
-
Untargeted Backdoor Attack Against Object DetectionLuo, Chengxiao / Li, Yiming / Jiang, Yong / Xia, Shu-Tao et al. | 2023
- 1
-
Cross-Modality depth Estimation via Unsupervised Stereo RGB-to-infrared TranslationTang, Shi / Ye, Xinchen / Xue, Fei / Xu, Rui et al. | 2023
- 1
-
A Dynamic Cross-Scale Transformer with Dual-Compound Representation for 3D Medical Image SegmentationZhang, Ruixia / Wang, Zhiqiong / Wang, Zhongyang / Xin, Junchang et al. | 2023
- 1
-
Generic Dependency Modeling for Multi-Party ConversationShen, Weizhou / Quan, Xiaojun / Yang, Ke et al. | 2023
- 1
-
WL-MSR: Watch and Listen for Multimodal Subtitle RecognitionLiu, Jiawei / Wang, Hao / Wang, Weining / He, Xingjian / Liu, Jing et al. | 2023
- 1
-
Residual Hybrid Attention Network for Compression Artifact ReductionLuo, Bingchun / Yu, Wei et al. | 2023
- 1
-
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech RecognitionSahai, Saumya Y. / Liu, Jing / Muniyappa, Thejaswi / Sathyendra, Kanthashree M. / Alexandridis, Anastasios / Strimel, Grant P. / McGowan, Ross / Rastrow, Ariya / Chang, Feng-Ju / Mouchtaris, Athanasios et al. | 2023
- 1
-
Look and Think: Intrinsic Unification of Self-Attention and Convolution for Spatial-Channel SpecificityGao, Xiang / Lin, Honghui / Li, Yu / Fang, Ruiyan / Zhang, Xin et al. | 2023
- 1
-
Higher-Order Link Prediction Via Learnable Maximum Mean DiscrepancyKaranikolas, Georgios V. / Pages-Zamora, Alba / Giannakis, Georgios B. et al. | 2023
- 1
-
EI2SR: Learning an Enhanced Intra-Instance Semantic Relationship for Arbitrary-Shaped Scene Text DetectionShu, Yan / Liu, Shaohui / Zhou, Yu / Xu, Honglei / Jiang, Feng et al. | 2023
- 1
-
Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant EnvironmentsNeri, Julian / Braun, Sebastian et al. | 2023
- 1
-
Comparative Layer-Wise Analysis of Self-Supervised Speech ModelsPasad, Ankita / Shi, Bowen / Livescu, Karen et al. | 2023
- 1
-
Maximum Likelihood Distillation for Robust Modulation ClassificationMaroto, Javier / Bovet, Gerome / Frossard, Pascal et al. | 2023
- 1
-
Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image ProcessingVali, Mohammad Hassan / Backstrom, Tom et al. | 2023
- 1
-
Deep Fusion of Multi-Object Densities Using TransformerLi, Lechi / Dai, Chen / Xia, Yuxuan / Svensson, Lennart et al. | 2023
- 1
-
Core: Transferable Long-Range Time Series Forecasting Enhanced by Covariates-Guided RepresentationLi, Xin-Yi / Zhong, Pei-Nan / Chen, Di / Yang, Yu-Bin et al. | 2023
- 1
-
Toward Privacy-Enhancing Ambulatory-Based Well-Being Monitoring: Investigating User Re-Identification Risk in Multimodal DataPranjal, Ravi / Seshadri, Ranjana / Kumar Sanath Kumar Kadaba, Rakesh / Feng, Tiantian / Narayanan, Shrikanth S. / Chaspari, Theodora et al. | 2023
- 1
-
Mutually Guided Few-Shot Learning For Relational Triple ExtractionYang, Chengmei / Jiang, Shuai / He, Bowei / Ma, Chen / He, Lianghua et al. | 2023
- 1
-
Guide and Select: A Transformer-Based Multimodal Fusion Method for Points of Interest Description GenerationLiu, Hanqing / Wang, Wei / Hu, Niu / Zheng, Hai-Tao / Xie, Rui / Wu, Wei / Bai, Yang et al. | 2023
- 1
-
Interpretation of Neural Networks is Susceptible to Universal Adversarial PerturbationsOskouie, Haniyeh Ehsani / Farnia, Farzan et al. | 2023
- 1
-
High-Resolution Embedding Extractor for Speaker DiarisationHeo, Hee-Soo / Kwon, Youngki / Lee, Bong-Jin / Kim, You Jin / Jung, Jee-Weon et al. | 2023
- 1
-
Prosody-Controllable Spontaneous TTS with Neural HMMSLameris, Harm / Mehta, Shivam / Henter, Gustav Eje / Gustafson, Joakim / Szekely, Eva et al. | 2023
- 1
-
Faster Than Fast: Accelerating the Griffin-Lim AlgorithmNenov, Rossen / Nguyen, Dang-Khoa / Balazs, Peter et al. | 2023
- 1
-
Scalable and Secure Federated XGBoostNguyen, Quang Minh / Khanh Le, Nhan / Nguyen, Lam M. et al. | 2023
- 1
-
A Generalized Subspace Distribution Adaptation Framework for Cross-Corpus Speech Emotion RecognitionLi, Shaokai / Song, Peng / Ji, Liang / Jin, Yun / Zheng, Wenming et al. | 2023
- 1
-
ClassA Entropy for the Analysis of Structural Complexity of Physiological SignalsXiao, Hongjian / Li, Ling / Mandic, Danilo P. et al. | 2023
- 1
-
Improving Disfluency Detection with Multi-Scale Self Attention and Contrastive LearningWang, Peiying / Duan, Chaoqun / Chen, Meng / He, Xiaodong et al. | 2023
- 1
-
Time-Resolved FMRI Shared Response Model Using Gaussian Process Factor AnalysisEbrahimi, MohammadReza / Calarco, Navona / Hawco, Colin / Voineskos, Aristotle / Khisti, Ashish et al. | 2023
- 1
-
Dynamic TF-TDNN: Dynamic Time Delay Neural Network Based on Temporal-Frequency Attention for Dialect RecognitionLiao, Chao / Huang, Jinwen / Yuan, Huan / Yao, Peng / Tan, Jianchao / Zhang, Dawei / Deng, Feng / Wang, Xiaorui / Song, Chengru et al. | 2023
- 1
-
Contrastive Learning of Functionality-Aware Code EmbeddingsLi, Yiyang / Wu, Hongqiu / Zhao, Hai et al. | 2023
- 1
-
Ultrasound Image Quality Control Using Speech-Assisted Switchable CycleGANHuh, Jaeyoung / Khan, Shujaat / Sun Lee, Eun / Chul Ye, Jong et al. | 2023
- 1
-
Super Dilated Nested Arrays with Ideal Critical Weights and Increased Degrees of FreedomShaalan, Ahmed M. A. / Du, Jun et al. | 2023
- 1
-
Transient Dictionary Learning for Compressed Time-of-Flight ImagingConde, Miguel Heredia et al. | 2023
- 1
-
Does Your Model Think Like an Engineer? Explainable AI for Bearing Fault Detection with Deep LearningDecker, Thomas / Lebacher, Michael / Tresp, Volker et al. | 2023
- 1
-
FAPM: Fast Adaptive Patch Memory for Real-Time Industrial Anomaly DetectionKim, Donghyeong / Park, Chaewon / Cho, Suhwan / Lee, Sangyoun et al. | 2023
- 1
-
A Distributed Adaptive Algorithm for Non-Smooth Spatial Filtering ProblemsHovine, Charles / Bertrand, Alexander et al. | 2023
- 1
-
Graph Learning from Gaussian and Stationary Graph SignalsBuciulea, Andrei / Marques, Antonio G. et al. | 2023
- 1
-
Spatio-Temporal Attention in Multi-Granular Brain Chronnectomes For Detection of Autism Spectrum DisorderOrme-Rogers, James / Srivastava, Ajitesh et al. | 2023
- 1
-
Priv-Aug-Shap-ECGResNet: Privacy Preserving Shapley-Value Attributed Augmented Resnet for Practical Single-Lead Electrocardiogram ClassificationUkil, Arijit / Marin, Leandro / Jara, Antonio J. et al. | 2023
- 1
-
Efficient Online Convolutional Dictionary Learning Using Approximate Sparse ComponentsVeshki, Farshad G. / Vorobyov, Sergiy A. et al. | 2023
- 1
-
Low-Latency Electrolaryngeal Speech Enhancement Based on Fastspeech2-Based Voice Conversion and Self-Supervised Speech RepresentationKobayashi, Kazuhiro / Hayashi, Tomoki / Toda, Tomoki et al. | 2023
- 1
-
Zero-Shot Personalized Lip-To-Speech Synthesis with Face Image Based Voice ControlSheng, Zheng-Yan / Ai, Yang / Ling, Zhen-Hua et al. | 2023
- 1
-
mmWave Wi-Fi Trajectory Estimation with Continuous-Time Neural Dynamic LearningVaca-Rubio, Cristian J. / Wang, Pu / Koike-Akino, Toshiaki / Wang, Ye / Boufounos, Petros / Popovski, Petar et al. | 2023
- 1
-
Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech EnhancementValentini-Botinhao, Cassia / Aldana Blanco, Andrea Lorena / Klejch, Ondrej / Bell, Peter et al. | 2023
- 1
-
D-3DLD: Depth-Aware Voxel Space Mapping for Monocular 3D Lane Detection with UncertaintyKim, Nayeon / Byeon, Moonsub / Ji, Daehyun / Oh, Dokwan et al. | 2023
- 1
-
Finer-Grained Decomposition for Parallel Quantum Mimo ProcessingKim, Minsung / Jamieson, Kyle et al. | 2023
- 1
-
Deep Root Music Algorithm for Data-Driven Doa EstimationShmuel, Dor H. / Merkofer, Julian P. / Revach, Guy / van Sloun, Ruud J. G. / Shlezinger, Nir et al. | 2023
- 1
-
Police: Provably Optimal Linear Constraint Enforcement For Deep Neural NetworksBalestriero, Randall / LeCun, Yann et al. | 2023
- 1
-
A Novel Metric For Evaluating Audio Caption SimilarityBhosale, Swapnil / Chakraborty, Rupayan / Kopparapu, Sunil Kumar et al. | 2023
- 1
-
Generalized Two-Stage Particle Filter for High DimensionsIloska, Marija / Bugallo, Monica F. et al. | 2023
- 1
-
Mitigating Unintended Memorization in Language Models Via Alternating TeachingLiu, Zhe / Zhang, Xuedong / Peng, Fuchun et al. | 2023
- 1
-
Adaptive Multi-Corpora Language Model Training for Speech RecognitionMa, Yingyi / Liu, Zhe / Zhang, Xuedong et al. | 2023
- 1
-
Domain Adaptation without Catastrophic Forgetting on a Small-Scale Partially-Labeled Corpus for Speech Emotion RecognitionZhu, Zhi / Sato, Yoshinao et al. | 2023
- 1
-
SingNet: a real-time Singing Voice beat and Downbeat Tracking SystemHeydari, Mojtaba / Wang, Ju-Chiang / Duan, Zhiyao et al. | 2023
- 1
-
PCQA-Graphpoint: Efficient Deep-Based Graph Metric for Point Cloud Quality AssessmentTliba, Marouane / Chetouani, Aladine / Valenzise, Giuseppe / Dufaux, Frederic et al. | 2023
- 1
-
Adaptive Step-Size Methods for Compressed SGDSubramaniam, Adarsh M. / Magesh, Akshayaa / Veeravalli, Venugopal V. et al. | 2023
- 1
-
Leveraging Multiple Sources in Automatic African American English Dialect Detection for Adults and ChildrenJohnson, Alexander / Shetty, Vishwas M. / Ostendorf, Mari / Alwan, Abeer et al. | 2023
- 1
-
Adaptive Simulated Annealing Through Alternating Rényi Divergence MinimizationGuilmeau, Thomas / Chouzenoux, Emilie / Elvira, Victor et al. | 2023
- 1
-
NAS-DYMC: NAS-Based Dynamic Multi-Scale Convolutional Neural Network for Sound Event DetectionWang, Jun / Yao, Peng / Deng, Feng / Tan, Jianchao / Song, Chengru / Wang, Xiaorui et al. | 2023
- 1
-
Wespeaker: A Research and Production Oriented Speaker Embedding Learning ToolkitWang, Hongji / Liang, Chengdong / Wang, Shuai / Chen, Zhengyang / Zhang, Binbin / Xiang, Xu / Deng, Yanlei / Qian, Yanmin et al. | 2023
- 1
-
Privacy Preserving Face Recognition with Lensless CameraHenry, Chris / Asif, M. Salman / Li, Zhu et al. | 2023
- 1
-
Exploiting CCTV Cameras for Hand Hygiene Recognition in ICUHuang, Weijun / Huang, Jia / Wang, Guowei / Lu, Hongzhou / He, Min / Wang, Wenjin et al. | 2023
- 1
-
Learning Sparse auto-Encoders for Green AI image codingGille, Cyprien / Guyard, Frederic / Antonini, Marc / Barlaud, Michel et al. | 2023
- 1
-
3D Audio Signal Processing Systems for Speech Enhancement and Sound Localization and DetectionBai, Jisheng / Huang, Siwei / Yin, Han / Jia, Yafei / Wang, Mou / Chen, Jianfeng et al. | 2023
- 1
-
Quantum Variational Bayes on ManifoldsLopatnikova, Anna / Tran, Minh-Ngoc et al. | 2023
- 1
-
Exploring Complementary Features in Multi-Modal Speech Emotion RecognitionWang, Suzhen / Ma, Yifeng / Ding, Yu et al. | 2023
- 1
-
Deep Spatio-Temporal Multiplex Graph Learning for Cardiac Imaging ClassificationBanus, Jaume / Ogier, Augustin / Hullin, Roger / Meyer, Philippe / van Heeswijk, Ruud B. / Richiardi, Jonas et al. | 2023
- 1
-
Sign Language Recognition via Deformable 3D Convolutions and Modulated Graph Convolutional NetworksPapadimitriou, Katerina / Potamianos, Gerasimos et al. | 2023
- 1
-
Unsupervised word Segmentation Based on Word InfluenceYan, Ruohao / Zhang, Huaping / Silamu, Wushour / Hamdulla, Askar et al. | 2023
- 1
-
TAPE: An End-to-End Timbre-Aware Pitch EstimatorTamer, Nazif Can / Ozer, Yigitcan / Muller, Meinard / Serra, Xavier et al. | 2023
- 1
-
Text Classification In The Wild: A Large-Scale Long-Tailed Name Normalization DatasetQi, Jiexing / Li, Shuhao / Guo, Zhixin / Huang, Yusheng / Zhou, Chenghu / Zhang, Weinan / Wang, Xinbing / Lin, Zhouhan et al. | 2023
- 1
-
Designing and Evaluating Speech Emotion Recognition Systems: A Reality Check Case Study with IEMOCAPAntoniou, Nikolaos / Katsamanis, Athanasios / Giannakopoulos, Theodoros / Narayanan, Shrikanth et al. | 2023
- 1
-
TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 Dns-ChallengeJu, Yukai / Chen, Jun / Zhang, Shimin / He, Shulin / Rao, Wei / Zhu, Weixin / Wang, Yannan / Yu, Tao / Shang, Shidong et al. | 2023
- 1
-
General or Specific? Investigating Effective Privacy Protection in Federated Learning for Speech Emotion RecognitionTan, Chao / Cao, Yang / Li, Sheng / Yoshikawa, Masatoshi et al. | 2023
- 1
-
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram TransformerLi, Kang / Song, Yan / Dai, Li-Rong / McLoughlin, Ian / Fang, Xin / Liu, Lin et al. | 2023
- 1
-
Nested Attention Network with Graph Filtering for Visual Question and AnsweringLu, Jing / Wu, Chunlei / Wang, Leiquan / Yuan, Shaozu / Wu, Jie et al. | 2023
- 1
-
Defending Against Universal Patch Attacks by Restricting Token Attention in Vision TransformersYu, Hongwei / Chen, Jiansheng / Ma, Huimin / Yu, Cheng / Ding, Xinlong et al. | 2023
- 1
-
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech SynthesisXue, Jinlong / Deng, Yayue / Wang, Fengping / Li, Ya / Gao, Yingming / Tao, Jianhua / Sun, Jianqing / Liang, Jiaen et al. | 2023
- 1
-
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource LanguagesBhogale, Kaushal / Raman, Abhigyan / Javed, Tahir / Doddapaneni, Sumanth / Kunchukuttan, Anoop / Kumar, Pratyush / Khapra, Mitesh M. et al. | 2023
- 1
-
Effectiveness of Inter- and Intra-Subarray Spatial Features for Acoustic Scene ClassificationKawamura, Takao / Kinoshita, Yuma / Ono, Nobutaka / Scheibler, Robin et al. | 2023
- 1
-
Bayesian Network Modeling and Prediction of Transitions Within the Homelessness SystemRahman, Khandker Sadia / Zois, Daphney-Stavroula / Chelmis, Charalampos et al. | 2023
- 1
-
Adaptive Knowledge Distillation Between Text and Speech Pre-Trained ModelsNi, Jinjie / Ma, Yukun / Wang, Wen / Chen, Qian / Ng, Dianwen / Lei, Han / Nguyen, Trung Hieu / Zhang, Chong / Ma, Bin / Cambria, Erik et al. | 2023
- 1
-
Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation AnnotationsCheng, Zhenxiao / Zhou, Jie / Wu, Wen / Chen, Qin / He, Liang et al. | 2023
- 1
-
Comparative Study of IRS Assisted Opportunistic Communications Over i.i.d. and los channelsYashvanth, L. / Murthy, Chandra R. et al. | 2023
- 1
-
Multi-Head Attention and GRU for Improved Match-Mismatch Classification of Speech Stimulus and EEG ResponseBorsdorf, Marvin / Pahuja, Saurav / Ivucic, Gabriel / Cai, Siqi / Li, Haizhou / Schultz, Tanja et al. | 2023
- 1
-
DTTR: Detecting Text with TransformersYang, Jing / You, Zhiqiang / Zhong, Zhiwei / Liu, Peng / Mei, Langqi / Huang, Shenguang et al. | 2023
- 1
-
DST: Deformable Speech Transformer for Emotion RecognitionChen, Weidong / Xing, Xiaofen / Xu, Xiangmin / Pang, Jianxin / Du, Lan et al. | 2023
- 1
-
Cross-Training: A Semi-Supervised Training Scheme for Speech RecognitionKhorram, Soheil / Tripathi, Anshuman / Kim, Jaeyoung / Lu, Han / Zhang, Qian / Prabhavalkar, Rohit / Sak, Hasim et al. | 2023
- 1
-
Wav2Seq: Pre-Training Speech-to-Text Encoder-Decoder Models Using Pseudo LanguagesWu, Felix / Kim, Kwangyoun / Watanabe, Shinji / Han, Kyu J. / McDonald, Ryan / Weinberger, Kilian Q. / Artzi, Yoav et al. | 2023
- 1
-
MLP-GAN for Brain Vessel Image SegmentationXie, Bin / Tang, Hao / Duan, Bin / Cai, Dawen / Yan, Yan et al. | 2023
- 1
-
Stacking-Based Attention Temporal Convolutional Network for Action SegmentationYang, Liu / Jiang, Yu / Hong, Junkun / Wu, Zhenjie / Yang, Zhan / Long, Jun et al. | 2023
- 1
-
Probabilistic Back-ends for Online Speaker Recognition and ClusteringSholokhov, Alexey / Kuzmin, Nikita / Lee, Kong Aik / Chng, Eng Siong et al. | 2023
- 1
-
Information Extraction from Pill Bottle Images via Text StitchingGupta, Rahul Kumar / Roy, Shilka / Jos, Sujit / S., Unni V. / Lavoie, Lauren / Medous, Frederic / Smith, Walter et al. | 2023
- 1
-
Semi-Supervised Remote Sensing Image Change Detection Using Mean Teacher Model for Constructing Pseudo-LabelsMao, Zan / Tong, Xinyu / Luo, Ze et al. | 2023
- 1
-
Analysing Discrete Self Supervised Speech Representation For Spoken Language ModelingSicherman, Amitay / Adi, Yossi et al. | 2023
- 1
-
Flowpose: Conditional Normalizing Flows for 3D Human Pose and Shape Estimation from Monocular VideosDu, Yaoyao / Zhang, Zixiao / Li, Zhihao / Wei, Peng / Liao, Qingmin / Yang, Wenming et al. | 2023
- 1
-
Glacier: Glass-Box Transformer for Interpretable Dynamic NeuroimagingMahmood, Usman / Fu, Zening / Calhoun, Vince / Plis, Sergey et al. | 2023
- 1
-
NBA-OMP: Near-Field Beam-Split-Aware Orthogonal Matching Pursuit for Wideband THz Channel EstimationElbir, Ahmet M. / Vijay Mishra, Kumar / Chatzinotas, Symeon et al. | 2023
- 1
-
MUG: A General Meeting Understanding and Generation BenchmarkZhang, Qinglin / Deng, Chong / Liu, Jiaqing / Yu, Hai / Chen, Qian / Wang, Wen / Yan, Zhijie / Liu, Jinglin / Ren, Yi / Zhao, Zhou et al. | 2023
- 1
-
Automatic Classification of Vocal Intensity Category from SpeechKodali, Manila / Kadiri, Sudarsana Reddy / Laaksonen, Laura / Alku, Paavo et al. | 2023
- 1
-
A Template Matching Approach for Reference Picture Padding in Video CodingHorst, Nicolas / Das, Priyanka / Wien, Mathias et al. | 2023
- 1
-
An Efficient Relay Selection Scheme for Relay-assisted HARQDing, Weihang / Shikh-Bahaei, Mohammad et al. | 2023
- 1
-
Sora: Scalable Black-Box Reachability Analyser on Neural NetworksXu, Peipei / Wang, Fu / Ruan, Wenjie / Zhang, Chi / Huang, Xiaowei et al. | 2023
- 1
-
The First Pathloss Radio Map Prediction ChallengeYapar, Cagkan / Jaensch, Fabian / Levie, Ron / Kutyniok, Gitta / Caire, Giuseppe et al. | 2023
- 1
-
U-Shiftformer: Brain Tumor Segmentation Using A Shifted Attention MechanismLin, Chih-Wei / Chen, Zhongsheng et al. | 2023
- 1
-
Does Human Speech Follow Benford’s Law?Hsu, Leo / Berisha, Visar et al. | 2023
- 1
-
Conversation-Oriented ASR with Multi-Look-Ahead CBS ArchitectureZhao, Huaibo / Fujie, Shinya / Ogawa, Tetsuji / Sakuma, Jin / Kida, Yusuke / Kobayashi, Tetsunori et al. | 2023
- 1
-
Towards a Unified Training for Levenshtein TransformerZheng, Kangjie / Wang, Longyue / Wang, Zhihao / Chen, Binqi / Zhang, Ming / Tu, Zhaopeng et al. | 2023
- 1
-
A Principled Approach to Model Validation in Domain GeneralizationLyu, Boyang / Nguyen, Thuan / Scheutz, Matthias / Ishwar, Prakash / Aeron, Shuchin et al. | 2023
- 1
-
Neural Networks with Quantization ConstraintsHounie, Ignacio / Elenter, Juan / Ribeiro, Alejandro et al. | 2023
- 1
-
Direct Position Determination with One-Bit Signal for Multiple TargetsNi, Lihua / Zhang, Di / Xing, Tianyi / Ran, Maoyan / Liu, Ning / Wan, Qun et al. | 2023
- 1
-
Learning to Balance the Global Coherence and Informativeness in Knowledge-Grounded Dialogue GenerationNiu, Chenxu / Hu, Yue / Peng, Wei / Xie, Yuqiang et al. | 2023
- 1
-
Backdoor Attack Against Automatic Speaker Verification Models in Federated LearningMeng, Dan / Wang, Xue / Wang, Jun et al. | 2023
- 1
-
Wireless Deep Speech Semantic TransmissionXiao, Zixuan / Yao, Shengshi / Dai, Jincheng / Wang, Sixian / Niu, Kai / Zhang, Ping et al. | 2023
- 1
-
Context-Aware Fine-Tuning of Self-Supervised Speech ModelsShon, Suwon / Wu, Felix / Kim, Kwangyoun / Sridhar, Prashant / Livescu, Karen / Watanabe, Shinji et al. | 2023
- 1
-
Improved Acoustic-to-Articulatory Inversion Using Representations from Pretrained Self-Supervised Learning ModelsUdupa, Sathvik / C, Siddarth / Ghosh, Prasanta Kumar et al. | 2023
- 1
-
Lightweight Annotation and Class Weight Training for Automatic Estimation of Alarm Audibility in NoiseEffa, Francois / Serizel, Romain / Arz, Jean-Pierre / Grimault, Nicolas et al. | 2023
- 1
-
Disentangled Training with Adversarial Examples for Robust Small-Footprint Keyword SpottingWang, Zhenyu / Wan, Li / Zhang, Biqiao / Huang, Yiteng / Li, Shang-Wen / Sun, Ming / Lei, Xin / Yang, Zhaojun et al. | 2023
- 1
-
Numerical Semantic Modeling for Implicit Discourse Relation RecognitionWang, Chenxu / Jian, Ping / Wang, Hai et al. | 2023
- 1
-
Stereoscopic Video Retargeting Based on Camera Motion ClassificationCai, Linghui / Tang, Zhenhua et al. | 2023
- 1
-
Spoofed Training Data for Speech Spoofing Countermeasure Can Be Efficiently Created Using Neural VocodersWang, Xin / Yamagishi, Junichi et al. | 2023
- 1
-
Massively Multilingual Shallow Fusion with Large Language ModelsHu, Ke / Sainath, Tara N. / Li, Bo / Du, Nan / Huang, Yanping / Dai, Andrew M. / Zhang, Yu / Cabrera, Rodrigo / Chen, Zhifeng / Strohman, Trevor et al. | 2023
- 1
-
SDTN: Speaker Dynamics Tracking Network for Emotion Recognition in ConversationChen, Jiawei / Huang, Peijie / Huang, Guotai / Li, Qianer / Xu, Yuhong et al. | 2023
- 1
-
Improving CTC-Based ASR Models With Gated Interlayer CollaborationYang, Yuting / Li, Yuke / Du, Binbin et al. | 2023
- 1
-
Restoration of Time-Varying Graph Signals using Deep Algorithm UnrollingKojima, Hayate / Noguchi, Hikari / Yamada, Koki / Tanaka, Yuichi et al. | 2023
- 1
-
A Dual-Path Transformer Network for Scene Text DetectionLin, Jingyu / Yan, Yan / Wang, Hanzi et al. | 2023
- 1
-
Audio-Visual Speech Enhancement with a Deep Kalman Filter Generative ModelGolmakani, Ali / Sadeghi, Mostafa / Serizel, Romain et al. | 2023
- 1
-
Ideal: Improved Dense Local Contrastive Learning For Semi-Supervised Medical Image SegmentationBasak, Hritam / Chattopadhyay, Soumitri / Kundu, Rohit / Nag, Sayan / Mallipeddi, Rammohan et al. | 2023
- 1
-
Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis SystemYoshimura, Takenori / Takaki, Shinji / Nakamura, Kazuhiro / Oura, Keiichiro / Hono, Yukiya / Hashimoto, Kei / Nankaku, Yoshihiko / Tokuda, Keiichi et al. | 2023
- 1
-
Symbol Level Precoding in the RF Domain for Low Hardware Complexity RIS-Assisted MU-MISO SystemsTsinos, Christos G. / Tsiftsis, Theodoros A. / Schober, Robert et al. | 2023
- 1
-
CTCBERT: Advancing Hidden-Unit Bert with CTC ObjectivesFan, Ruchao / Wang, Yiming / Gaur, Yashesh / Li, Jinyu et al. | 2023
- 1
-
Sine: Similarity-Regularized Intra-Class Exploitation for Cross-Granularity Few-Shot LearningYang, Jinhai / Yang, Hua et al. | 2023
- 1
-
Topological Signal Processing Over Weighted Simplicial ComplexesBattiloro, Claudio / Sardellitti, Stefania / Barbarossa, Sergio / Lorenzo, Paolo Di et al. | 2023
- 1
-
Neural Mode EstimationSun, Peng / Wen, Zhenyu / Zhou, Yejian / Hong, Zhen / Lin, Tao et al. | 2023
- 1
-
Meta Learning with Adaptive Loss Weight for Low-Resource Speech RecognitionWang, Qiulin / Hu, Wenxuan / Li, Lin / Hong, Qingyang et al. | 2023
- 1
-
An Auto-Encoder Based Method for Camera Fingerprint CompressionZhang, Kaixuan / Liu, Zihan / Hu, Jiashang / Wang, Shilin et al. | 2023
- 1
-
A Transformer-Based E2E SLU Model for Improved Semantic ParsingIstaiteh, Othman / Kussad, Yasmeen / Daqour, Yahya / Habib, Maria / Habash, Mohammad / Gowda, Dhananjaya et al. | 2023
- 1
-
Procontext: Exploring Progressive Context Transformer for TrackingLan, Jin-Peng / Cheng, Zhi-Qi / He, Jun-Yan / Li, Chenyang / Luo, Bin / Bao, Xu / Xiang, Wangmeng / Geng, Yifeng / Xie, Xuansong et al. | 2023
- 1
-
Achieving Fair Speech Emotion Recognition via Perceptual FairnessChien, Woan-Shiuan / Lee, Chi-Chun et al. | 2023
- 1
-
Unsupervised Pre-Training for Data-Efficient Text-to-Speech on Low Resource LanguagesPark, Seongyeon / Song, Myungseo / Kim, Bohyung / Oh, Tae-Hyun et al. | 2023
- 1
-
Image Sharing Chain Detection VIA Sequence-To-Sequence ModelYou, Jiaxiang / Li, Yuanman / Liang, Rongqin / Tan, Yuxuan / Zhou, Jiantao / Li, Xia et al. | 2023
- 1
-
NCL: Textual Backdoor Defense Using Noise-Augmented Contrastive LearningZhai, Shengfang / Shen, Qingni / Chen, Xiaoyi / Wang, Weilong / Li, Cong / Fang, Yuejian / Wu, Zhonghai et al. | 2023
- 1
-
Higher-Order Spatio-Temporal Neural Networks for Covid-19 ForecastingChen, Yuzhou / Batsakis, Sotiris / Poor, H. Vincent et al. | 2023
- 1
-
Regression to Classification: Waveform Encoding for Neural Field-Based Audio Signal RepresentationKim, TaeSoo / Rho, Daniel / Lee, Gahui / Park, JaeHan / Ko, Jong Hwan et al. | 2023
- 1
-
Visual Answer Localization with Cross-Modal Mutual Knowledge TransferWeng, Yixuan / Li, Bin et al. | 2023
- 1
-
An Empirical Study and Improvement for Speech Emotion RecognitionWu, Zhen / Lu, Yizhe / Dai, Xinyu et al. | 2023
- 1
-
A Study of Audio Mixing Methods for Piano Transcription in Violin-Piano EnsemblesKim, Hyemi / Park, Jiyun / Kwon, Taegyun / Jeong, Dasaem / Nam, Juhan et al. | 2023
- 1
-
Interaction-Assisted Multi-Modal Representation Learning for RecommendationWu, Hao / Wang, Jiajie / Zu, Zhonglin et al. | 2023