Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation (Englisch)
- Neue Suche nach: Xu, Yuanyou
- Neue Suche nach: Yang, Zongxin
- Neue Suche nach: Yang, Yi
- Neue Suche nach: Xu, Yuanyou
- Neue Suche nach: Yang, Zongxin
- Neue Suche nach: Yang, Yi
In:
2023 IEEE/CVF International Conference on Computer Vision (ICCV)
;
9704-9717
;
2023
-
ISBN:
-
ISSN:
- Aufsatz (Konferenz) / Elektronische Ressource
-
Titel:Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation
-
Beteiligte:
-
Erschienen in:
-
Verlag:
- Neue Suche nach: IEEE
-
Erscheinungsdatum:01.10.2023
-
Format / Umfang:3504318 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Medientyp:Aufsatz (Konferenz)
-
Format:Elektronische Ressource
-
Sprache:Englisch
-
Datenquelle:
Inhaltsverzeichnis Konferenzband
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 1
-
Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF SensorLiu, Xinyang / Li, Yijin / Teng, Yanbin / Bao, Hujun / Zhang, Guofeng / Zhang, Yinda / Cui, Zhaopeng et al. | 2023
- 1
-
Title Page I| 2023
- 3
-
Title Page III| 2023
- 4
-
Copyright| 2023
- 5
-
Table of Contents| 2023
- 12
-
ScanNet++: A High-Fidelity Dataset of 3D Indoor ScenesYeshwanth, Chandan / Liu, Yueh-Cheng / Niesner, Matthias / Dai, Angela et al. | 2023
- 23
-
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence ApproachLu, Jiachen / Li, Hongyang / Peng, Renyuan / Wen, Feng / Cai, Xinyue / Zhang, Wei / Xu, Hang / Zhang, Li et al. | 2023
- 34
-
Doppelgangers: Learning to Disambiguate Images of Similar StructuresCai, Ruojin / Tung, Joseph / Wang, Qianqian / Averbuch-Elor, Hadar / Hariharan, Bharath / Snavely, Noah et al. | 2023
- 45
-
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual QueriesMai, Jinjie / Hamdi, Abdullah / Giancola, Silvio / Zhao, Chen / Ghanem, Bernard et al. | 2023
- 58
-
ClothPose: A Real-world Benchmark for Visual Analysis of Garment Pose via An Indirect Recording SolutionXu, Wenqiang / Du, Wenxin / Xue, Han / Li, Yutong / Ye, Ruolin / Wang, Yan-Feng / Lu, Cewu et al. | 2023
- 69
-
EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion RigidityJiang, Zijie / Okutomi, Masatoshi et al. | 2023
- 79
-
ENVIDR: Implicit Differentiable Renderer with Neural Environment LightingLiang, Ruofan / Chen, Huiting / Li, Chunlin / Chen, Fan / Panneer, Selvakumar / Vijaykumar, Nandita et al. | 2023
- 90
-
Robust Mixture-of-Expert Training for Convolutional Neural NetworksZhang, Yihua / Cai, Ruisi / Chen, Tianlong / Zhang, Guanhua / Zhang, Huan / Chen, Pin-Yu / Chang, Shiyu / Wang, Zhangyang / Liu, Sijia et al. | 2023
- 102
-
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training ModelsLu, Dong / Wang, Zhiqiang / Wang, Teng / Guan, Weili / Gao, Hongchang / Zheng, Feng et al. | 2023
- 112
-
CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive LearningBansal, Hritik / Yin, Fan / Singhi, Nishad / Grover, Aditya / Yang, Yu / Chang, Kai-Wei et al. | 2023
- 124
-
CGBA: Curvature-aware Geometric Black-box AttackReza, Md Farhamdur / Rahmati, Ali / Wu, Tianfu / Dai, Huaiyu et al. | 2023
- 134
-
Robust Evaluation of Diffusion-Based Adversarial PurificationLee, Minjong / Kim, Dongwoo et al. | 2023
- 145
-
Advancing Example Exploitation Can Alleviate Critical Challenges in Adversarial TrainingGe, Yao / Li, Yun / Han, Keji / Zhu, Junyi / Long, Xianzhong et al. | 2023
- 155
-
The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned DataZhu, Zixuan / Wang, Rui / Zou, Cong / Jing, Lihua et al. | 2023
- 165
-
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored ModelsSur, Indranil / Sikka, Karan / Walmer, Matthew / Koneripalli, Kaushik / Roy, Anirban / Lin, Xiao / Divakaran, Ajay / Jha, Susmit et al. | 2023
- 176
-
Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based Reinforcement LearningHuang, Yangru / Peng, Peixi / Zhao, Yifan / Zhai, Yunpeng / Xu, Haoran / Tian, Yonghong et al. | 2023
- 186
-
Among Us: Adversarially Robust Collaborative Perception by ConsensusLi, Yiming / Fang, Qi / Bai, Jiamu / Chen, Siheng / Juefei-Xu, Felix / Feng, Chen et al. | 2023
- 196
-
Walking Your LiDOG: A Journey Through Multiple Domains for LiDAR Semantic SegmentationSaltori, Cristiano / Osep, Aljosa / Ricci, Elisa / Leal-Taixe, Laura et al. | 2023
- 207
-
Stabilizing Visual Reinforcement Learning via Asymmetric Interactive CooperationZhai, Yunpeng / Peng, Peixi / Zhao, Yifan / Huang, Yangru / Tian, Yonghong et al. | 2023
- 217
-
MAAL: Multimodality-Aware Autoencoder-based Affordance Learning for 3D Articulated ObjectsLiang, Yuanzhi / Wang, Xiaohan / Zhu, Linchao / Yang, Yi et al. | 2023
- 228
-
Rethinking Range View Representation for LiDAR SegmentationKong, Lingdong / Liu, Youquan / Chen, Runnan / Ma, Yuexin / Zhu, Xinge / Li, Yikang / Hou, Yuenan / Qiao, Yu / Liu, Ziwei et al. | 2023
- 238
-
Organizers| 2023
- 240
-
Keynotes| 2023
- 241
-
PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic PouringLin, Haitao / Fu, Yanwei / Xue, Xiangyang et al. | 2023
- 252
-
CROSSFIRE: Camera Relocalization On Self-Supervised Features from an Implicit RepresentationMoreau, Arthur / Piasco, Nathan / Bennehar, Moussab / Tsishkou, Dzmitry / Stanciulescu, Bogdan / de La Fortelle, Arnaud et al. | 2023
- 263
-
Environment Agnostic Representation for Visual Reinforcement learningChoi, Hyesong / Lee, Hunsang / Jeong, Seongwon / Min, Dongbo et al. | 2023
- 274
-
Test-time Personalizable Forecasting of 3D Human PosesCui, Qiongjie / Sun, Huaijiang / Lu, Jianfeng / Li, Weiqing / Li, Bin / Yi, Hongwei / Wang, Haofan et al. | 2023
- 284
-
HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative Perception with Vision TransformerXiang, Hao / Xu, Runsheng / Ma, Jiaqi et al. | 2023
- 296
-
Efficient neural supersampling on a novel gaming datasetMercier, Antoine / Erasmus, Ruan / Savani, Yashesh / Dhingra, Manik / Porikli, Fatih / Berger, Guillaume et al. | 2023
- 307
-
Locally Stylized Neural Radiance FieldsPang, Hong-Wing / Hua, Binh-Son / Yeung, Sai-Kit et al. | 2023
- 317
-
NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent ObjectsWang, Dongqing / Zhang, Tong / Susstrunk, Sabine et al. | 2023
- 328
-
DDColor: Towards Photo-Realistic Image Colorization via Dual DecodersKang, Xiaoyang / Yang, Tao / Ouyang, Wenqi / Ren, Peiran / Li, Lingzhi / Xie, Xuansong et al. | 2023
- 339
-
IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View SynthesisYe, Weicai / Chen, Shuo / Bao, Chong / Bao, Hujun / Pollefeys, Marc / Cui, Zhaopeng / Zhang, Guofeng et al. | 2023
- 352
-
PARIS: Part-level Reconstruction and Motion Analysis for Articulated ObjectsLiu, Jiayi / Mahdavi-Amiri, Ali / Savva, Manolis et al. | 2023
- 364
-
ReMoDiffuse: Retrieval-Augmented Motion Diffusion ModelZhang, Mingyuan / Guo, Xinying / Pan, Liang / Cai, Zhongang / Hong, Fangzhou / Li, Huirong / Yang, Lei / Liu, Ziwei et al. | 2023
- 374
-
DS-Fusion: Artistic Typography via Discriminated and Stylized DiffusionTanveer, Maham / Wang, Yizhi / Mahdavi-Amiri, Ali / Zhang, Hao et al. | 2023
- 385
-
Dynamic Mesh-Aware Radiance FieldsQiao, Yi-Ling / Gao, Alexander / Xu, Yiran / Feng, Yue / Huang, Jia-Bin / Lin, Ming C. et al. | 2023
- 397
-
Neural Reconstruction of Relightable Human Model from Monocular VideoSun, Wenzhang / Che, Yunlong / Guo, Yandong / Huang, Han et al. | 2023
- 408
-
Neural Microfacet Fields for Inverse RenderingMai, Alexander / Verbin, Dor / Kuester, Falko / Fridovich-Keil, Sara et al. | 2023
- 419
-
A Theory of Topological Derivatives for Inverse Rendering of GeometryMehta, Ishit / Chandraker, Manmohan / Ramamoorthi, Ravi et al. | 2023
- 430
-
Vox-E: Text-guided Voxel Editing of 3D ObjectsSella, Etai / Fiebelman, Gal / Hedman, Peter / Averbuch-Elor, Hadar et al. | 2023
- 441
-
StegaNeRF: Embedding Invisible Information within Neural Radiance FieldsLi, Chenxin / Feng, Brandon Y. / Fan, Zhiwen / Pan, Panwang / Wang, Zhangyang et al. | 2023
- 454
-
GlobalMapper: Arbitrary-Shaped Urban Layout GenerationHe, Liu / Aliaga, Daniel et al. | 2023
- 465
-
Urban Radiance Field Representation with Deformable Neural Mesh PrimitivesLu, Fan / Xu, Yan / Chen, Guang / Li, Hongsheng / Lin, Kwan-Yee / Jiang, Changjun et al. | 2023
- 477
-
End2End Multi-View Feature Matching with Differentiable Pose OptimizationRoessle, Barbara / Niesner, Matthias et al. | 2023
- 488
-
Tree-Structured Shading DecompositionGeng, Chen / Yu, Hong-Xing / Zhang, Sharon / Agrawala, Maneesh / Wu, Jiajun et al. | 2023
- 499
-
Lens Parameter Estimation for Realistic Depth of Field ModelingPiche-Meunier, Dominique / Hold-Geoffroy, Yannick / Zhang, Jianming / Lalonde, Jean-Francois et al. | 2023
- 509
-
AttT2M: Text-Driven Human Motion Generation with Multi-Perspective Attention MechanismZhong, Chongyang / Hu, Lei / Zhang, Zihao / Xia, Shihong et al. | 2023
- 520
-
Cross-modal Latent Space Alignment for Image to Avatar TranslationDe Guevara, Manuel Ladron / Hold-Geoffroy, Yannick / Echevarria, Jose / Smith, Cameron / Li, Yijun / Ito, Daichi et al. | 2023
- 530
-
Computationally-Efficient Neural Image Compression with Shallow DecodersYang, Yibo / Mandt, Stephan et al. | 2023
- 541
-
3D Instance Segmentation via Enhanced Spatial and Semantic SupervisionAl Khatib, Salwa / El Amine Boudjoghra, Mohamed / Lahoud, Jean / Khan, Fahad Shahbaz et al. | 2023
- 551
-
Learning Neural Eigenfunctions for Unsupervised Semantic SegmentationDeng, Zhijie / Luo, Yucen et al. | 2023
- 562
-
Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise BinarizationZhao, Weiguang / Yan, Yuyao / Yang, Chaolong / Ye, Jianan / Yang, Xi / Huang, Kaizhu et al. | 2023
- 572
-
Point2Mask: Point-supervised Panoptic Segmentation via Optimal TransportLi, Wentong / Yuan, Yuqian / Wang, Song / Zhu, Jianke / Li, Jianshu / Liu, Jian / Zhang, Lei et al. | 2023
- 582
-
Handwritten and Printed Text Segmentation: A Signature Case StudyGholamian, Sina / Vahdat, Ali et al. | 2023
- 593
-
Semantic-Aware Implicit Template Learning via Part Deformation ConsistencyKim, Sihyeon / Ko, Juyeon / Joo, Minseok / Cha, Juhan / Lee, Jaewon / Kim, Hyunwoo J. et al. | 2023
- 604
-
LeaF: Learning Frames for 4D Point Cloud Sequence UnderstandingLiu, Yunze / Chen, Junyu / Zhang, Zekai / Huang, Jingwei / Yi, Li et al. | 2023
- 614
-
MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic SegmentationJo, Sanghyun / Yu, In-Jae / Kim, Kyungsu et al. | 2023
- 624
-
USAGE: A Unified Seed Area Generation Paradigm for Weakly Supervised Semantic SegmentationPeng, Zelin / Wang, Guanchun / Xie, Lingxi / Jiang, Dongsheng / Shen, Wei / Tian, Qi et al. | 2023
- 635
-
XMem++: Production-level Video Segmentation From Few Annotated FramesBekuzarov, Maksym / Bermudez, Ariana / Lee, Joon-Young / Li, Hao et al. | 2023
- 645
-
ΣIGMA: Scale-Invariant Global Sparse Shape MatchingGao, Maolin / Roetzer, Paul / Eisenberger, Marvin / Lahner, Zorah / Moeller, Michael / Cremers, Daniel / Bernard, Florian et al. | 2023
- 655
-
Self-Calibrated Cross Attention Network for Few-Shot SegmentationXu, Qianxiong / Zhao, Wenting / Lin, Guosheng / Long, Cheng et al. | 2023
- 666
-
Multi-granularity Interaction Simulation for Unsupervised Interactive SegmentationLi, Kehan / Zhao, Yian / Wang, Zhennan / Cheng, Zesen / Jin, Peng / Ji, Xiangyang / Yuan, Li / Liu, Chang / Chen, Jie et al. | 2023
- 677
-
Texture Learning Domain Randomization for Domain Generalized SegmentationKim, Sunghwan / Kim, Dae-Hwan / Kim, Hoseong et al. | 2023
- 688
-
Unsupervised Video Object Segmentation with Online Adversarial Self-TuningSu, Tiankang / Song, Huihui / Liu, Dong / Liu, Bo / Liu, Qingshan et al. | 2023
- 699
-
Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation OnlyChen, Jun / Zhu, Deyao / Qian, Guocheng / Ghanem, Bernard / Yan, Zhicheng / Zhu, Chenchen / Xiao, Fanyi / Culatana, Sean Chang / Elhoseiny, Mohamed et al. | 2023
- 711
-
RbA: Segmenting Unknown Regions Rejected by AllNayal, Nazir / Yavuz, Misra / Henriques, Joao F. / Guney, Fatma et al. | 2023
- 723
-
Sempart: Self-supervised Multi-resolution Partitioning of Image SemanticsRavindran, Sriram / Basu, Debraj et al. | 2023
- 734
-
Multi-Object Discovery by Low-Dimensional Object MotionSafadoust, Sadra / Guney, Fatma et al. | 2023
- 745
-
MemorySeg: Online LiDAR Semantic Segmentation with a Latent MemoryLi, Enxu / Casas, Sergio / Urtasun, Raquel et al. | 2023
- 755
-
Treating Pseudo-labels Generation as Image Matting for Weakly Supervised Semantic SegmentationWang, Changwei / Xu, Rongtao / Xu, Shibiao / Meng, Weiliang / Zhang, Xiaopeng et al. | 2023
- 766
-
BoxSnake: Polygonal Instance Segmentation with Box SupervisionYang, Rui / Song, Lin / Ge, Yixiao / Li, Xiu et al. | 2023
- 777
-
Dynamic Token Pruning in Plain Vision Transformers for Semantic SegmentationTang, Quan / Zhang, Bowen / Liu, Jiajun / Liu, Fagui / Liu, Yifan et al. | 2023
- 787
-
Instance Neural Radiance FieldLiu, Yichen / Hu, Benran / Huang, Junkai / Tai, Yu-Wing / Tang, Chi-Keung et al. | 2023
- 797
-
Global Knowledge Calibration for Fast Open-Vocabulary SegmentationHan, Kunyang / Liu, Yong / Liew, Jun Hao / Ding, Henghui / Liu, Jiajun / Wang, Yitong / Tang, Yansong / Yang, Yujiu / Feng, Jiashi / Zhao, Yao et al. | 2023
- 808
-
Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic SegmentationPeng, Duo / Hu, Ping / Ke, Qiuhong / Liu, Jun et al. | 2023
- 821
-
Boosting Semantic Segmentation from the Perspective of Explicit Class EmbeddingsLiu, Yuhe / Liu, Chuanjian / Han, Kai / Tang, Quan / Qin, Zengchang et al. | 2023
- 832
-
The Making and Breaking of CamouflageLamdouar, Hala / Xie, Weidi / Zisserman, Andrew et al. | 2023
- 843
-
CoinSeg: Contrast Inter- and Intra- Class Representations for Incremental SegmentationZhang, Zekang / Gao, Guangyu / Jiao, Jianbo / Liu, Chi Harold / Wei, Yunchao et al. | 2023
- 854
-
Few-Shot Physically-Aware Articulated Mesh Generation via Hierarchical DeformationLiu, Xueyi / Wang, Bin / Wang, He / Yi, Li et al. | 2023
- 865
-
HAL3D: Hierarchical Active Learning for Fine-Grained 3D Part LabelingYu, Fenggen / Qian, Yiming / Gil-Ureta, Francisca / Jackson, Brian / Bennett, Eric / Zhang, Hao et al. | 2023
- 876
-
FreeCOS: Self-Supervised Learning from Fractals and Unlabeled Images for Curvilinear Object SegmentationShi, Tianyi / Ding, Xiaohuan / Zhang, Liang / Yang, Xin et al. | 2023
- 887
-
MasQCLIP for Open-Vocabulary Universal Image SegmentationXu, Xin / Xiong, Tianyi / Ding, Zheng / Tu, Zhuowen et al. | 2023
- 899
-
CTVIS: Consistent Training for Online Video Instance SegmentationYing, Kaining / Zhong, Qing / Mao, Weian / Wang, Zhenhua / Chen, Hao / Wu, Lin Yuanbo / Liu, Yifan / Fan, Chengxiang / Zhuge, Yunzhi / Shen, Chunhua et al. | 2023
- 909
-
A Generalist Framework for Panoptic Segmentation of Images and VideosChen, Ting / Li, Lala / Saxena, Saurabh / Hinton, Geoffrey / Fleed, David J. et al. | 2023
- 920
-
Spectrum-guided Multi-granularity Referring Video Object SegmentationMiao, Bo / Bennamoun, Mohammed / Gao, Yongsheng / Mian, Ajmal et al. | 2023
- 931
-
Space Engage: Collaborative Space Supervision for Contrastive-based Semi-Supervised Semantic SegmentationWang, Changqi / Xie, Haoyu / Yuan, Yuhui / Fu, Chong / Yue, Xiangyu et al. | 2023
- 943
-
Adaptive Superpixel for Active Learning in Semantic SegmentationKim, Hoyoung / Oh, Minhyeon / Hwang, Sehyun / Kwak, Suha / Ok, Jungseul et al. | 2023
- 954
-
Multimodal Variational Auto-encoder based Audio-Visual SegmentationMao, Yuxin / Zhang, Jing / Xiang, Mochu / Zhong, Yiran / Dai, Yuchao et al. | 2023
- 966
-
Isomer: Isomerous Transformer for Zero-shot Video Object SegmentationYuan, Yichen / Wang, Yifan / Wang, Lijun / Zhao, Xiaoqi / Lu, Huchuan / Wang, Yu / Su, Weibo / Zhang, Lei et al. | 2023
- 977
-
2D-3D Interlaced Transformer for Point Cloud Segmentation with Scene-Level SupervisionYang, Cheng-Kun / Chen, Min-Hung / Chuang, Yung-Yu / Lin, Yen-Yu et al. | 2023
- 988
-
Foreground-Background Separation through Concept Distillation from Generative Image Foundation ModelsDombrowski, Mischa / Reynaud, Hadrien / Baugh, Matthew / Kainz, Bernhard et al. | 2023
- 999
-
SegPrompt: Boosting Open-world Segmentation via Category-level Prompt LearningZhu, Muzhi / Li, Hengtao / Chen, Hao / Fan, Chengxiang / Mao, Weian / Jing, Chenchen / Liu, Yifan / Shen, Chunhua et al. | 2023
- 1009
-
Monte Carlo Linear Clustering with Single-Point Supervision is Enough for Infrared Small Target DetectionLi, Boyang / Wang, Yingqian / Wang, Longguang / Zhang, Fei / Liu, Ting / Lin, Zaiping / An, Wei / Guo, Yulan et al. | 2023
- 1020
-
A Simple Framework for Open-Vocabulary Segmentation and DetectionZhang, Hao / Li, Feng / Zou, Xueyan / Liu, Shilong / Li, Chunyuan / Yang, Jianwei / Zhang, Lei et al. | 2023
- 1032
-
Source-free Depth for Object Pop-outWu, Zongwei / Paudel, Danda Pani / Fan, Deng-Ping / Wang, Jingjing / Wang, Shuo / Demonceaux, Cedric / Timofte, Radu / Van Gool, Luc et al. | 2023
- 1043
-
DynaMITe: Dynamic Query Bootstrapping for Multi-object Interactive Segmentation TransformerRana, Amit Kumar / Mahadevan, Sabarinath / Hermans, Alexander / Leibe, Bastian et al. | 2023
- 1053
-
Atmospheric Transmission and Thermal Inertia Induced Blind Road Segmentation with a Large-Scale Dataset TBRSDChen, Junzhang / Bai, Xiangzhi et al. | 2023
- 1064
-
Informative Data Mining for One-shot Cross-Domain Semantic SegmentationWang, Yuxi / Liang, Jian / Xiao, Jun / Mei, Shuqi / Yang, Yuran / Zhang, Zhaoxiang et al. | 2023
- 1075
-
Homography Guided Temporal Fusion for Road Line and Marking SegmentationWang, Shan / Nguyen, Chuong / Liu, Jiawei / Zhang, Kaihao / Luo, Wenhan / Zhang, Yanhao / Muthu, Sundaram / Maken, Fahira Afzal / Li, Hongdong et al. | 2023
- 1086
-
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass NetworkHan, Cong / Zhong, Yujie / Li, Dengjie / Han, Kai / Ma, Lin et al. | 2023
- 1097
-
TCOVIS: Temporally Consistent Online Video Instance SegmentationLi, Junlong / Yu, Bingyao / Rao, Yongming / Zhou, Jie / Lu, Jiwen et al. | 2023
- 1108
-
FPR: False Positive Rectification for Weakly Supervised Semantic SegmentationChen, Liyi / Lei, Chenyang / Li, Ruihuang / Li, Shuai / Zhang, Zhaoxiang / Zhang, Lei et al. | 2023
- 1119
-
Stochastic Segmentation with Conditional Categorical Diffusion ModelsZbinden, Lukas / Doorenbos, Lars / Pissas, Theodoros / Huber, Adrian Thomas / Sznitman, Raphael / Marquez-Neila, Pablo et al. | 2023
- 1130
-
SegGPT: Towards Segmenting Everything In ContextWang, Xinlong / Zhang, Xiaosong / Cao, Yue / Wang, Wen / Shen, Chunhua / Huang, Tiejun et al. | 2023
- 1141
-
Open-vocabulary Panoptic Segmentation with Embedding ModulationChen, Xi / Li, Shuang / Lim, Ser-Nam / Torralba, Antonio / Zhao, Hengshuang et al. | 2023
- 1151
-
Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic SegmentationLiu, Yuyuan / Ding, Choubo / Tian, Yu / Pang, Guansong / Belagiannis, Vasileios / Reid, Ian / Carneiro, Gustavo et al. | 2023
- 1162
-
Zero-guidance Segmentation Using Zero Segment LabelsRewatbowornwong, Pitchaporn / Chatthee, Nattanat / Chuangsuwanich, Ekapol / Suwajanakorn, Supasorn et al. | 2023
- 1173
-
Model Calibration in Dense Classification with Adaptive Label PerturbationLiu, Jiawei / Ye, Changkun / Wang, Shan / Cui, Ruikai / Zhang, Jing / Zhang, Kaihao / Barnes, Nick et al. | 2023
- 1185
-
Enhanced Soft Label for Semi-Supervised Semantic SegmentationMa, Jie / Wang, Chuan / Liu, Yang / Lin, Liang / Li, Guanbin et al. | 2023
- 1196
-
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic SegmentationCai, Kaixin / Ren, Pengzhen / Zhu, Yi / Xu, Hang / Liu, Jianzhuang / Li, Changlin / Wang, Guangrun / Liang, Xiaodan et al. | 2023
- 1206
-
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion ModelsWu, Weijia / Zhao, Yuzhong / Shou, Mike Zheng / Zhou, Hong / Shen, Chunhua et al. | 2023
- 1218
-
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object SegmentationSun, Rui / Wang, Yuan / Mai, Huayu / Zhang, Tianzhu / Wu, Feng et al. | 2023
- 1229
-
Semi-Supervised Semantic Segmentation under Label Noise via Diverse Learning GroupsLi, Peixia / Purkait, Pulak / Ajanthan, Thalaiyasingam / Abdolshah, Majid / Garg, Ravi / Husain, Hisham / Xu, Chenchen / Gould, Stephen / Ouyang, Wanli / Van Den Hengel, Anton et al. | 2023
- 1239
-
SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal TargetsSimons, Cody / Raychaudhuri, Dripta S. / Miraj Ahmed, Sk / You, Suya / Karydis, Konstantinos / Roy-Chowdhury, Amit K. et al. | 2023
- 1250
-
Class-incremental Continual Learning for Instance Segmentation with Image-level Weak SupervisionHsieh, Yu-Hsing / Chen, Guan-Sheng / Cai, Shun-Xian / Wei, Ting-Yun / Yang, Huei-Fang / Chen, Chu-Song et al. | 2023
- 1262
-
Coarse-to-Fine Amodal Segmentation with Shape PriorGao, Jianxiong / Qian, Xuelin / Wang, Yikai / Xiao, Tianjun / He, Tong / Zhang, Zheng / Fu, Yanwei et al. | 2023
- 1272
-
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric RepresentationFan, Ke / Lei, Jingshi / Qian, Xuelin / Yu, Miaopeng / Xiao, Tianjun / He, Tong / Zhang, Zheng / Fu, Yanwei et al. | 2023
- 1282
-
DVIS: Decoupled Video Instance Segmentation FrameworkZhang, Tao / Tian, Xingye / Wu, Yu / Ji, Shunping / Wang, Xuebo / Zhang, Yuan / Wan, Pengfei et al. | 2023
- 1292
-
3D Segmentation of Humans in Point Clouds with Synthetic DataTakmaz, Ayca / Schult, Jonas / Kaftan, Irem / Akcay, Mertcan / Leibe, Bastian / Sumner, Robert / Engelmann, Francis / Tang, Siyu et al. | 2023
- 1305
-
WaterMask: Instance Segmentation for Underwater ImageryLian, Shijie / Li, Hua / Cong, Runmin / Li, Suqi / Zhang, Wei / Kwong, Sam et al. | 2023
- 1316
-
Tracking Anything with Decoupled Video SegmentationCheng, Ho Kei / Wug Oh, Seoung / Price, Brian / Schwing, Alexander / Lee, Joon-Young et al. | 2023
- 1327
-
Cross Contrasting Feature Perturbation for Domain GeneralizationLi, Chenming / Zhang, Daoan / Huang, Wenjian / Zhang, Jianguo et al. | 2023
- 1338
-
Flexible Visual Recognition by Evidential Modeling of Confusion and IgnoranceFan, Lei / Liu, Bo / Li, Haoxiang / Wu, Ying / Hua, Gang et al. | 2023
- 1348
-
CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image ClassificationAbdelfattah, Rabab / Guo, Qing / Li, Xiaoguang / Wang, Xiaofeng / Wang, Song et al. | 2023
- 1358
-
RankMixup: Ranking-Based Mixup Training for Network CalibrationNoh, Jongyoun / Park, Hyekang / Lee, Junghyup / Ham, Bumsub et al. | 2023
- 1369
-
Label-Noise Learning with Intrinsically Long-Tailed DataLu, Yang / Zhang, Yiliang / Han, Bo / Cheung, Yiu-Ming / Wang, Hanzi et al. | 2023
- 1379
-
Parallel Attention Interaction Network for Few-Shot Skeleton-based Action RecognitionLiu, Xingyu / Zhou, Sanping / Wang, Le / Hua, Gang et al. | 2023
- 1389
-
Rethinking Mobile Block for Efficient Attention-based ModelsZhang, Jiangning / Li, Xiangtai / Li, Jian / Liu, Liang / Xue, Zhucun / Zhang, Boshen / Jiang, Zhengkai / Huang, Tianxin / Wang, Yabiao / Wang, Chengjie et al. | 2023
- 1401
-
Read-only Prompt Optimization for Vision-Language Few-shot LearningLee, Dongjun / Song, Seokwon / Suh, Jihee / Choi, Joonmyeong / Lee, Sanghyeok / Kim, Hyunwoo J. et al. | 2023
- 1412
-
Understanding Self-attention Mechanism via Dynamical System PerspectiveHuang, Zhongzhan / Liang, Mingfu / Qin, Jinghui / Zhong, Shanshan / Lin, Liang et al. | 2023
- 1423
-
Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial LabelsZhang, Wenqiao / Liu, Changshuo / Zeng, Lingze / Ooi, Bengchin / Tang, Siliang / Zhuang, Yueting et al. | 2023
- 1433
-
What do neural networks learn in image classification? A frequency shortcut perspectiveWang, Shunxin / Veldhuis, Raymond / Brune, Christoph / Strisciuglio, Nicola et al. | 2023
- 1443
-
Inducing Neural Collapse to a Fixed Hierarchy-Aware Frame for Reducing Mistake SeverityLiang, Tong / Davis, Jim et al. | 2023
- 1453
-
Unified Out-Of-Distribution Detection: A Model-Specific PerspectiveAverly, Reza / Chao, Wei-Lun et al. | 2023
- 1464
-
A Unified Framework for Robustness on Diverse Sampling ErrorsJeon, Myeongho / Kang, Myungjoo / Lee, Joonseok et al. | 2023
- 1473
-
Scene-Aware Label Graph Learning for Multi-Label Image ClassificationZhu, Xuelin / Liu, Jian / Liu, Weijia / Ge, Jiawei / Liu, Bo / Cao, Jiuxin et al. | 2023
- 1483
-
Holistic Label Correction for Noisy Multi-Label ClassificationXia, Xiaobo / Deng, Jiankang / Bao, Wei / Du, Yuxuan / Han, Bo / Shan, Shiguang / Liu, Tongliang et al. | 2023
- 1494
-
Strip-MLP: Efficient Token Interaction for Vision MLPCao, Guiping / Luo, Shengda / Huang, Wenjian / Lan, Xiangyuan / Jiang, Dongmei / Wang, Yaowei / Zhang, Jianguo et al. | 2023
- 1505
-
EQ-Net: Elastic Quantization Neural NetworksXu, Ke / Han, Lei / Tian, Ye / Yang, Shangshang / Zhang, Xingyi et al. | 2023
- 1515
-
Data-free Knowledge Distillation for Fine-grained Visual CategorizationShao, Renrong / Zhang, Wei / Yin, Jianhua / Wang, Jun et al. | 2023
- 1526
-
Shift from Texture-bias to Shape-bias: Edge Deformation-based Augmentation for Robust Object RecognitionHe, Xilin / Lin, Qinliang / Luo, Cheng / Xie, Weicheng / Song, Siyang / Liu, Feng / Shen, Linlin et al. | 2023
- 1536
-
Latent-OFER: Detect, Mask, and Reconstruct with Latent Vectors for Occluded Facial Expression RecognitionLee, Isack / Lee, Eungi / Yoo, Seok Bong et al. | 2023
- 1547
-
DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic CalibrationZhou, Nan / Chen, Jiaxin / Huang, Di et al. | 2023
- 1557
-
Understanding the Feature Norm for Out-of-Distribution DetectionPark, Jaewoo / Long Chai, Jacky Chen / Yoon, Jaeho / Jin Teoh, Andrew Beng et al. | 2023
- 1568
-
Multi-View Active Fine-Grained Visual RecognitionDu, Ruoyi / Yu, Wenqing / Wang, Heqing / Lin, Ting-En / Chang, Dongliang / Ma, Zhanyu et al. | 2023
- 1579
-
DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion ModelsGao, Ruiyuan / Zhao, Chenchen / Hong, Lanqing / Xu, Qiang et al. | 2023
- 1590
-
Task-aware Adaptive Learning for Cross-domain Few-shot LearningGuo, Yurong / Du, Ruoyi / Dong, Yuan / Hospedales, Timothy / Song, Yi-Zhe / Ma, Zhanyu et al. | 2023
- 1600
-
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain PromptingHuang, Qidong / Dong, Xiaoyi / Chen, Dongdong / Chen, Yinpeng / Yuan, Lu / Hua, Gang / Zhang, Weiming / Yu, Nenghai et al. | 2023
- 1611
-
Saliency Regularization for Self-Training with Partial AnnotationsWang, Shouwen / Wan, Qian / Xiang, Xiang / Zeng, Zhigang et al. | 2023
- 1621
-
Learning Gabor Texture Features for Fine-Grained RecognitionZhu, Lanyun / Chen, Tianrun / Yin, Jianxiong / See, Simon / Liu, Jun et al. | 2023
- 1632
-
UniFormerV2: Unlocking the Potential of Image ViTs for Video UnderstandingLi, Kunchang / Wang, Yali / He, Yinan / Li, Yizhuo / Wang, Yi / Wang, Limin / Qiao, Yu et al. | 2023
- 1644
-
RankMatch: Fostering Confidence and Consistency in Learning with Noisy LabelsZhang, Ziyi / Chen, Weikai / Fang, Chaowei / Li, Zhen / Chen, Lechao / Lin, Liang / Li, Guanbin et al. | 2023
- 1655
-
MetaGCD: Learning to Continually Learn in Generalized Category DiscoveryWu, Yanan / Chi, Zhixiang / Wang, Yang / Feng, Songhe et al. | 2023
- 1666
-
FerKD: Surgical Label Adaptation for Efficient DistillationShen, Zhiqiang et al. | 2023
- 1676
-
Point-Query Quadtree for Crowd Counting, Localization, and MoreLiu, Chengxin / Lu, Hao / Cao, Zhiguo / Liu, Tongliang et al. | 2023
- 1686
-
Nearest Neighbor Guidance for Out-of-Distribution DetectionPark, Jaewoo / Jung, Yoon Gyo / Beng Jin Teoh, Andrew et al. | 2023
- 1696
-
Bayesian Optimization Meets Self-DistillationLee, HyunJae / Song, Heon / Lee, Hyeonsoo / Lee, Gi-Hyeon / Park, Suyeong / Yoo, Donggeun et al. | 2023
- 1706
-
When Prompt-based Incremental Learning Does Not Meet Strong PretrainingTang, Yu-Ming / Peng, Yi-Xing / Zheng, Wei-Shi et al. | 2023
- 1717
-
When to Learn What: Model-Adaptive Data Augmentation CurriculumHou, Chengkai / Zhang, Jieyu / Zhou, Tianyi et al. | 2023
- 1729
-
Parametric Information Maximization for Generalized Category DiscoveryChiaroni, Florent / Dolz, Jose / Masud, Ziko Imtiaz / Mitiche, Amar / Ayed, Ismail Ben et al. | 2023
- 1740
-
Boosting Few-shot Action Recognition with Graph-guided Hybrid MatchingXing, Jiazheng / Wang, Mengmeng / Ruan, Yudi / Chen, Bofan / Guo, Yaowei / Mu, Boyu / Dai, Guang / Wang, Jingdong / Liu, Yong et al. | 2023
- 1751
-
Domain Generalization via Rationale InvarianceChen, Liang / Zhang, Yong / Song, Yibing / Van Den Hengel, Anton / Liu, Lingqiao et al. | 2023
- 1761
-
Masked Spiking TransformerWang, Ziqing / Fang, Yuetong / Cao, Jiahang / Zhang, Qiang / Wang, Zhongrui / Xu, Renjing et al. | 2023
- 1772
-
Prototype Reminiscence and Augmented Asymmetric Knowledge Aggregation for Non-Exemplar Class-Incremental LearningShi, Wuxuan / Ye, Mang et al. | 2023
- 1782
-
Distilled Reverse Attention Network for Open-world Compositional Zero-Shot LearningLi, Yun / Liu, Zhe / Jha, Saurav / Yao, Lina et al. | 2023
- 1792
-
Candidate-aware Selective Disambiguation Based On Normalized Entropy for Instance-dependent Partial-label LearningHe, Shuo / Yang, Guowu / Feng, Lei et al. | 2023
- 1802
-
CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say NoWang, Hualiang / Li, Yi / Yao, Huifeng / Li, Xiaomeng et al. | 2023
- 1813
-
Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person SearchWang, Benzhi / Yang, Yang / Wu, Jinlin / Qi, Guo-Jun / Lei, Zhen et al. | 2023
- 1823
-
Sample-wise Label Confidence Incorporation for Learning with Noisy LabelsAhn, Chanho / Kim, Kikyung / Baek, Ji-Won / Lim, Jongin / Han, Seungju et al. | 2023
- 1833
-
Combating Noisy Labels with Sample Selection by Mining High-Discrepancy ExamplesXia, Xiaobo / Han, Bo / Zhan, Yibing / Yu, Jun / Gong, Mingming / Gong, Chen / Liu, Tongliang et al. | 2023
- 1844
-
Spatial-Aware Token for Weakly Supervised Object LocalizationWu, Pingyu / Zhai, Wei / Cao, Yang / Luo, Jiebo / Zha, Zheng-Jun et al. | 2023
- 1855
-
Towards Improved Input Masking for Convolutional Neural NetworksBalasubramanian, Sriram / Feizi, Soheil et al. | 2023
- 1866
-
PDiscoNet: Semantically consistent part discovery for fine-grained recognitionvan der Klis, Robert / Alaniz, Stephan / Mancini, Massimiliano / Dantas, Cassio F. / Ienco, Dino / Akata, Zeynep / Marcos, Diego et al. | 2023
- 1877
-
Corrupting Neuron Explanations of Deep Visual FeaturesSrivastava, Divyansh / Oikarinen, Tuomas / Weng, Tsui-Wei et al. | 2023
- 1887
-
ICICLE: Interpretable Class Incremental Continual LearningRymarczyk, Dawid / van de Weijer, Joost / Zielinski, Bartosz / Twardowski, Bartlomiej et al. | 2023
- 1899
-
ProbVLM: Probabilistic Adapter for Frozen Vison-Language ModelsUpadhyay, Uddeshya / Karthik, Shyamgopal / Mancini, Massimiliano / Akata, Zeynep et al. | 2023
- 1911
-
Out-of-Distribution Detection for Monocular Depth EstimationHornauer, Julia / Holzbock, Adrian / Belagiannis, Vasileios et al. | 2023
- 1922
-
Studying How to Efficiently and Effectively Guide Models with ExplanationsRao, Sukrut / Bohle, Moritz / Parchami-Araghi, Amin / Schiele, Bernt et al. | 2023
- 1934
-
Rosetta Neurons: Mining the Common Units in a Model ZooDravid, Amil / Gandelsman, Yossi / Efros, Alexei A. / Shocher, Assaf et al. | 2023
- 1944
-
Prototype-based Dataset ComparisonVan Noord, Nanne et al. | 2023
- 1955
-
Learning to Identify Critical States for Reinforcement Learning from VideosLiu, Haozhe / Zhuge, Mingchen / Li, Bing / Wang, Yuhui / Faccio, Francesco / Ghanem, Bernard / Schmidhuber, Jurgen et al. | 2023
- 1966
-
Leaping Into Memories: Space-Time Deep Feature SynthesisStergiou, Alexandros / Deligiannis, Nikos et al. | 2023
- 1977
-
MAGI: Multi-Annotated Explanation-Guided LearningZhang, Yifei / Gu, Siyi / Gao, Yuyang / Pan, Bo / Yang, Xiaofeng / Zhao, Liang et al. | 2023
- 1988
-
SAFARI: Versatile and Efficient Evaluations for Robustness of InterpretabilityHuang, Wei / Zhao, Xingyu / Jin, Gaojie / Huang, Xiaowei et al. | 2023
- 1999
-
Do DALL-E and Flamingo Understand Each Other?Li, Hang / Gu, Jindong / Koner, Rajat / Sharifzadeh, Sahand / Tresp, Volker et al. | 2023
- 2011
-
Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype NetworksHuang, Qihan / Xue, Mengqi / Huang, Wenqi / Zhang, Haofei / Song, Jie / Jing, Yongcheng / Song, Mingli et al. | 2023
- 2021
-
MoreauGrad: Sparse and Robust Interpretation of Neural Networks via Moreau EnvelopeZhang, Jingwei / Farnia, Farzan et al. | 2023
- 2031
-
Towards Understanding the Generalization of Deepfake Detectors from a Game-Theoretical ViewYao, Kelu / Wang, Jin / Diao, Boyu / Li, Chao et al. | 2023
- 2042
-
Counterfactual-based Saliency Map: Towards Visual Contrastive Explanations for Neural NetworksWang, Xue / Wang, Zhibo / Weng, Haiqin / Guo, Hengchang / Zhang, Zhifei / Jin, Lu / Wei, Tao / Ren, Kui et al. | 2023
- 2052
-
Beyond Single Path Integrated Gradients for Reliable Input Attribution via Randomized Path SamplingJeon, Giyoung / Jeong, Haedong / Choi, Jaesik et al. | 2023
- 2062
-
Learning Support and Trivial Prototypes for Interpretable Image ClassificationWang, Chong / Liu, Yuyuan / Chen, Yuanhong / Liu, Fengbei / Tian, Yu / McCarthy, Davis / Frazer, Helen / Carneiro, Gustavo et al. | 2023
- 2073
-
Visual Explanations via Iterated Integrated AttributionsBarkan, Oren / Elisha, Yehonatan / Asher, Yuval / Eshel, Amit / Koenigstein, Noam et al. | 2023
- 2085
-
Unsupervised Compositional Concepts Discovery with Text-to-Image Generative ModelsLiu, Nan / Du, Yilun / Li, Shuang / Tenenbaum, Joshua B. / Torralba, Antonio et al. | 2023
- 2096
-
Human Preference Score: Better Aligning Text-to-image Models with Human PreferenceWu, Xiaoshi / Sun, Keqiang / Zhu, Feng / Zhao, Rui / Li, Hongsheng et al. | 2023
- 2106
-
DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout TransformerLevi, Elad / Brosh, Eli / Mykhailych, Mykola / Perez, Meir et al. | 2023
- 2116
-
Anti-DreamBooth: Protecting users from personalized text-to-image synthesisVan Le, Thanh / Phung, Hao / Nguyen, Thuan Hoang / Dao, Quan / Tran, Ngoc N. / Tran, Anh et al. | 2023
- 2128
-
GECCO: Geometrically-Conditioned Point Diffusion ModelsTyszkiewicz, Michal J. / Fua, Pascal / Trulls, Eduard et al. | 2023
- 2139
-
DiffDreamer: Towards Consistent Unsupervised Single-view Scene Extrapolation with Conditional Diffusion ModelsCai, Shengqu / Chan, Eric Ryan / Peng, Songyou / Shahbazi, Mohamad / Obukhov, Anton / Van Gool, Luc / Wetzstein, Gordon et al. | 2023
- 2151
-
Guided Motion Diffusion for Controllable Human Motion SynthesisKarunratanakul, Korrawe / Preechakul, Konpat / Suwajanakorn, Supasorn / Tang, Siyu et al. | 2023
- 2163
-
COOP: Decoupling and Coupling of Whole-Body Grasping Pose GenerationZheng, Yanzhao / Shi, Yunzhou / Cui, Yuhao / Zhao, Zhongzhou / Luo, Zhiling / Zhou, Wei et al. | 2023
- 2174
-
Zero-shot spatial layout conditioning for text-to-image diffusion modelsCouairon, Guillaume / Careil, Marlene / Cord, Matthieu / Lathuiliere, Stephane / Verbeek, Jakob et al. | 2023
- 2184
-
StyleDomain: Efficient and Lightweight Parameterizations of StyleGAN for One-shot and Few-shot Domain AdaptationAlanov, Aibek / Titov, Vadim / Nakhodnov, Maksim / Vetrov, Dmitry et al. | 2023
- 2195
-
GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance ManifoldsXiang, Jianfeng / Yang, Jiaolong / Deng, Yu / Tong, Xin et al. | 2023
- 2206
-
Your Diffusion Model is Secretly a Zero-Shot ClassifierLi, Alexander C. / Prabhudesai, Mihir / Duggal, Shivam / Brown, Ellis / Pathak, Deepak et al. | 2023
- 2218
-
Learning Hierarchical Features with Joint Latent Space Energy-Based PriorCui, Jiali / Wu, Ying Nian / Han, Tian et al. | 2023
- 2228
-
ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion GenerationXu, Liang / Song, Ziyang / Wang, Dongliang / Su, Jing / Fang, Zhicheng / Ding, Chenjing / Gan, Weihao / Yan, Yichao / Jin, Xin / Yang, Xiaokang et al. | 2023
- 2239
-
Landscape Learning for Neural Network InversionLiu, Ruoshi / Mao, Chengzhi / Tendulkar, Purva / Wang, Hao / Vondrick, Carl et al. | 2023
- 2251
-
Diffusion in StyleEveraert, Martin Nicolas / Bocchio, Marco / Arpa, Sami / Susstrunk, Sabine / Achanta, Radhakrishna et al. | 2023
- 2262
-
Diffusion-SDF: Conditional Generative Modeling of Signed Distance FunctionsChou, Gene / Bahat, Yuval / Heide, Felix et al. | 2023
- 2273
-
GETAvatar: Generative Textured Meshes for Animatable Human AvatarsZhang, Xuanmeng / Zhang, Jianfeng / Chacko, Rohan / Xu, Hongyi / Song, Guoxian / Yang, Yi / Feng, Jiashi et al. | 2023
- 2283
-
A-STAR: Test-time Attention Segregation and Retention for Text-to-image SynthesisAgarwal, Aishwarya / Karanam, Srikrishna / Joseph, K J / Saxena, Apoorv / Goswami, Koustava / Srinivasan, Balaji Vasan et al. | 2023
- 2294
-
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image CompositionLu, Shilin / Liu, Yanzhu / Kong, Adams Wai-Kin et al. | 2023
- 2306
-
Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative DescriptionsQian, Yijun / Urbanek, Jack / Hauptmann, Alexander G. / Won, Jungdam et al. | 2023
- 2317
-
BeLFusion: Latent Diffusion for Behavior-Driven Human Motion PredictionBarquero, German / Escalera, Sergio / Palmero, Cristina et al. | 2023
- 2328
-
Delta Denoising ScoreHertz, Amir / Aberman, Kfir / Cohen-Or, Daniel et al. | 2023
- 2338
-
Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D ImitationChen, Xingyu / Deng, Yu / Wang, Baoyuan et al. | 2023
- 2349
-
DreamBooth3D: Subject-Driven Text-to-3D GenerationRaj, Amit / Kaza, Srinivas / Poole, Ben / Niemeyer, Michael / Ruiz, Nataniel / Mildenhall, Ben / Zada, Shiran / Aberman, Kfir / Rubinstein, Michael / Barron, Jonathan et al. | 2023
- 2360
-
Feature Proliferation — the "Cancer" in StyleGAN and its TreatmentsSong, Shuang / Liang, Yuanbang / Wu, Jing / Lai, Yu-Kun / Qin, Yipeng et al. | 2023
- 2371
-
Unsupervised Facial Performance Editing via Vector-Quantized StyleGAN RepresentationsKicanaoglu, Berkay / Garrido, Pablo / Bharaj, Gaurav et al. | 2023
- 2383
-
3D-aware Image Generation using 2D Diffusion ModelsXiang, Jianfeng / Yang, Jiaolong / Huang, Binbin / Tong, Xin et al. | 2023
- 2394
-
Neural Collage Transfer: Artistic Reconstruction via Material ManipulationLee, Ganghun / Kim, Minji / Lee, Yunsu / Lee, Minsu / Zhang, Byoung-Tak et al. | 2023
- 2406
-
Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model AdaptionHu, Teng / Zhang, Jiangning / Liu, Liang / Yi, Ran / Kou, Siqi / Zhu, Haokun / Chen, Xu / Wang, Yabiao / Wang, Chengjie / Ma, Lizhuang et al. | 2023
- 2416
-
Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and ReconstructionChen, Hansheng / Gu, Jiatao / Chen, Anpei / Tian, Wei / Tu, Zhuowen / Liu, Lingjie / Su, Hao et al. | 2023
- 2426
-
Erasing Concepts from Diffusion ModelsGandikota, Rohit / Materzynska, Joanna / Fiotto-Kaufman, Jaden / Bau, David et al. | 2023
- 2437
-
Make Encoder Great Again in 3D GAN Inversion through Geometry and Occlusion-Aware EncodingYuan, Ziyang / Zhu, Yiming / Li, Yu / Liu, Hongyu / Yuan, Chun et al. | 2023
- 2448
-
HairNeRF: Geometry-Aware Image Synthesis for Hairstyle TransferChang, Seunggyu / Kim, Gihoon / Kim, Hayeon et al. | 2023
- 2459
-
SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-trainingLin, Yuanze / Wei, Chen / Wang, Huiyu / Yuille, Alan / Xie, Cihang et al. | 2023
- 2470
-
DiffusionRet: Generative Text-Video Retrieval with Diffusion ModelJin, Peng / Li, Hao / Cheng, Zesen / Li, Kehan / Ji, Xiangyang / Liu, Chang / Yuan, Li / Chen, Jie et al. | 2023
- 2482
-
Explore and Tell: Embodied Visual Captioning in 3D EnvironmentsHu, Anwen / Chen, Shizhe / Zhang, Liang / Jin, Qin et al. | 2023
- 2492
-
Distilling Large Vision-Language Model with Out-of-Distribution GeneralizabilityLi, Xuanlin / Fang, Yunhao / Liu, Minghua / Ling, Zhan / Tu, Zhuowen / Su, Hao et al. | 2023
- 2504
-
Learning Trajectory-Word Alignments for Video-Language TasksYang, Xu / Li, Zhangzikang / Xu, Haiyang / Zhang, Hanwang / Ye, Qinghao / Li, Chenliang / Yan, Ming / Zhang, Yu / Huang, Fei / Huang, Songfang et al. | 2023
- 2515
-
Variational Causal Inference Network for Explanatory Visual Question AnsweringXue, Dizhan / Qian, Shengsheng / Xu, Changsheng et al. | 2023
- 2526
-
TextManiA: Enriching Visual Feature by Text-driven Manifold AugmentationYe-Bin, Moon / Kim, Jisoo / Kim, Hongyeob / Son, Kilho / Oh, Tae-Hyun et al. | 2023
- 2538
-
Segment Every Reference Object in Spatial and Temporal SpacesWu, Jiannan / Jiang, Yi / Yan, Bin / Lu, Huchuan / Yuan, Zehuan / Luo, Ping et al. | 2023
- 2551
-
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language ModelsLi, Juncheng / Gao, Minghe / Wei, Longhui / Tang, Siliang / Zhang, Wenqiao / Li, Mengze / Ji, Wei / Tian, Qi / Chua, Tat-Seng / Zhuang, Yueting et al. | 2023
- 2563
-
Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image PretrainingKim, Bumsoo / Jo, Yeonsik / Kim, Jinhyung / Kim, Seunghwan et al. | 2023
- 2573
-
Toward Multi-Granularity Decision-Making: Explicit Visual Reasoning with Hierarchical KnowledgeZhang, Yifeng / Chen, Shi / Zhao, Qi et al. | 2023
- 2584
-
VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level MatchingBi, Junyu / Cheng, Daixuan / Yao, Ping / Pang, Bochen / Zhan, Yuefeng / Yang, Chuanguang / Wang, Yujing / Sun, Hao / Deng, Weiwei / Zhang, Qi et al. | 2023
- 2594
-
Moment Detection in Long Tutorial VideosCroitoru, Ioana / Bogolin, Simion-Vlad / Albanie, Samuel / Liu, Yang / Wang, Zhaowen / Yoon, Seunghyun / Dernoncourt, Franck / Jin, Hailin / Bui, Trung et al. | 2023
- 2605
-
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior RefinementZhu, Xiangyang / Zhang, Renrui / He, Bowei / Zhou, Aojun / Wang, Dong / Zhao, Bin / Gao, Peng et al. | 2023
- 2616
-
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional ImagesBitton-Guetta, Nitzan / Bitton, Yonatan / Hessel, Jack / Schmidt, Ludwig / Elovici, Yuval / Stanovsky, Gabriel / Schwartz, Roy et al. | 2023
- 2628
-
Advancing Referring Expression Segmentation Beyond Single ImageWu, Yixuan / Zhang, Zhao / Xie, Chi / Zhu, Feng / Zhao, Rui et al. | 2023
- 2639
-
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world LearningZhu, Xiangyang / Zhang, Renrui / He, Bowei / Guo, Ziyu / Zeng, Ziyao / Qin, Zipeng / Zhang, Shanghang / Gao, Peng et al. | 2023
- 2651
-
Unsupervised Prompt Tuning for Text-Driven Object DetectionHe, Weizhen / Chen, Weijie / Chen, Binbin / Yang, Shicai / Xie, Di / Lin, Luojun / Qi, Donglian / Zhuang, Yueting et al. | 2023
- 2662
-
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual GroundingWang, Zehan / Huang, Haifeng / Zhao, Yang / Li, Linjun / Cheng, Xize / Zhu, Yichen / Yin, Aoxiong / Zhao, Zhou et al. | 2023
- 2672
-
I can’t believe there’s no images! : Learning Visual Tasks Using Only Language SupervisionGu, Sophia / Clark, Christopher / Kembhavi, Aniruddha et al. | 2023
- 2684
-
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited SamplesLi, Guanghui / Gao, Mingqi / Liu, Heng / Zhen, Xiantong / Zheng, Feng et al. | 2023
- 2694
-
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsDing, Henghui / Liu, Chang / He, Shuting / Jiang, Xudong / Loy, Chen Change et al. | 2023
- 2704
-
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt TuningFeng, Chun-Mei / Yu, Kai / Liu, Yong / Khan, Salman / Zuo, Wangmeng et al. | 2023
- 2715
-
ShapeScaffolder: Structure-Aware 3D Shape Generation from TextTian, Xi / Yang, Yong-Liang / Wu, Qi et al. | 2023
- 2725
-
SuS-X: Training-Free Name-Only Transfer of Vision-Language ModelsUdandarao, Vishaal / Gupta, Ankush / Albanie, Samuel et al. | 2023
- 2737
-
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual GuidanceMa, Yiwei / Wang, Haowei / Zhang, Xiaoqing / Jiang, Guannan / Sun, Xiaoshuai / Zhuang, Weilin / Ji, Jiayi / Ji, Rongrong et al. | 2023
- 2749
-
OnlineRefer: A Simple Online Baseline for Referring Video Object SegmentationWu, Dongming / Wang, Tiancai / Zhang, Yuang / Zhang, Xiangyu / Shen, Jianbing et al. | 2023
- 2759
-
Attentive Mask CLIPYang, Yifan / Huang, Weiquan / Wei, Yixuan / Peng, Houwen / Jiang, Xinyang / Jiang, Huiqiang / Wei, Fangyun / Wang, Yin / Hu, Han / Qiu, Lili et al. | 2023
- 2770
-
Knowledge Proxy Intervention for Deconfounded Video Question AnsweringLi, Jiangtong / Niu, Li / Zhang, Liqing et al. | 2023
- 2782
-
UniVTG: Towards Unified Video-Language Temporal GroundingLin, Kevin Qinghong / Zhang, Pengchuan / Chen, Joya / Pramanick, Shraman / Gao, Difei / Wang, Alex Jinpeng / Yan, Rui / Shou, Mike Zheng et al. | 2023
- 2793
-
Self-supervised Cross-view Representation Reconstruction for Change CaptioningTu, Yunbin / Li, Liang / Su, Li / Zha, Zheng-Jun / Yan, Chenggang / Huang, Qingming et al. | 2023
- 2804
-
Unified Coarse-to-Fine Alignment for Video-Text RetrievalWang, Ziyang / Sung, Yi-Lin / Cheng, Feng / Bertasius, Gedas / Bansal, Mohit et al. | 2023
- 2816
-
Confidence-aware Pseudo-label Learning for Weakly Supervised Visual GroundingLiu, Yang / Zhang, Jiahua / Chen, Qingchao / Peng, Yuxin et al. | 2023
- 2827
-
TextPSG: Panoptic Scene Graph Generation from Textual DescriptionsZhao, Chengyang / Shen, Yikang / Chen, Zhenfang / Ding, Mingyu / Gan, Chuang et al. | 2023
- 2839
-
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language KnowledgeLin, Wei / Karlinsky, Leonid / Shvetsova, Nina / Possegger, Horst / Kozinski, Mateusz / Panda, Rameswar / Feris, Rogerio / Kuehne, Hilde / Bischof, Horst et al. | 2023
- 2851
-
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report GenerationLi, Yaowei / Yang, Bang / Cheng, Xuxin / Zhu, Zhihong / Li, Hongxiang / Zou, Yuexian et al. | 2023
- 2863
-
CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine TranslationGupta, Devaansh / Kharbanda, Siddhant / Zhou, Jiawei / Li, Wanhua / Pfister, Hanspeter / Wei, Donglai et al. | 2023
- 2875
-
Learning Human-Human Interactions in Images from Weak Textual SupervisionAlper, Morris / Averbuch-Elor, Hadar et al. | 2023
- 2888
-
BUS : Efficient and Effective Vision-language Pre-training with Bottom-Up Patch SummarizationJiang, Chaoya / Xu, Haiyang / Ye, Wei / Ye, Qinghao / Li, Chenliang / Yan, Ming / Bi, Bin / Zhang, Shikun / Huang, Fei / Huang, Songfang et al. | 2023
- 2899
-
3D-VisTA: Pre-trained Transformer for 3D Vision and Text AlignmentZhu, Ziyu / Ma, Xiaojian / Chen, Yixin / Deng, Zhidong / Huang, Siyuan / Li, Qing et al. | 2023
- 2910
-
ALIP: Adaptive Language-Image Pre-training with Synthetic CaptionYang, Kaicheng / Deng, Jiankang / An, Xiang / Li, Jiawei / Feng, Ziyong / Guo, Jia / Yang, Jing / Liu, Tongliang et al. | 2023
- 2920
-
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language ModelsShi, Cheng / Yang, Sibei et al. | 2023
- 2930
-
Noise-aware Learning from Web-crawled Image-Text Data for Image CaptioningKang, Wooyoung / Mun, Jonghwan / Lee, Sungjun / Roh, Byungseok et al. | 2023
- 2941
-
Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question AnsweringQian, Zi / Wang, Xin / Duan, Xuguang / Qin, Pengda / Li, Yuhong / Zhu, Wenwu et al. | 2023
- 2951
-
PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3Hu, Yushi / Hua, Hang / Yang, Zhengyuan / Shi, Weijia / Smith, Noah A. / Luo, Jiebo et al. | 2023
- 2964
-
Grounded Image Text Matching with Mismatched Relation ReasoningWu, Yu / Wei, Yana / Wang, Haozhe / Liu, Yongfei / Yang, Sibei / He, Xuming et al. | 2023
- 2976
-
GePSAn: Generative Procedure Step Anticipation in Cooking VideosAbdelslam, Mohamed A. / Rangrej, Samrudhdhi B. / Hadji, Isma / Dvornik, Nikita / Derpanis, Konstantinos G. / Fazly, Afsaneh et al. | 2023
- 2986
-
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language ModelsSong, Chan Hee / Sadler, Brian M. / Wu, Jiaman / Chao, Wei-Lun / Washington, Clayton / Su, Yu et al. | 2023
- 2998
-
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity ControlHu, Zi-Yuan / Li, Yanyang / Lyu, Michael R. / Wang, Liwei et al. | 2023
- 3009
-
With a Little Help from your own Past: Prototypical Memory Networks for Image CaptioningBarraco, Manuele / Sarto, Sara / Cornia, Marcella / Baraldi, Lorenzo / Cucchiara, Rita et al. | 2023
- 3020
-
DALL-EVAL: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation ModelsCho, Jaemin / Zala, Abhay / Bansal, Mohit et al. | 2023
- 3032
-
Learning Navigational Visual Representations with Semantic Map SupervisionHong, Yicong / Zhou, Yang / Zhang, Ruiyi / Dernoncourt, Franck / Bui, Trung / Gould, Stephen / Tan, Hao et al. | 2023
- 3045
-
CoTDet: Affordance Knowledge Prompting for Task Driven Object DetectionTang, Jiajin / Zheng, Ge / Yu, Jingyi / Yang, Sibei et al. | 2023
- 3056
-
Open Set Video HOI detection from Action-centric Chain-of-Look PromptingXi, Nan / Meng, Jingjing / Yuan, Junsong et al. | 2023
- 3067
-
Learning Concise and Descriptive Attributes for Visual RecognitionYan, An / Wang, Yu / Zhong, Yiwu / Dong, Chengyu / He, Zexue / Lu, Yujie / Wang, William Yang / Shang, Jingbo / McAuley, Julian et al. | 2023
- 3078
-
Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering ModelsKo, Dohwan / Lee, Ji Soo / Choi, Miso / Chu, Jaewon / Park, Jihwan / Kim, Hyunwoo J. et al. | 2023
- 3090
-
Encyclopedic VQA: Visual questions about detailed properties of fine-grained categoriesMensink, Thomas / Uijlings, Jasper / Castrejon, Lluis / Goel, Arushi / Cadar, Felipe / Zhou, Howard / Sha, Fei / Araujo, Andre / Ferrari, Vittorio et al. | 2023
- 3102
-
Story Visualization by Online Text Augmentation with Context MemoryAhn, Daechul / Kim, Daneul / Song, Gwangmo / Kim, Seung Hwan / Lee, Honglak / Kang, Dongyeop / Choi, Jonghyun et al. | 2023
- 3113
-
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningFei, Junjie / Wang, Teng / Zhang, Jinrui / He, Zhenyu / Wang, Chengjie / Zheng, Feng et al. | 2023
- 3124
-
Too Large; Data Reduction for Vision-Language Pre-TrainingWang, Alex Jinpeng / Lin, Kevin Qinghong / Zhang, David Junhao / Lei, Stan Weixian / Shou, Mike Zheng et al. | 2023
- 3135
-
ViLTA: Enhancing Vision-Language Pre-training through Textual AugmentationWang, Weihan / Yang, Zhen / Xu, Bin / Li, Juanzi / Sun, Yankui et al. | 2023
- 3147
-
Teaching CLIP to Count to TenPaiss, Roni / Ephrat, Ariel / Tov, Omer / Zada, Shiran / Mosseri, Inbar / Irani, Michal / Dekel, Tali et al. | 2023
- 3158
-
Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set ProjectionZhou, Junsheng / Ma, Baorui / Li, Shujuan / Liu, Yu-Shen / Han, Zhizhong et al. | 2023
- 3170
-
Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-ExpertsCong, Wenyan / Liang, Hanxue / Wang, Peihao / Fan, Zhiwen / Chen, Tianlong / Varma, Mukund / Wang, Yi / Wang, Zhangyang et al. | 2023
- 3182
-
MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and BeyondLi, Yixuan / Jiang, Lihan / Xu, Linning / Xiangli, Yuanbo / Wang, Zhenzhi / Lin, Dahua / Dai, Bo et al. | 2023
- 3193
-
R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple CamerasSchmied, Aron / Fischer, Tobias / Danelljan, Martin / Pollefeys, Marc / Yu, Fisher et al. | 2023
- 3204
-
ClimateNeRF: Extreme Weather Synthesis in Neural Radiance FieldLi, Yuan / Lin, Zhi-Hao / Forsyth, David / Huang, Jia-Bin / Wang, Shenlong et al. | 2023
- 3216
-
Rendering Humans from Object-Occluded Monocular VideosXiang, Tiange / Sun, Adam / Wu, Jiajun / Adeli, Ehsan / Fei-Fei, Li et al. | 2023
- 3228
-
AssetField: Assets Mining and Reconfiguration in Ground Feature Plane RepresentationXiangli, Yuanbo / Xu, Linning / Pan, Xingang / Zhao, Nanxuan / Dai, Bo / Lin, Dahua et al. | 2023
- 3239
-
PETRv2: A Unified Framework for 3D Perception from Multi-Camera ImagesLiu, Yingfei / Yan, Junjie / Jia, Fan / Li, Shuailin / Gao, Aqi / Wang, Tiancai / Zhang, Xiangyu et al. | 2023
- 3250
-
MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance FieldsKaneko, Takuhiro et al. | 2023
- 3261
-
Adaptive Positional Encoding for Bundle-Adjusting Neural Radiance FieldsGao, Zelin / Dai, Weichen / Zhang, Yu et al. | 2023
- 3272
-
NeuS2: Fast Learning of Neural Implicit Surfaces for Multi-view ReconstructionWang, Yiming / Han, Qin / Habermann, Marc / Daniilidis, Kostas / Theobalt, Christian / Liu, Lingjie et al. | 2023
- 3284
-
Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video RecognitionWang, Qitong / Zhao, Long / Yuan, Liangzhe / Liu, Ting / Peng, Xi et al. | 2023
- 3295
-
Uncertainty Guided Adaptive Warping for Robust and Efficient Stereo MatchingJing, Junpeng / Li, Jiankun / Xiong, Pengfei / Liu, Jiangyu / Liu, Shuaicheng / Guo, Yichen / Deng, Xin / Xu, Mai / Jiang, Lai / Sigal, Leonid et al. | 2023
- 3305
-
Compatibility of Fundamental Matrices for Complete Viewing GraphsBratelund, Martin / Rydell, Felix et al. | 2023
- 3314
-
ProtoTransfer: Cross-Modal Prototype Transfer for Point Cloud SegmentationTang, Pin / Xu, Hai-Ming / Ma, Chao et al. | 2023
- 3325
-
SA-BEV: Generating Semantic-Aware Bird’s-Eye-View Feature for Multi-view 3D Object DetectionZhang, Jinqing / Zhang, Yanan / Liu, Qingjie / Wang, Yunhong et al. | 2023
- 3335
-
GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object DetectionSong, Ziying / Wei, Haiyue / Bai, Lin / Yang, Lei / Jia, Caiyan et al. | 2023
- 3347
-
Tangent Sampson Error: Fast Approximate Two-view Reprojection Error for Central Camera ModelsTerekhov, Mikhail / Larsson, Viktor et al. | 2023
- 3356
-
Using a Waffle Iron for Automotive Point Cloud Semantic SegmentationPuy, Gilles / Boulch, Alexandre / Marlet, Renaud et al. | 2023
- 3367
-
Fast Globally Optimal Surface Normal from an Affine CorrespondenceHajder, Levente / Loczi, Lajos / Barath, Daniel et al. | 2023
- 3379
-
Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face SynthesisBuhler, Marcel C. / Sarkar, Kripasindhu / Shah, Tanmay / Li, Gengyan / Wang, Daoye / Helminger, Leonhard / Orts-Escolano, Sergio / Lagun, Dmitry / Hilliges, Otmar / Beeler, Thabo et al. | 2023
- 3391
-
Canonical Factors for Hybrid Neural FieldsYi, Brent / Zeng, Weijia / Buchanan, Sam / Ma, Yi et al. | 2023
- 3404
-
Center-Based Decoupled Point Cloud Registration for 6D Object Pose EstimationJiang, Haobo / Dang, Zheng / Gu, Shuo / Xie, Jin / Salzmann, Mathieu / Yang, Jian et al. | 2023
- 3415
-
Deep geometry-aware camera self-calibration from videoHagemann, Annika / Knorr, Moritz / Stiller, Christoph et al. | 2023
- 3426
-
V-FUSE: Volumetric Depth Map Fusion with Long-Range ConstraintsBurgdorfer, Nathaniel / Mordohai, Philippos et al. | 2023
- 3436
-
Consistent Depth Prediction for Transparent Object Reconstruction from RGB-D CameraCai, Yuxiang / Zhu, Yifan / Zhang, Haiwei / Ren, Bo et al. | 2023
- 3446
-
FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance FieldsHwang, Sungwon / Hyung, Junha / Kim, Daejin / Kim, Min-Jung / Choo, Jaegul et al. | 2023
- 3457
-
HollowNeRF: Pruning Hashgrid-Based NeRFs with Trainable Collision MitigationXie, Xiufeng / Gherardi, Riccardo / Pan, Zhihong / Huang, Stephen et al. | 2023
- 3468
-
ICE-NeRF: Interactive Color Editing of NeRFs via Decomposition-Aware Weight OptimizationLee, Jae-Hyeok / Kim, Dae-Shik et al. | 2023
- 3479
-
FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient CalibrationHuang, Zhijian / Lin, Sihao / Liu, Guiyu / Luo, Mukun / Ye, Chaoqiang / Xu, Hang / Chang, Xiaojun / Liang, Xiaodan et al. | 2023
- 3489
-
Neural Fields for Structured LightingShandilya, Aarrushi / Attal, Benjamin / Richardt, Christian / Tompkin, James / O'Toole, Matthew et al. | 2023
- 3500
-
CO-Net: Learning Multiple Point Cloud Tasks at Once with A Cohesive NetworkXie, Tao / Wang, Ke / Lu, Siyi / Zhang, Yukun / Dai, Kun / Li, Xiaoyu / Xu, Jie / Wang, Li / Zhao, Lijun / Zhang, Xinyu et al. | 2023
- 3511
-
Pose-Free Neural Radiance Fields via Implicit Pose RegularizationZhang, Jiahui / Zhan, Fangneng / Yu, Yingchen / Liu, Kunhao / Wu, Rongliang / Zhang, Xiaoqin / Shao, Ling / Lu, Shijian et al. | 2023
- 3521
-
TransHuman: A Transformer-based Human Representation for Generalizable Neural Human RenderingPan, Xiao / Yang, Zongxin / Ma, Jianxin / Zhou, Chang / Yang, Yi et al. | 2023
- 3533
-
S-VolSDF: Sparse Multi-View Stereo Regularization of Neural Implicit SurfacesWu, Haoyu / Graikos, Alexandros / Samaras, Dimitris et al. | 2023
- 3546
-
DPS-Net: Deep Polarimetric Stereo Depth EstimationTian, Chaoran / Pan, Weihong / Wang, Zimo / Mao, Mao / Zhang, Guofeng / Bao, Hujun / Tan, Ping / Cui, Zhaopeng et al. | 2023
- 3557
-
3DPPE: 3D Point Positional Encoding for Transformer-based Multi-Camera 3D Object DetectionShu, Changyong / Deng, Jiajun / Yu, Fisher / Liu, Yifan et al. | 2023
- 3567
-
Deformable Neural Radiance Fields using RGB and Event CamerasMa, Qi / Paudel, Danda Pani / Chhatkuli, Ajad / Van Gool, Luc et al. | 2023
- 3578
-
NeILF++: Inter-Reflectable Light Fields for Geometry and Material EstimationZhang, Jingyang / Yao, Yao / Li, Shiwei / Liu, Jingbo / Fang, Tian / McKinnon, David / Tsin, Yanghai / Quan, Long et al. | 2023
- 3588
-
Hierarchical Prior Mining for Non-local Multi-View StereoRen, Chunlin / Xu, Qingshan / Zhang, Shikun / Yang, Jiaqi et al. | 2023
- 3598
-
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionWang, Shihao / Liu, Yingfei / Wang, Tiancai / Li, Ying / Zhang, Xiangyu et al. | 2023
- 3609
-
Re-ReND: Real-time Rendering of NeRFs across DevicesRojas, Sara / Zarzar, Jesus / Perez, Juan C. / Sanakoyeu, Artsiom / Thabet, Ali / Pumarola, Albert / Ghanem, Bernard et al. | 2023
- 3619
-
Learning Shape Primitives via Implicit Convexity RegularizationHuang, Xiaoyang / Zhang, Yi / Chen, Kai / Li, Teng / Zhang, Wenjun / Ni, Bingbing et al. | 2023
- 3629
-
Geometry-guided Feature Learning and Fusion for Indoor Scene ReconstructionYin, Ruihong / Karaoglu, Sezer / Gevers, Theo et al. | 2023
- 3639
-
LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware AlignmentZhang, Zhiwei / Zhang, Zhizhong / Yu, Qian / Yi, Ran / Xie, Yuan / Ma, Lizhuang et al. | 2023
- 3649
-
PivotNet: Vectorized Pivot Learning for End-to-end HD Map ConstructionDing, Wenjie / Qiao, Limeng / Qiu, Xi / Zhang, Chi et al. | 2023
- 3660
-
Sat2Density: Faithful Density Learning from Satellite-Ground Image PairsQian, Ming / Xiong, Jincheng / Xia, Gui-Song / Xue, Nan et al. | 2023
- 3670
-
Mask-Attention-Free Transformer for 3D Instance SegmentationLai, Xin / Yuan, Yuhui / Chu, Ruihang / Chen, Yukang / Hu, Han / Jia, Jiaya et al. | 2023
- 3681
-
Scene-Aware Feature MatchingLu, Xiaoyong / Yan, Yaping / Wei, Tong / Du, Songlin et al. | 2023
- 3691
-
Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-LabelingChen, Zhuoxiao / Luo, Yadan / Wang, Zheng / Baktashmotlagh, Mahsa / Huang, Zi et al. | 2023
- 3704
-
GO-SLAM: Global Optimization for Consistent 3D Instant ReconstructionZhang, Youmin / Tosi, Fabio / Mattoccia, Stefano / Poggi, Matteo et al. | 2023
- 3715
-
BANSAC: A dynamic BAyesian Network for adaptive SAmple ConsensusPiedade, Valter / Miraldo, Pedro et al. | 2023
- 3725
-
Theoretical and Numerical Analysis of 3D Reconstruction Using Point and Line IncidencesRydell, Felix / Shehu, Elima / Torres, Angelica et al. | 2023
- 3735
-
RealGraph: A Multiview Dataset for 4D Real-world Context Graph GenerationLin, Haozhe / Chen, Zequn / Zhang, Jinzhi / Bai, Bing / Wang, Yu / Huang, Ruqi / Fang, Lu et al. | 2023
- 3746
-
CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive LearningXiong, Kaiqiang / Peng, Rui / Zhang, Zhe / Feng, Tianxing / Jiao, Jianbo / Gao, Feng / Wang, Ronggang et al. | 2023
- 3758
-
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object PredictionZong, Zhuofan / Jiang, Dongzhi / Song, Guanglu / Xue, Zeyue / Su, Jingyong / Li, Hongsheng / Liu, Yu et al. | 2023
- 3768
-
Object as Query: Lifting any 2D Object Detector to 3D DetectionWang, Zitian / Huang, Zehao / Fu, Jiahui / Wang, Naiyan / Liu, Si et al. | 2023
- 3778
-
PARTNER: Level up the Polar Representation for LiDAR 3D Object DetectionNie, Ming / Xue, Yujing / Wang, Chunwei / Ye, Chaoqiang / Xu, Hang / Zhu, Xinge / Huang, Qingqiu / Mi, Michael Bi / Wang, Xinchao / Zhang, Li et al. | 2023
- 3791
-
Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object DetectionWang, Chuxin / Yang, Wenfei / Zhang, Tianzhu et al. | 2023
- 3802
-
QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object DetectionZhang, Yifan / Dong, Zhen / Yang, Huanrui / Lu, Ming / Tseng, Cheng-Ching / Du, Yuan / Keutzer, Kurt / Du, Li / Zhang, Shanghang et al. | 2023
- 3813
-
Adding Conditional Control to Text-to-Image Diffusion ModelsZhang, Lvmin / Rao, Anyi / Agrawala, Maneesh et al. | 2023
- 3825
-
Factorized Inverse Path Tracing for Efficient and Accurate Material-Lighting EstimationWu, Liwen / Zhu, Rui / Yaldiz, Mustafa B. / Zhu, Yinhao / Cai, Hong / Matai, Janarbek / Porikli, Fatih / Li, Tzu-Mao / Chandraker, Manmohan / Ramamoorthi, Ravi et al. | 2023
- 3836
-
Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained RepresentationsWang, Jianren / Dasari, Sudeep / Srirama, Mohan Kumar / Tulsiani, Shubham / Gupta, Abhinav et al. | 2023
- 3846
-
3D Implicit Transporter for Temporally Consistent Keypoint DiscoveryZhong, Chengliang / Zheng, Yuhang / Zheng, Yupeng / Zhao, Hao / Yi, Li / Mu, Xiaodong / Wang, Ling / Li, Pengfei / Zhou, Guyue / Yang, Chao et al. | 2023
- 3858
-
Chordal Averaging on Flag Manifolds and Its ApplicationsMankovich, Nathan / Birdal, Tolga et al. | 2023
- 3868
-
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist LearningWan, Weikang / Geng, Haoran / Liu, Yun / Shan, Zikang / Yang, Yaodong / Yi, Li / Wang, He et al. | 2023
- 3880
-
GameFormer: Game-theoretic Modeling and Learning of Transformer-based Interactive Prediction and Planning for Autonomous DrivingHuang, Zhiyu / Liu, Haochen / Lv, Chen et al. | 2023
- 3891
-
PPR: Physically Plausible Reconstruction from Monocular VideosYang, Gengshan / Yang, Shuo / Zhang, John Z. / Manchester, Zachary / Ramanan, Deva et al. | 2023
- 3902
-
Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh ReconstructionWang, Wenjia / Ge, Yongtao / Mei, Haiyi / Cai, Zhongang / Sun, Qingping / Wang, Yanjun / Shen, Chunhua / Yang, Lei / Komura, Taku et al. | 2023
- 3913
-
ACLS: Adaptive and Conditional Label Smoothing for Network CalibrationPark, Hyekang / Noh, Jongyoun / Oh, Youngmin / Baek, Donghyeon / Ham, Bumsub et al. | 2023
- 3923
-
PGFed: Personalize Each Client’s Global Objective for Federated LearningLuo, Jun / Mendieta, Matias / Chen, Chen / Wu, Shandong et al. | 2023
- 3934
-
Overwriting Pretrained Bias with Finetuning DataWang, Angelina / Russakovsky, Olga et al. | 2023
- 3946
-
ITI-Gen: Inclusive Text-to-Image GenerationZhang, Cheng / Chen, Xuanbai / Chai, Siqi / Wu, Chen Henry / Lagun, Dmitry / Beeler, Thabo / De La Torre, Fernando et al. | 2023
- 3958
-
FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI MethodsHesse, Robin / Schaub-Meyer, Simone / Roth, Stefan et al. | 2023
- 3969
-
X-VoE: Measuring eXplanatory Violation of Expectation in Physical EventsDai, Bo / Wang, Linge / Jia, Baoxiong / Zhang, Zeyu / Zhu, Song-Chun / Zhang, Chi / Zhu, Yixin et al. | 2023
- 3980
-
Adaptive Testing of Computer Vision ModelsGao, Irena / Ilharco, Gabriel / Lundberg, Scott / Ribeiro, Marco Tulio et al. | 2023
- 3992
-
Segment AnythingKirillov, Alexander / Mintun, Eric / Ravi, Nikhila / Mao, Hanzi / Rolland, Chloe / Gustafson, Laura / Xiao, Tete / Whitehead, Spencer / Berg, Alexander C. / Lo, Wan-Yen et al. | 2023
- 4004
-
Shape Analysis of Euclidean Curves under Frenet-Serret FrameworkChassat, Perrine / Park, Juhyun / Brunel, Nicolas et al. | 2023
- 4014
-
Unmasking Anomalies in Road-Scene SegmentationRai, Shyam Nandan / Cermelli, Fabio / Fontanel, Dario / Masone, Carlo / Caputo, Barbara et al. | 2023
- 4024
-
High Quality Entity SegmentationQi, Lu / Kuen, Jason / Shen, Tiancheng / Gu, Jiuxiang / Li, Wenbo / Guo, Weidong / Jia, Jiaya / Lin, Zhe / Yang, Ming-Hsuan et al. | 2023
- 4034
-
Towards Open-Vocabulary Video Instance SegmentationWang, Haochen / Jiang, Xiaolong / Tang, Xu / Hu, Yao / Yan, Cilin / Xie, Weidi / Wang, Shuai / Gavves, Efstratios et al. | 2023
- 4044
-
Beyond One-to-One: Rethinking the Referring Image SegmentationHu, Yutao / Wang, Qixiong / Shao, Wenqi / Xie, Enze / Li, Zhenguo / Han, Jungong / Luo, Ping et al. | 2023
- 4055
-
Multiple Instance Learning Framework with Masked Hard Instance Mining for Whole Slide Image ClassificationTang, Wenhao / Huang, Sheng / Zhang, Xiaoxian / Zhou, Fengtao / Zhang, Yi / Liu, Bo et al. | 2023
- 4065
-
Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation LearningReed, Colorado J / Gupta, Ritwik / Li, Shufan / Brockman, Sarah / Funk, Christopher / Clipp, Brian / Keutzer, Kurt / Candido, Salvatore / Uyttendaele, Matt / Darrell, Trevor et al. | 2023
- 4077
-
Progressive Spatio-Temporal Prototype Matching for Text-Video RetrievalLi, Pandeng / Xie, Chen-Wei / Zhao, Liming / Xie, Hongtao / Ge, Jiannan / Zheng, Yun / Zhao, Deli / Zhang, Yongdong et al. | 2023
- 4088
-
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance LearningHe, Junwen / Wang, Yifan / Wang, Lijun / Lu, Huchuan / Luo, Bin / He, Jun-Yan / Lan, Jin-Peng / Geng, Yifeng / Xie, Xuansong et al. | 2023
- 4099
-
LogicSeg: Parsing Visual Semantics with Neural Logic Learning and ReasoningLi, Liulei / Wang, Wenguan / Yi, Yang et al. | 2023
- 4111
-
ASIC: Aligning Sparse in-the-wild Image CollectionsGupta, Kamal / Jampani, Varun / Esteves, Carlos / Shrivastava, Abhinav / Makadia, Ameesh / Snavely, Noah / Kar, Abhishek et al. | 2023
- 4123
-
CLIPascene: Scene Sketching with Different Types and Levels of AbstractionVinker, Yael / Alaluf, Yuval / Cohen-Or, Daniel / Shamir, Ariel et al. | 2023
- 4134
-
LD-ZNet: A Latent Diffusion Approach for Text-Based Image SegmentationPNVR, Koutilya / Singh, Bharat / Ghosh, Pallabi / Siddiquie, Behjat / Jacobs, David et al. | 2023
- 4146
-
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion ModelsCao, Tianshi / Kreis, Karsten / Fidler, Sanja / Sharp, Nicholas / Yin, Kangxue et al. | 2023
- 4159
-
NeuRBF: A Neural Fields Representation with Adaptive Radial Basis FunctionsChen, Zhang / Li, Zhong / Song, Liangchen / Chen, Lele / Yu, Jingyi / Yuan, Junsong / Xu, Yi et al. | 2023
- 4172
-
Scalable Diffusion Models with TransformersPeebles, William / Xie, Saining et al. | 2023
- 4183
-
Texture Generation on 3D Meshes with Point-UV DiffusionYu, Xin / Dai, Peng / Li, Wenbo / Ma, Lan / Liu, Zhengzhe / Qi, Xiaojuan et al. | 2023
- 4194
-
Generative Novel View Synthesis with 3D-Aware Diffusion ModelsChan, Eric R. / Nagano, Koki / Chan, Matthew A. / Bergman, Alexander W. / Park, Jeong Joon / Levy, Axel / Aittala, Miika / De Mello, Shalini / Karras, Tero / Wetzstein, Gordon et al. | 2023
- 4207
-
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-TuningXie, Enze / Yao, Lewei / Shi, Han / Liu, Zhili / Zhou, Daquan / Liu, Zhaoqiang / Li, Jiawei / Li, Zhenguo et al. | 2023
- 4217
-
VQ3D: Learning a 3D-Aware Generative Model on ImageNetSargent, Kyle / Koh, Jing Yu / Zhang, Han / Chang, Huiwen / Herrmann, Charles / Srinivasan, Pratul / Wu, Jiajun / Sun, Deqing et al. | 2023
- 4228
-
Ref-NeuS: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with ReflectionGe, Wenhang / Hu, Tao / Zhao, Haoyu / Liu, Shu / Chen, Ying-Cong et al. | 2023
- 4238
-
A Complete Recipe for Diffusion Generative ModelsPandey, Kushagra / Mandt, Stephan et al. | 2023
- 4250
-
MMVP: Motion-Matrix-based Video PredictionZhong, Yiqi / Liang, Luming / Zharkov, Ilya / Neumann, Ulrich et al. | 2023
- 4261
-
SAGA: Spectral Adversarial Geometric Attack on 3D MeshesStolik, Tomer / Lang, Itai / Avidan, Shai et al. | 2023
- 4272
-
Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial ExamplesJi, Qiufan / Wang, Lin / Shi, Cong / Hu, Shengshan / Chen, Yingying / Sun, Lichao et al. | 2023
- 4282
-
ACTIVE: Towards Highly Transferable 3D Physical Camouflage for Universal and Robust Vehicle EvasionSuryanto, Naufal / Kim, Yongsu / Larasati, Harashta Tatimma / Kang, Hyoeun / Le, Thi-Thu-Huong / Hong, Yoonyoung / Yang, Hunmin / Oh, Se-Yoon / Kim, Howon et al. | 2023
- 4292
-
Frequency-aware GAN for Adversarial Manipulation GenerationZhu, Peifei / Osada, Genki / Kataoka, Hirokatsu / Takahashi, Tsubasa et al. | 2023
- 4302
-
Breaking Temporal Consistency: Generating Video Universal Adversarial Perturbations Using Image ModelsKim, Hee-Seon / Son, Minji / Kim, Minbeom / Kwon, Myung-Joon / Kim, Changick et al. | 2023
- 4312
-
Tracing the Origin of Adversarial Attack for Forensic Investigation and DeterrenceFang, Han / Zhang, Jiyi / Qiu, Yupeng / Liu, Jiayang / Xu, Ke / Fang, Chengfang / Chang, Ee-Chien et al. | 2023
- 4322
-
Downstream-agnostic Adversarial ExamplesZhou, Ziqi / Hu, Shengshan / Zhao, Ruizhi / Wang, Qian / Zhang, Leo Yu / Hou, Junhui / Jin, Hai et al. | 2023
- 4333
-
Hiding Visual Information via Obfuscating Adversarial PerturbationsSu, Zhigang / Zhou, Dawei / Wang, Nannan / Liu, Decheng / Wang, Zhen / Gao, Xinbo et al. | 2023
- 4344
-
An Embarrassingly Simple Backdoor Attack on Self-supervised LearningLi, Changjiang / Pang, Ren / Xi, Zhaohan / Du, Tianyu / Ji, Shouling / Yao, Yuan / Wang, Ting et al. | 2023
- 4356
-
Efficient Decision-based Black-box Patch Attacks on Video RecognitionJiang, Kaixun / Chen, Zhaoyu / Huang, Hao / Wang, Jiafeng / Yang, Dingkang / Li, Bo / Wang, Yan / Zhang, Wenqiang et al. | 2023
- 4367
-
Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness TradeoffSuzuki, Satoshi / Yamaguchi, Shin'ya / Takeda, Shoichiro / Kanai, Sekitoshi / Makishima, Naoki / Ando, Atsushi / Masumura, Ryo et al. | 2023
- 4379
-
Towards Building More Robust Models with Frequency BiasBu, Qingwen / Huang, Dong / Cui, Heming et al. | 2023
- 4389
-
Does Physical Adversarial Example Really Matter to Autonomous Driving? Towards System-Level Effect of Adversarial Object Evasion AttackWang, Ningfei / Luo, Yunpeng / Sato, Takami / Xu, Kaidi / Chen, Qi Alfred et al. | 2023
- 4401
-
Improving Generalization of Adversarial Training via Robust Critical Fine-TuningZhu, Kaijie / Hu, Xixu / Wang, Jindong / Xie, Xing / Yang, Ge et al. | 2023
- 4412
-
Enhancing Generalization of Universal Adversarial Perturbation through Gradient AggregationLiu, Xuannan / Zhong, Yaoyao / Zhang, Yuhang / Qin, Lixiong / Deng, Weihong et al. | 2023
- 4422
-
Unified Adversarial Patch for Cross-modal Attacks in the Physical WorldWei, Xingxing / Huang, Yao / Sun, Yitong / Yu, Jie et al. | 2023
- 4432
-
RFLA: A Stealthy Reflected Light Adversarial Attack in the Physical WorldWang, Donghua / Yao, Wen / Jiang, Tingsong / Li, Chao / Chen, Xiaoqian et al. | 2023
- 4443
-
Enhancing Fine-Tuning based Backdoor Defense with Sharpness-Aware MinimizationZhu, Mingli / Wei, Shaokui / Shen, Li / Fan, Yanbo / Wu, Baoyuan et al. | 2023
- 4455
-
Conditional 360-degree Image Synthesis for Immersive Indoor Scene DecorationShum, Ka Chun / Pang, Hong-Wing / Hua, Binh-Son / Nguyen, Duc Thanh / Yeung, Sai-Kit et al. | 2023
- 4466
-
An Adaptive Model Ensemble Adversarial Attack for Boosting Adversarial TransferabilityChen, Bin / Yin, Jiali / Chen, Shukai / Chen, Bohao / Liu, Ximeng et al. | 2023
- 4476
-
Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine LearningLee, Byung-Kwan / Kim, Junho / Ro, Yong Man et al. | 2023
- 4487
-
LEA2: A Lightweight Ensemble Adversarial Attack via Non-overlapping Vulnerable Frequency RegionsQian, Yaguan / He, Shuke / Zhao, Chenyu / Sha, Jiaqiang / Wang, Wei / Wang, Bin et al. | 2023
- 4499
-
Explaining Adversarial Robustness of Neural Networks from Clustering Effect PerspectiveJin, Yulin / Zhang, Xiaoyu / Lou, Jian / Ma, Xu / Wang, Zilong / Chen, Xiaofeng et al. | 2023
- 4509
-
VertexSerum: Poisoning Graph Neural Networks for Link InferenceDing, Ruyi / Duan, Shijin / Xu, Xiaolin / Fei, Yunsi et al. | 2023
- 4519
-
How to choose your best allies for a transferable attack?Maho, Thibault / Moosavi-Dezfooli, Seyed-Mohsen / Furon, Teddy et al. | 2023
- 4529
-
Enhancing Adversarial Robustness in Low-Label Regime via Adaptively Weighted Regularization and Knowledge DistillationYang, Dongyoon / Kong, Insung / Kim, Yongdai et al. | 2023
- 4539
-
AdvDiffuser: Natural Adversarial Example Synthesis with Diffusion ModelsChen, Xinquan / Gao, Xitong / Zhao, Juanjuan / Ye, Kejiang / Xu, Cheng-Zhong et al. | 2023
- 4550
-
F&F Attack: Adversarial Attack against Multiple Object Trackers by Inducing False Negatives and False PositivesZhou, Tao / Ye, Qi / Luo, Wenhan / Zhang, Kaihao / Shi, Zhiguo / Chen, Jiming et al. | 2023
- 4561
-
Rickrolling the Artist: Injecting Backdoors into Text Encoders for Text-to-Image SynthesisStruppek, Lukas / Hintersdorf, Dominik / Kersting, Kristian et al. | 2023
- 4574
-
Hard No-Box Adversarial Attack on Skeleton-Based Human Action Recognition with Skeleton-Motion-Informed GradientLu, Zhengzhi / Wang, He / Chang, Ziyi / Yang, Guoan / Shum, Hubert P. H. et al. | 2023
- 4584
-
Structure Invariant Transformation for better Adversarial TransferabilityWang, Xiaosen / Zhang, Zeliang / Zhang, Jianping et al. | 2023
- 4597
-
Beating Backdoor Attack at Its Own GameLiu, Min / Sangiovanni-Vincentelli, Alberto / Yue, Xiangyu et al. | 2023
- 4607
-
Transferable Adversarial Attack for Both Vision Transformers and Convolutional Networks via Momentum Integrated GradientsMa, Wenshuo / Li, Yidong / Jia, Xiaofeng / Xu, Wei et al. | 2023
- 4617
-
REAP: A Large-Scale Realistic Adversarial Patch BenchmarkHingun, Nabeel / Sitawarin, Chawin / Li, Jerry / Wagner, David et al. | 2023
- 4629
-
Multi-metrics adaptively identifies backdoors in Federated learningHuang, Siquan / Li, Yijiang / Chen, Chong / Shi, Leyu / Gao, Ying et al. | 2023
- 4640
-
Backpropagation Path Search On Adversarial TransferabilityXu, Zhuoer / Gu, Zhangxuan / Zhang, Jianping / Cui, Shiwen / Meng, Changhua / Wang, Weiqiang et al. | 2023
- 4651
-
Rapid Network Adaptation:Learning to Adapt Neural Networks Using Test-Time FeedbackYeo, Teresa / Kar, Oguzhan Fatih / Sodagar, Zahra / Zamir, Amir et al. | 2023
- 4665
-
One-bit Flip is All You Need: When Bit-flip Attack Meets Model TrainingDong, Jianshuo / Qiu, Han / Li, Yiming / Zhang, Tianwei / Li, Yuanjie / Lai, Zeqi / Zhang, Chao / Xia, Shu-Tao et al. | 2023
- 4676
-
PolicyCleanse: Backdoor Detection and Mitigation for Competitive Reinforcement LearningGuo, Junfeng / Li, Ang / Wang, Lixu / Liu, Cong et al. | 2023
- 4686
-
Towards Viewpoint-Invariant Visual Recognition via Adversarial TrainingRuan, Shouwei / Dong, Yinpeng / Su, Hang / Peng, Jianteng / Chen, Ning / Wei, Xingxing et al. | 2023
- 4697
-
Fast Adversarial Training with Smooth ConvergenceZhao, Mengnan / Zhang, Lihe / Kong, Yuqiu / Yin, Baocai et al. | 2023
- 4707
-
The Perils of Learning From Unlabeled Data: Backdoor Attacks on Semi-supervised LearningShejwalkar, Virat / Lyu, Lingjuan / Houmansadr, Amir et al. | 2023
- 4718
-
Boosting Adversarial Transferability via Gradient Relevance AttackZhu, Hegui / Ren, Yuchen / Sui, Xiaoyan / Yang, Lianping / Jiang, Wuming et al. | 2023
- 4728
-
Towards Robust Model Watermark via Reducing Parametric VulnerabilityGan, Guanhao / Li, Yiming / Wu, Dongxian / Xia, Shu-Tao et al. | 2023
- 4739
-
TRM-UAP: Enhancing the Transferability of Data-Free Universal Adversarial Perturbation via Truncated Ratio MaximizationLiu, Yiran / Feng, Xin / Wang, Yunlong / Yang, Wu / Ming, Di et al. | 2023
- 4749
-
Enhancing Privacy Preservation in Federated Learning via Learning Rate PerturbationWan, Guangnian / Du, Haitao / Yuan, Xuejing / Yang, Jun / Chen, Meiling / Xu, Jie et al. | 2023
- 4759
-
TARGET: Federated Class-Continual Learning via Exemplar-Free DistillationZhang, Jie / Chen, Chen / Zhuang, Weiming / Lyu, Lingjuan et al. | 2023
- 4771
-
FACTS: First Amplify Correlations and Then Slice to Discover BiasYenamandra, Sriram / Ramesh, Pratik / Prabhu, Viraj / Hoffman, Judy et al. | 2023
- 4782
-
Computation and Data Efficient Backdoor AttacksWu, Yutong / Han, Xingshuo / Qiu, Han / Zhang, Tianwei et al. | 2023
- 4792
-
Global Balanced Experts for Federated Long-Tailed LearningZeng, Yaopei / Liu, Lei / Liu, Li / Shen, Li / Liu, Shaoguo / Wu, Baoyuan et al. | 2023
- 4803
-
Source-free Domain Adaptive Human Pose EstimationPeng, Qucheng / Zheng, Ce / Chen, Chen et al. | 2023
- 4814
-
Gender Artifacts in Visual DatasetsMeister, Nicole / Zhao, Dora / Wang, Angelina / Ramaswamy, Vikram V. / Fong, Ruth / Russakovsky, Olga et al. | 2023
- 4826
-
FRAug: Tackling Federated Learning with Non-IID Features via Representation AugmentationChen, Haokun / Frikha, Ahmed / Krompass, Denis / Gu, Jindong / Tresp, Volker et al. | 2023
- 4837
-
zPROBE: Zero Peek Robustness Checks for Federated LearningGhodsi, Zahra / Javaheripi, Mojan / Sheybani, Nojan / Zhang, Xinqiao / Huang, Ke / Koushanfar, Farinaz et al. | 2023
- 4848
-
Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot StudyKo, Myeongseob / Jin, Ming / Wang, Chenguang / Jia, Ruoxi et al. | 2023
- 4859
-
FedPD: Federated Open Set Recognition with Parameter DisentanglementYang, Chen / Zhu, Meilu / Liu, Yifan / Yuan, Yixuan et al. | 2023
- 4869
-
MUter: Machine Unlearning on Adversarially Trained ModelsLiu, Junxu / Xue, Mingsheng / Lou, Jian / Zhang, Xiaoyu / Xiong, Li / Qin, Zhan et al. | 2023
- 4880
-
Beyond Skin Tone: A Multidimensional Measure of Apparent Skin ColorThong, William / Joniak, Przemyslaw / Xiang, Alice et al. | 2023
- 4891
-
A Multidimensional Analysis of Social Biases in Vision TransformersBrinkmann, Jannik / Swoboda, Paul / Bartelt, Christian et al. | 2023
- 4901
-
Partition-and-Debias: Agnostic Biases Mitigation via A Mixture of Biases-Specific ExpertsLi, Jiaxuan / Vo, Duc Minh / Nakayama, Hideki et al. | 2023
- 4912
-
Rethinking Data Distillation: Do Not Overlook CalibrationZhu, Dongyao / Fang, Yanbo / Lei, Bowen / Xie, Yiqun / Xu, Dongkuan / Zhang, Jie / Zhang, Ruqi et al. | 2023
- 4923
-
Mining bias-target Alignment from Voronoi CellsNahon, Remi / Nguyen, Van-Tam / Tartaglione, Enzo et al. | 2023
- 4933
-
Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image ClassificationChiu, Ming-Chang / Chen, Pin-Yu / Ma, Xuezhe et al. | 2023
- 4944
-
GIFD: A Generative Gradient Inversion Method with Feature Domain OptimizationFang, Hao / Chen, Bin / Wang, Xuan / Wang, Zhi / Xia, Shu-Tao et al. | 2023
- 4954
-
Benchmarking Algorithmic Bias in Face Recognition: An Experimental Approach Using Synthetic Faces and Human EvaluationLiang, Hao / Perona, Pietro / Balakrishnan, Guha et al. | 2023
- 4965
-
FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated LearningSun, Guangyu / Mendieta, Matias / Luo, Jun / Wu, Shandong / Chen, Chen et al. | 2023
- 4976
-
Towards Attack-tolerant Federated Learning via Critical Parameter AnalysisHan, Sungwon / Park, Sungwon / Wu, Fangzhao / Kim, Sundong / Zhu, Bin / Xie, Xing / Cha, Meeyoung et al. | 2023
- 4986
-
What can Discriminator do? Towards Box-free Ownership Verification of Generative Adversarial NetworksHuang, Ziheng / Li, Boheng / Cai, Yan / Wang, Run / Guo, Shangwei / Fang, Liming / Chen, Jing / Wang, Lina et al. | 2023
- 4997
-
Robust Heterogeneous Federated Learning under Data CorruptionFang, Xiuwen / Ye, Mang / Yang, Xiyuan et al. | 2023
- 5008
-
Communication-efficient Federated Learning with Single-Step Synthetic Features Compressor for Faster ConvergenceZhou, Yuhao / Shi, Mingjia / Li, Yuanxi / Sun, Yanan / Ye, Qing / Lv, Jiancheng et al. | 2023
- 5018
-
GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated LearningZhang, Jianqing / Hua, Yang / Wang, Hao / Song, Tao / Xue, Zhengui / Ma, Ruhui / Cao, Jian / Guan, Haibing et al. | 2023
- 5029
-
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous AttentionZeng, Wenxuan / Li, Meng / Xiong, Wenjie / Tong, Tong / Lu, Wen-Jie / Tan, Jin / Wang, Runsheng / Huang, Ru et al. | 2023
- 5041
-
Identification of Systematic Errors of Image Classifiers on Rare SubgroupsMetzen, Jan Hendrik / Hutmacher, Robin / Hua, N. Grace / Boreiko, Valentyn / Zhang, Dan et al. | 2023
- 5051
-
Adaptive Image Anonymization in the Context of Image Classification with Neural NetworksShvai, Nadiya / Carmona, Arcadi Llanza / Nakib, Amir et al. | 2023
- 5061
-
When Do Curricula Work in Federated Learning?Vahidian, Saeed / Kadaveru, Sreevatsank / Baek, Woonjoon / Wang, Weijia / Kungurtsev, Vyacheslav / Chen, Chen / Shah, Mubarak / Lin, Bill et al. | 2023
- 5072
-
Domain Specified Optimization for Deployment AuthorizationWang, Haotian / Chi, Haoang / Yang, Wenjing / Lin, Zhipeng / Geng, Mingyang / Lan, Long / Zhang, Jing / Tao, Dacheng et al. | 2023
- 5083
-
STPrivacy: Spatio-Temporal Privacy-Preserving Action RecognitionLi, Ming / Xu, Xiangyu / Fan, Hehe / Zhou, Pan / Liu, Jun / Liu, Jia-Wei / Li, Jiahe / Keppo, Jussi / Shou, Mike Zheng / Yan, Shuicheng et al. | 2023
- 5093
-
SAL-ViT: Towards Latency Efficient Private Inference on ViT using Selective Attention Search with a Learnable Softmax ApproximationZhang, Yuke / Chen, Dake / Kundu, Souvik / Li, Chenghao / Beerel, Peter A. et al. | 2023
- 5103
-
Generative Gradient Inversion via Over-Parameterized Networks in Federated LearningZhang, Chi / Zhang, Xiaoman / Sotthiwat, Ekanut / Xu, Yanyu / Liu, Ping / Zhen, Liangli / Liu, Yong et al. | 2023
- 5113
-
Inspecting the Geographical Representativeness of Images from Text-to-Image ModelsBasu, Abhipsa / Babu, R. Venkatesh / Pruthi, Danish et al. | 2023
- 5125
-
Divide and Conquer: a Two-Step Method for High Quality Face De-identification with Model ExplainabilityWen, Yunqian / Liu, Bo / Cao, Jingyi / Xie, Rong / Song, Li et al. | 2023
- 5135
-
Exploring the Benefits of Visual Prompting in Differential PrivacyLi, Yizhe / Tsai, Yu-Lin / Yu, Chia-Mu / Chen, Pin-Yu / Ren, Xuebin et al. | 2023
- 5145
-
Towards Fairness-aware Adversarial Network PruningZhang, Lei / Wang, Zhibo / Dong, Xiaowei / Feng, Yunhe / Pang, Xiaoyi / Zhang, Zhifei / Ren, Kui et al. | 2023
- 5155
-
AutoReP: Automatic ReLU Replacement for Fast Private Network InferencePeng, Hongwu / Huang, Shaoyi / Zhou, Tong / Luo, Yukui / Wang, Chenghong / Wang, Zigeng / Zhao, Jiahui / Xie, Xi / Li, Ang / Geng, Tony et al. | 2023
- 5166
-
Flatness-Aware Minimization for Domain GeneralizationZhang, Xingxuan / Xu, Renzhe / Yu, Han / Dong, Yancheng / Tian, Pengfei / Cui, Peng et al. | 2023
- 5180
-
Communication-Efficient Vertical Federated Learning with Limited Overlapping SamplesSun, Jingwei / Xu, Ziyue / Yang, Dong / Nath, Vishwesh / Li, Wenqi / Zhao, Can / Xu, Daguang / Chen, Yiran / Roth, Holger R. et al. | 2023
- 5190
-
Multimodal Distillation for Egocentric Action RecognitionRadevski, Gorjan / Grujicic, Dusan / Blaschko, Matthew / Moens, Marie-Francine / Tuytelaars, Tinne et al. | 2023
- 5202
-
Self-Supervised Object Detection from Egocentric VideosAkiva, Peri / Huang, Jing / Liang, Kevin J / Kovvuri, Rama / Chen, Xingyu / Feiszli, Matt / Dana, Kristin / Hassner, Tal et al. | 2023
- 5215
-
Multi-label affordance mapping from egocentric visionMur-Labadia, Lorenzo / Guerrero, Jose J. / Martinez-Cantin, Ruben et al. | 2023
- 5227
-
Ego-Only: Egocentric Action Detection without Exocentric TransferringWang, Huiyu / Singh, Mitesh Kumar / Torresani, Lorenzo et al. | 2023
- 5239
-
COPILOT: Human-Environment Collision Prediction and Localization from Egocentric VideosPan, Boxiao / Shen, Bokui / Rempe, Davis / Paschalidou, Despoina / Mo, Kaichun / Yang, Yanchao / Guibas, Leonidas J. et al. | 2023
- 5250
-
EgoPCA: A New Framework for Egocentric Hand-Object Interaction UnderstandingXu, Yue / Li, Yong-Lu / Huang, Zhemin / Liu, Michael Xu / Lu, Cewu / Tai, Yu-Wing / Tang, Chi-Keung et al. | 2023
- 5262
-
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the BackbonePramanick, Shraman / Song, Yale / Nag, Sayan / Lin, Kevin Qinghong / Shah, Hardik / Shou, Mike Zheng / Chellappa, Rama / Zhang, Pengchuan et al. | 2023
- 5275
-
WDiscOOD: Out-of-Distribution Detection via Whitened Linear Discriminant AnalysisChen, Yiye / Lin, Yunzhi / Xu, Ruinian / Vela, Patricio A. et al. | 2023
- 5285
-
Pairwise Similarity Learning is SimPLEWen, Yandong / Liu, Weiyang / Feng, Yao / Raj, Bhiksha / Singh, Rita / Weller, Adrian / Black, Michael J. / Scholkopf, Bernhard et al. | 2023
- 5296
-
No Fear of Classifier Biases: Neural Collapse Inspired Federated Learning with Synthetic and Fixed ClassifierLi, Zexi / Shang, Xinyi / He, Rui / Lin, Tao / Wu, Chao et al. | 2023
- 5307
-
Generalizable Neural Fields as Partially Observed Neural ProcessesGu, Jeffrey / Wang, Kuan-Chieh / Yeung, Serena et al. | 2023
- 5317
-
M2T: Masking Transformers Twice for Faster DecodingMentzer, Fabian / Agustson, Eirikur / Tschannen, Michael et al. | 2023
- 5327
-
Keep It SimPool:Who Said Supervised Transformers Suffer from Attention Deficit?Psomas, Bill / Kakogeorgiou, Ioannis / Karantzalos, Konstantinos / Avrithis, Yannis et al. | 2023
- 5338
-
Improving Pixel-based MIM by Reducing Wasted Modeling CapabilityLiu, Yuan / Zhang, Songyang / Chen, Jiacheng / Yu, Zhaohui / Chen, Kai / Lin, Dahua et al. | 2023
- 5350
-
Learning Image-Adaptive Codebooks for Class-Agnostic Image RestorationLiu, Kechun / Jiang, Yitong / Choi, Inchang / Gu, Jinwei et al. | 2023
- 5361
-
Quality Diversity for Visual Pre-TrainingChavhan, Ruchika / Gouk, Henry / Li, Da / Hospedales, Timothy et al. | 2023
- 5372
-
Subclass-balancing Contrastive Learning for Long-tailed RecognitionHou, Chengkai / Zhang, Jieyu / Wang, Haonan / Zhou, Tianyi et al. | 2023
- 5385
-
Mastering Spatial Graph Prediction of Road NetworksSotiris, Anagnostidis / Lucchi, Aurelien / Hofmann, Thomas et al. | 2023
- 5396
-
Poincaré ResNetvan Spengler, Max / Berkhout, Erwin / Mettes, Pascal et al. | 2023
- 5406
-
Exploring Model Transferability through the Lens of Potential EnergyLi, Xiaotong / Hu, Zixuan / Ge, Yixiao / Shan, Ying / Duan, Ling-Yu et al. | 2023