PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations (English)
- New search for: Geng, Haoran
- New search for: Li, Ziming
- New search for: Geng, Yiran
- New search for: Chen, Jiayi
- New search for: Dong, Hao
- New search for: Wang, He
- New search for: Geng, Haoran
- New search for: Li, Ziming
- New search for: Geng, Yiran
- New search for: Chen, Jiayi
- New search for: Dong, Hao
- New search for: Wang, He
In:
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
;
2978-2988
;
2023
-
ISBN:
-
ISSN:
- Conference paper / Electronic Resource
-
Title:PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations
-
Contributors:Geng, Haoran ( author ) / Li, Ziming ( author ) / Geng, Yiran ( author ) / Chen, Jiayi ( author ) / Dong, Hao ( author ) / Wang, He ( author )
-
Published in:
-
Publisher:
- New search for: IEEE
-
Publication date:2023-06-01
-
Size:1293841 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
-
Source:
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Megahertz Light Steering Without Moving PartsPediredla, Adithya / Narasimhan, Srinivasa G. / Chamanzar, Maysamreza / Gkioulekas, Ioannis et al. | 2023
- 1
-
Title Page i| 2023
- 01
-
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression ComprehensionJin, Lei / Luo, Gen / Zhou, Yiyi / Sun, Xiaoshuai / Jiang, Guannan / Shu, Annan / Ji, Rongrong et al. | 2023
- 01
-
Affordances from Human Videos as a Versatile Representation for RoboticsBahl, Shikhar / Mendonca, Russell / Chen, Lili / Jain, Unnat / Pathak, Deepak et al. | 2023
- 1
-
Copyright and Reprint Permissions| 2023
- 3
-
Title Page iii| 2023
- 13
-
Robust Dynamic Radiance FieldsLiu, Yu-Lun / Gao, Chen / Meuleman, Andreas / Tseng, Hung-Yu / Saraf, Ayush / Kim, Changil / Chuang, Yung-Yu / Kopf, Johannes / Huang, Jia-Bin et al. | 2023
- 24
-
DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance FieldsChen, Yu / Lee, Gim Hee et al. | 2023
- 35
-
VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence NormalizationZhu, Bingfan / Yang, Yanchao / Wang, Xulong / Zheng, Youyi / Guibas, Leonidas et al. | 2023
- 46
-
AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware TrainingJiang, Yifan / Hedman, Peter / Mildenhall, Ben / Xu, Dejia / Barron, Jonathan T. / Wang, Zhangyang / Xue, Tianfan et al. | 2023
- 56
-
SeaThru-NeRF: Neural Radiance Fields in Scattering MediaLevy, Deborah / Peleg, Amit / Pearl, Naama / Rosenbaum, Dan / Akkaynak, Derya / Korman, Simon / Treibitz, Tali et al. | 2023
- 66
-
Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance FieldsIsaac-Medina, Brian K. S. / Willcocks, Chris G. / Breckon, Toby P. et al. | 2023
- 76
-
Neural Residual Radiance Fields for Streamably Free-Viewpoint VideosWang, Liao / Hu, Qiang / He, Qihan / Wang, Ziyu / Yu, Jingyi / Tuytelaars, Tinne / Xu, Lan / Wu, Minye et al. | 2023
- 88
-
Plen-VDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and RenderingYan, Han / Liu, Celong / Ma, Chao / Mei, Xing et al. | 2023
- 97
-
Local Implicit Ray Function for Generalizable Radiance Field RepresentationHuang, Xin / Zhang, Qi / Feng, Ying / Li, Xiaoyu / Wang, Xuan / Wang, Qing et al. | 2023
- 108
-
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor ScenesGao, Yiming / Cao, Yan-Pei / Shan, Ying et al. | 2023
- 119
-
Frequency-Modulated Point Cloud Rendering with Easy EditingZhang, Yi / Huang, Xiaoyang / Ni, Bingbing / Zhang, Wenjun / Li, Teng et al. | 2023
- 130
-
HexPlane: A Fast Representation for Dynamic ScenesCao, Ang / Johnson, Justin et al. | 2023
- 142
-
Differentiable Shadow Mapping for Efficient Inverse GraphicsWorchel, Markus / Alexa, Marc et al. | 2023
- 154
-
Hybrid Neural Rendering for Large-Scale Scenes with Motion BlurDai, Peng / Zhang, Yinda / Yu, Xin / Lyu, Xiaoyang / Qi, Xiaojuan et al. | 2023
- 165
-
TensoIR: Tensorial Inverse RenderingJin, Haian / Liu, Isabella / Xu, Peijia / Zhang, Xiaoshuai / Han, Songfang / Bi, Sai / Zhou, Xiaowei / Xu, Zexiang / Su, Hao et al. | 2023
- 175
-
ShadowNeuS: Neural SDF Reconstruction by Shadow Ray SupervisionLing, Jingwang / Wang, Zhibo / Xu, Feng et al. | 2023
- 186
-
Realistic Saliency Guided Image EnhancementMiangoleh, S. Mahdi H. / Bylinskii, Zoya / Kee, Eric / Shechtman, Eli / Aksoy, Yagiz et al. | 2023
- 195
-
LightPainter: Interactive Portrait Relighting with Freehand ScribbleMei, Yiqun / Zhang, He / Zhang, Xuaner / Zhang, Jianming / Shu, Zhixin / Wang, Yilin / Wei, Zijun / Yan, Shi / Jung, HyunJoon / Patel, Vishal M. et al. | 2023
- 206
-
A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and ReflectanceXu, Xianmin / Lin, Yuxin / Zhou, Haoyang / Zeng, Chong / Yu, Yaxin / Zhou, Kun / Wu, Hongzhi et al. | 2023
- 216
-
Learning Visibility Field for Detailed 3D Human Reconstruction and RelightingZheng, Ruichen / Li, Peng / Wang, Haoqian / Yu, Tao et al. | 2023
- 227
-
Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency LossesJang, Junbong / Lee, Kwonmoo / Kim, Tae-Kyun et al. | 2023
- 237
-
NeUDF: Leaning Neural Unsigned Distance Fields with Volume RenderingLiu, Yu-Tao / Wang, Li / Yang, Jie / Chen, Weikai / Meng, Xiaoxu / Yang, Bo / Gao, Lin et al. | 2023
- 248
-
NeAT: Learning Neural Implicit Surfaces with Arbitrary Topologies from Multi-View ImagesMeng, Xiaoxu / Chen, Weikai / Yang, Bo et al. | 2023
- 259
-
ALTO: Alternating Latent Topologies for Implicit 3D ReconstructionWang, Zhen / Zhou, Shijie / Park, Jeong Joon / Paschalidou, Despoina / You, Suya / Wetzstein, Gordon / Guibas, Leonidas / Kadambi, Achuta et al. | 2023
- 271
-
Controllable Mesh Generation Through Sparse Latent Point Diffusion ModelsLyu, Zhaoyang / Wang, Jinyi / An, Yuwei / Zhang, Ya / Lin, Dahua / Dai, Bo et al. | 2023
- 275
-
Photo Pre-Training, But for SketchLi, Ke / Pang, Kaiyue / Song, Yi-Zhe et al. | 2023
- 281
-
Power Bundle Adjustment for Large-Scale 3D ReconstructionWeber, Simon / Demmel, Nikolaus / Chan, Tin Chon / Cremersl, Daniel et al. | 2023
- 290
-
Neural Pixel Composition for 3D-4D View Synthesis from Multi-ViewsBansal, Aayush / Zollhoefer, Michael et al. | 2023
- 300
-
Magic3D: High-Resolution Text-to-3D Content CreationLin, Chen-Hsuan / Gao, Jun / Tang, Luming / Takikawa, Towaki / Zeng, Xiaohui / Huang, Xun / Kreis, Karsten / Fidler, Sanja / Liu, Ming-Yu / Lin, Tsung-Yi et al. | 2023
- 301
-
Message from the 2023 General and Program Chairs| 2023
- 302
-
2023 Organizing Committee| 2023
- 304
-
2023 Outstanding Reviewers| 2023
- 305
-
Sponsors| 2023
- 310
-
3D Video Loops from Asynchronous InputMa, Li / Li, Xiaoyu / Liao, Jing / Sander, Pedro V. et al. | 2023
- 321
-
High-fidelity 3D GAN Inversion by Pseudo-multi-view OptimizationXie, Jiaxin / Ouyang, Hao / Piao, Jingtan / Lei, Chenyang / Chen, Qifeng et al. | 2023
- 332
-
Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance FieldLi, Leheng / Lian, Qing / Wang, Luozhou / Ma, Ningning / Chen, Ying-Cong et al. | 2023
- 342
-
3D GAN Inversion with Facial Symmetry PriorYin, Fei / Zhang, Yong / Wang, Xuan / Wang, Tengfei / Li, Xiaoyu / Gong, Yuan / Fan, Yanbo / Cun, Xiaodong / Shan, Ying / Oztireli, Cengiz et al. | 2023
- 352
-
StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face SwappingJiang, Diqiong / Song, Dan / Tong, Ruofeng / Tang, Min et al. | 2023
- 362
-
FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face ReconstructionBai, Haoran / Kang, Di / Zhang, Haoxian / Pan, Jinshan / Bao, Linchao et al. | 2023
- 372
-
Robust Model-based Face Reconstruction through Weakly-Supervised Outlier SegmentationLi, Chunlu / Morel-Forster, Andreas / Vetter, Thomas / Egger, Bernhard / Kortylewski, Adam et al. | 2023
- 382
-
Learning Neural Proto-Face Field for Disentangled 3D Face Modeling in the WildZhang, Zhenyu / Chen, Renwang / Cao, Weijian / Tai, Ying / Wang, Chengjie et al. | 2023
- 394
-
A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild ImagesLei, Biwen / Ren, Jianqiang / Feng, Mengyang / Cui, Miaomiao / Xie, Xuansong et al. | 2023
- 404
-
BlendFields: Few-Shot Example-Driven Facial ModelingKania, Kacper / Garbin, Stephan J. / Tagliasacchi, Andrea / Estellers, Virginia / Yi, Kwang Moo / Valentin, Julien / Trzcinski, Tomasz / Kowalski, Marek et al. | 2023
- 416
-
Implicit Neural Head Synthesis via Controllable Local Deformation FieldsChen, Chuhan / O'Toole, Matthew / Bharaj, Gaurav / Garrido, Pablo et al. | 2023
- 427
-
DPE: Disentanglement of Pose and Expression for General Video Portrait EditingPang, Youxin / Zhang, Yong / Quan, Weize / Fan, Yanbo / Cun, Xiaodong / Shan, Ying / Yan, Dong-Ming et al. | 2023
- 437
-
GANHead: Towards Generative Animatable Neural Head AvatarsWu, Sijing / Yan, Yichao / Li, Yunhao / Cheng, Yuhao / Zhu, Wenhan / Gao, Ke / Li, Xiaobo / Zhai, Guangtao et al. | 2023
- 448
-
EDGE: Editable Dance Generation From MusicTseng, Jonathan / Castellon, Rodrigo / Liu, C. Karen et al. | 2023
- 458
-
Unsupervised Volumetric AnimationSiarohin, Aliaksandr / Menapace, Willi / Skorokhodov, Ivan / Olszewski, Kyle / Ren, Jian / Lee, Hsin-Ying / Chai, Menglei / Tulyakov, Sergey et al. | 2023
- 459
-
Blowing in the Wind: CycleNet for Human Cinemagraphs from Still ImagesBertiche, Hugo / Mitra, Niloy J. / Kulkarni, Kuldeep / Huang, Chun-Hao Paul / Wang, Tuanfeng Y. / Madadi, Meysam / Escalera, Sergio / Ceylan, Duygu et al. | 2023
- 469
-
Generating Holistic 3D Human Motion from SpeechYi, Hongwei / Liang, Hualin / Liu, Yifei / Cao, Qiong / Wen, Yandong / Bolkart, Timo / Tao, Dacheng / Black, Michael J. et al. | 2023
- 481
-
Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion ModelDu, Yuming / Kips, Robin / Pumarola, Albert / Starke, Sebastian / Thabet, Ali / Sanakoyeu, Artsiom et al. | 2023
- 491
-
Learning Anchor Transformations for 3D Garment AnimationZhao, Fang / Li, Zekun / Huang, Shaoli / Weng, Junwu / Zhou, Tianfei / Xie, Guo-Sen / Wang, Jue / Shan, Ying et al. | 2023
- 501
-
CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template DecompositionZhang, Hongwen / Lin, Siyou / Shao, Ruizhi / Zhang, Yuxiang / Zheng, Zerong / Huang, Han / Guo, Yandong / Liu, Yebin et al. | 2023
- 512
-
ECON: Explicit Clothed humans Optimized via Normal integrationXiu, Yuliang / Yang, Jinlong / Cao, Xu / Tzionas, Dimitrios / Black, Michael J. et al. | 2023
- 524
-
PersonNeRF : Personalized Reconstruction from Photo CollectionsWeng, Chung-Yi / Srinivasan, Pratul P. / Curless, Brian / Kemelmacher-Shlizerman, Ira et al. | 2023
- 534
-
3D Human Mesh Estimation from Virtual MarkersMa, Xiaoxuan / Su, Jiajun / Wang, Chunyu / Zhu, Wentao / Wang, Yizhou et al. | 2023
- 544
-
Overcoming the TradeOff between Accuracy and Plausibility in 3D Hand Shape ReconstructionYu, Ziwei / Li, Chen / Yang, Linlin / Zheng, Xiaoxu / Bi Mi, Michael / Lee, Gim Hee / Yao, Angela et al. | 2023
- 554
-
Recovering 3D Hand Mesh Sequence from a Single Blurry Image: A New Dataset and Temporal UnfoldingOh, Yeonguk / Park, JoonKyu / Kim, Jaeha / Moon, Gyeongsik / Lee, Kyoung Mu et al. | 2023
- 564
-
MeMaHand: Exploiting Mesh-Mano Interaction for Single Image Two-Hand ReconstructionWang, Congyi / Zhu, Feida / Went, Shilei et al. | 2023
- 574
-
PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body EstimationShetty, Karthik / Birkhold, Annette / Jaganathan, Srikrishna / Strobel, Norbert / Kowarschik, Markus / Maier, Andreas / Egger, Bernhard et al. | 2023
- 585
-
CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation SynthesisZheng, Juntian / Zheng, Qingyuan / Fang, Lixing / Liu, Yun / Yi, Li et al. | 2023
- 595
-
Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD StreamJiang, Yuheng / Yao, Kaixin / Su, Zhuo / Shen, Zhehao / Luo, Haimin / Xu, Lan et al. | 2023
- 606
-
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsWen, Bowen / Tremblay, Jonathan / Blukis, Valts / Tyree, Stephen / Muller, Thomas / Evans, Alex / Fox, Dieter / Kautz, Jan / Birchfield, Stan et al. | 2023
- 618
-
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial ScenesJu, Xuan / Zeng, Ailing / Wang, Jianan / Xu, Qiang / Zhang, Lei et al. | 2023
- 630
-
Omnimatte3D: Associating Objects and Their Effects in Unconstrained Monocular VideoSuhail, Mohammed / Lu, Erika / Li, Zhengqi / Snavely, Noah / Sigal, Leonid / Cole, Forrester et al. | 2023
- 640
-
On the Benefits of 3D Pose and Tracking for Human Action RecognitionRajasegaran, Jathushan / Pavlakos, Georgios / Kanazawa, Angjoo / Feichtenhofer, Christoph / Malik, Jitendra et al. | 2023
- 650
-
Towards Stable Human Pose Estimation via Cross-View Fusion and Foot StabilizationZhuo, Li'an / Cao, Jian / Wang, Qi / Zhang, Bang / Bo, Liefeng et al. | 2023
- 660
-
Human Pose as Compositional TokensGeng, Zigang / Wang, Chunyu / Wei, Yixuan / Liu, Ze / Li, Houqiang / Hu, Han et al. | 2023
- 672
-
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape EstimationLiu, Qihao / Kortylewski, Adam / Yuille, Alan et al. | 2023
- 682
-
SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban EnvironmentsDai, Yudi / Lin, Yitai / Lin, Xiping / Wen, Chenglu / Xu, Lan / Yi, Hongwei / Shen, Siqi / Ma, Yuexin / Wang, Cheng et al. | 2023
- 693
-
Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction ModuleHuang, Linzhi / Li, Yulong / Tian, Hongbo / Yang, Yue / Li, Xiangang / Deng, Weihong / Ye, Jieping et al. | 2023
- 704
-
Human Pose Estimation in Extremely Low-Light ConditionsLee, Sohyun / Rim, Jaesung / Jeong, Boseung / Kim, Geonu / Woo, Byungju / Lee, Haechan / Cho, Sunghyun / Kwak, Suha et al. | 2023
- 715
-
Flexible-Cm GAN: Towards Precise 3D Dose Prediction in RadiotherapyGao, Riqiang / Lou, Bin / Xu, Zhoubing / Comaniciu, Dorin / Kamen, Ali et al. | 2023
- 726
-
DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward EquilibriumBangunharcana, Antyanta / Magd, Ahmed / Kim, Kyung-Soo et al. | 2023
- 739
-
A Rotation-Translation-Decoupled Solution for Robust and Efficient Visual-Inertial InitializationHe, Yijia / Xu, Bo / Ouyang, Zhanpeng / Li, Hongdong et al. | 2023
- 749
-
Semidefinite Relaxations for Robust Multiview TriangulationHarenstam-Nielsen, Linus / Zeller, Niclas / Cremers, Daniel et al. | 2023
- 758
-
A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB ImageJiang, Zheheng / Rahmani, Hossein / Black, Sue / Williams, Bryan M. et al. | 2023
- 768
-
Instant Multi-View Head Capture through Learnable RegistrationBolkart, Timo / Li, Tianye / Black, Michael J. et al. | 2023
- 780
-
On the Importance of Accurate Geometry Data for Dense 3D Vision TasksJung, HyunJun / Ruhkamp, Patrick / Zhai, Guangyao / Brasch, Nikolas / Li, Yitong / Verdie, Yannick / Song, Jifei / Zhou, Yiren / Armagan, Anil / Ilic, Slobodan et al. | 2023
- 792
-
Learning 3D Scene Priors with 2D SupervisionNie, Yinyu / Dai, Angela / Han, Xiaoguang / NieBner, Matthias et al. | 2023
- 803
-
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and GenerationWu, Tong / Zhang, Jiarui / Fu, Xiao / Wang, Yuxin / Ren, Jiawei / Pan, Liang / Wu, Wayne / Yang, Lei / Wang, Jiaqi / Qian, Chen et al. | 2023
- 815
-
OpenScene: 3D Scene Understanding with Open VocabulariesPeng, Songyou / Genova, Kyle / Jiang, Chiyu / Tagliasacchi, Andrea / Pollefeys, Marc / Funkhouser, Thomas et al. | 2023
- 825
-
Multi-View Azimuth Stereo via Tangent Space ConsistencyCao, Xu / Santo, Hiroaki / Okura, Fumio / Matsushita, Yasuyuki et al. | 2023
- 835
-
Progressive Transformation Learning for Leveraging Virtual Images in TrainingShen, Yi-Ting / Lee, Hyungtae / Kwon, Heesung / Bhattacharyya, Shuvra S. et al. | 2023
- 845
-
Connecting the Dots: Floorplan Reconstruction Using Two-Level QueriesYue, Yuanwen / Kontogianni, Theodora / Schindler, Konrad / Engelmann, Francis et al. | 2023
- 855
-
NeRF-Supervised Deep StereoTosi, Fabio / Tonioni, Alessio / De Gregorio, Daniele / Poggi, Matteo et al. | 2023
- 867
-
Semantic Scene Completion with Cleaner SelfWang, Fengyun / Zhang, Dong / Zhang, Hanwang / Tang, Jinhui / Sun, Qianru et al. | 2023
- 878
-
PanelNet: Understanding 360 Indoor Environment via Panel RepresentationYu, Haozheng / He, Lu / Jian, Bing / Feng, Weiwei / Liu, Shan et al. | 2023
- 888
-
Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform CoordinatesPaliwal, Avinash / Tsarov, Andrii / Kalantari, Nima Khademi et al. | 2023
- 899
-
Depth Estimation from Indoor Panoramas with Neural Scene RepresentationChang, Wenjie / Zhang, Yueyi / Xiong, Zhiwei et al. | 2023
- 909
-
NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear InterpolationZheng, Zehan / Wu, Danni / Lu, Ruisi / Lu, Fan / Chen, Guang / Jiang, Changjun et al. | 2023
- 919
-
RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View StereoCai, Changjiang / Ji, Pan / Yan, Qingan / Xu, Yi et al. | 2023
- 929
-
NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera LocalizationTang, Shitao / Tang, Sicong / Tagliasacchi, Andrea / Tan, Ping / Furukawa, Yasutaka et al. | 2023
- 940
-
MACARONS: Mapping and Coverage Anticipation with RGB Online Self-SupervisionGuedon, Antoine / Monnier, Tom / Monasse, Pascal / Lepetit, Vincent et al. | 2023
- 952
-
vMAP: Vectorised Object Mapping for Neural Field SLAMKong, Xin / Liu, Shikun / Taher, Marwan / Davison, Andrew J. et al. | 2023
- 962
-
Seeing a Rose in Five Thousand WaysZhang, Yunzhi / Wu, Shangzhe / Snavely, Noah / Wu, Jiajun et al. | 2023
- 972
-
Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight TrackingWang, Yihao / Wang, Zhigang / Zhao, Bin / Wang, Dong / Chen, Mulin / Li, Xuelong et al. | 2023
- 982
-
Seeing With Sound: Long-Range Acoustic Beamforming for Multimodal Scene UnderstandingChakravarthula, Praneeth / D'Souza, Jim Aldon / Tseng, Ethan / Bartusek, Joe / Heide, Felix et al. | 2023
- 992
-
Distilling Focal Knowledge from Imperfect Expert for 3D Object DetectionZeng, Jia / Chen, Li / Deng, Hanming / Lu, Lewei / Yan, Junchi / Qiao, Yu / Li, Hongyang et al. | 2023
- 1012
-
AShapeFormer : Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via TransformersLi, Zechuan / Yu, Hongshan / Yang, Zhengeng / Chen, Tongjia / Akhtar, Naveed et al. | 2023
- 1022
-
Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous DrivingDong, Yinpeng / Kang, Caixin / Zhang, Jinlai / Zhu, Zijian / Wang, Yikai / Yang, Xiao / Su, Hang / Wei, Xingxing / Zhu, Jun et al. | 2023
- 1033
-
Gaussian Label Distribution Learning for Spherical Image Object DetectionXu, Hang / Liu, Xinyuan / Zhao, Qiang / Ma, Yike / Yan, Chenggang / Dai, Feng et al. | 2023
- 1043
-
Deep Depth Estimation from Thermal ImageShin, Ukcheol / Park, Jinsun / Kweon, In So et al. | 2023
- 1054
-
LidarGait: Benchmarking 3D Gait Recognition with Point CloudsShen, Chuanfu / Chao, Fan / Wu, Wei / Wang, Rui / Huang, George Q. / Yu, Shiqi et al. | 2023
- 1064
-
Generalized UAV Object Detection via Frequency Domain DisentanglementWang, Kunyu / Fu, Xueyang / Huang, Yukun / Cao, Chengzhi / Shi, Gege / Zha, Zheng-Jun et al. | 2023
- 1074
-
Learning Compact Representations for LiDAR Completion and GenerationXiong, Yuwen / Ma, Wei-Chiu / Wang, Jingkang / Urtasun, Raquel et al. | 2023
- 1084
-
CXTrack: Improving 3D Point Cloud Tracking with Contextual InformationXu, Tian-Xing / Guo, Yuan-Chen / Lai, Yu-Kun / Zhang, Song-Hai et al. | 2023
- 1094
-
Multispectral Video Semantic Segmentation: A Benchmark Dataset and BaselineJi, Wei / Li, Jingjing / Bian, Cheng / Zhou, Zongwei / Zhao, Jiaying / Yuille, Alan / Cheng, Li et al. | 2023
- 1105
-
LinK: Linear Kernel for LiDAR-based 3D PerceptionLu, Tao / Ding, Xiang / Liu, Haisong / Wu, Gangshan / Wang, Limin et al. | 2023
- 1116
-
Point Cloud Forecasting as a Proxy for 4D Occupancy ForecastingKhurana, Tarasha / Hu, Peiyun / Held, David / Ramanan, Deva et al. | 2023
- 1125
-
Curricular Object Manipulation in LiDAR-based Object DetectionZhu, Ziyue / Meng, Qiang / Wang, Xiao / Wang, Ke / Yan, Liujiang / Yang, Jian et al. | 2023
- 1136
-
Delivering Arbitrary-Modal Semantic SegmentationZhang, Jiaming / Liu, Ruiping / Shi, Hao / Yang, Kailun / ReiB, Simon / Peng, Kunyu / Fu, Haodong / Wang, Kaiwei / Stiefelhagen, Rainer et al. | 2023
- 1148
-
Robust Outlier Rejection for 3D Registration with Variational BayesJiang, Haobo / Dang, Zheng / Wei, Zhen / Xie, Jin / Yang, Jian / Salzmann, Mathieu et al. | 2023
- 1158
-
3D Human Keypoints Estimation from Point Clouds in the Wild without Human LabelsWeng, Zhenzhen / Gorban, Alexander S. / Ji, Jingwei / Najibi, Mahyar / Zhou, Yin / Anguelov, Dragomir et al. | 2023
- 1168
-
Self-Supervised Pre-Training with Masked Shape Prediction for 3D Scene UnderstandingJiang, Li / Yang, Zetong / Shi, Shaoshuai / Golyanik, Vladislav / Dai, Dengxin / Schiele, Bernt et al. | 2023
- 1179
-
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D UnderstandingXue, Le / Gao, Mingfei / Xing, Chen / Martin-Martin, Roberto / Wu, Jiajun / Xiong, Caiming / Xu, Ran / Niebles, Juan Carlos / Savarese, Silvio et al. | 2023
- 1190
-
Open-Vocabulary Point-Cloud Object Detection without 3D AnnotationLu, Yuheng / Xu, Chenfeng / Wei, Xiaobao / Xie, Xiaodong / Tomizuka, Masayoshi / Keutzer, Kurt / Zhang, Shanghang et al. | 2023
- 1200
-
FlatFormer: Flattened Window Attention for Efficient Point Cloud TransformerLiu, Zhijian / Yang, Xinyu / Tang, Haotian / Yang, Shang / Han, Song et al. | 2023
- 1212
-
PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud VideosShen, Zhiqiang / Sheng, Xiaoxiao / Wang, Longguang / Guo, Yulan / Liu, Qiong / Zhou, Xi et al. | 2023
- 1223
-
E2PN: Efficient SE(3)-Equivariant Point NetworkZhu, Minghan / Ghaffari, Maani / Clark, William A / Peng, Huei et al. | 2023
- 1233
-
Poly-PC: A Polyhedral Network for Multiple Point Cloud Tasks at OnceXie, Tao / Wang, Shiguang / Wang, Ke / Yang, Linqi / Jiang, Zhiqiang / Zhang, Xingcheng / Dai, Kun / Li, Ruifeng / Cheng, Jian et al. | 2023
- 1244
-
Improving Graph Representation for Point Cloud Segmentation via Attentive FilteringZhang, Nan / Pan, Zhiyi / Li, Thomas H. / Gao, Wei / Li, Ge et al. | 2023
- 1255
-
BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud RegistrationAo, Sheng / Hu, Qingyong / Wang, Hanyun / Xu, Kai / Guo, Yulan et al. | 2023
- 1265
-
TopDiG: Class-agnostic Topological Directional Graph Extraction from Remote Sensing ImagesYang, Bingnan / Zhang, Mi / Zhang, Zhan / Zhang, Zhili / Hu, Xiangyun et al. | 2023
- 1275
-
Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants with no False Negatives and no False PositivesWiddowson, Daniel / Kurlin, Vitaliy et al. | 2023
- 1285
-
Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic SegmentationZheng, Xu / Zhu, Jinjing / Liu, Yexin / Cao, Zidong / Fu, Chong / Wang, Lin et al. | 2023
- 1296
-
CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple ShapesBhatia, Harshil / Tretschk, Edith / Lahner, Zorah / Benkner, Marcel Seelbach / Moeller, Michael / Theobalt, Christian / Golyanik, Vladislav et al. | 2023
- 1306
-
Enhancing Deformable Local Features by Jointly Learning to Detect and Describe KeypointsPotje, Guilherme / Cadar, Felipe / Araujo, Andre / Martins, Renato / Nascimento, Erickson R. et al. | 2023
- 1316
-
Understanding and Improving Features Learned in Deep Functional MapsAttaiki, Souhaib / Ovsjanikov, Maks et al. | 2023
- 1327
-
High-Frequency Stereo Matching NetworkZhao, Haoliang / Zhou, Huizhou / Zhang, Yongjun / Chen, Jie / Yang, Yitong / Zhao, Yong et al. | 2023
- 1337
-
Rethinking Optical Flow from Geometric Matching Consistent PerspectiveDong, Qiaole / Cao, Chenjie / Fu, Yanwei et al. | 2023
- 1348
-
Efficient Robust Principal Component Analysis via Block Krylov Iteration and CUR DecompositionFang, Shun / Xu, Zhengqin / Wu, Shiqian / Xie, Shoulie et al. | 2023
- 1358
-
VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan SegmentationYang, Bingchen / Jiang, Haiyong / Pan, Hao / Xiao, Jun et al. | 2023
- 1368
-
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous DrivingFang, Shaoheng / Wang, Zi / Zhong, Yiqi / Ge, Junhao / Chen, Siheng et al. | 2023
- 1379
-
Implicit Occupancy Flow Fields for Perception and Prediction in Self-DrivingAgro, Ben / Sykora, Quinlan / Casas, Sergio / Urtasun, Raquel et al. | 2023
- 1389
-
UniSim: A Neural Closed-Loop Sensor SimulatorYang, Ze / Chen, Yun / Wang, Jingkang / Manivasagam, Sivabalan / Ma, Wei-Chiu / Yang, Anqi Joyce / Urtasun, Raquel et al. | 2023
- 1400
-
FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-Tail Trajectory PredictionWang, Yuning / Zhang, Pu / Bai, Lei / Xue, Jianru et al. | 2023
- 1410
-
EqMotion: Equivariant Multi-Agent Motion Prediction with Invariant Interaction ReasoningXu, Chenxin / Tan, Robby T. / Tan, Yuhong / Chen, Siheng / Wang, Yu Guang / Wang, Xinchao / Wang, Yanfeng et al. | 2023
- 1421
-
Lookahead Diffusion Probabilistic Models for Refining Mean EstimationZhang, Guoqiang / Niwa, Kenta / Kleijn, W. Bastiaan et al. | 2023
- 1430
-
Neural Volumetric Memory for Visual Locomotion ControlYang, Ruihan / Yang, Ge / Wang, Xiaolong et al. | 2023
- 1441
-
Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human AttentionMondal, Sounak / Yang, Zhibo / Ahn, Seoyoung / Samaras, Dimitris / Zelinsky, Gregory / Hoai, Minh et al. | 2023
- 1451
-
DrapeNet: Garment Generation and Self-Supervised DrapingDe Luigi, Luca / Li, Ren / Guillard, Benoit / Salzmann, Mathieu / Fua, Pascal et al. | 2023
- 1461
-
Tracking Multiple Deformable Objects in Egocentric VideosHuang, Mingzhen / Li, Xiaoxing / Hu, Jun / Peng, Honghong / Lyu, Siwei et al. | 2023
- 1472
-
Good is Bad: Causality Inspired Cloth-debiasing for Cloth-changing Person Re-identificationYang, Zhengwei / Lin, Meng / Zhong, Xian / Wu, Yu / Wang, Zheng et al. | 2023
- 1482
-
Micron-BERT: BERT-Based Facial Micro-Expression RecognitionNguyen, Xuan-Bac / Duong, Chi Nhan / Li, Xin / Gauch, Susan / Seo, Han-Seok / Luu, Khoa et al. | 2023
- 1493
-
MARLIN: Masked Autoencoder for facial video Representation LearnINgCai, Zhixi / Ghosh, Shreya / Stefanov, Kalin / Dhall, Abhinav / Cai, Jianfei / Rezatofighi, Hamid / Haffari, Reza / Hayat, Munawar et al. | 2023
- 1505
-
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based GeneratorGuan, Jiazhi / Zhang, Zhanwang / Zhou, Hang / Hu, Tianshu / Wang, Kaisiyuan / He, Dongliang / Feng, Haocheng / Liu, Jingtuo / Ding, Errui / Liu, Ziwei et al. | 2023
- 1516
-
REALIMPACT: A Dataset of Impact Sound Fields for Real ObjectsClarke, Samuel / Gao, Ruohan / Wang, Mason / Rau, Mark / Xu, Julia / Wang, Jui-Hsien / James, Doug L. / Wu, Jiajun et al. | 2023
- 1526
-
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action RecognitionZhu, Xiaoyu / Huang, Po-Yao / Liang, Junwei / De Melo, Celso M. / Hauptmann, Alexander et al. | 2023
- 1537
-
Progressive Spatio-temporal Alignment for Efficient Event-based Motion EstimationHuang, Xueyan / Zhang, Yueyi / Xiong, Zhiwei et al. | 2023
- 1547
-
Event-Based Shape from PolarizationMuglikar, Manasi / Bauersfeld, Leonard / Moeys, Diederik Paul / Scaramuzza, Davide et al. | 2023
- 1557
-
Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-ResolutionLu, Yunfan / Wang, Zipeng / Liu, Minjie / Wang, Hongjian / Wang, Lin et al. | 2023
- 1568
-
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame InterpolationPark, Junheum / Kim, Jintae / Kim, Chang-Su et al. | 2023
- 1578
-
A Unified Pyramid Recurrent Network for Video Frame InterpolationJin, Xin / Wu, Longhai / Chen, Jie / Chen, Youxin / Koo, Jayoon / Hahm, Cheul-Hee et al. | 2023
- 1588
-
Event-based Blurry Frame Interpolation under Blind ExposureWeng, Wenming / Zhang, Yueyi / Xiong, Zhiwei et al. | 2023
- 1599
-
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow EstimationShi, Xiaoyu / Huang, Zhaoyang / Li, Dasong / Zhang, Manyuan / Cheung, Ka Chun / See, Simon / Qin, Hongwei / Dai, Jifeng / Li, Hongsheng et al. | 2023
- 1611
-
POTTER: Pooling Attention Transformer for Efficient Human Mesh RecoveryZheng, Ce / Liu, Xianpeng / Qi, Guo-Jun / Chen, Chen et al. | 2023
- 1621
-
Adaptive Patch Deformation for Textureless-Resilient Multi-View StereoWang, Yuesong / Zeng, Zhaojie / Guan, Tao / Yang, Wei / Chen, Zhuo / Liu, Wenkai / Xu, Luoyuan / Luo, Yawei et al. | 2023
- 1631
-
On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches TransferYu, Zhenjie / Li, Shuang / Shen, Yirui / Liu, Chi Harold / Wang, Shuigen et al. | 2023
- 1641
-
Thermal Spread Functions (TSF): Physics-Guided Material ClassificationDashpute, Aniket / Saragadam, Vishwanath / Alexander, Emma / Willomitzer, Florian / Katsaggelos, Aggelos / Veeraraghavan, Ashok / Cossairt, Oliver et al. | 2023
- 1651
-
Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-ResolutionChen, Xuhai / Zhang, Jiangning / Xu, Chao / Wang, Yabiao / Wang, Chengjie / Liu, Yong et al. | 2023
- 1662
-
Learning Semantic-Aware Knowledge Guidance for Low-Light Image EnhancementWu, Yuhui / Pan, Chen / Wang, Guoqing / Yang, Yang / Wei, Jiwei / Li, Chongyi / Shen, Heng Tao et al. | 2023
- 1672
-
CutMIB: Boosting Light Field Super-Resolution via Multi-View Image BlendingXiao, Zeyu / Liu, Yutong / Gao, Ruisheng / Xiong, Zhiwei et al. | 2023
- 1683
-
sRGB Real Noise Synthesizing with Neighboring Correlation-Aware Noise ModelFu, Zixuan / Guo, Lanqing / Wen, Bihan et al. | 2023
- 1692
-
Masked Image Training for Generalizable Deep Image DenoisingChen, Haoyu / Gu, Jinjin / Liu, Yihao / Magid, Salma Abdel / Dong, Chao / Wang, Qiong / Pfister, Hanspeter / Zhu, Lei et al. | 2023
- 1704
-
DR2: Diffusion-Based Robust Degradation Remover for Blind Face RestorationWang, Zhixin / Zhang, Ziying / Zhang, Xiaoyun / Zheng, Huangjie / Zhou, Mingyuan / Zhang, Ya / Wang, Yanfeng et al. | 2023
- 1714
-
Learning Distortion Invariant Representation for Image Restoration from a Causality PerspectiveLi, Xin / Li, Bingchen / Jin, Xin / Lan, Cuiling / Chen, Zhibo et al. | 2023
- 1725
-
Perception-Oriented Single Image Super-Resolution using Optimal Objective EstimationPark, Seung Ho / Moon, Young Su / Cho, Nam Ik et al. | 2023
- 1736
-
Catch Missing Details: Image Reconstruction with Frequency Augmented Variational AutoencoderLin, Xinmiao / Li, Yikang / Hsiao, Jenhao / Ho, Chiuman / Kong, Yu et al. | 2023
- 1746
-
MD-VQA: Multi-Dimensional Quality Assessment for UGC Live VideosZhang, Zicheng / Wu, Wei / Sun, Wei / Tu, Danyang / Lu, Wei / Min, Xiongkuo / Chen, Ying / Zhai, Guangtao et al. | 2023
- 1756
-
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large InputTian, Senmao / Lu, Ming / Liu, Jiaming / Guo, Yandong / Chen, Yurong / Zhang, Shunli et al. | 2023
- 1766
-
Initialization Noise in Image Gradients and Saliency MapsWoerl, Ann-Christin / Disselhoff, Jan / Wand, Michael et al. | 2023
- 1776
-
Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-ResolutionYao, Jie-En / Tsao, Li-Yuan / Lo, Yi-Chen / Tseng, Roy / Chang, Chia-Che / Lee, Chun-Yi et al. | 2023
- 1786
-
Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance PursuitWang, Xiaohang / Chen, Xuanhong / Ni, Bingbing / Wang, Hang / Tong, Zhengyan / Liu, Yutian et al. | 2023
- 1796
-
CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-ResolutionCao, Jiezhang / Wang, Qin / Xian, Yongqin / Li, Yawei / Ni, Bingbing / Pi, Zhiming / Zhang, Kai / Zhang, Yulun / Timofte, Radu / Van Gool, Luc et al. | 2023
- 1808
-
Multiplicative Fourier Level of DetailDou, Yishun / Zheng, Zhong / Jin, Qiaoqiao / Ni, Bingbing et al. | 2023
- 1818
-
Document Image Shadow Removal Guided by Color-Aware BackgroundZhang, Ling / He, Yinghao / Zhang, Qing / Liu, Zheng / Zhang, Xiaolong / Xiao, Chunxia et al. | 2023
- 1828
-
StyleRes: Transforming the Residuals for Real Image Editing with StyleGANPehlivan, Hamza / Dalva, Yusuf / Dundar, Aysegul et al. | 2023
- 1838
-
TopNet: Transformer-Based Object Placement Network for Image CompositingZhu, Sijie / Lin, Zhe / Cohen, Scott / Kuen, Jason / Zhang, Zhifei / Chen, Chen et al. | 2023
- 1848
-
VecFontSDF: Learning to Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance FunctionsXia, Zeqing / Xiong, Bojun / Lian, Zhouhui et al. | 2023
- 1858
-
CF-Font: Content Fusion for Few-Shot Font GenerationWang, Chi / Zhou, Min / Ge, Tiezheng / Jiang, Yuning / Bao, Hujun / Xu, Weiwei et al. | 2023
- 1868
-
SIEDOB: Semantic Image Editing by Disentangling Object and BackgroundLuo, Wuyang / Yang, Su / Zhang, Xinjian / Zhang, Weishan et al. | 2023
- 1879
-
MaskSketch: Unpaired Structure-guided Masked Image GenerationBashkirova, Dina / Lezama, Jose / Sohn, Kihyuk / Saenko, Kate / Essa, Irfan et al. | 2023
- 1890
-
Text2Scene: Text-driven Indoor Scene Stylization with Part-Aware DetailsHwang, Inwoo / Kim, Hyeonwoo / Kim, Young Min et al. | 2023
- 1900
-
Uncovering the Disentanglement Capability in Text-to-Image Diffusion ModelsWu, Qiucheng / Liu, Yujian / Zhao, Handong / Kale, Ajinkya / Bui, Trung / Yu, Tong / Lin, Zhe / Zhang, Yang / Chang, Shiyu et al. | 2023
- 1911
-
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion ModelsJain, Ajay / Xie, Amber / Abbeel, Pieter et al. | 2023
- 1921
-
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image TranslationTumanyan, Narek / Geyer, Michal / Bagon, Shai / Dekel, Tali et al. | 2023
- 1931
-
Multi-Concept Customization of Text-to-Image DiffusionKumari, Nupur / Zhang, Bingliang / Zhang, Richard / Shechtman, Eli / Zhu, Jun-Yan et al. | 2023
- 1942
-
Unifying Layout Generation with a Decoupled Diffusion ModelHui, Mude / Zhang, Zhizheng / Zhang, Xiaoyi / Xie, Wenxuan / Wang, Yuwang / Lu, Yan et al. | 2023
- 1952
-
BBDM: Image-to-Image Translation with Brownian Bridge Diffusion ModelsLi, Bo / Xue, Kaitao / Liu, Bin / Lai, Yu-Kun et al. | 2023
- 1962
-
Towards Practical Plug-and-Play Diffusion ModelsGo, Hyojun / Lee, Yunsung / Kim, JinYoung / Lee, Seunghyun / Jeong, Myeongho / Lee, Hyun Seung / Choi, Seungtaek et al. | 2023
- 1972
-
Post-Training Quantization on Diffusion ModelsShang, Yuzhang / Yuan, Zhihang / Xie, Bin / Wu, Bingzhe / Yan, Yan et al. | 2023
- 1982
-
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits AnimationShen, Shuai / Zhao, Wenliang / Meng, Zibin / Li, Wanhua / Zhu, Zheng / Zhou, Jie / Lu, Jiwen et al. | 2023
- 1992
-
Mask-Guided Matting in the WildPark, Kwanyong / Woo, Sanghyun / Oh, Seoung Wug / Kweon, In So / Lee, Joon-Young et al. | 2023
- 2002
-
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image GenerationHuang, Mengqi / Mao, Zhendong / Wang, Quan / Zhang, Yongdong et al. | 2023
- 2012
-
Compression-Aware Video Super-ResolutionWang, Yingwei / Isobe, Takashi / Jia, Xu / Tao, Xin / Lu, Huchuan / Tai, Yu-Wing et al. | 2023
- 2022
-
Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN modelsAhuja, Nilesh / Datta, Parual / Kanzariya, Bhavya / Somayazulu, V. Srinivasa / Tickoo, Omesh et al. | 2023
- 2031
-
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for VideosZhao, Qi / Asif, M. Salman / Ma, Zhan et al. | 2023
- 2041
-
Polynomial Implicit Neural Representations For Large Diverse DatasetsSingh, Rajhans / Shukla, Ankita / Turaga, Pavan et al. | 2023
- 2052
-
Learning Decorrelated Representations Efficiently Using Fast Fourier TransformShigeto, Yutaro / Shimbo, Masashi / Yoshikawa, Yuya / Takeuchi, Akikazu et al. | 2023
- 2061
-
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerChen, Xuanyao / Liu, Zhijian / Tang, Haotian / Yi, Li / Zhao, Hang / Han, Song et al. | 2023
- 2071
-
N-Gram in Swin Transformers for Efficient Lightweight Image Super-ResolutionChoi, Haram / Lee, Jeongmin / Yang, Jihoon et al. | 2023
- 2082
-
Slide-Transformer: Hierarchical Vision Transformer with Local Self-AttentionPan, Xuran / Ye, Tianzhu / Xia, Zhuofan / Song, Shiji / Huang, Gao et al. | 2023
- 2092
-
Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision TransformersWei, Siyuan / Ye, Tianzhu / Zhang, Shen / Tang, Yao / Liang, Jiajun et al. | 2023
- 2102
-
Top-Down Visual Attention from Analysis by SynthesisShi, Baifeng / Darrell, Trevor / Wang, Xin et al. | 2023
- 2113
-
Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural NetworksFrey, Markus / Doeller, Christian F. / Barry, Caswell et al. | 2023
- 2122
-
Masked Image Modeling with Local Multi-Scale ReconstructionWang, Haoqing / Tang, Yehui / Wang, Yunhe / Guo, Jianyuan / Deng, Zhi-Hong / Han, Kai et al. | 2023
- 2132
-
Siamese Image Modeling for Self-Supervised Vision Representation LearningTao, Chenxin / Zhu, Xizhou / Su, Weijie / Huang, Gao / Li, Bin / Zhou, Jie / Qiao, Yu / Wang, Xiaogang / Dai, Jifeng et al. | 2023
- 2142
-
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image SynthesisLi, Tianhong / Chang, Huiwen / Mishra, Shlok Kumar / Zhang, Han / Katabi, Dina / Krishnan, Dilip et al. | 2023
- 2153
-
Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identificationZhang, Yukang / Wang, Hanzi et al. | 2023
- 2163
-
DistilPose: Tokenized Pose Regression with Heatmap DistillationYe, Suhang / Zhang, Yingyi / Hu, Jie / Cao, Liujuan / Zhang, Shengchuan / Shen, Lei / Wang, Jun / Ding, Shouhong / Ji, Rongrong et al. | 2023
- 2173
-
Graph Transformer GANs for Graph-Constrained House GenerationTang, Hao / Zhang, Zhenyu / Shi, Humphrey / Li, Bo / Shao, Ling / Sebe, Nicu / Timofte, Radu / Van Gool, Luc et al. | 2023
- 2183
-
Automatic High Resolution Wire Segmentation and RemovalChiu, Mang Tik / Zhang, Xuaner / Wei, Zijun / Zhou, Yuqian / Shechtman, Eli / Barnes, Connelly / Lin, Zhe / Kainz, Florian / Amirghodsi, Sohrab / Shi, Humphrey et al. | 2023
- 2193
-
Tree Instance Segmentation with Temporal Contour GraphFiroze, Adnan / Wingren, Cameron / Yeh, Raymond A. / Benes, Bedrich / Aliaga, Daniel et al. | 2023
- 2203
-
Dual-Path Adaptation from Image to Video TransformersPark, Jungin / Lee, Jiyoung / Sohn, Kwanghoon et al. | 2023
- 2214
-
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video LearningPiergiovanni, AJ / Kuo, Weicheng / Angelova, Anelia et al. | 2023
- 2225
-
Modeling Video as Stochastic Processes for Fine-Grained Video Representation LearningZhang, Heng / Liu, Daqing / Zheng, Qi / Su, Bing et al. | 2023
- 2235
-
Masked Motion Encoding for Self-Supervised Video Representation LearningSun, Xinyu / Chen, Peihao / Chen, Liangwei / Li, Changhao / Li, Thomas H. / Tan, Mingkui / Gan, Chuang et al. | 2023
- 2246
-
Boosting Video Object Segmentation via Space-Time Correspondence LearningZhang, Yurong / Li, Liulei / Wang, Wenguan / Xie, Rong / Song, Li / Zhang, Wenjun et al. | 2023
- 2257
-
Two-shot Video Object SegmentationYan, Kun / Li, Xiao / Wei, Fangyun / Wang, Jinglu / Zhang, Chenbin / Wang, Ping / Lu, Yan et al. | 2023
- 2268
-
Look Before You Match: Instance Understanding Matters in Video Object SegmentationWang, Junke / Chen, Dongdong / Wu, Zuxuan / Luo, Chong / Tang, Chuanxin / Dai, Xiyang / Zhao, Yucheng / Xie, Yujia / Yuan, Lu / Jiang, Yu-Gang et al. | 2023
- 2279
-
Spatial-then-Temporal Self-Supervised Learning for Video CorrespondenceLi, Rui / Liu, Dong et al. | 2023
- 2289
-
Few-Shot Referring Relationships in VideosKumar, Yogesh / Mishra, Anand et al. | 2023
- 2299
-
Vision Transformers are Parameter-Efficient Audio-Visual LearnersLin, Yan-Bo / Sung, Yi-Lin / Lei, Jie / Bansal, Mohit / Bertasius, Gedas et al. | 2023
- 2310
-
Egocentric Video Task TranslationXue, Zihui / Song, Yale / Grauman, Kristen / Torresani, Lorenzo et al. | 2023
- 2321
-
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture GenerationYang, Sicheng / Wu, Zhiyong / Li, Minglei / Zhang, Zhensong / Hao, Lei / Bao, Weihong / Zhuang, Haolin et al. | 2023
- 2331
-
Co-speech Gesture Synthesis by Reinforcement Learning with Contrastive Pretrained RewardsSun, Mingyang / Zhao, Mengchen / Hou, Yaqing / Li, Minglei / Xu, Huang / Xu, Songcen / Hao, Jianye et al. | 2023
- 2341
-
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action RecognitionDave, Ishan Rajendrakumar / Rizve, Mamshad Nayeem / Chen, Chen / Shah, Mubarak et al. | 2023
- 2353
-
How can objects help action recognition?Zhou, Xingyi / Arnab, Anurag / Sun, Chen / Schmid, Cordelia et al. | 2023
- 2363
-
Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action RecognitionLin, Lilang / Zhang, Jiahang / Liu, Jiaying et al. | 2023
- 2373
-
Decomposed Cross-Modal Distillation for RGB-based Temporal Action DetectionLee, Pilhyeon / Kim, Taeoh / Shim, Minho / Wee, Dongyoon / Byun, Hyeran et al. | 2023
- 2384
-
ASPnet: Action Segmentation with Shared-Private Representation of Multiple Data Sourcesvan Amsterdam, Beatrice / Kadkhodamohammadi, Abdolrahim / Luengo, Imanol / Stoyanov, Danail et al. | 2023
- 2394
-
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action LocalizationRen, Huan / Yang, Wenfei / Zhang, Tianzhu / Zhang, Yongdong et al. | 2023
- 2405
-
LOGO: A Long-Form Video Dataset for Group Action Quality AssessmentZhang, Shiyi / Dai, Wenxun / Wang, Sujia / Shen, Xiangwei / Lu, Jiwen / Zhou, Jie / Tang, Yansong et al. | 2023
- 2415
-
Use Your Head: Improving Long-Tail Video RecognitionPerrett, Toby / Sinha, Saptarshi / Burghardt, Tilo / Mirmehdi, Majid / Damen, Dima et al. | 2023
- 2426
-
Conditional Generation of Audio from Video via Foley AnalogiesDu, Yuexi / Chen, Ziyang / Salamon, Justin / Russell, Bryan / Owens, Andrew et al. | 2023
- 2437
-
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential VideosDong, Sixun / Hu, Huazhang / Lian, Dongze / Luo, Weixin / Qian, Yicheng / Gao, Shenghua et al. | 2023
- 2448
-
You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed VideosFang, Xiang / Liu, Daizong / Zhou, Pan / Nan, Guoshun et al. | 2023
- 2461
-
Connecting Vision and Language with Video Localized NarrativesVoigtlaender, Paul / Changpinyo, Soravit / Pont-Tuset, Jordi / Soricut, Radu / Ferrari, Vittorio et al. | 2023
- 2472
-
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation LearningJin, Peng / Huang, Jinfa / Xiong, Pengfei / Tian, Shangxuan / Liu, Chang / Ji, Xiangyang / Yuan, Li / Chen, Jie et al. | 2023
- 2483
-
Aligning Step-by-Step Instructional Diagrams to Video DemonstrationsZhang, Jiahao / Cherian, Anoop / Liu, Yanbin / Ben-Shabat, Yizhak / Rodriguez, Cristian / Gould, Stephen et al. | 2023
- 2493
-
Make-A-Story: Visual Memory Conditioned Consistent Story GenerationRahman, Tanzila / Lee, Hsin- Ying / Ren, Jian / Tulyakov, Sergey / Mahajan, Shweta / Sigal, Leonid et al. | 2023
- 2503
-
Test of Time: Instilling Video-Language Models with a Sense of TimeBagad, Piyush / Tapaswi, Makarand / Snoek, Cees G.M. et al. | 2023
- 2517
-
How You Feelin’? Learning Emotions and Mental States in Movie ScenesSrivastava, Dhruv / Singh, Aditya Kumar / Tapaswi, Makarand et al. | 2023
- 2529
-
Continuous Sign Language Recognition with Correlation NetworkHu, Lianyu / Gao, Liqing / Liu, Zekang / Feng, Wei et al. | 2023
- 2540
-
DIP: Dual Incongruity Perceiving Network for Sarcasm DetectionWen, Changsong / Jia, Guoli / Yang, Jufeng et al. | 2023
- 2551
-
Gloss Attention for Gloss-free Sign Language TranslationYin, Aoxiong / Zhong, Tianyun / Tang, Li / Jin, Weike / Jin, Tao / Zhao, Zhou et al. | 2023
- 2563
-
Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation StatesDu, Heming / Li, Lincheng / Huang, Zi / Yu, Xin et al. | 2023
- 2574
-
Behavioral Analysis of Vision-and-Language Navigation AgentsYang, Zijiao / Majumdar, Arjun / Lee, Stefan et al. | 2023
- 2583
-
KERM: Knowledge Enhanced Reasoning for Vision-and-Language NavigationLi, Xiangyang / Wang, Zihan / Yang, Jiahao / Wang, Yaowei / Jiang, Shuqiang et al. | 2023
- 2593
-
Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query LocalizationXu, Mengmeng / Li, Yanghao / Fu, Cheng-Yang / Ghanem, Bernard / Xiang, Tao / Perez-Rua, Juan-Manuel et al. | 2023
- 2604
-
Efficient Multimodal Fusion via Interactive PromptingLi, Yaowei / Quan, Ruijie / Zhu, Linchao / Yang, Yi et al. | 2023
- 2614
-
NS3D: Neuro-Symbolic Grounding of 3D Objects and RelationsHsu, Joy / Mao, Jiayuan / Wu, Jiajun et al. | 2023
- 2624
-
Dynamic Inference with Grounding Based Vision and Language ModelsUzkent, Burak / Garg, Amanmeet / Zhu, Wentao / Doshi, Keval / Yi, Jingru / Wang, Xiaolong / Omar, Mohamed et al. | 2023
- 2634
-
Improving Commonsense in Vision-Language Models via Knowledge Graph RiddlesYe, Shuquan / Xie, Yujia / Chen, Dongdong / Xu, Yichong / Yuan, Lu / Zhu, Chenguang / Liao, Jing et al. | 2023
- 2646
-
S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical LearningSuo, Wei / Sun, Mengyang / Liu, Weisong / Gao, Yiqi / Wang, Peng / Zhang, Yanning / Wu, Qi et al. | 2023
- 2657
-
Teaching Structured Vision & Language Concepts to Vision & Language ModelsDoveh, Sivan / Arbelle, Assaf / Harary, Sivan / Schwartz, Eli / Herzig, Roei / Giryes, Raja / Feris, Rogerio / Panda, Rameswar / Ullman, Shimon / Karlinsky, Leonid et al. | 2023
- 2669
-
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion TasksHan, Xiao / Zhu, Xiatian / Yu, Licheng / Zhang, Li / Song, Yi-Zhe / Xiang, Tao et al. | 2023
- 2691
-
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language TasksLi, Hao / Zhu, Jinguo / Jiang, Xiaohu / Zhu, Xizhou / Li, Hongsheng / Yuan, Chun / Wang, Xiaohua / Qiao, Yu / Wang, Xiaogang / Wang, Wenhai et al. | 2023
- 2701
-
Learning from Unique Perspectives: User-aware Saliency ModelingChen, Shi / Valliappan, Nachiappan / Shen, Shaolei / Ye, Xinyu / Kohlhoff, Kai / He, Junfeng et al. | 2023
- 2711
-
CRAFT: Concept Recursive Activation FacTorization for ExplainabilityFel, Thomas / Picard, Agustin / Bethune, Louis / Boissin, Thibaut / Vigouroux, David / Colin, Julien / Cadenc, Remi / Serre, Thomas et al. | 2023
- 2722
-
Doubly Right Object Recognition: A Why Prompt for Visual RationalesMao, Chengzhi / Teotia, Revant / Sundar, Amrutha / Menon, Sachit / Yang, Junfeng / Wang, Xin / Vondrick, Carl et al. | 2023
- 2733
-
Sketch2Saliency: Learning to Detect Salient Objects from Human DrawingsBhunia, Ayan Kumar / Koley, Subhadeep / Kumar, Amandeep / Sain, Aneeshan / Chowdhury, Pinaki Nath / Xiang, Tao / Song, Yi-Zhe et al. | 2023
- 2744
-
PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image ClassificationNauta, Meike / Schlotterer, Jorg / van Keulen, Maurice / Seifert, Christin et al. | 2023
- 2765
-
CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or NotSain, Aneeshan / Bhunia, Ayan Kumar / Chowdhury, Pinaki Nath / Koley, Subhadeep / Xiang, Tao / Song, Yi-Zhe et al. | 2023
- 2776
-
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for Visual RecognitionWei, Yixuan / Cao, Yue / Zhang, Zheng / Peng, Houwen / Yao, Zhuliang / Xie, Zhenda / Hu, Han / Guo, Baining et al. | 2023
- 2787
-
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalJiang, Ding / Ye, Mang et al. | 2023
- 2798
-
Multi-Modal Representation Learning with Text-Driven Soft MasksPark, Jaeyoo / Han, Bohyung et al. | 2023
- 2808
-
Texts as Images in Prompt Tuning for Multi-Label Image RecognitionGuo, Zixian / Dong, Bowen / Ji, Zhilong / Bai, Jinfeng / Guo, Yiwen / Zuo, Wangmeng et al. | 2023
- 2818
-
Reproducible Scaling Laws for Contrastive Language-Image LearningCherti, Mehdi / Beaumont, Romain / Wightman, Ross / Wortsman, Mitchell / Ilharco, Gabriel / Gordon, Cade / Schuhmann, Christoph / Schmidt, Ludwig / Jitsev, Jenia et al. | 2023
- 2830
-
Multilateral Semantic Relations Modeling for Image Text RetrievalWang, Zheng / Gao, Zhenwei / Guo, Kangshuai / Yang, Yang / Wang, Xiaoming / Shen, Heng Tao et al. | 2023
- 2840
-
Smallcap: Lightweight Image Captioning Prompted with Retrieval AugmentationRamos, Rita / Martins, Bruno / Elliott, Desmond / Kementchedjhieva, Yova et al. | 2023
- 2850
-
Probing Sentiment-Oriented PreTraining Inspired by Human Sentiment Perception MechanismFeng, Tinglei / Liu, Jiaxuan / Yang, Jufeng et al. | 2023
- 2861
-
Prefix Conditioning Unifies Language and Label SupervisionSaito, Kuniaki / Sohn, Kihyuk / Zhang, Xiang / Li, Chun-Liang / Lee, Chen-Yu / Saenko, Kate / Pfister, Tomas et al. | 2023
- 2871
-
Crossing the Gap: Domain Generalization for Image CaptioningRen, Yuchen / Mao, Zhendong / Fang, Shancheng / Lu, Yan / He, Tong / Du, Hao / Zhang, Yongdong / Ouyang, Wanli et al. | 2023
- 2881
-
A Bag-of-Prototypes Representation for Dataset-Level ApplicationsTu, Weijie / Deng, Weijian / Gedeon, Tom / Zheng, Liang et al. | 2023
- 2893
-
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language ModelLiang, Dingkang / Xie, Jiahao / Zou, Zhikang / Ye, Xiaoqing / Xu, Wei / Bai, Xiang et al. | 2023
- 2904
-
D2Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-Based TransformersHe, Jianfeng / Gao, Yuan / Zhang, Tianzhu / Zhang, Zhe / Wu, Feng et al. | 2023
- 2915
-
Learning to Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic SpaceZhang, Yong / Pan, Yingwei / Yao, Ting / Huang, Rui / Mei, Tao / Chen, Chang-Wen et al. | 2023
- 2925
-
Relational Context Learning for Human-Object Interaction DetectionKim, Sanghyun / Jung, Deunsol / Cho, Minsu et al. | 2023
- 2935
-
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language SupervisionXu, Jilan / Hou, Junlin / Zhang, Yuejie / Feng, Rui / Wang, Yi / Qiao, Yu / Xie, Weidi et al. | 2023
- 2945
-
Side Adapter Network for Open-Vocabulary Semantic SegmentationXu, Mengde / Zhang, Zheng / Wei, Fangyun / Hu, Han / Bai, Xiang et al. | 2023
- 2955
-
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion ModelsXu, Jiarui / Liu, Sifei / Vahdat, Arash / Byeon, Wonmin / Wang, Xiaolong / De Mello, Shalini et al. | 2023
- 2967
-
IFSeg: Image-free Semantic Segmentation via Vision-Language ModelYun, Sukmin / Park, Seong Hyeon / Seo, Paul Hongsuck / Shin, Jinwoo et al. | 2023
- 2978
-
PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud ObservationsGeng, Haoran / Li, Ziming / Geng, Yiran / Chen, Jiayi / Dong, Hao / Wang, He et al. | 2023
- 2989
-
OneFormer: One Transformer to Rule Universal Image SegmentationJain, Jitesh / Li, Jiachen / Chiu, MangTik / Hassani, Ali / Orlov, Nikita / Shi, Humphrey et al. | 2023
- 2999
-
Delving into Shape-aware Zero-shot Semantic SegmentationLiu, Xinyu / Tian, Beiwen / Wang, Zhen / Wang, Rui / Sheng, Kehua / Zhang, Bo / Zhao, Hao / Zhou, Guyue et al. | 2023
- 3010
-
CoMFormer: Continual Learning in Semantic and Panoptic SegmentationCermelli, Fabio / Cord, Matthieu / Douillard, Arthur et al. | 2023
- 3021
-
Learning to Segment Every Referring Object Point by PointQu, Mengxue / Wu, Yu / Wei, Yunchao / Liu, Wu / Liang, Xiaodan / Zhao, Yao et al. | 2023
- 3031
-
Unsupervised Continual Semantic Adaptation Through Neural RenderingLiu, Zhizheng / Milano, Francesco / Frey, Jonas / Siegwart, Roland / Blum, Hermann / Cadena, Cesar et al. | 2023
- 3041
-
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and SegmentationLi, Feng / Zhang, Hao / Xu, Huaizhe / Liu, Shilong / Zhang, Lei / Ni, Lionel M. / Shum, Heung-Yeung et al. | 2023
- 3051
-
Transformer Scale Gate for Semantic SegmentationShi, Hengcan / Hayat, Munawar / Cai, Jianfei et al. | 2023
- 3061
-
Style Projected Clustering for Domain Generalized Semantic SegmentationHuang, Wei / Chen, Chang / Li, Yong / Li, Jiacheng / Li, Cheng / Song, Fenglong / Yan, Youliang / Xiong, Zhiwei et al. | 2023
- 3072
-
Rethinking Few-Shot Medical Segmentation: A Vector Quantization ViewHuang, Shiqi / Xu, Tingfa / Shen, Ning / Mu, Feng / Li, Jianan et al. | 2023
- 3082
-
Continual Semantic Segmentation with Automatic Memory Sample SelectionZhu, Lanyun / Chen, Tianrun / Yin, Jianxiong / See, Simon / Liu, Jun et al. | 2023
- 3093
-
Token Contrast for Weakly-Supervised Semantic SegmentationRu, Lixiang / Zheng, Heliang / Zhan, Yibing / Du, Bo et al. | 2023
- 3103
-
Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation GraphZhou, Rixin / Wei, Jiafu / Zhang, Qian / Qi, Ruihua / Yang, Xi / Li, Chuntao et al. | 2023
- 3114
-
Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic SegmentationWang, Xiaoyang / Zhang, Bingfeng / Yu, Limin / Xiao, Jimin et al. | 2023
- 3124
-
Cut and Learn for Unsupervised Object Detection and Instance SegmentationWang, Xudong / Girdhar, Rohit / Yu, Stella X. / Misra, Ishan et al. | 2023
- 3135
-
Extracting Class Activation Maps from Non-Discriminative Features as wellChen, Zhaozheng / Sun, Qianru et al. | 2023
- 3145
-
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance SegmentationCheng, Tianheng / Wang, Xinggang / Chen, Shaoyu / Zhang, Qian / Liu, Wenyu et al. | 2023
- 3155
-
Hierarchical Fine-Grained Image Forgery Detection and LocalizationGuo, Xiao / Liu, Xiaohong / Ren, Zhiyuan / Grosz, Steven / Masi, Iacopo / Liu, Xiaoming et al. | 2023
- 3166
-
Towards Professional Level Crowd Annotation of Expert Domain DataWang, Pei / Vasconcelos, Nuno et al. | 2023
- 3176
-
Unsupervised Object Localization: Observing the Background to Discover ObjectsSimeoni, Oriane / Sekkat, Chloe / Puy, Gilles / Vobecky, Antonin / Zablocki, Eloi / Perez, Patrick et al. | 2023
- 3187
-
Semi-supervised learning made simple with self-supervised clusteringFini, Enrico / Astolfi, Pietro / Alahari, Karteek / Alameda-Pineda, Xavier / Mairal, Julien / Nabi, Moin / Ricci, Elisa et al. | 2023
- 3198
-
Unbalanced Optimal Transport: A Unified Framework for Object DetectionDe Plaen, Henri / De Plaen, Pierre-Francois / Suykens, Johan A. K. / Proesmans, Marc / Tuytelaars, Tinne / Van Gool, Luc et al. | 2023
- 3208
-
DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object DetectionMa, Jiawei / Niu, Yulei / Xu, Jincheng / Huang, Shiyuan / Han, Guangxing / Chang, Shih-Fu et al. | 2023
- 3219
-
CLIP the Gap: A Single Domain Generalization Approach for Object DetectionVidit, Vidit / Engilberge, Martin / Salzmann, Mathieu et al. | 2023
- 3230
-
Unknown Sniffer for Object Detection: Don't Turn a Blind Eye to Unknown ObjectsLiang, Wenteng / Xue, Feng / Liu, Yihao / Zhong, Guofeng / Ming, Anlong et al. | 2023
- 3240
-
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object DetectionWang, Xinjiang / Yang, Xingyi / Zhang, Shilong / Li, Yijiang / Feng, Litong / Fang, Shijie / Lyu, Chengqi / Chen, Kai / Zhang, Wayne et al. | 2023
- 3250
-
Optimal Proposal Learning for Deployable End-to-End Pedestrian DetectionSong, Xiaolin / Chen, Binghui / Li, Pengyu / He, Jun-Yan / Wang, Biao / Geng, Yifeng / Xie, Xuansong / Zhang, Honggang et al. | 2023
- 3261
-
AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object DetectionGao, Yipeng / Lin, Kun-Yu / Yan, Junkai / Wang, Yaowei / Zheng, Wei-Shi et al. | 2023
- 3272
-
Where is My Spot? Few-shot Image Generation via Latent Subspace OptimizationZheng, Chenxi / Liu, Bangzhen / Zhang, Huaidong / Xu, Xuemiao / He, Shengfeng et al. | 2023
- 3282
-
Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution DetectionLu, Fan / Zhu, Kai / Zhai, Wei / Zheng, Kecheng / Cao, Yang et al. | 2023
- 3292
-
MAESTER: Masked Autoencoder Guided Segmentation at Pixel Resolution for Accurate, Self-Supervised Subcellular Structure RecognitionXie, Ronald / Pang, Kuan / Bader, Gary D. / Wang, Bo et al. | 2023
- 3302
-
Orthogonal Annotation Benefits Barely-supervised Medical Image SegmentationCai, Heng / Li, Shumeng / Qi, Lei / Yu, Qian / Shi, Yinghuan / Gao, Yang et al. | 2023
- 3312
-
RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure PredictionZhou, Donghao / Gu, Chunbin / Xu, Junde / Liu, Furui / Wang, Qiong / Chen, Guangyong / Heng, Pheng-Ann et al. | 2023
- 3323
-
Topology-Guided Multi-Class Cell Context Generation for Digital PathologyAbousamra, Shahira / Gupta, Rajarsi / Kurc, Tahsin / Samaras, Dimitris / Saltz, Joel / Chen, Chao et al. | 2023
- 3334
-
Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report GenerationLi, Mingjie / Lin, Bingqian / Chen, Zicong / Lin, Haokun / Liang, Xiaodan / Chang, Xiaojun et al. | 2023
- 3344
-
Benchmarking Self-Supervised Learning on Diverse Pathology DatasetsKang, Mingu / Song, Heon / Park, Seonwook / Yoo, Donggeun / Pereira, Sergio et al. | 2023
- 3355
-
Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive LearningLiu, Kangning / Zhu, Weicheng / Shen, Yiqiu / Liu, Sheng / Razavian, Narges / Geras, Krzysztof J. / Fernandez-Granda, Carlos et al. | 2023
- 3366
-
Learning Expressive Prompting With Residuals for Vision TransformersDas, Rajshekhar / Dukler, Yonatan / Ravichandran, Avinash / Swaminathan, Ashwin et al. | 2023
- 3378
-
Detection of Out-of-Distribution Samples Using Binary Neuron Activation PatternsOlber, Bartlomiej / Radlak, Krystian / Popowicz, Adam / Szczepankiewicz, Michal / Chachula, Krystian et al. | 2023
- 3388
-
Decoupling MaxLogit for Out-of-Distribution DetectionZhang, Zihan / Xiang, Xiang et al. | 2023
- 3398
-
Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete LabelsDing, Zixuan / Wang, Ao / Chen, Hui / Zhang, Qiang / Liu, Pengzhang / Bao, Yongjun / Yan, Weipeng / Han, Jungong et al. | 2023
- 3408
-
Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label ClassificationKim, Youngwook / Kim, Jae Myung / Jeong, Jieun / Schmid, Cordelia / Akata, Zeynep / Lee, Jungwoo et al. | 2023
- 3418
-
DivClust: Controlling Diversity in Deep ClusteringMetaxas, Ioannis Maniadis / Tzimiropoulos, Georgios / Patras, Ioannis et al. | 2023
- 3429
-
Deep Semi-Supervised Metric Learning with Mixed Label PropagationZhuang, Furen / Moulin, Pierre et al. | 2023
- 3439
-
Leveraging Inter-Rater Agreement for Classification in the Presence of Noisy LabelsBucarelli, Maria Sofia / Cassano, Lucas / Siciliano, Federico / Mantrach, Amin / Silvestri, Fabrizio et al. | 2023
- 3449
-
Modeling Inter-Class and Intra-Class Constraints in Novel Class DiscoveryLi, Wenbin / Fan, Zhichen / Huo, Jing / Gao, Yang et al. | 2023
- 3459
-
Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class DiscoveryYang, Muli / Wang, Liancheng / Deng, Cheng / Zhang, Hanwang et al. | 2023
- 3469
-
Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency is All You NeedWei, Tong / Gan, Kai et al. | 2023
- 3479
-
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category DiscoveryZhang, Sheng / Khan, Salman / Shen, Zhiqiang / Naseer, Muzammal / Chen, Guangyi / Khan, Fahad Shahbaz et al. | 2023
- 3489
-
Probabilistic Knowledge Distillation of Face EnsemblesXu, Jianqing / Li, Shen / Deng, Ailin / Xiong, Miao / Wu, Jiaying / Wu, Jiaxiang / Ding, Shouhong / Hooi, Bryan et al. | 2023
- 3499
-
Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed RecognitionZhou, Zhipeng / Li, Lanqing / Zhao, Peilin / Heng, Pheng-Ann / Gong, Wei et al. | 2023
- 3510
-
Promoting Semantic Connectivity: Dual Nearest Neighbors Contrastive Learning for Unsupervised Domain GeneralizationLiu, Yuchen / Wang, Yaoming / Chen, Yabo / Dai, Wenrui / Li, Chenglin / Zou, Junni / Xiong, Hongkai et al. | 2023
- 3520
-
Instance Relation Graph Guided Source-Free Domain Adaptive Object DetectionVS, Vibashan / Oza, Poojan / Patel, Vishal M. et al. | 2023
- 3531
-
MOT: Masked Optimal Transport for Partial Domain AdaptationLuo, You-Wei / Ren, Chuan-Xian et al. | 2023
- 3541
-
TOPLight: Lightweight Neural Networks with Task-Oriented Pretraining for Visible-Infrared RecognitionYu, Hao / Cheng, Xu / Peng, Wei et al. | 2023
- 3551
-
OSAN: A One-Stage Alignment Network to Unify Multimodal Alignment and Unsupervised Domain AdaptationLiu, Ye / Qiao, Lingfeng / Lu, Changchong / Yin, Di / Lin, Chen / Peng, Haoyuan / Ren, Bo et al. | 2023
- 3561
-
Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game PerspectiveZhu, Jinjing / Bai, Haotian / Wang, Lin et al. | 2023
- 3572
-
ARO-Net: Learning Implicit Fields from Anchored Radial ObservationsWang, Yizhi / Huang, Zeyu / Shamir, Ariel / Huang, Hui / Zhang, Hao / Hu, Ruizhen et al. | 2023
- 3582
-
A Probabilistic Framework for Lifelong Test-Time AdaptationBrahma, Dhanajit / Rai, Piyush et al. | 2023
- 3592
-
Distribution Shift Inversion for Out-of-Distribution PredictionYu, Runpeng / Liu, Songhua / Yang, Xingyi / Wang, Xinchao et al. | 2023
- 3603
-
Learning Joint Latent Space EBM Prior Model for Multi-layer GeneratorCui, Jiali / Wu, Ying Nian / Han, Tian et al. | 2023
- 3613
-
A Data-Based Perspective on Transfer LearningJain, Saachi / Salman, Hadi / Khaddaj, Alaa / Wong, Eric / Park, Sung Min / Madry, Aleksander et al. | 2023
- 3623
-
A Meta-Learning Approach to Predicting Performance and Data RequirementsJain, Achin / Swaminathan, Gurumurthy / Favaro, Paolo / Yang, Hao / Ravichandran, Avinash / Harutyunyan, Hrayr / Achille, Alessandro / Dabeer, Onkar / Schiele, Bernt / Swaminathan, Ashwin et al. | 2023
- 3633
-
Guided Recommendation for Model Fine-TuningLi, Hao / Fowlkes, Charless / Yang, Hao / Dabeer, Onkar / Tu, Zhuowen / Soatto, Stefano et al. | 2023
- 3643
-
EMT-NAS: Transferring architectural knowledge between tasks from different datasetsLiao, Peng / Jin, Yaochu / Du, Wenli et al. | 2023
- 3654
-
AttriCLIP: A Non-Incremental Learner for Incremental Knowledge LearningWang, Runqi / Duan, Xiaoyue / Kang, Guoliang / Liu, Jianzhuang / Lin, Shaohui / Xu, Songcen / Lv, Jinhu / Zhang, Baochang et al. | 2023
- 3664
-
Batch Model Consolidation: A Multi-Task Model Consolidation FrameworkFostiropoulos, Iordanis / Zhu, Jiaye / Itti, Laurent et al. | 2023
- 3677
-
SmartAssign:Learning A Smart Knowledge Assignment Strategy for Deraining and DesnowingWang, Yinglong / Ma, Chao / Liu, Jianzhuang et al. | 2023
- 3687
-
TinyMIM: An Empirical Study of Distilling MIM Pre-trained ModelsRen, Sucheng / Wei, Fangyun / Zhang, Zheng / Hu, Han et al. | 2023
- 3698
-
Computationally Budgeted Continual Learning: What Does Matter?Prabhu, Ameya / Al Kader Hammoud, Hasan Abed / Dokania, Puneet / Torr, Philip H.S. / Lim, Ser-Nam / Ghanem, Bernard / Bibi, Adel et al. | 2023
- 3708
-
GradMA: A Gradient-Memory-based Accelerated Federated Learning with Alleviated Catastrophic ForgettingLuo, Kangyang / Li, Xiang / Lan, Yunshi / Gao, Ming et al. | 2023
- 3718
-
Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space DecouplingZhao, Zhen / Zhang, Zhizhong / Tan, Xin / Liu, Jun / Qu, Yanyun / Xie, Yuan / Ma, Lizhuang et al. | 2023
- 3728
-
Neuro-Modulated Hebbian Learning for Fully Test-Time AdaptationTang, Yushun / Zhang, Ce / Xu, Heng / Chen, Shuoshuo / Cheng, Jie / Leng, Luziwei / Guo, Qinghai / He, Zhihai et al. | 2023
- 3739
-
Generalizing Dataset Distillation via Deep Generative PriorCazenavette, George / Wang, Tongzhou / Torralba, Antonio / Efros, Alexei A. / Zhu, Jun-Yan et al. | 2023
- 3749
-
Minimizing the Accumulated Trajectory Error to Improve Dataset DistillationDu, Jiawei / Jiang, Yidi / Tan, Vincent Y.F. / Zhou, Joey Tianyi / Li, Haizhou et al. | 2023
- 3759
-
Slimmable Dataset CondensationLiu, Songhua / Ye, Jingwen / Yu, Runpeng / Wang, Xinchao et al. | 2023
- 3769
-
Sharpness-Aware Gradient Matching for Domain GeneralizationWang, Pengfei / Zhang, Zhaoxiang / Lei, Zhen / Zhang, Lei et al. | 2023
- 3779
-
Dynamic Neural Network for Multi-Task Learning Searching across Diverse Network TopologiesChoi, Wonhyeok / Im, Sunghoon et al. | 2023
- 3789
-
SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision BoundariesHumayun, Ahmed Imtiaz / Balestriero, Randall / Balakrishnan, Guha / Baraniuk, Richard et al. | 2023
- 3799
-
VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue DistributionKim, Jaeill / Kang, Suhyun / Hwang, Duhun / Shin, Jungwook / Rhee, Wonjong et al. | 2023
- 3811
-
Efficient On-Device Training via Gradient FilteringYang, Yuedong / Li, Guihong / Marculescu, Radu et al. | 2023
- 3821
-
Are Data-Driven Explanations Robust Against Out-of-Distribution Data?Li, Tang / Qiao, Fengchun / Ma, Mengmeng / Peng, Xi et al. | 2023
- 3832
-
BiasAdv: Bias-Adversarial Augmentation for Model DebiasingLim, Jongin / Kim, Youngdong / Kim, Byungjai / Ahn, Chanho / Shin, Jinwoo / Yang, Eunho / Han, Seungju et al. | 2023
- 3842
-
Q-DETR: An Efficient Low-Bit Quantized Detection TransformerXu, Sheng / Li, Yanjing / Lin, Mingbao / Gao, Peng / Guo, Guodong / Lu, Jinhu / Zhang, Baochang et al. | 2023
- 3852
-
NIPQ: Noise proxy-based Integrated Pseudo-QuantizationShin, Juncheol / So, Junhyuk / Park, Sein / Kang, Seungyeop / Yoo, Sungjoo / Park, Eunhyeok et al. | 2023
- 3862
-
CUDA: Convolution-Based Unlearnable DatasetsSadasivan, Vinu Sankar / Soltanolkotabi, Mahdi / Feizi, Soheil et al. | 2023
- 3872
-
KD-DLGAN: Data Limited Image Generation via Knowledge DistillationCui, Kaiwen / Yu, Yingchen / Zhan, Fangneng / Liao, Shengcai / Lu, Shijian / Xing, Eric et al. | 2023
- 3883
-
Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN TrainingAsokan, Siddarth / Seelamantula, Chandra Sekhar et al. | 2023
- 3894
-
Efficient Verification of Neural Networks Against LVM-Based SpecificationsHanspal, Harleen / Lomuscio, Alessio et al. | 2023
- 3904
-
Bi-directional Feature Fusion Generative Adversarial Network for Ultra-high Resolution Pathological Image Virtual Re-stainingSun, Kexin / Chen, Zhineng / Wang, Gongwei / Liu, Jun / Ye, Xiongjun / Jiang, Yu-Gang et al. | 2023
- 3914
-
DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly DetectionZhang, Xuan / Li, Shiyu / Li, Xi / Huang, Ping / Shan, Jiulong / Chen, Ting et al. | 2023
- 3924
-
OmniAL: A Unified CNN Framework for Unsupervised Anomaly LocalizationZhao, Ying et al. | 2023
- 3934
-
Federated Incremental Semantic SegmentationDong, Jiahua / Zhang, Duzhen / Cong, Yang / Cong, Wei / Ding, Henghui / Dai, Dengxin et al. | 2023
- 3944
-
Re-Thinking Federated Active Learning Based on Inter-Class DiversityKim, SangMook / Bae, Sangmin / Song, Hwanjun / Yun, Se-Young et al. | 2023
- 3954
-
Federated Domain Generalization with Generalization AdjustmentZhang, Ruipeng / Xu, Qinwei / Yao, Jiangchao / Zhang, Ya / Tian, Qi / Wang, Yanfeng et al. | 2023
- 3964
-
On the Effectiveness of Partial Variance Reduction in Federated Learning with Heterogeneous DataLi, Bo / Schmidt, Mikkel N. / Alstrom, Tommy S. / Stich, Sebastian U. et al. | 2023
- 3974
-
The Resource Problem of Using Linear Layer Leakage Attack in Federated LearningZhao, Joshua C. / Elkordy, Ahmed Roushdy / Sharma, Atul / Ezzeldin, Yahya H. / Avestimehr, Salman / Bagchi, Saurabh et al. | 2023
- 3984
-
Unlearnable Clusters: Towards Label-Agnostic Unlearnable ExamplesZhang, Jiaming / Ma, Xingjun / Yi, Qi / Sang, Jitao / Jiang, Yu-Gang / Wang, Yaowei / Xu, Changsheng et al. | 2023
- 3994
-
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection GeneralizationDong, Shichao / Wang, Jin / Ji, Renhe / Liang, Jiajun / Fan, Haoqiang / Ge, Zheng et al. | 2023
- 4005
-
Backdoor Defense via Adaptively Splitting Poisoned DatasetGao, Kuofeng / Bai, Yang / Gu, Jindong / Yang, Yong / Xia, Shu-Tao et al. | 2023
- 4015
-
How to Backdoor Diffusion Models?Chou, Sheng-Yen / Chen, Pin-Yu / Ho, Tsung-Yi et al. | 2023
- 4025
-
TrojViT: Trojan Insertion in Vision TransformersZheng, Mengxin / Lou, Qian / Jiang, Lei et al. | 2023
- 4035
-
TrojDiff: Trojan Attacks on Diffusion Models with Diverse TargetsChen, Weixin / Song, Dawn / Li, Bo et al. | 2023
- 4045
-
Ensemble-based Blackbox Attacks on Dense PredictionCai, Zikui / Tan, Yaoteng / Asif, M. Salman et al. | 2023
- 4056
-
Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based AttacksYu, Yunrui / Xu, Cheng-Zhong et al. | 2023
- 4067
-
The Best Defense is a Good Offense: Adversarial Augmentation Against Adversarial AttacksFrosio, Iuri / Kautz, Jan et al. | 2023
- 4077
-
Adversarial Robustness via Random Projection FiltersDong, Minjing / Xu, Chang et al. | 2023
- 4087
-
Jedi: Entropy-Based Localization and Removal of Adversarial PatchesTarchoun, Bilel / Khalifa, Anouar Ben / Mahjoub, Mohamed Ali / Abu-Ghazaleh, Nael / Alouani, Ihsen et al. | 2023
- 4096
-
Exploring the Relationship Between Architectural Design and Adversarially Robust GeneralizationLiu, Aishan / Tang, Shiyu / Liang, Siyuan / Gong, Ruihao / Wu, Boxi / Liu, Xianglong / Tao, Dacheng et al. | 2023
- 4108
-
Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch CorruptionsGuo, Yong / Stutz, David / Schiele, Bernt et al. | 2023
- 4119
-
Towards Effective Adversarial Textured 3D Meshes on Physical Face RecognitionYang, Xiao / Liu, Chang / Xu, Longlong / Wang, Yikai / Dong, Yinpeng / Chen, Ning / Su, Hang / Zhu, Jun et al. | 2023
- 4129
-
AltFreezing for More General Video Face Forgery DetectionWang, Zhendong / Bao, Jianmin / Zhou, Wengang / Wang, Weilun / Li, Houqiang et al. | 2023
- 4139
-
Passive Micron-Scale Time-of-Flight with Sunlight InterferometryKotwal, Alankar / Levin, Anat / Gkioulekas, Ioannis et al. | 2023
- 4150
-
F2-NeRF: Fast Neural Radiance Field Training with Free Camera TrajectoriesWang, Peng / Liu, Yuan / Chen, Zhaoxi / Liu, Lingjie / Liu, Ziwei / Komura, Taku / Theobalt, Christian / Wang, Wenping et al. | 2023
- 4160
-
NoPe-NeRF: Optimising Neural Radiance Field with No Pose PriorBian, Wenjing / Wang, Zirui / Li, Kejie / Bian, Jia-Wang et al. | 2023
- 4170
-
BAD-NeRF: Bundle Adjusted Deblur Neural Radiance FieldsWang, Peng / Zhao, Lingzhe / Ma, Ruijie / Liu, Peidong et al. | 2023
- 4180
-
DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion ModelsWynn, Jamie / Turmukhambetov, Daniyar et al. | 2023
- 4190
-
SPARF: Neural Radiance Fields from Sparse and Noisy PosesTruong, Prune / Rakotosaona, Marie-Julie / Manhardt, Fabian / Tombari, Federico et al. | 2023
- 4201
-
Interactive Segmentation of Radiance FieldsGoel, Rahul / Sirikonda, Dhawal / Saini, Saurabh / Narayanan, P J et al. | 2023
- 4212
-
Temporal Interpolation is all You Need for Dynamic Neural Radiance FieldsPark, Sungheon / Son, Minjung / Jang, Seokhwan / Ahn, Young Chun / Kim, Ji-Yeon / Kang, Nahyup et al. | 2023
- 4222
-
Compressing Volumetric Radiance Fields to 1 MBLi, Lingzhi / Shen, Zhen / Wang, Zhongshu / Shen, Li / Bo, Liefeng et al. | 2023
- 4232
-
Multiscale Tensor Decomposition and Rendering Equation Encoding for View SynthesisHan, Kang / Xiang, Wei et al. | 2023
- 4242
-
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene StylizationZhang, Yuechen / He, Zexin / Xing, Jinbo / Yao, Xufeng / Jia, Jiaya et al. | 2023
- 4252
-
Representing Volumetric Videos as Dynamic MLP MapsPeng, Sida / Yan, Yunzhi / Shuai, Qing / Bao, Hujun / Zhou, Xiaowei et al. | 2023
- 4263
-
Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense GridsDong, Wei / Choy, Chris / Loop, Charles / Litany, Or / Zhu, Yuke / Anandkumar, Anima et al. | 2023
- 4273
-
DynIBaR: Neural Dynamic Image-Based RenderingLi, Zhengqi / Wang, Qianqian / Cole, Forrester / Tucker, Richard / Snavely, Noah et al. | 2023
- 4285
-
Plateau-Reduced Differentiable Path TracingFischer, Michael / Ritschel, Tobias et al. | 2023
- 4295
-
NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect IlluminationWu, Haoqian / Hu, Zhipeng / Li, Lincheng / Zhang, Yongqiang / Fan, Changjie / Yu, Xin et al. | 2023
- 4305
-
WildLight: In-the-wild Inverse Rendering with a FlashlightCheng, Ziang / Li, Junxuan / Li, Hongdong et al. | 2023
- 4315
-
Relightable Neural Human Assets from Multi-view Gradient IlluminationsZhou, Taotao / He, Kai / Wu, Di / Xu, Teng / Zhang, Qixuan / Shao, Kuixiang / Chen, Wenzheng / Xu, Lan / Yu, Jingyi et al. | 2023
- 4328
-
DiffRF: Rendering-Guided 3D Radiance Field DiffusionMuller, Norman / Siddiqui, Yawar / Porzi, Lorenzo / Bulo, Samuel Rota / Kontschieder, Peter / NieBner, Matthias et al. | 2023
- 4339
-
Analyzing Physical Impacts Using Transient Surface Wave ImagingZhang, Tianyuan / Sheinin, Mark / Chan, Dorian / Rau, Mark / O'Toole, Matthew / Narasimhan, Srinivasa G. et al. | 2023
- 4349
-
Neural Kaleidoscopic Space SculptingAhn, Byeongjoo / De Zeeuw, Michael / Gkioulekas, Ioannis / Sankaranarayanan, Aswin C. et al. | 2023
- 4359
-
Towards Unbiased Volume Rendering of Neural Implicit Surfaces with Geometry PriorsZhang, Yongqiang / Hu, Zhipeng / Wu, Haoqian / Zhao, Minda / Li, Lincheng / Zou, Zhengxia / Fan, Changjie et al. | 2023
- 4369
-
Neural Kernel Surface ReconstructionHuang, Jiahui / Gojcic, Zan / Atzmon, Matan / Litany, Or / Fidler, Sanja / Williams, Francis et al. | 2023
- 4380
-
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled ConsistencyXu, Mingye / Xu, Mutian / He, Tong / Ouyang, Wanli / Wang, Yali / Han, Xiaoguang / Qiao, Yu et al. | 2023
- 4391
-
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field InversionPavllo, Dario / Tan, David Joseph / Rakotosaona, Marie-Julie / Tombari, Federico et al. | 2023
- 4402
-
DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene SynthesisXu, Yinghao / Chai, Menglei / Shi, Zifan / Peng, Sida / Skorokhodov, Ivan / Siarohin, Aliaksandr / Yang, Ceyuan / Shen, Yujun / Lee, Hsin-Ying / Zhou, Bolei et al. | 2023
- 4413
-
Heat Diffusion Based Multi-Scale and Geometric Structure-Aware Transformer for Mesh SegmentationWong, Chi-Chong et al. | 2023
- 4423
-
Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular ImageDeng, Yu / Wang, Baoyuan / Shum, Heung-Yeung et al. | 2023
- 4434
-
3D-aware Conditional Image SynthesisDeng, Kangle / Yang, Gengshan / Ramanan, Deva / Zhu, Jun-Yan et al. | 2023
- 4446
-
VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANsFruhstuck, Anna / Sarafianos, Nikolaos / Xu, Yuanlu / Wonka, Peter / Tung, Tony et al. | 2023
- 4456
-
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationCheng, Yen-Chi / Lee, Hsin-Ying / Tulyakov, Sergey / Schwing, Alexander / Gui, Liangyan et al. | 2023
- 4466
-
Generating Part-Aware Editable 3D Shapes without 3D SupervisionTertikas, Konstantinos / Paschalidou, Despoina / Pan, Boxiao / Park, Jeong Joon / Uy, Mikaela Angelina / Emiris, Ioannis / Avrithis, Yannis / Guibas, Leonidas et al. | 2023
- 4479
-
NeuralLift-360: Lifting an in-the-Wild 2D Photo to A 3D Object with 360° ViewsXu, Dejia / Jiang, Yifan / Wang, Peihao / Fan, Zhiwen / Wang, Yi / Wang, Zhangyang et al. | 2023
- 4490
-
Implicit Identity Driven Deepfake Face Swapping DetectionHuang, Baojin / Wang, Zhongyuan / Yang, Jifan / Ai, Jiaxin / Zou, Qin / Wang, Qian / Ye, Dengpan et al. | 2023
- 4500
-
Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural FieldsAgaram, Rohith / Dewan, Shaurya / Sajnani, Rahul / Poulenard, Adrien / Krishna, Madhava / Sridhar, Srinath et al. | 2023
- 4511
-
Improving Fairness in Facial Albedo Estimation via Visual-Textual CuesRen, Xingyu / Deng, Jiankang / Ma, Chao / Yan, Yichao / Yang, Xiaokang et al. | 2023
- 4521
-
High-fidelity 3D Face Generation from Natural Language DescriptionsWu, Menghua / Zhu, Hao / Huang, Linjia / Zhuang, Yiyu / Lu, Yuanxun / Cao, Xun et al. | 2023
- 4531
-
DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face AlignmentLi, Heyuan / Wang, Bo / Cheng, Yu / Kankanhalli, Mohan / Tan, Robby T. et al. | 2023
- 4541
-
High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative PriorsBai, Yunpeng / Fan, Yanbo / Wang, Xuan / Zhang, Yong / Sun, Jingxiang / Yuan, Chun / Shan, Ying et al. | 2023
- 4552
-
3DAvatarGAN: Bridging Domains for Personalized Editable AvatarsAbdal, Rameen / Lee, Hsin-Ying / Zhu, Peihao / Chai, Menglei / Siarohin, Aliaksandr / Wonka, Peter / Tulyakov, Sergey et al. | 2023
- 4563
-
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using DiffusionWang, Tengfei / Zhang, Bo / Zhang, Ting / Gu, Shuyang / Bao, Jianmin / Baltrusaitis, Tadas / Shen, Jingjing / Chen, Dong / Wen, Fang / Chen, Qifeng et al. | 2023
- 4574
-
Instant Volumetric Head AvatarsZielonka, Wojciech / Bolkart, Timo / Thies, Justus et al. | 2023
- 4585
-
Synthesizing Photorealistic Virtual Humans Through Cross-Modal DisentanglementRavichandran, Siddarth / Texler, Ondrej / Dinev, Dimitar / Kang, Hyun Jae et al. | 2023
- 4595
-
3D Cinemagraphy from a Single ImageLi, Xingyi / Cao, Zhiguo / Sun, Huiqiang / Zhang, Jianming / Xian, Ke / Lin, Guosheng et al. | 2023
- 4606
-
TryOnDiffusion: A Tale of Two UNetsZhu, Luyang / Yang, Dawei / Zhu, Tyler / Reda, Fitsum / Chan, William / Saharia, Chitwan / Norouzi, Mohammad / Kemelmacher-Shlizerman, Ira et al. | 2023
- 4616
-
Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand DisentanglementQi, Xingqun / Liu, Chen / Sun, Muyi / Li, Lincheng / Fan, Changjie / Yu, Xin et al. | 2023
- 4627
-
Normal-guided Garment UV Prediction for Human Re-texturingJafarian, Yasamin / Wang, Tuanfeng Y. / Ceylan, Duygu / Yang, Jimei / Carr, Nathan / Zhou, Yi / Park, Hyun Soo et al. | 2023
- 4637
-
REC-MV: REconstructing 3D Dynamic Cloth from Monocular VideosQiu, Lingteng / Chen, Guanying / Zhou, Jiapeng / Xu, Mutian / Wang, Junle / Han, Xiaoguang et al. | 2023
- 4647
-
SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human ReconstructionCao, Yukang / Han, Kai / Wong, Kwan-Yee K. et al. | 2023
- 4670
-
Handy: Towards a High Fidelity 3D Hand Shape and Appearance ModelPotamias, Rolandos Alexandros / Ploumpis, Stylianos / Moschoglou, Stylianos / Triantafyllou, Vasileios / Zafeiriou, Stefanos et al. | 2023
- 4681
-
Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete CounterpartsLamb, Nikolas / Palmer, Cameron / Molloy, Benjamin / Banerjee, Sean / Banerjee, Natasha Kholgade et al. | 2023
- 4692
-
Distilling Neural Fields for Real-Time Articulated Shape ReconstructionTan, Jeff / Yang, Gengshan / Ramanan, Deva et al. | 2023
- 4702
-
GANmouflage: 3D Object Nondetection with Texture FieldsGuo, Rui / Collins, Jasmine / de Lima, Oscar / Owens, Andrew et al. | 2023
- 4713
-
3D Human Pose Estimation via Intuitive PhysicsTripathi, Shashank / Muller, Lea / Huang, Chun-Hao P. / Taheri, Omid / Black, Michael J. / Tzionas, Dimitrios et al. | 2023
- 4726
-
Object pop-up: Can we infer 3D objects and their poses from human interactions alone?Petrov, Ilya A. / Marin, Riccardo / Chibane, Julian / Pons-Moll, Gerard et al. | 2023
- 4737
-
UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned PolicyXu, Yinzhen / Wan, Weikang / Zhang, Jialiang / Liu, Haoran / Shan, Zikang / Shen, Hao / Wang, Ruicheng / Geng, Haoran / Weng, Yijia / Chen, Jiayi et al. | 2023
- 4747
-
Constrained Evolutionary Diffusion Filter for Monocular Endoscope TrackingLuo, Xiongbiao et al. | 2023
- 4757
-
Visibility Aware Human-Object Interaction Tracking from Single RGB CameraXie, Xianghui / Bhatnagar, Bharat Lal / Pons-Moll, Gerard et al. | 2023
- 4769
-
Transformer-based Unified Recognition of Two Hands Manipulating ObjectsCho, Hoseong / Kim, Chanwoo / Kim, Jihyeon / Lee, Seongyeong / Ismayilzada, Elkhan / Baek, Seungryul et al. | 2023
- 4779
-
HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution EstimationSengupta, Akash / Budvytis, Ignas / Cipolla, Roberto et al. | 2023
- 4790
-
3D Human Pose Estimation with Spatio-Temporal Criss-Cross AttentionTang, Zhenhua / Qiu, Zhaofan / Hao, Yanbin / Hong, Richang / Yao, Ting et al. | 2023
- 4800
-
GFPose: Learning 3D Human Pose Prior with Gradient FieldsCi, Hai / Wu, Mingdong / Zhu, Wentao / Ma, Xiaoxuan / Dong, Hao / Zhong, Fangwei / Wang, Yizhou et al. | 2023
- 4811
-
JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and TrackingVendrow, Edward / Le, Duy Tho / Cai, Jianfei / Rezatofighi, Hamid et al. | 2023
- 4821
-
Analyzing and Diagnosing Pose Estimation with AttributionsHe, Qiyuan / Yang, Linlin / Gu, Kerui / Lin, Qiuxia / Yao, Angela et al. | 2023
- 4831
-
Shape-Constraint Recurrent Flow for 6D Object Pose EstimationHai, Yang / Song, Rui / Li, Jiaojiao / Hu, Yinlin et al. | 2023
- 4841
-
TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose EstimationChen, Hanzhi / Manhardt, Fabian / Navab, Nassir / Busam, Benjamin et al. | 2023
- 4853
-
Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image EnsembleYao, Chun-Han / Hung, Wei-Chih / Li, Yuanzhen / Rubinstein, Michael / Yang, Ming-Hsuan / Jampani, Varun et al. | 2023
- 4863
-
Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast SolutionLiao, Bangyan / Qu, Delin / Xue, Yifei / Zhang, Huiqing / Lao, Yizhen et al. | 2023
- 4872
-
Revisiting the P3P ProblemDing, Yaqing / Yang, Jian / Larsson, Viktor / Olsson, Carl / Astrom, Kalle et al. | 2023
- 4881
-
Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable CategoriesSinha, Samarth / Shapovalov, Roman / Reizenstein, Jeremy / Rocco, Ignacio / Neverova, Natalia / Vedaldi, Andrea / Novotny, David et al. | 2023
- 4892
-
MobileBrick: Building LEGO for 3D Reconstruction on Mobile DevicesLi, Kejie / Bian, Jia-Wang / Castle, Robert / Torr, Philip H.S. / Prisacariu, Victor Adrian et al. | 2023
- 4902
-
EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene SupervisionLei, Jiahui / Deng, Congyue / Schmeckpeper, Karl / Guibas, Leonidas / Daniilidis, Kostas et al. | 2023
- 4913
-
GINA-3D: Learning to Generate Implicit Neural Assets in the WildShen, Bokui / Yan, Xinchen / Qi, Charles R. / Najibi, Mahyar / Deng, Boyang / Guibas, Leonidas / Zhou, Yin / Anguelov, Dragomir et al. | 2023
- 4927
-
Habitat-Matterport 3D Semantics DatasetYadav, Karmesh / Ramrakhya, Ram / Ramakrishnan, Santhosh Kumar / Gervet, Theo / Turner, John / Gokaslan, Aaron / Maestre, Noah / Chang, Angel Xuan / Batra, Dhruv / Savva, Manolis et al. | 2023
- 4937
-
BUOL: A Bottom-Up Framework with Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction From a Single ImageChu, Tao / Zhang, Pan / Liu, Qiong / Wang, Jiaqi et al. | 2023
- 4947
-
Panoptic Compositional Feature Field for Editable Scene Rendering with Network-Inferred Labels via Metric LearningCheng, Xinhua / Wu, Yanmin / Jia, Mengxi / Wang, Qian / Zhang, Jian et al. | 2023
- 4958
-
A Light Touch Approach to Teaching Transformers Multi-view GeometryBhalgat, Yash / Henriques, Joao F. / Zisserman, Andrew et al. | 2023
- 4970
-
Learning to Render Novel Views from Wide-Baseline Stereo PairsDu, Yilun / Smith, Cameron / Tewari, Ayush / Sitzmann, Vincent et al. | 2023
- 4981
-
Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and StereoMehl, Lukas / Schmalfuss, Jenny / Jahedi, Azin / Nalivayko, Yaroslava / Bruhn, Andres et al. | 2023
- 4992
-
EventNeRF: Neural Radiance Fields from a Single Colour Event CameraRudnev, Viktor / Elgharib, Mohamed / Theobalt, Christian / Golyanik, Vladislav et al. | 2023
- 5003
-
LightedDepth: Video Depth Estimation in Light of Limited Inference View AnglesZhu, Shengjie / Liu, Xiaoming et al. | 2023
- 5013
-
Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display CameraFeng, Ruicheng / Li, Chongyi / Chen, Huaijin / Li, Shuai / Gu, Jinwei / Loy, Chen Change et al. | 2023
- 5023
-
Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel ImageKim, Donggun / Jang, Hyeonjoong / Kim, Inchul / Kim, Min H. et al. | 2023
- 5033
-
Trap Attention: Monocular Depth Estimation with Manual TrapsNing, Chao / Gan, Hongping et al. | 2023
- 5044
-
Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and PosesBrachmann, Eric / Cavallari, Tommaso / Prisacariu, Victor Adrian et al. | 2023
- 5054
-
Energy-Efficient Adaptive 3D SensingTilmon, Brevin / Sun, Zhanghao / Koppal, Sanjeev J. / Wu, Yicheng / Evangelidis, Georgios / Zahreddine, Ramzi / Krishnan, Gurunandan / Ma, Sizhuo / Wang, Jian et al. | 2023
- 5064
-
Incremental 3D Semantic Scene Graph Prediction from RGB SequencesWu, Shun-Cheng / Tateno, Keisuke / Navab, Nassir / Tombari, Federico et al. | 2023
- 5075
-
Consistent Direct Time-of-Flight Video Depth Super-ResolutionSun, Zhanghao / Ye, Wei / Xiong, Jinhui / Choe, Gyeongmin / Wang, Jialiang / Su, Shuochen / Ranjan, Rakesh et al. | 2023
- 5086
-
Learning to Zoom and UnzoomThavamani, Chittesh / Li, Mengtian / Ferroni, Francesco / Ramanan, Deva et al. | 2023
- 5096
-
FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D DetectionWang, Yuqi / Chen, Yuntao / Zhang, Zhaoxiang et al. | 2023
- 5106
-
3D Video Object Detection with Learnable Object-Centric Global OptimizationHe, Jiawei / Chen, Yuntao / Wang, Naiyan / Zhang, Zhaoxiang et al. | 2023
- 5116
-
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye ViewZhou, Shengchao / Liu, Weizhou / Hu, Chen / Zhou, Shuchang / Ma, Chao et al. | 2023
- 5126
-
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D DataZhao, Haojie / Chen, Junsong / Wang, Lijun / Lu, Huchuan et al. | 2023
- 5136
-
Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU SupervisionMing, Qi / Miao, Lingjuan / Ma, Zhe / Zhao, Lin / Zhou, Zhiqiang / Huang, Xuhui / Chen, Yuanpei / Guo, Yufei et al. | 2023
- 5146
-
SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial ExamplesLiu, Han / Wu, Yuhao / Yu, Zhiyuan / Vorobeychik, Yevgeniy / Zhang, Ning et al. | 2023
- 5156
-
Normalizing Flow based Feature Synthesis for Outlier-Aware Object DetectionKumar, Nishant / Segvic, Sinisa / Eslami, Abouzar / Gumhold, Stefan et al. | 2023
- 5166
-
OcTr: Octree-Based Transformer for 3D Object DetectionZhou, Chao / Zhang, Yanan / Chen, Jiaxin / Huang, Di et al. | 2023
- 5176
-
HypLiLoc: Towards Effective LiDAR Pose Regression with Hyperbolic FusionWang, Sijie / Kang, Qiyu / She, Rui / Wang, Wei / Zhao, Kai / Song, Yang / Tay, Wee Peng et al. | 2023