VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance (English)
- New search for: Crowson, Katherine
- New search for: Biderman, Stella
- New search for: Kornis, Daniel
- New search for: Stander, Dashiell
- New search for: Hallahan, Eric
- New search for: Castricato, Louis
- New search for: Raff, Edward
- New search for: Avidan, Shai
- New search for: Brostow, Gabriel
- Further information on Brostow, Gabriel:
- https://orcid.org/https://orcid.org/0000-0001-8472-3828
- New search for: Cissé, Moustapha
- New search for: Farinella, Giovanni Maria
- Further information on Farinella, Giovanni Maria:
- https://orcid.org/https://orcid.org/0000-0002-6034-0432
- New search for: Hassner, Tal
- Further information on Hassner, Tal:
- https://orcid.org/https://orcid.org/0000-0003-2275-1406
- New search for: Crowson, Katherine
- New search for: Biderman, Stella
- New search for: Kornis, Daniel
- New search for: Stander, Dashiell
- New search for: Hallahan, Eric
- New search for: Castricato, Louis
- New search for: Raff, Edward
In:
Computer Vision – ECCV 2022
: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXVII
;
Chapter: 6
;
88-105
;
2022
- Article/Chapter (Book) / Electronic Resource
-
Title:VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance
-
Additional title:Lect.Notes Computer
-
Contributors:Avidan, Shai ( editor ) / Brostow, Gabriel ( editor ) / Cissé, Moustapha ( editor ) / Farinella, Giovanni Maria ( editor ) / Hassner, Tal ( editor ) / Crowson, Katherine ( author ) / Biderman, Stella ( author ) / Kornis, Daniel ( author ) / Stander, Dashiell ( author ) / Hallahan, Eric ( author )
-
Conference:European Conference on Computer Vision ; 2022 ; Tel Aviv, Israel
-
Published in:Computer Vision – ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXVII ; Chapter: 6 ; 88-105Lecture Notes in Computer Science ; 13697 ; 88-105
-
Publisher:
- New search for: Springer Nature Switzerland
-
Place of publication:Cham
-
Publication date:2022-10-22
-
Size:18 pages
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Article/Chapter (Book)
-
Type of material:Electronic Resource
-
Language:English
-
Keywords:
-
Source:
Table of contents eBook
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Most and Least Retrievable Images in Visual-Language Query SystemsZhu, Liuwan / Ning, Rui / Li, Jiang / Xin, Chunsheng / Wu, Hongyi et al. | 2022
- 2
-
Sports Video Analysis on Large-Scale DataWu, Dekun / Zhao, He / Bao, Xingce / Wildes, Richard P. et al. | 2022
- 3
-
Grounding Visual Representations with Texts for Domain GeneralizationMin, Seonwoo / Park, Nokyung / Kim, Siwon / Park, Seunghyun / Kim, Jinkyu et al. | 2022
- 4
-
Bridging the Visual Semantic Gap in VLN via Semantically Richer InstructionsOssandón, Joaquín / Earle, Benjamín / Soto, Álvaro et al. | 2022
- 5
-
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story ContinuationMaharana, Adyasha / Hannan, Darryl / Bansal, Mohit et al. | 2022
- 6
-
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language GuidanceCrowson, Katherine / Biderman, Stella / Kornis, Daniel / Stander, Dashiell / Hallahan, Eric / Castricato, Louis / Raff, Edward et al. | 2022
- 7
-
Semantic-Aware Implicit Neural Audio-Driven Video Portrait GenerationLiu, Xian / Xu, Yinghao / Wu, Qianyi / Zhou, Hang / Wu, Wayne / Zhou, Bolei et al. | 2022
- 8
-
End-to-End Active Speaker DetectionAlcázar, Juan León / Cordes, Moritz / Zhao, Chen / Ghanem, Bernard et al. | 2022
- 9
-
Emotion Recognition for Multiple Context AwarenessYang, Dingkang / Huang, Shuai / Wang, Shunli / Liu, Yang / Zhai, Peng / Su, Liuzhen / Li, Mingcheng / Zhang, Lihua et al. | 2022
- 10
-
Adaptive Fine-Grained Sketch-Based Image RetrievalBhunia, Ayan Kumar / Sain, Aneeshan / Shah, Parth Hiren / Gupta, Animesh / Chowdhury, Pinaki Nath / Xiang, Tao / Song, Yi-Zhe et al. | 2022
- 11
-
Quantized GAN for Complex Music Generation from Dance VideosZhu, Ye / Olszewski, Kyle / Wu, Yu / Achlioptas, Panos / Chai, Menglei / Yan, Yan / Tulyakov, Sergey et al. | 2022
- 12
-
Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network PredictionWang, Hu / Zhang, Jianpeng / Chen, Yuanhong / Ma, Congbo / Avery, Jodie / Hull, Louise / Carneiro, Gustavo et al. | 2022
- 13
-
Localizing Visual Sounds the Easy WayMo, Shentong / Morgado, Pedro et al. | 2022
- 14
-
Learning Visual Styles from Audio-Visual AssociationsLi, Tingle / Liu, Yichen / Owens, Andrew / Zhao, Hang et al. | 2022
- 15
-
Remote Respiration Monitoring of Moving Person Using Radio SignalsChoi, Jae-Ho / Kang, Ki-Bong / Kim, Kyung-Tae et al. | 2022
- 16
-
Camera Pose Estimation and Localization with Active Audio SensingYang, Karren / Firman, Michael / Brachmann, Eric / Godard, Clément et al. | 2022
- 17
-
PACS: A Dataset for Physical Audiovisual CommonSense ReasoningYu, Samuel / Wu, Peter / Liang, Paul Pu / Salakhutdinov, Ruslan / Morency, Louis-Philippe et al. | 2022
- 18
-
VoViT: Low Latency Graph-Based Audio-Visual Voice Separation TransformerMontesinos, Juan F. / Kadandale, Venkatesh S. / Haro, Gloria et al. | 2022
- 19
-
Telepresence Video Quality AssessmentYing, Zhenqiang / Ghadiyaram, Deepti / Bovik, Alan et al. | 2022
- 20
-
MultiMAE: Multi-modal Multi-task Masked AutoencodersBachmann, Roman / Mizrahi, David / Atanov, Andrei / Zamir, Amir et al. | 2022
- 21
-
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound SeparationTzinis, Efthymios / Wisdom, Scott / Remez, Tal / Hershey, John R. et al. | 2022
- 22
-
Audio–Visual SegmentationZhou, Jinxing / Wang, Jianyuan / Zhang, Jiayi / Sun, Weixuan / Zhang, Jing / Birchfield, Stan / Guo, Dan / Kong, Lingpeng / Wang, Meng / Zhong, Yiran et al. | 2022
- 23
-
Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects SuppressionJin, Yeying / Yang, Wenhan / Tan, Robby T. et al. | 2022
- 24
-
Relationformer: A Unified Framework for Image-to-Graph GenerationShit, Suprosanna / Koner, Rajat / Wittmann, Bastian / Paetzold, Johannes / Ezhov, Ivan / Li, Hongwei / Pan, Jiazhen / Sharifzadeh, Sahand / Kaissis, Georgios / Tresp, Volker et al. | 2022
- 25
-
GAMa: Cross-View Video Geo-LocalizationVyas, Shruti / Chen, Chen / Shah, Mubarak et al. | 2022
- 26
-
Revisiting a kNN-Based Image Classification System with High-Capacity StorageNakata, Kengo / Ng, Youyang / Miyashita, Daisuke / Maki, Asuka / Lin, Yu-Chieh / Deguchi, Jun et al. | 2022
- 27
-
Geometric Representation Learning for Document Image RectificationFeng, Hao / Zhou, Wengang / Deng, Jiajun / Wang, Yuechen / Li, Houqiang et al. | 2022
- 28
-
S\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^{2}$$\end{document}-VER: Semi-supervised Visual Emotion RecognitionJia, Guoli / Yang, Jufeng et al. | 2022
- 29
-
Image Coding for Machines with Omnipotent Feature LearningFeng, Ruoyu / Jin, Xin / Guo, Zongyu / Feng, Runsen / Gao, Yixin / He, Tianyu / Zhang, Zhizheng / Sun, Simeng / Chen, Zhibo et al. | 2022
- 30
-
Feature Representation Learning for Unsupervised Cross-Domain Image RetrievalHu, Conghui / Lee, Gim Hee et al. | 2022
- 31
-
Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and RecognitionXu, Shilin / Li, Xiangtai / Wang, Jingbo / Cheng, Guangliang / Tong, Yunhai / Tao, Dacheng et al. | 2022
- 32
-
Semantic-Guided Multi-mask Image HarmonizationRen, Xuqian / Liu, Yifan et al. | 2022
- 33
-
Learning an Isometric Surface Parameterization for Texture UnwrappingDas, Sagnik / Ma, Ke / Shu, Zhixin / Samaras, Dimitris et al. | 2022
- 34
-
Towards Regression-Free Neural Networks for Diverse Compute PlatformsDuggal, Rahul / Zhou, Hao / Yang, Shuo / Fang, Jun / Xiong, Yuanjun / Xia, Wei et al. | 2022
- 35
-
Relationship Spatialization for Depth EstimationXu, Xiaoyu / Qiu, Jiayan / Wang, Xinchao / Wang, Zhou et al. | 2022
- 36
-
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained ModelsXu, Chenfeng / Yang, Shijia / Galanti, Tomer / Wu, Bichen / Yue, Xiangyu / Zhai, Bohan / Zhan, Wei / Vajda, Peter / Keutzer, Kurt / Tomizuka, Masayoshi et al. | 2022
- 37
-
FAR: Fourier Aerial Video RecognitionKothandaraman, Divya / Guan, Tianrui / Wang, Xijun / Hu, Shuowen / Lin, Ming / Manocha, Dinesh et al. | 2022
- 38
-
Translating a Visual LEGO Manual to a Machine-Executable PlanWang, Ruocheng / Zhang, Yunzhi / Mao, Jiayuan / Cheng, Chin-Yi / Wu, Jiajun et al. | 2022
- 39
-
Fabric Material Recovery from Video Using Multi-scale Geometric Auto-EncoderLiang, Junbang / Lin, Ming et al. | 2022
- 40
-
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle AdjustmentRen, Jie / Liang, Wenteng / Yan, Ran / Mai, Luo / Liu, Shiwen / Liu, Xiao et al. | 2022
- 41
-
The One Where They Reconstructed 3D Humans and Environments in TV ShowsPavlakos, Georgios / Weber, Ethan / Tancik, Matthew / Kanazawa, Angjoo et al. | 2022