ECCV 2022 Papers with Georgia Tech Authors
Paper links available on select papers
ORAL
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni (Meta AI Research); Adam Polyak (Facebook); Oron Ashual (Facebook AI Research); Shelly Sheynin (Meta); Devi Parikh (Georgia Tech & Facebook AI Research); Yaniv Taigman (Facebook)
Open-Set Semi-Supervised Object Detection
Yen-Cheng Liu (Georgia Institute of Technology); Chih-Yao Ma (Facebook); Xiaoliang Dai (Facebook); Junjiao Tian (Georgia Institute of Technology); Peter Vajda (Facebook); Zijian He (Facebook); Zsolt Kira (Georgia Institute of Technology)
PressureVision: Estimating Hand Pressure from a Single RGB Image
Patrick L Grady (Georgia Institute of Technology)*; Chengcheng Tang (Facebook Reality Labs); Samarth Brahmbhatt (Intel); Christopher D Twigg (Meta); Chengde Wan (Facebook Reality Lab); James Hays (Georgia Institute of Technology, USA); Charlie Kemp (Georgia Institute of Technology)
POSTER
A Sketch Is Worth a Thousand Words:Image Retrieval with Text and Sketch
Patsorn Sangkloy (Georgia Institute of Technology); Wittawat Jitkrittum (Google Research); Diyi Yang (Georgia Institute of Technology); James Hays (Georgia Institute of Technology, USA)
BLT: Bidirectional Layout Transformer for Controllable Layout Generation
Xiang Kong (Carnegie Mellon University); Lu Jiang (Google Research); Huiwen Chang (Google); Han Zhang (Google); Yuan Hao (Google); Haifeng Gong (Google Inc.); Irfan Essa (Google & Georgia Tech)
CoGS: Controllable Generation and Search from Sketch and Style
Cusuh Ham (Georgia Institute of Technology); Gemma Canet Tarrés (CVSSP, University of Surrey); Tu Bui (University of Surrey); James Hays (Georgia Institute of Technology, USA); Zhe Lin (Adobe Research); John Collomosse (Adobe Research)
Decomposing The Tangent of Occluding Boundaries According to Curvatures and Torsions
Huizong Yang (Georgia Institute of Technology); Anthony Yezzi (Georgia Institute of Technology)
Egocentric Activity Recognition and Localization on a 3D Map
Miao Liu (Georgia Institute of Technology); Lingni Ma (Facebook Reality Labs); Kiran Somasundaram (Facebook Reality Labs); Yin Li (University of Wisconsin-Madison); Kristen Grauman (Facebook AI Research & UT Austin); James Rehg (Georgia Institute of Technology); Chao Li (Facebook Reality Labs)
Generative Adversarial Network for Future Hand Segmentation from Egocentric Video
Wenqi Jia (Georgia Institute of Technology); Miao Liu (Georgia Institute of Technology); James Rehg (Georgia Institute of Technology)
Housekeep: Tidying Virtual Households using Commonsense Reasoning
Yash Mukund Kant (University of Toronto); Arun Ramachandran (Georgia Institute of Technology); Sriram Yenamandra (Georgia Institute of Technology); Igor Gilitschenski (University of Toronto); Dhruv Batra (Georgia Tech & Facebook AI Research); Andrew Szot (Georgia Institute of Technology); Harsh Agrawal (Georgia Institute of Technology)
Improved Masked Image Generation with Token-Critic
Jose Lezama (Google Research); Huiwen Chang (Google); Lu Jiang (Google Research); Irfan Essa (Google & Georgia Tech)
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge (University of Maryland); Thomas F Hayes (Meta); Harry Yang (Facebook); Xi Yin (Facebook); Guan Pang (Facebook); David Jacobs (University of Maryland, USA); Jia-Bin Huang (Facebook ); Devi Parikh (Georgia Tech & Facebook AI Research)
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Thomas F Hayes (Meta); Songyang Zhang (University of Rochester); Xi Yin (Facebook); Guan Pang (Facebook); Sasha Sheng (Meta Platforms); Harry Yang (Facebook); Songwei Ge (University of Maryland, College Park); Qiyuan Hu (Facebook AI Research); Devi Parikh (Georgia Tech & Facebook AI Research)
Planes vs. Chairs: Category-guided 3D shape learning without any 3D cues
Zixuan Huang (Georgia Institute of Technology); Stefan Stojanov (Georgia Institute of Technology); Anh Thai (Georgia Institute of Technology); Varun Jampani (Google); James Rehg (Georgia Institute of Technology)
PT4AL: Using Self-Supervised Pretext Tasks for Active Learning
John Seon Keun Yi (Georgia Institute of Technology); Minseok Seo (si-analytics); Jongchan Park (Lunit); Dong-Geol Choi (Hanbat National University)
SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas
John W Lambert (Georgia Institute of Technology); Yuguang Li (Zillow Group); Ivaylo Boyadzhiev (Zillow Group); Lambert Wixson (Zillow Group); Manjunath Narayana (Zillow group); Will A Hutchcroft (Zillow Group); James Hays (Georgia Institute of Technology, USA); Frank Dellaert (Georgia Tech); Sing Bing Kang (Zillow Group)
ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization Muhammad Zubair Irshad (Georgia Institute of Technology); Sergey Zakharov (Toyota Research Institute); Rareș A Ambruș (Toyota Research Institute); Thomas Kollar (Toyota Research Institute); Zsolt Kira (Georgia Institute of Technology); Adrien Gaidon (Toyota Research Institute)
Towards Regression-Free Neural Networks for Diverse Compute Platforms
Rahul Duggal (Georgia Tech); Hao Zhou (Amazon); Shuo Yang (Amazon); Jun Fang (Amazon); Yuanjun Xiong (Amazon); Wei Xia (Amazon)
VQGAN-CLIP: Open Domain Image Generation and Manipulation Using Natural Language
Katherine B Crowson (EleutherAI); Stella R Biderman (Booz Allen Hamilton); daniel kornis (Eleuther.ai); Dashiell Stander (Eleuther AI); Eric Hallahan (EleutherAI); Louis J Castricato (Georgia Tech); Edward Raff (Booz Allen Hamilton)
WORKSHOP
SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning
Nilaksh Das, ShengYun Peng, Duen Horng Chau
Hydra Attention: Efficient Attention with Many Heads
Daniel Bolya, Cheng-Yang Fu, Xiaoliang Dai, Peizhao Zhang, Judy Hoffman