cvpr2021官网(CVPR2023)
CVPR 2023 录用论文
CVPR 2023 统计数据:
提交:9155 篇论文
接受:2360 篇论文(接受率 25.8%)
亮点:235 篇论文(接受论文的 10% ,提交论文的 2.6%)
获奖候选人:12 篇论文(接受论文的 0.51% ,提交论文的 0.13%)已接受论文列表(未决抄袭和双重提交检查):
Generating Human Motion from Textual Descriptions with High Quality Discrete Representation
Jianrong Zhang · Yangsong Zhang · Xiaodong Cun · Yong Zhang · Hongwei Zhao · Hongtao Lu · Xi SHEN · Ying Shan
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Wenxuan Zhang · Xiaodong Cun · Xuan Wang · Yong Zhang · Xi SHEN · Yu Guo · Ying Shan · Fei Wang
Explicit Visual Prompting for Low-Level Structure Segmentations
Weihuang Liu · Xi SHEN · Chi-Man Pun · Xiaodong Cun
Privacy-preserving Adversarial Facial Features
Zhibo Wang · He Wang · Shuaifan Jin · Wenwen Zhang · Jiahui Hu · Yan Wang · Peng Sun · Wei Yuan whu · Kaixin Liu · Kui Ren
NeRF-RPN: A general framework for object detection in NeRFs
Benran Hu · Junkai Huang · Yichen Liu · Yu-Wing Tai · Chi-Keung Tang
Category Query Learning for Human-Object Interaction Classification
Chi Xie · Fangao Zeng · Yue Hu · Shuang Liang · Yichen Wei
A Unified Pyramid Recurrent Network for Video Frame Interpolation
Xin Jin · LONG WU · Jie Chen · Chen Youxin · Jay Koo · Cheul-hee Hahm
SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field
Chong Bao · Yinda Zhang · Bangbang Yang · Tianxing Fan · Zesong Yang · Hujun Bao · Guofeng Zhang · Zhaopeng Cui
PATS: Patch Area Transportation with Subdivision for Local Feature Matching
Junjie Ni · Yijin Li · Zhaoyang Huang · Hongsheng Li · Zhaopeng Cui · Hujun Bao · Guofeng Zhang
DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation
Ying-Tian Liu · Zhifei Zhang · Yuan-Chen Guo · Matthew Fisher · Zhaowen Wang · Song-Hai Zhang
Towards Robust Tampered Text Detection in Document Image: New dataset and New Solution
Chenfan Qu · Chongyu Liu · Yuliang Liu · Xinhong Chen · Dezhi Peng · Fengjun Guo · Lianwen Jin
PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
Zhixin Ling · Zhen Xing · Xiangdong Zhou · Man Cao · Guichun Zhou
SVFormer: Semi-supervised Video Transformer for Action Recognition
Zhen Xing · Qi Dai · Han Hu · Jingjing Chen · Zuxuan Wu · Yu-Gang Jiang
Multi-Object Manipulation via Object-Centric Neural Scattering Functions
Stephen Tian · Yancheng Cai · Hong-Xing Yu · Sergey Zakharov · Katherine Liu · Adrien Gaidon · Yunzhu Li · Jiajun Wu
RealImpact: A Dataset of Impact Sound Fields for Real Objects
Samuel Clarke · Ruohan Gao · Mason L Wang · Mark Rau · Julia Xu · Jui-Hsien Wang · Doug James · Jiajun Wu
3D Neural Field Generation using Triplane Diffusion
Jesse Shue · Eric Chan · Ryan Po · Zachary Ankner · Jiajun Wu · Gordon Wetzstein
Putting People in Their Place: Affordance-Aware Human Insertion into Scenes
Sumith Kulal · Tim Brooks · Alex Aiken · Jiajun Wu · Jimei Yang · Jingwan Lu · Alexei A. Efros · Krishna Kumar Singh
Towards Effective Visual Representations for Partial-Label Learning
Shiyu Xia · Jiaqi Lyu · Ning Xu · Gang Niu · Xin Geng
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation
Zhen Li · Zuo-Liang Zhu · Ling-Hao Han · Qibin Hou · Chunle Guo · Ming-Ming Cheng
DNF: Decouple and Feedback Network for Seeing in the Dark
Xin Jin · Ling-Hao Han · Zhen Li · Chunle Guo · Zhi Chai · Chongyi Li
Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising
Miaoyu Li · Ji Liu · Ying Fu · Yulun Zhang · Dejing Dou
Dynamic Aggregated Network for Gait Recognition
Kang Ma · Ying Fu · Dezhi Zheng · Chunshui Cao · Xuecai Hu · Yongzhen Huang
LG-BPN: Local and Global Blind-Patch Network for Self-Supervised Real-World Denoising
ZiChun Wang · Ying Fu · Ji Liu · Yulun Zhang
Real-Time Neural Light Field on Mobile Devices
Junli Cao · Huan Wang · Pavlo Chemerys · Vladislav Shakhrai · Ju Hu · Yun Fu · Denys Makoviichuk · Sergey Tulyakov · Jian Ren
ScaleDet: A Scalable Multi-Dataset Object Detector
Yanbei Chen · Manchen Wang · Abhay Mittal · Zhenlin Xu · Paolo Favaro · Joseph Tighe · Davide Modolo
All in One: Exploring Unified Video-Language Pre-training
Jinpeng Wang · Yixiao Ge · Rui Yan · Yuying Ge · Kevin Qinghong Lin · Satoshi Tsutsui · Xudong Lin · Guanyu Cai · Jianping WU · Ying Shan · Xiaohu Qie · Mike Zheng Shou
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Ziyun Zeng · Yuying Ge · Xihui Liu · Bin Chen · Ping Luo · Shu-Tao Xia · Yixiao Ge
KD-GAN: Data Limited Image Generation via Knowledge Distillation
Kaiwen Cui · Yingchen Yu · Fangneng Zhan · Shengcai Liao · Shijian Lu · Eric Xing
Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection with Single Point Supervision
Xinyi Ying · Li Liu · Yingqian Wang · Ruojing Li · Nuo Chen · Zaiping Lin · Weidong Sheng · Shilin Zhou
Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning
Haiyu Wu · Grace Bezold · Aman Bhatta · Kevin Bowyer
Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding
Gyeongman Kim · Hajin Shim · Hyunsu Kim · Yunjey Choi · Junho Kim · Eunho Yang
3D Video Object Detection with Learnable Object-Centric Global Optimization
Jiawei He · Yuntao Chen · Naiyan Wang · Zhaoxiang Zhang
BEVFormer v2: Adapting Modern Image Backbones to Bird’s-Eye-View Recognition via Perspective Supervision
Chenyu Yang · Yuntao Chen · Hao Tian · Chenxin Tao · Xizhou Zhu · Zhaoxiang Zhang · Gao Huang · Hongyang Li · Yu Qiao · Lewei Lu · Jie Zhou · Jifeng Dai
MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds
Jiahui Liu · Chirui CHANG · Jianhui Liu · Xiaoyang Wu · Lan Ma · XIAOJUAN QI
Understanding Imbalanced Semantic Segmentation Through Neural Collapse
Zhisheng Zhong · Jiequan Cui · Yibo Yang · Xiaoyang Wu · XIAOJUAN QI · Xiangyu Zhang · Jiaya Jia
Hierarchical Dense Correlation Distillation for Few-Shot Segmentation
Bohao PENG · Zhuotao Tian · Xiaoyang Wu · Chengyao Wang · Shu Liu · Jingyong Su · Jiaya Jia
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning
Xiaoyang Wu · Xin Wen · Xihui Liu · Hengshuang Zhao
Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation
Zhehan Kan · Shuoshuo Chen · Ce Zhang · Yushun Tang · Zhihai He
Neuro-Modulated Hebbian Learning for Fully Test-Time Adaptation
Yushun Tang · Ce Zhang · Heng Xu · Shuoshuo Chen · Jie Cheng · Luziwei Leng · Qinghai Guo · Zhihai He
Noisy Correspondence Learning with Meta Similarity Correction
Haochen Han · Kaiyao Miao · Qinghua Zheng · Minnan Luo
Detecting Backdoors During the Inference Stage Based on Corruption Robustness Consistency
Xiaogeng Liu · Minghui Li · Haoyu Wang · Shengshan Hu · Dengpan Ye · Hai Jin · Libing Wu · Chaowei Xiao
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Jiang Liu · Hui Ding · Zhaowei Cai · Yuting Zhang · Ravi Satzoda · Vijay Mahadevan · R. Manmatha
Glocal Energy-based Learning for Few-Shot Open-Set Recognition
Haoyu Wang · Guansong Pang · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang
PointDistiller: Structured Knowledge Distillation Towards Efficient and Compact 3D Detection
Linfeng Zhang · Runpei Dong · Hung-Shuo Tai · Kaisheng Ma
LipFormer: High-fidelity and Generalizable Talking Face Generation with A Pre-learned Facial Codebook
Jiayu Wang · Kang Zhao · Shiwei Zhang · Yingya Zhang · Yujun Shen · Deli Zhao · Jingren Zhou
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning
Chao Xu · Junwei Zhu · Jiangning Zhang · Yue Han · Wenqing Chu · Ying Tai · Chengjie Wang · Zhifeng Xie · Yong Liu
EC^2: Emergent Communication for Embodied Control
Yao Mu · Shunyu Yao · Mingyu Ding · Ping Luo · Chuang Gan
Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss
Anas Mahmoud · Jordan Sir Kwang Hu · Tianshu Kuai · Ali Harakeh · Liam Paull · Steven Waslander
Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection
Vibashan Vishnukumar Sharmini · Poojan Oza · Vishal Patel
Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations
Vibashan Vishnukumar Sharmini · Ning Yu · Chen Xing · Can Qin · Mingfei Gao · Juan Carlos Niebles · Vishal Patel · Ran Xu
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition
Xiaoyu Zhu · Po-Yao Huang · Junwei Liang · Celso de Melo · Alexander Hauptmann
DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks
Qiangqiang Wu · Tianyu Yang · Ziquan Liu · Baoyuan Wu · Ying Shan · Antoni Chan
TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization
Ziquan Liu · Yi Xu · Xiangyang Ji · Antoni Chan
Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting
Wei Lin · Antoni Chan
Music-Driven Group Choreography
Nhat Le · Trong Thang Pham · Tuong Do · Erman Tjiputra · Quang Tran · Anh Nguyen
Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Mengmeng Xu · Yanghao Li · Cheng-Yang Fu · Bernard Ghanem · Tao Xiang · Juan-Manuel Perez-Rua
Rotation-Invariant Transformer for Point Cloud Matching
Hao Yu · Zheng Qin · Ji Hou · Mahdi Saleh · Dongsheng Li · Benjamin Busam · Slobodan Ilic
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors
Ji Hou · Xiaoliang Dai · Zijian He · Angela Dai · Matthias Niessner
Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data
Yuhao Chen · Xin Tan · Borui Zhao · ZhaoWei CHEN · Renjie Song · jiajun liang · Xuequan Lu
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization
Shichao Dong · Jin Wang · Renhe Ji · jiajun liang · Haoqiang Fan · Zheng Ge
EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision
Jiahui Lei · Congyue Deng · Karl Schmeckpeper · Leonidas Guibas · Kostas Daniilidis
SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation
Huimin Huang · Shiao Xie · Lanfen Lin · Tong Ruofeng · Yen-wei Chen · Yuexiang Li · Hong Wang · Yawen Huang · Yefeng Zheng
CNVid-3.5M: Build, Filter, and Pre-train the Large-scale Public Chinese Video-text Dataset
Tian Gan · Qing Wang · Xingning Dong · Xiangyuan Ren · Liqiang Nie · Qingpei Guo
Disentangling Writer and Character Styles for Handwriting Generation
Gang Dai · Yifan Zhang · Qingfeng Wang · Qing Du · Zhuliang Yu · Zhuoman Liu · Shuangping Huang
A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image
Changlong Jiang · Yang Xiao · Cunlin Wu · Mingyang Zhang · Jinghong Zheng · Zhiguo Cao · Joey Zhou
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li · Jinguo Zhu · Xiaohu Jiang · Xizhou Zhu · Hongsheng Li · Chun Yuan · Xiaohua Wang · Yu Qiao · Xiaogang Wang · Wenhai Wang · Jifeng Dai
ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations
Panos Achlioptas · Ian Huang · Minhyuk Sung · Sergey Tulyakov · Leonidas Guibas
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Feng Li · Ailing Zeng · Shilong Liu · Hao Zhang · Hongyang Li · Lionel Ni · Lei Zhang
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li · Hao Zhang · Huaizhe Xu · Shilong Liu · Lei Zhang · Lionel Ni · Heung-Yeung Shum
MP-Former: Mask-Piloted Transfomer for Image Segmentation
Hao Zhang · Feng Li · Huaizhe Xu · Shijia Huang · Shilong Liu · Lionel Ni · Lei Zhang
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
Jun Cen · Shiwei Zhang · Xiang Wang · Yixuan Pei · Zhiwu Qing · Yingya Zhang · Qifeng Chen
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
Xiang Wang · Shiwei Zhang · Zhiwu Qing · Changxin Gao · Yingya Zhang · Deli Zhao · Nong Sang
PCR: Proxy-based Contrastive Replay for Online Class-Incremental Continual Learning
Huiwei Lin · Baoquan Zhang · Shanshan Feng · Xutao Li · Yunming Ye
Building Rearticulable Models for Arbitrary 3D Objects from 4D Point Clouds
Shaowei Liu · Saurabh Gupta · Shenlong Wang
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Xuran Pan · Tianzhu Ye · Zhuofan Xia · Shiji Song · Gao Huang
Compressing Volumetric Radiance Fields to 1 MB
Lingzhi Li · Zhen Shen · Zhongshu Wang · Li Shen · Liefeng Bo
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Ziniu Hu · Ahmet Iscen · Chen Sun · Zirui Wang · Kai-Wei Chang · Yizhou Sun · Cordelia Schmid · David Ross · Alireza Fathi
Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Ahmet Iscen · Alireza Fathi · Cordelia Schmid
Learning to Name Classes for Vision and Language Models
Sarah Parisot · Yongxin Yang · Steven McDonagh
SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory
Sicheng Li · Hao Li · Yue Wang · Yiyi Liao · Lu Yu
Semi-Supervised Video Inpainting with Cycle Consistency Constraints
Zhiliang Wu · Han Xuan · Changchang Sun · Weili Guan · Kang Zhang · Yan Yan
Deep Stereo Video Inpainting
Zhiliang Wu · Changchang Sun · Han Xuan · Yan Yan
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
Siteng Huang · Biao Gong · Yulin Pan · Jianwen Jiang · Yiliang Lv · Yuyuan Li · Donglin Wang
NeRF-Supervised Deep Stereo
Fabio Tosi · Alessio Tonioni · Daniele Gregorio · Matteo Poggi
Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding
Zihang Lin · Chaolei Tan · Jian-Fang Hu · Zhi Jin · Tiancai Ye · Wei-Shi Zheng
Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding
Chaolei Tan · Zihang Lin · Jian-Fang Hu · Wei-Shi Zheng · Jianhuang Lai
Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation
Ruixuan Cong · Da Yang · Rongshan Chen · Sizhe Wang · Zhenglong Cui · HaoSheng
Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions
Yong Guo · David Stutz · Bernt Schiele
DF-Platter: Multi-Face Heterogeneous Deepfake Dataset
Kartik Narayan · Harsh Agarwal · Kartik Thakral · Surbhi Mittal · Mayank Vatsa · Richa Singh
Metadata-Based RAW Reconstruction via Implicit Neural Functions
Leyi Li · Huijie Qiao · Qi Ye · Qinmin Yang
I
2
-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs
Jingsen Zhu · Yuchi Huo · Qi Ye · Fujun Luan · Jifan Li · Dianbing Xi · Lisha Wang · Rui Tang · Wei Hua · Hujun Bao · Rui Wang
Polarized Color Image Denoising
Zhuoxiao Li · Haiyang Jiang · Mingdeng Cao · Yinqiang Zheng
NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination
Haoqian Wu · Zhipeng Hu · Lincheng Li · Yongqiang Zhang · Changjie Fan · Xin Yu
Balanced Energy Regularization Loss for Out-of-distribution Detection
Hyunjun Choi · Hawook Jeong · Jin Choi
DeCo : Decomposition and Reconstruction for Compositional Temporal Grounding via Coarse-to-Fine Contrastive Ranking
Lijin Yang · Quan Kong · Hsuan-Kung Yang · Wadim Kehl · Yoichi Sato · Norimasa Kobori
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Zixian Ma · Jerry Hong · Mustafa Omer Gul · Mona Gandhi · Irena Gao · Ranjay Krishna
Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic Mask
Shangzhan Zhang · Sida Peng · Tianrun Chen · Linzhan Mou · Haotong Lin · Kaicheng Yu · Yiyi Liao · Xiaowei Zhou
Learning 3D-aware Image Synthesis with Unknown Pose Distribution
Zifan Shi · Yujun Shen · Yinghao Xu · Sida Peng · Yiyi Liao · Sheng Guo · Qifeng Chen · Dit-Yan Yeung
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Jiazhi Guan · Zhanwang Zhang · Hang Zhou · Tianshu Hu · Kaisiyuan Wang · Dongliang He · Haocheng Feng · Jingtuo Liu · Errui Ding · Ziwei Liu · Jingdong Wang
A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others
Zhiheng Li · Ivan Evtimov · Albert Gordo · Caner Hazirbas · Tal Hassner · Cristian Canton · Chenliang Xu · Mark Ibrahim
Cooperation or Competition: Avoiding Player Domination for Multi-target Robustness by Adaptive Budgets
Yimu Wang · Dinghuai Zhang · Yihan Wu · Heng Huang · Hongyang Zhang
Gated Stereo: Joint Depth Estimation from Gated and Wide-Baseline Active Stereo Cues
Stefanie Walz · Mario Bijelic · Andrea Ramazzina · Amanpreet Walia · Fahim Mannan · Felix Heide
SliceMatch: Geometry-guided Aggregation for Cross-View Pose Estimation
Zimin Xia · Holger Caesar · Julian Kooij · Ted Lentsch
Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations
Lei Hsiung · Yun-Yun Tsai · Pin-Yu Chen · Tsung-Yi Ho
StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer
Sasikarn Khwanmuang · Pakkapon Phongthawee · Patsorn Sangkloy · Supasorn Suwajanakorn
Learning Geometric-aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs
Pattaramanee Arsomngern · Sarana Nutanong · Supasorn Suwajanakorn
Visibility Constrained Wide-band Illumination Spectrum Design for Seeing-in-the-Dark
Muyao Niu · Zhuoxiao Li · Zhihang Zhong · Yinqiang Zheng
ToThePoint: Efficient Contrastive Learning of 3D Point Clouds via Recycling
Xinglin Li · Jiajing Chen · Jinhui Ouyang · Hanhui Deng · Senem Velipasalar · Di Wu
AUNet: Learning Relations Between Action Units for Face Forgery Detection
Weiming Bai · Yufan Liu · Zhipeng Zhang · Bing Li · Weiming Hu
Physical-World Optical Adversarial Attacks on 3D Face Recognition
Yanjie Li · Yiquan Li · Xuelong Dai · Songtao Guo · Bin Xiao
Robust Single Image Reflection Removal Against Adversarial Attacks
Zhenbo Song · Zhenyuan Zhang · Kaihao Zhang · Wenhan Luo · Zhaoxin Fan · Wenqi Ren · Jianfeng Lu
The Enemy of My Enemy is My Friend: Exploring Inverse Adversaries for Improving Adversarial Training
Junhao Dong · Seyed-Mohsen Moosavi-Dezfooli · Jianhuang Lai · Xiaohua Xie
Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation
Bo Huang · Mingyang Chen · Yi Wang · JUNDA LU · Minhao Cheng · Wei Wang
Introducing Competition to Boost the Transferability of Targeted Adversarial Examples through Clean Feature Mixup
Junyoung Byun · Myung-Joon Kwon · Seungju Cho · Yoonji Kim · Changick Kim
Angelic Patches for Improving Third-Party Object Detector Performance
Wenwen Si · Shuo Li · Sangdon Park · Insup Lee · Osbert Bastani
Sibling-Attack: Rethinking Transferable Adversarial Attacks against Face Recognition
Zexin Li · Bangjie Yin · Taiping Yao · Junfeng Guo · Shouhong Ding · Simin Chen · Cong Liu
A Practical Upper Bound for the Worst-Case Attribution Deviations
Fan Wang · Adams Kong
You Are Catching My Attention: Are Vision Transformers Bad Learners under Backdoor Attacks?
Zenghui Yuan · Pan Zhou · Kai Zou · Yu Cheng
Architectural Backdoors in Neural Networks
Mikel Bober-Irizar · Ilia Shumailov · Yiren Zhao · Robert Mullins · Nicolas Papernot
The Dark Side of Dynamic Routing Neural Networks: Towards Efficiency Backdoor Injection
Simin Chen · Hanlin Chen · Mirazul Haque · Cong Liu · Wei Yang
StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning
Yuqian Fu · YU XIE · Yanwei Fu · Yu-Gang Jiang
Rethinking Domain Generalization for Face Anti-spoofing: Separability and Alignment
Yiyou Sun · Yaojie Liu · Xiaoming Liu · Yixuan Li · Vincent Chu
Make Landscape Flatter in Differentially Private Federated Learning
Yifan Shi · Yingqi Liu · Kang Wei · Li Shen · Xueqian Wang · Dacheng Tao
Confidence-aware Personalized Federated Learning via Variational Expectation Maximization
Junyi Zhu · Xingchen Ma · Matthew Blaschko
ScaleFL: Resource-Adaptive Federated Learning with Heterogeneous Clients
Fatih Ilhan · Gong Su · Ling Liu
MetaMix: Towards Corruption-Robust Continual Learning with Temporally Self-Adaptive Data Transformation
Zhenyi Wang · Li Shen · Donglin Zhan · Qiuling Suo · Yanjun Zhu · Tiehang Duan · Mingchen Gao
Revisiting Reverse Distillation for Anomaly Detection
Tran Dinh Tien · Anh Tuan Nguyen · Nguyen Tran · Huy Ta · Soan Duong · Chanh Nguyen · Steven Truong
Generating Anomalies for Video Anomaly Detection with Prompt-based Feature Mapping
Zuhao Liu · Xiao-Ming Wu · Dian Zheng · Kun-Yu Lin · Wei-Shi Zheng
Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection
Xincheng Yao · Ruoqi Li · Jing Zhang · Jun Sun · Chongyang Zhang
Towards Universal Fake Image Detectors that Generalize Across Generative Models
Utkarsh Ojha · Yuheng Li · Yong Jae Lee
Edges to Shapes to Concepts: Adversarial Augmentation for Robust Vision
Aditay Tripathi · Rishubh Singh · Anirban Chakraborty · Pradeep Shenoy
Sequential training of GANs against GAN-classifiers reveals correlated “knowledge gaps ” present among independently trained GAN instances
Arkanath Pathak · Nicholas Dufour
Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond
Zhengcong Fei · Mingyuan Fan · Li Zhu · Junshi Huang · Xiaoming Wei · Xiaolin Wei
Vector Quantization with Self-attention for Quality-independent Representation Learning
zhou yang · Weisheng Dong · Xin Li · Mengluan Huang · Yulin Sun · Guangming Shi
PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
Jiawei Liu · Lin Niu · Zhihang Yuan · Dawei Yang · Xinggang Wang · Wenyu Liu
Hard Sample Matters a Lot in Zero-Shot Quantization
Huantong Li · Xiangmiao Wu · fanbing Lv · Daihai Liao · Thomas Li · Yonggang Zhang · Bo Han · Mingkui Tan
Fair Scratch Tickets: Finding Fair Sparse Networks without Weight Training
Pengwei Tang · Wei Yao · Zhicong Li · Yong Liu
Understanding Deep Generative Models with Generalized Empirical Likelihoods
Suman Ravuri · Mélanie Rey · Shakir Mohamed · Marc Deisenroth
Deep Deterministic Uncertainty: A New Simple Baseline
Jishnu Mukhoti · Andreas Kirsch · Joost van Amersfoort · Philip Torr · Yarin Gal
Compacting Binary Neural Networks by Sparse Kernel Selection
Yikai Wang · Wenbing Huang · Yinpeng Dong · Fuchun Sun · Anbang Yao
Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures
Eugenia Iofinova · Alexandra Peste · Dan Alistarh
X-Pruner: eXplainable Pruning for Vision Transformers
Lu Yu · Wei Xiang
Deep Graph Reprogramming
Yongcheng Jing · Chongbin Yuan · Li Ju · Yiding Yang · Xinchao Wang · Dacheng Tao
FlowGrad: Controlling the Output of Generative ODEs with Gradients
Xingchao Liu · Lemeng Wu · Shujian Zhang · Chengyue Gong · Wei Ping · qiang liu
Exploring Data Geometry for Continual Learning
Zhi Gao · Chen Xu · Feng Li · Yunde Jia · Mehrtash Harandi · Yuwei Wu
Improving Generalization with Domain Convex Game
Fangrui Lv · Jian Liang · Shuang Li · Jinming Zhang · Di Liu
SLACK: Stable Learning of Augmentations with Cold-start and KL regularization
Juliette Marrie · Michael Arbel · Diane Larlus · Julien Mairal
Critical Learning Periods for Multisensory Integration in Deep Networks
Michael Kleinman · Alessandro Achille · Stefano Soatto
Preserving Linear Separability in Continual Learning by Backward Feature Projection
Qiao Gu · Dongsub Shim · Florian Shkurti
Multi-level Logit Distillation
Ying Jin · Jiaqi Wang · Dahua Lin
Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint
Shikang Yu · Jiachen Chen · Hu Han · Shuqiang Jiang
Masked Autoencoders Enable Efficient Knowledge Distillers
Yutong Bai · Zeyu Wang · Junfei Xiao · Chen Wei · Huiyu Wang · Alan Yuille · Yuyin Zhou · Cihang Xie
DKT: Diverse Knowledge Transfer Transformer for Class Incremental Learning
Xinyuan Gao · Yuhang He · SongLin Dong · Jie Cheng · Xing Wei · Yihong Gong
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
Changdae Oh · Hyeji Hwang · Hee-young Lee · YongTaek Lim · Geunyoung Jung · Jiyoung Jung · Hosik Choi · Kyungwoo Song
PIVOT: Prompting for Video Continual Learning
Andres Villa · Juan Leon Alcazar · Motasem Alfarra · Kumail Alhamoud · Julio Hurtado · Fabian Caba · Alvaro Soto · Bernard Ghanem
MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering
Jingjing Jiang · Nanning Zheng
NIFF: Alleviating Forgetting in Generalized Few-Shot Object Detection via Neural Instance Feature Forging
Karim Guirguis · Johannes Meier · George Eskandar · Matthias Kayser · Bin Yang · Jürgen Beyerer
Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning
Zeyin Song · Yifan Zhao · Yujun Shi · Peixi Peng · Li Yuan · Yonghong Tian
Improved Test-Time Adaptation for Domain Generalization
Liang Chen · Yong Zhang · Yibing Song · Ying Shan · Lingqiao Liu
TIPI: Test Time Adaptation with Transformation Invariance
Anh Tuan Nguyen · Thanh Nguyen-Tang · Ser-Nam Lim · Philip Torr
ActMAD: Activation Matching to Align Distributions for Test-Time-Training
Muhammad Mirza Mirza · Pol Jane Soneira · Wei Lin · Mateusz Kozinski · Horst Possegger · Horst Bischof
Modality-Agnostic Debiasing for Single Domain Generalization
Sanqing Qu · Yingwei Pan · Guang Chen · Ting Yao · changjun jiang · Tao Mei
ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency Transform for Domain Generalization
Jintao Guo · Na Wang · Lei Qi · Yinghuan Shi
C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation
Nazmul Karim · Niluthpol Chowdhury Mithun · Abhinav Rajvanshi · Han-pang Chiu · Supun Samarasekera · Nazanin Rahnavard
Adjustment and Alignment for Unbiased Open Set Domain Adaptation
Wuyang Li · Jie Liu · Bo Han · Yixuan Yuan
Semi-Supervised Domain Adaptation with Source Label Adaptation
Yu-Chu Yu · Hsuan-Tien Lin
Dynamically Instance-Guided Adaptation: A Backward-free Approach for Test-Time Domain Adaptive Semantic Segmentation
Wei Wang · Zhun Zhong · Weijie Wang · Xi Chen · Charles Ling · Boyu Wang · Nicu Sebe
FCC: Feature Clusters Compression for Long-Tailed Visual Recognition
Jian Li · Ziyao Meng · daqian Shi · Rui Song · Xiaolei Diao · Jingwen Wang · Hao Xu
DISC: Learning from Noisy Labels via Dynamic Instance-Specific Selection and Correction
Yifan Li · Hu Han · Shiguang Shan · Xilin CHEN
Superclass Learning with Representation Enhancement
Zeyu Gan · Suyun Zhao · Jinlong Kang · Liyuan Shang · Hong Chen · Cuiping Li
Improving Selective Visual Question Answering by Learning from Your Peers
Corentin Dancette · Spencer Whitehead · Rishabh Maheshwary · Shanmukha Ramakrishna Vedantam · Stefan Scherer · Xinlei Chen · Matthieu CORD · Marcus Rohrbach
Difficulty-based Sampling for Debiased Contrastive Representation Learning
Taeuk Jang · Xiaoqian Wang
Token Boosting for Robust Self-Supervised Visual Transformer Pre-training
Tianjiao Li · Lin Geng Foo · Ping Hu · Xindi Shang · Hossein Rahmani · Zehuan Yuan · Jun Liu
HyperMatch: Noise-Tolerant Semi-Supervised Learning via Relaxed Contrastive Constraint
Beitong Zhou · Jing Lu · Kerui Liu · Yunlu Xu · Zhanzhan Cheng · Yi Niu
Open-Set Likelihood Maximization for Few-Shot Learning
Malik Boudiaf · Etienne Bennequin · Myriam Tami · Antoine Toubhans · Pablo Piantanida · CELINE HUDELOT · Ismail Ayed
Transductive Few-Shot Learning with Prototypes Label-Propagation by Iterative Graph Refinement
Hao Zhu · Piotr Koniusz
Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric
Pengxin Zeng · Yunfan Li · Peng Hu · Dezhong Peng · Jiancheng Lv · Xi Peng
On the Effects of Self-supervision and Contrastive Alignment in Deep Multi-view Clustering
Daniel J. Trosten · Sigurd Løkse · Robert Jenssen · Michael Kampffmeyer
Sample-level Multi-view Graph Clustering
Yuze Tan · Yixi Liu · Shudong Huang · Wentao Feng · Jiancheng Lv
Discriminating Known from Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder
Aming WU · Cheng Deng
GEN: Pushing the Limits of Softmax-Based Out-of-Distribution Detection
Xixi Liu · Yaroslava Lochman · Christopher Zach
RankMix: Data Augmentation for Weakly Supervised Learning of Classifying Whole Slide Images with Diverse Sizes and Imbalanced Categories
Yuan-Chih Chen · Chun-Shien Lu
Best of Both Worlds: Multimodal Contrastive Learning with Tabular and Imaging Data
Paul Hager · Martin J. Menten · Daniel Rueckert
DeGPR: Deep Guided Posterior Regularisation For Multi-Class Cell Detection And Counting
Aayush Tyagi · Chirag Mohapatra · Prasenjit Das · Govind Makharia · Lalita Mehra · Prathosh AP · Mausam .
OCELOT: Overlapped Cell on Tissue Dataset for Histopathology
Jeongun Ryu · Aaron Valero Puche · JaeWoong Shin · Seonwook Park · Biagio Brattoli · Jinhee Lee · Wonkyung Jung · Soo Ick Cho · Kyunghyun Paeng · Chan-Young Ock · Donggeun Yoo · Sérgio Pereira
SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection
Tiange Xiang · Yixiao Zhang · Yongyi Lu · Alan Yuille · Chaoyi Zhang · Weidong Cai · Zongwei Zhou
Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization
Mingze Yuan · Yingda Xia · Hexin Dong · Zifan Chen · Jiawen Yao · Mingyan Qiu · Ke Yan · Xiaoli Yin · Yu Shi · Xin Chen · Zaiyi Liu · Bin Dong · Jingren Zhou · Le Lu · Ling Zhang · Li Zhang
MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and Recovery
Duowen Chen · Yunhao Bai · Wei Shen · Qingli Li · Lequan Yu · Yan Wang
(ML)
2
P-Encoder: On Exploration of Channel-class Correlation for Multi-label Zero-shot Learning
Ziming Liu · Song Guo · Xiaocheng Lu · Jingcai Guo · Jiewei Zhang · Yue Zeng · Fushuo Huo
Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning
Yu Wang · Pengchong Qiao · Chang Liu · Guoli Song · Xiawu Zheng · Jie Chen
Contrastive Mean Teacher for Domain Adaptive Object Detectors
Shengcao Cao · Dhiraj Joshi · Liangyan Gui · Yu-Xiong Wang
Harmonious Teacher for Cross-domain Object Detection
Jinhong Deng · Dongli Xu · Wen Li · Lixin Duan
Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection
Chuandong Liu · CHENQIANG GAO · Fangcen Liu · Pengcheng Li · Deyu Meng · Xinbo Gao
Semi-DETR: Semi-Supervised Object Detection with Detection Transformers
Jiacheng Zhang · Xiangru Lin · Wei Zhang · Kuo Wang · Xiao Tan · Junyu Han · Errui Ding · Jingdong Wang · Guanbin Li
Continual Detection Transformer for Incremental Object Detection
Yaoyao Liu · Bernt Schiele · Andrea Vedaldi · Christian Rupprecht
DA-DETR: Domain Adaptive Detection Transformer with Information Fusion
Jingyi Zhang · Jiaxing Huang · Zhipeng Luo · Gongjie Zhang · Xiaoqin Zhang · Shijian Lu
CIGAR: Cross-Modality Graph Reasoning for Domain Adaptive Object Detection
Yabo Liu · Jinghua Wang · Chao Huang · Yaowei Wang · Yong Xu
Box-Level Active Detection
Mengyao Lyu · Jundong Zhou · Hui Chen · Yi-Jie Huang · Dongdong Yu · Yaqian Li · Yandong Guo · Yuchen Guo · Liuyu Xiang · Guiguang Ding
Enhanced Training of Query-Based Object Detection via Selective Query Recollection
Fangyi Chen · Han Zhang · Kai Hu · Yu-Kai Huang · Chenchen Zhu · Marios Savvides
Vision Transformers are Good Mask Auto-Labelers
Shiyi Lan · Xitong Yang · Zhiding Yu · Zuxuan Wu · Jose Alvarez · Anima Anandkumar
Weakly Supervised Posture Mining for Fine-grained Classification
Zhenchao Tang · Hualin Yang · Calvin Yu-Chian Chen
IDGI: A Framework to Eliminate Explanation Noise from Integrated Gradients
Ruo Yang · Binghui Wang · Mustafa Bilgic
Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm
Yichen Xie · Han Lu · Junchi Yan · Xiaokang Yang · Masayoshi Tomizuka · Wei Zhan
Instance-specific and Model-adaptive Supervision for Semi-supervised Semantic Segmentation
Zhen Zhao · Sifan Long · Jimin Pi · Jingdong Wang · Luping Zhou
Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation
Yan Jin · Mengke LI · Yang Lu · Yiu-ming Cheung · Hanzi Wang
Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation
Chaohui Yu · Qiang Zhou · Jingliang Li · Jianlong Yuan · Zhibin Wang · Fan Wang
Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation
Zesen Cheng · Pengchong Qiao · Kehan Li · Siheng Li · Pengxu Wei · Xiangyang Ji · Li Yuan · Chang Liu · Jie Chen
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
Junjie He · Pengyu Li · Yifeng Geng · Xuansong Xie
On Calibrating Semantic Segmentation Models: Analyses and An Algorithm
Dongdong Wang · Boqing Gong · Liqiang Wang
Content-aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers
Chenyang Lu · Daan de Geus · Gijs Dubbelman
Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark
Deyi Ji · Feng Zhao · Hongtao Lu · Mingyuan Tao · Jieping Ye
Few-shot Semantic Image Synthesis with Class Affinity Transfer
Marlene Careil · Jakob Verbeek · Stéphane Lathuilière
Network-free, unsupervised semantic segmentation with synthetic images
Qianli Feng · Raghudeep Gadde · Wentong Liao · Eduard Ramon · Aleix Martinez
MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence
Yixuan Sun · Yiwen Huang · HaiJing Guo · Yuzhou Zhao · Runmin Wu · Yizhou Yu · Weifeng Ge · Wenqiang Zhang
GRES: Generalized Referring Expression Segmentation
Chang Liu · Henghui Ding · Xudong Jiang
Semantic Prompt for Few-Shot Image Recognition
Wentao Chen · Chenyang Si · Zhang Zhang · Liang Wang · Zilei Wang · Tieniu Tan
Contrastive Grouping with Transformer for Referring Image Segmentation
Jiajin Tang · Ge Zheng · Cheng Shi · Sibei YANG
Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning
Xiaocheng Lu · Song Guo · Ziming Liu · Jingcai Guo
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning
Zhenyu Xie · Zaiyu Huang · Xin Dong · Fuwei Zhao · Haoye Dong · Xijin Zhang · Feida Zhu · Xiaodan Liang
OvarNet: Towards Open-vocabulary Object Attribute Recognition
Keyan Chen · Xiaolong Jiang · Yao Hu · Xu Tang · Yan Gao · Jianqi Chen · Weidi Xie
HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models
Shan Ning · Longtian Qiu · Yongfei Liu · Xuming He
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Lewei Yao · Jianhua Han · Xiaodan Liang · Dan Xu · Wei Zhang · Zhenguo Li · Hang Xu
Data-efficient Large Scale Place Recognition with Graded Similarity Supervision
Maria Leyva-Vallina · Nicola Strisciuglio · Nicolai Petkov
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing
Zequn Zeng · Hao Zhang · Zhengjue Wang · Ruiying Lu · Dongsheng Wang · Bo Chen
Deep Hashing with Minimal-Distance-Separated Hash Centers
Liangdao Wang · Yan Pan · Cong Liu · Hanjiang Lai · Jian Yin · Ye Liu
Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment
Runqi Wang · Hao ZHENG · Xiaoyue Duan · Jianzhuang Liu · Yuning Lu · Tian Wang · Songcen Xu · Baochang Zhang
Masked Autoencoding Does Not Help Natural Language Supervision at Scale
Floris Weers · Vaishaal Shankar · Angelos Katharopoulos · Yinfei Yang · Tom Gunter
Improving Cross-Modal Retrieval with Set of Diverse Embeddings
Dongwon Kim · Namyup Kim · Suha Kwak
Revisiting Self-Similarity: Structural Embedding for Image Retrieval
Seongwon Lee · Suhyeon Lee · Hongje Seong · Euntai Kim
LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data
Jihye Park · Sunwoo Kim · Soohyun Kim · Seokju Cho · Jaejun Yoo · Youngjung Uh · Seungryong Kim
Scaling Language-Image Pre-training via Masking
Yanghao Li · Haoqi Fan · Ronghang Hu · Christoph Feichtenhofer · Kaiming He
Variational Distribution Learning for Unsupervised Text-to-Image Generation
MINSOO KANG · Doyup Lee · Jiseob Kim · Saehoon Kim · Bohyung Han
Semantic-Conditional Diffusion Networks for Image Captioning
Jianjie Luo · Yehao Li · Yingwei Pan · Ting Yao · Jianlin Feng · Hongyang Chao · Tao Mei
Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style
Fengyin Lin · Mingkang Li · Da Li · Timothy Hospedales · Yi-Zhe Song · Yonggang Qi
MAGVLT: Masked Generative Vision-and-Language Transformer
Sungwoong Kim · Daejin Jo · Donghoon Lee · Jongmin Kim
SketchXAI: A First Look at Explainability for Human Sketches
Zhiyu Qu · Yulia Gryaditskaya · Ke Li · Kaiyue Pang · Tao Xiang · Yi-Zhe Song
Learning Geometry-aware Representations by Sketching
Hyundo Lee · Inwoo Hwang · Hyunsung Go · Won-Seok Choi · Kibeom Kim · Byoung-Tak Zhang
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Dezhao Luo · Jiabo Huang · Shaogang Gong · Hailin Jin · Yang Liu
Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting
Syed Talal Wasim · Muhammad Muzammal Naseer · Salman Khan · Fahad Khan · Mubarak Shah
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
WonJun Moon · Sangeek Hyun · SangUk Park · Dongchan Park · Jae-Pil Heo
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning
Wei Ji · Renjie Liang · Zhedong Zheng · Wenqiao Zhang · Shengyu Zhang · Juncheng Li · Mengze Li · Tat-Seng Chua
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels
Jingqiu Zhou · Linjiang Huang · Liang Wang · Si Liu · Hongsheng Li
PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization
Mamshad Nayeem Rizve · Gaurav Mittal · Ye Yu · Matthew Hall · Sandra Sajeev · Mubarak Shah · Mei Chen
Open Set Action Recognition via Multi-Label Evidential Learning
Chen Zhao · Dawei Du · Anthony Hoogs · Christopher Funk
Object Discovery from Motion-Guided Tokens
Zhipeng Bao · Pavel Tokmakov · Yu-Xiong Wang · Adrien Gaidon · Martial Hebert
Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling
Ryo Hachiuma · Fumiaki Sato · Taiki Sekii
Video Test-Time Adaptation for Action Recognition
Wei Lin · Muhammad Mirza Mirza · Mateusz Kozinski · Horst Possegger · Hilde Kuehne · Horst Bischof
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
Tiantian Geng · Teng WANG · Jinming Duan · Runmin Cong · Feng Zheng
A Light Weight Model for Active Speaker Detection
Junhua Liao · Haihan Duan · Kanghui Feng · WanBing Zhao · Yanbing Yang · Liangyin Chen
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR
Paul Hongsuck Seo · Arsha Nagrani · Cordelia Schmid
Egocentric Audio-Visual Object Localization
Chao Huang · Yapeng Tian · Anurag Kumar · Chenliang Xu
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Tsu-Jui Fu · Linjie Li · Zhe Gan · Kevin Lin · William Yang Wang · Lijuan Wang · Zicheng Liu
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Jaehoon Yoo · Semin Kim · Doyup Lee · Chiheon Kim · Seunghoon Hong
Unifying Short and Long-Term Tracking with Graph Hierarchies
Orcun Cetintas · Guillem Braso · Laura Leal-Taixé
Hierarchical Neural Memory Network for Low Latency Event Processing
Ryuhei Hamaguchi · Yasutaka Furukawa · Masaki Onishi · Ken Sakurada
Mask-Free Video Instance Segmentation
Lei Ke · Martin Danelljan · Henghui Ding · Yu-Wing Tai · Chi-Keung Tang · Fisher Yu
Hierarchical Semantic Contrast for Scene-aware Video Anomaly Detection
Shengyang Sun · Xiaojin Gong
Breaking the “Object ” in Video Object Segmentation
Pavel Tokmakov · Jie Li · Adrien Gaidon
VideoTrack: Learning to Track Objects via Video Transformer
Fei Xie · Lei Chu · Jiahao Li · Yan Lu · Chao Ma
Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models
Paul Micaelli · Arash Vahdat · Hongxu Yin · Jan Kautz · Pavlo Molchanov
Unbiased Scene Graph Generation in Videos
Sayak Nag · Kyle Min · Subarna Tripathi · Amit Roy-Chowdhury
Graph Representation for Order-aware Visual Transformation
Yue Qiu · Yanjun Sun · Fumiya Matsuzawa · Kenji Iwata · Hirokatsu Kataoka
Prototype-based Embedding Network for Scene Graph Generation
Chaofan Zheng · Xinyu Lyu · Lianli Gao · Bo Dai · Jingkuan Song
Efficient Mask Correction for Click-Based Interactive Image Segmentation
Fei Du · Jianlong Yuan · Zhibin Wang · Fan Wang
G-MSM: Unsupervised Multi-Shape Matching with Graph-based Affinity Priors
Marvin Eisenberger · Aysim Toker · Laura Leal-Taixé · Daniel Cremers
Shape-Erased Feature Learning for Visible-Infrared Person Re-Identification
Jiawei Feng · Ancong Wu · Wei-Shi Zheng
Mixed Autoencoder for Self-supervised Visual Representation Learning
Kai Chen · Zhili LIU · Lanqing HONG · Hang Xu · Zhenguo Li · Dit-Yan Yeung
Stare at What You See: Masked Image Modeling without Reconstruction
Hongwei Xue · Peng Gao · Hongyang Li · Yu Qiao · Hao Sun · Houqiang Li · Jiebo Luo
ResFormer: Scaling ViTs with Multi-Resolution Training
Rui Tian · Zuxuan Wu · Qi Dai · Han Hu · Yu Qiao · Yu-Gang Jiang
Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding
Zijiao Chen · Jiaxin Qing · Tiange Xiang · Wan Lin Yue · Juan Zhou Zhou
DropKey for Vision Transformer
Bonan Li · Yinhan Hu · Xuecheng Nie · Congying Han · Xiangjian Jiang · Tiande Guo · Luoqi Liu
Vision Transformer with Super Token Sampling
Huaibo Huang · Xiaoqiang Zhou · Jie Cao · Ran He · Tieniu Tan
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
Cong Wei · Brendan Duke · Ruowei Jiang · Parham Aarabi · Graham Taylor · Florian Shkurti
All are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao · Shen Nie · Kaiwen Xue · Yue Cao · Chongxuan Li · Hang Su · Jun Zhu
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Chong Yu · Tao Chen · Zhongxue Gan · Jiayuan Fan
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
Yihao Chen · Xianbiao Qi · Jianan Wang · Lei Zhang
Structured Sparsity Learning for Efficient Video Super-Resolution
Bin Xia · Jingwen He · Yulun Zhang · Yitong Wang · Yapeng Tian · Wenming Yang · Luc Van Gool
Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos
Yubin Hu · Yuze He · Yanghao Li · Jisheng Li · Yuxing Han · jiangtao wen · Yong-jin Liu
Neural Video Compression with Diverse Contexts
Jiahao Li · Bin Li · Yan Lu
Large-capacity and Flexible Video Steganography via Invertible Neural Network
Chong Mou · Youmin Xu · Jiechong Song · Chen Zhao · Bernard Ghanem · Jian Zhang
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang · Zhendong Mao · Zhuowei Chen · Yongdong Zhang
Binary Latent Diffusion
Ze Wang · Jiang Wang · Zicheng Liu · Qiang Qiu
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
Andreas Blattmann · Robin Rombach · Huan Ling · Tim Dockhorn · Seung Wook Kim · Sanja Fidler · Karsten Kreis
Diffusion Probabilistic Model Made Slim
Xingyi Yang · Daquan Zhou · Jiashi Feng · Xinchao Wang
Solving 3D Inverse Problems from Pre-trained 2D Diffusion Models
Hyungjin Chung · Dohoon Ryu · Michael McCann · Marc Klasky · Jong Ye
EDICT: Exact Diffusion Inversion via Coupled Transformations
Bram Wallace · Akash Gokul · Nikhil Naik
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
Patrick Schramowski · Manuel Brack · Björn Deiseroth · Kristian Kersting
GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li · Haotian Liu · Qingyang Wu · Fangzhou Mu · Jianwei Yang · Jianfeng Gao · Chunyuan Li · Yong Jae Lee
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz · Yuanzhen Li · Varun Jampani · Yael Pritch · Michael Rubinstein · Kfir Aberman
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
Guangcong Zheng · Xianpan Zhou · Xuewei Li · Zhongang Qi · Ying Shan · Xi Li
Affordance Diffusion: Synthesizing Hand-Object Interactions
Yufei Ye · Xueting Li · Abhinav Gupta · Shalini De Mello · Stan Birchfield · Jiaming Song · Shubham Tulsiani · Sifei Liu
SceneComposer: Any-Level Semantic Image Synthesis
Yu Zeng · Zhe Lin · Jianming Zhang · Qing Liu · John Collomosse · Jason Kuen · Vishal Patel
Handwritten Text Generation from Visual Archetypes
Vittorio Pippi · Silvia Cascianelli · Rita Cucchiara
Referring Image Matting
Jizhizi Li · Jing Zhang · Dacheng Tao
Neural Transformation Fields for Arbitrary-Styled Font Generation
Bin Fu · Junjun He · Jianjun Wang · Yu Qiao
SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Mode
Shaoan Xie · Zhifei Zhang · Zhe Lin · Tobias Hinz · Kun Zhang
Masked and Adaptive Transformer for Exemplar Based Image Translation
chang jiang · Fei Gao · Biao Ma · Lin Yuhao · Nannan Wang · Gang Xu
Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis
Thuan Nguyen · Thanh Le · Anh Tran
RWSC-Fusion: Region-Wise Style-Controlled Fusion Network for the Prohibited X-ray Security Image Synthesis
luwen duan · Min Wu · Lijian Mao · Jun Yin · Xiong Jianping · Xi Li
Towards Artistic Image Aesthetics Assessment: a Large-scale Dataset and a New Method
Ran Yi · Haoyuan Tian · Zhihao Gu · Yu-Kun Lai · Paul Rosin
Omni Aggregation Networks for Lightweight Image Super-Resolution
Hang Wang · Xuanhong Chen · Bingbing Ni · Yutian Liu · Jinfan Liu
Activating More Pixels in Image Super-Resolution Transformer
Xiangyu Chen · Xintao Wang · Jiantao Zhou · Yu Qiao · Chao Dong
Spatial-Frequency Mutual Learning for Face Super-Resolution
Chenyang Wang · Junjun Jiang · Zhiwei Zhong · Xianming Liu
Kernel Aware Resampler
Michael Bernasconi · Abdelaziz Djelouah · Farnood Salehi · Markus Gross · Christopher Schroers
RGB no more: Minimally-decoded JPEG Vision Transformers
Jeongsoo Park · Justin Johnson
Multi-Realism Image Compression with a Conditional Generator
Eirikur Agustsson · David Minnen · George Toderici · Fabian Mentzer
Learning to Exploit the Sequence-Specific Prior Knowledge for Image Processing Pipelines Optimization
Haina Qin · Longfei Han · Weihua Xiong · Juan Wang · Wentao Ma · Bing Li · Weiming Hu
Quality-aware Pre-trained Models for Blind Image Quality Assessment
Kai Zhao · Kun Yuan · Ming Sun · Mading Li · Xing Wen
Robust Unsupervised StyleGAN Image Restoration
Yohan Poirier-Ginter · Jean-Francois Lalonde
RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors
Rui-Qi Wu · Zheng-Peng Duan · Chunle Guo · Zhi Chai · Chongyi Li
Toward Stable, Interpretable, and Lightweight Hyperspectral Super-resolution
Wenjin Guo · Weiying Xie · Kai Jiang · Yunsong Li · Jie Lei · Leyuan Fang
Residual Degradation Learning Unfolding Framework with Mixing Priors across Spectral and Spatial for Compressive Spectral Imaging
Yubo Dong · Dahua Gao · Tian Qiu · Yuyan Li · Minxi Yang · Guangming Shi
Learning a Simple Low-light Image Enhancer from Paired Low-light Instances
Zhenqi Fu · Yan Yang · Xiaotong Tu · Yue Huang · Xinghao Ding · Kai-Kuang Ma
Learning a Deep Color Difference Metric for Photographic Images
Haoyu Chen · Zhihua Wang · Yang Yang · Qilin Sun · Kede Ma
Learning a Practical SDR-to-HDRTV Up-conversion using New Dataset and Degradation Models
Cheng Guo · Leidong Fan · Ziyu Xue · Xiuhua Jiang
BiasBed - Rigorous Texture Bias Evaluation
Nikolai Kalischek · Rodrigo Daudt · Torben Peters · Reinhard Furrer · Jan D. Wegner · Konrad Schindler
A Unified HDR Imaging Method with Pixel and Patch Level
Qingsen Yan · Weiye Chen · song zhang · Yu Zhu · Jinqiu Sun · Yanning Zhang
Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement
Nancy Mehta · Akshay Dudhane · Subrahmanyam Murala · Syed Waqas Zamir · Salman Khan · Fahad Khan
Deep Discriminative Spatial and Temporal Network for Efficient Video Deblurring
Jinshan Pan · Boming Xu · Jiangxin Dong · Jianjun Ge · Jinhui Tang
1000 FPS HDR Video with a Spike-RGB Hybrid Camera
Yakun Chang · Chu Zhou · Yuchen Hong · hu liwen · Chao Xu · Tiejun Huang · Boxin Shi
Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation
Kun Zhou · Wenbo Li · Xiaoguang Han · Jiangbo Lu
Range-nullspace Video Frame Interpolation with Focalized Motion Estimation
Zhiyang Yu · Yu Zhang · Dongqing Zou · Xijun Chen · Jimmy Ren · Shunqing Ren
Deep Polarization Reconstruction with PDAVIS Events
Haiyang Mei · Zuowen Wang · Xin Yang · Xiaopeng Wei · Tobi Delbruck
Unsupervised space-time network for temporally-consistent segmentation of multiple motions
Etienne Meunier · Patrick Bouthemy
NeMo: Learning 3D Neural Motion Fields from Multiple Video Instances of the Same Action
Kuan-Chieh Wang · Zhenzhen Weng · Maria Xenochristou · Joao Araujo · Jeffrey Gu · Karen Liu · Serena Yeung
TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning with Structure-Trajectory Prompted Reconstruction for Person Re-Identification
Haocong Rao · Chunyan Miao
FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Yansong Tang · Jinpeng Liu · Aoyang Liu · Bin Yang · Wenxun Dai · Yongming Rao · Jiwen Lu · Jie Zhou · Xiu Li
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Bowen Zhang · Chenyang Qi · Pan Zhang · Bo Zhang · HsiangTao Wu · Dong Chen · Qifeng Chen · Yong Wang · Fang Wen
Feature Representation Learning with Adaptive Displacement Generation and Transformer Fusion for Micro-Expression Recognition
Zhijun Zhai · Jianhui Zhao · Chengjiang Long · Wenju Xu · He Shuangjiang · huijuan zhao
Clothing-Change Feature Augmentation for Person Re-Identification
Ke Han · Shaogang Gong · Yan Huang · Liang Wang · Tieniu Tan
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
Yuang Zhang · Tiancai Wang · Xiangyu Zhang
Camouflaged Object Detection with Feature Decomposition and Edge Reconstruction
Chunming He · Kai Li · Yachao Zhang · Longxiang Tang · Yulun Zhang · Zhenhua Guo · Xiu Li
Source-free Adaptive Gaze Estimation with Uncertainty Reduction
Xin Cai · Jiabei Zeng · Shiguang Shan · Xilin CHEN
PyPose: A Library for Robot Learning with Physics-based Optimization
Chen Wang · Dasong Gao · Kuan Xu · Junyi Geng · Yaoyu Hu · Yuheng Qiu · Bowen Li · Fan Yang · Brady Moon · Abhinav Pandey · Aryan FNU · Jiahe Xu · Tianhao Wu · Haonan He · Daning Huang · Zhongqiang Ren · Shibo Zhao · Taimeng Fu · Pranay Reddy Anthireddy · Xiao Lin · Wenshan Wang · Jingnan Shi · Rajat Talak · Kun Cao · Yi Du · Han Wang · Huai Yu · Shanzhao Wang · Siyu Chen · Ananth Kashyap · Rohan Bandaru · Karthik Dantu · Jiajun Wu · Lihua Xie · Luca Carlone · Marco Hutter · Sebastian Scherer
Stimulus Verification is a Universal and Effective Sampler in Multi-modal Human Trajectory Prediction
Jianhua Sun · Yuxuan Li · Liang Chai · Cewu Lu
StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments
Sean Kulinski · Nicholas Waytowich · James Hare · David I. Inouye
ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed Proposals
Xishun Wang · Tong Su · Fang Da · Xiaodong Yang
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving
Xiaosong Jia · Penghao Wu · Li Chen · Jiangwei Xie · Conghui He · Junchi Yan · Hongyang Li
HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining
SHIXIANG TANG · Cheng Chen · Meilin Chen · Qingsong Xie · Yizhou Wang · Yuanzheng Ci · LEI BAI · Feng Zhu · Haiyang Yang · Li Yi · Rui Zhao · Wanli Ouyang
BEV-Guided Multi-Modality Fusion for Driving Perception
Yunze Man · Liangyan Gui · Yu-Xiong Wang
Robust and Scalable Gaussian Process Regression and Its Applications
Yifan Lu · Jiayi Ma · Leyuan Fang · Xin Tian · Junjun Jiang
Tangentially Elongated Gaussian Belief Propagation for Event-based Incremental Optical Flow Estimation
Jun Nagata · Yusuke Sekikawa
Adaptive Annealing for Robust Geometric Estimation
Sidhartha Chitturi · Lalit Manam · Venu Madhav Govindu
Iterative Geometry Encoding Volume for Stereo Matching
Xu Gangwei · Xianqi Wang · Xiaohuan Ding · Xin Yang
PMatch: Paired Masked Image Modeling for Dense Geometric Matching
Shengjie Zhu · Xiaoming Liu
Adaptive Spot-Guided Transformer for Consistent Local Feature Matching
Jiahuan Yu · Jiahao Chang · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Feng Wu
Learning Rotation-Equivariant Features for Visual Correspondence
Jongmin Lee · Byungjin Kim · Seungwook Kim · Minsu Cho
UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement
Sisi You · Hantao Yao · Bing-Kun BAO · Changsheng Xu
Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching
Paul Rötzer · Zorah Laehner · Florian Bernard
LP-DIF: Learning Local Pattern-specific Deep Implicit Function for 3D Objects and Scenes
Meng Wang · Yushen Liu · Yue Gao · Kanle Shi · Yi Fang · Zhizhong Han
HGNet: Learning Hierarchical Geometry from Points, Edges, and Surfaces
Ting Yao · Yehao Li · Yingwei Pan · Tao Mei
Neural Intrinsic Embedding for Non-rigid Point Cloud Matching
puhua jiang · Mingze Sun · Ruqi Huang
PointClustering: Unsupervised Point Cloud Pre-training using Transformation Invariance in Clustering
Fuchen Long · Ting Yao · Zhaofan Qiu · Lusong Li · Tao Mei
Self-positioning Point-based Transformer for Point Cloud Understanding
Jinyoung Park · Sanghyeok Lee · Sihyeon Kim · Yunyang Xiong · Hyunwoo Kim
PointConvFormer: Revenge of the Point-Based Convolution
Wenxuan Wu · Li Fuxin · Qi Shan
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders
Renrui Zhang · Liuhui Wang · Yu Qiao · Peng Gao · Hongsheng Li
Geometry and Uncertainty-Aware 3D Point Cloud Class-Incremental Semantic Segmentation
Yuwei Yang · Munawar Hayat · Zhao Jin · Chao Ren · Yinjie Lei
Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions
Yurui Zhu · Tianyu Wang · Xueyang Fu · Xuanyu Yang · Xin Guo · Jifeng Dai · Yu Qiao · Xiaowei Hu
PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models
Minghua Liu · Yinhao Zhu · Hong Cai · Shizhong Han · Zhan Ling · Fatih Porikli · Hao Su
Semi-Weakly Supervised Object Kinematic Motion Prediction
Gengxin Liu · Qian Sun · Haibin Huang · Chongyang Ma · Yulan Guo · Li Yi · Hui Huang · Ruizhen Hu
Implicit Surface Contrastive Clustering for LiDAR Point Clouds
Zaiwei Zhang · Min Bai · Li Erran Li
LaserMix for Semi-Supervised LiDAR Semantic Segmentation
Lingdong Kong · Jiawei Ren · Liang Pan · Ziwei Liu
MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving
Jiale Li · Hang Dai · Hao Han · Yong Ding
GraVoS: Voxel Selection for 3D Point-Cloud Detection
Oren Shrout · Yizhak Ben-Shabat · Ayellet Tal
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking
Yukang Chen · Jianhui Liu · Xiangyu Zhang · XIAOJUAN QI · Jiaya Jia
Virtual Sparse Convolution for Multimodal 3D Object Detection
Hai Wu · Chenglu Wen · Shaoshuai Shi · Xin Li · Cheng Wang
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection
Yang Jiao · ZEQUN JIE · Shaoxiang Chen · Jingjing Chen · Lin Ma · Yu-Gang Jiang
OrienterNet: Visual Localization in 2D Public Maps with Neural Matching
Paul-Edouard Sarlin · Daniel DeTone · Tsun-Yi Yang · Armen Avetisyan · Julian Straub · Tomasz Malisiewicz · Samuel Rota Bulò · Richard Newcombe · Peter Kontschieder · Vasileios Balntas
Uncertainty-aware Vision-based Metric Cross-view Geolocalization
Florian Fervers · Sebastian Bullinger · Christoph Bodensteiner · Michael Arens · Rainer Stiefelhagen
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection
Lei Yang · Kaicheng Yu · tao tang · Jun Li · Kun Yuan · Li Wang · Xinyu Zhang · Peng Chen
Understanding the Robustness of 3D Object Detection with Bird’s-Eye-View Representations in Autonomous Driving
Zijian Zhu · Yichi Zhang · Hai Chen · Yinpeng Dong · Shu Zhao · Wenbo Ding · Jiachen Zhong · Shibao Zheng
Object Detection with Self-Supervised Scene Adaptation
ZEKUN ZHANG · Minh Hoai
AeDet: Azimuth-invariant Multi-view 3D Object Detection
Chengjian Feng · ZEQUN JIE · Yujie Zhong · Xiangxiang Chu · Lin Ma
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Kaixin Xiong · Shi Gong · Xiaoqing Ye · Xiao Tan · Ji Wan · Errui Ding · Jingdong Wang · Xiang Bai
VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
Ziqin Wang · Bowen Cheng · Lichen Zhao · Dong Xu · Yang Tang · Lyu Sheng
Modality-invariant Visual Odometry for Embodied Vision
Marius Memmel · Roman Bachmann · Amir Zamir
Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes
Rui Li · Dong Gong · Wei Yin · Hao Chen · Yu Zhu · Kaixuan Wang · Xiaozhi Chen · Jinqiu Sun · Yanning Zhang
OmniVidar: Omnidirectional Depth Estimation from Multi-Fisheye Images
Sheng Xie · Daochuan Wang · Yun-Hui Liu
DINN360: Deformable Invertible Neural Networks for Latitude-aware 360
\degree
Image Rescaling
Yichen Guo · Mai Xu · Lai Jiang · Ning Li · Leon Sigal · Yunjin Chen
GeoMVSNet: Learning Multi-View Stereo with Geometry Perception
Zhe Zhang · Rui Peng · Yuxi Hu · Ronggang Wang
A Practical Stereo Depth System for Smart Glasses
Jialiang Wang · Daniel Scharstein · Akash Bapat · Kevin Blackburn-Matzen · Matthew Yu · Jonathan Lehman · Suhib Alsisan · Yanghan Wang · Sam Tsai · Jan-Michael Frahm · Zijian He · Peter Vajda · Michael Cohen · Matt Uyttendaele
DC
2
: Dual-Camera Defocus Control by Learning to Refocus
Hadi AlZayer · Abdullah Abuolaim · Leung Chun Chan · Yang Yang · Ying Lou · Jia-Bin Huang · Abhishek Kar
iDisc: Internal Discretization for Monocular Depth Estimation
Luigi Piccinelli · Christos Sakaridis · Fisher Yu
SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks
Sergio Izquierdo · Javier Civera
Inverting the Imaging Process by Learning an Implicit Camera Model
Xin Huang · Qi Zhang · Ying Feng · Hongdong Li · Qing Wang
Learning to Measure the Point Cloud Reconstruction Loss in a Representation Space
Tianxin Huang · Zhonggan Ding · Jiangning Zhang · Ying Tai · Zhenyu Zhang · Mingang Chen · Chengjie Wang · Yong Liu
Better “CMOS ” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution
Xuhai Chen · Jiangning Zhang · Chao Xu · Yabiao Wang · Chengjie Wang · Yong Liu
Delivering Arbitrary-Modal Semantic Segmentation
Jiaming Zhang · Ruiping Liu · Hao Shi · Kailun Yang · Simon Reiß · Haodong Fu · Kunyu Peng · Kaiwei Wang · Rainer Stiefelhagen
Efficient Hierarchical Entropy Model for Learned Point Cloud Compression
Rui Song · Chunyang Fu · Shan Liu · Ge Li
Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring
Ruyang Liu · Jingjia Huang · Ge Li · Jiashi Feng · Xinglong Wu · Thomas Li
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Feng Liang · Bichen Wu · Xiaoliang Dai · Kunpeng Li · Yinan Zhao · Hang Zhang · Peizhao Zhang · Peter Vajda · Diana Marculescu
Imagic: Text-Based Real Image Editing with Diffusion Models
Bahjat Kawar · Shiran Zada · Oran Lang · Omer Tov · Huiwen Chang · Tali Dekel · Inbar Mosseri · michal Irani
Neumann Network with Recursive Kernels for Single Image Defocus Deblurring
Yuhui Quan · Zicong Wu · Hui Ji
Transfer4D: A framework for frugal motion capture and deformation transfer
Shubh Maheshwari · Rahul Narain · Ramya Hebbalaguppe
Iterative Proposal Refinement for Weakly-Supervised Video Grounding
Meng Cao · Fangyun Wei · Can Xu · Xiubo Geng · Long Chen · Can Zhang · Yuexian Zou · Tao Shen · Daxin Jiang
X
3
KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection
Marvin Klingner · Shubhankar Borse · Varun Ravi Kumar · Behnaz Rezaei · Venkatraman Narayanan · Senthil Yogamani · Fatih Porikli
AnyFlow: Arbitrary Scale Optical Flow with Implicit Neural Representation
Hyunyoung Jung · Zhuo Hui · Lei Luo · Haitao Yang · Feng Liu · Sungjoo Yoo · Rakesh Ranjan · Denis Demandolx
IterativePFN: True Iterative Point Cloud Filtering
Dasith de Silva Edirimuni · Xuequan Lu · Zhiwen Shao · Gang Li · Antonio Robles-Kelly · Ying He
Fake it till you make it: Learning transferable representations from synthetic ImageNet clones
Mert Bulent Sariyildiz · Karteek Alahari · Diane Larlus · Yannis Kalantidis
Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness
Zhijie Shen · Zishuo Zheng · Chunyu Lin · Lang Nie · Kang Liao · Shuai Zheng · Yao Zhao
Exploring Incompatible Knowledge Transfer in Few-shot Image Generation
Yunqing Zhao · Chao Du · Milad Abdollahzadeh · Tianyu Pang · Min Lin · Shuicheng YAN · Ngai-man Cheung
OmniObject3D: Large Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation
Tong Wu · Jiarui Zhang · Xiao Fu · Yuxin WANG · Jiawei Ren · Liang Pan · Wenyan Wu · Lei Yang · Jiaqi Wang · Chen Qian · Dahua Lin · Ziwei Liu
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Jianhui Yu · Hao Zhu · Liming Jiang · CHEN CHANGE LOY · Weidong Cai · Wenyan Wu
TensoIR: Tensorial Inverse Rendering
Haian Jin · Isabella Liu · Peijia Xu · Xiaoshuai Zhang · Songfang Han · Sai Bi · Xiaowei Zhou · Zexiang Xu · Hao Su
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation
Jiangwei Lao · Weixiang Hong · Xin Guo · Yingying Zhang · Wang Jian · Jingdong Chen · Wei Chu
Integral Neural Networks
Kirill Solodskikh · Azim Kurbanov · Ruslan Aydarkhanov · Irina Zhelavskaya · Yury Parfenov · Dehua Song · Stamatios Lefkimmiatis
FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework For Long-tail Trajectory Prediction
创心域SEO版权声明:以上内容作者已申请原创保护,未经允许不得转载,侵权必究!授权事宜、对本内容有异议或投诉,敬请联系网站管理员,我们将尽快回复您,谢谢合作!