• Skip to main content
  • Skip to primary sidebar
AAAI

AAAI

Association for the Advancement of Artificial Intelligence

    • AAAI

      AAAI

      Association for the Advancement of Artificial Intelligence

  • About AAAIAbout AAAI
    • News
    • Officers and Committees
    • Staff
    • Bylaws
    • Awards
      • Fellows Program
      • Classic Paper Award
      • Dissertation Award
      • Distinguished Service Award
      • Allen Newell Award
      • Outstanding Paper Award
      • AI for Humanity Award
      • Feigenbaum Prize
      • Patrick Henry Winston Outstanding Educator Award
      • Engelmore Award
      • AAAI ISEF Awards
      • Senior Member Status
      • Conference Awards
    • Partnerships
    • Resources
    • Mailing Lists
    • Past Presidential Addresses
    • AAAI 2025 Presidential Panel on the Future of AI Research
    • Presidential Panel on Long-Term AI Futures
    • Past Policy Reports
      • The Role of Intelligent Systems in the National Information Infrastructure (1995)
      • A Report to ARPA on Twenty-First Century Intelligent Systems (1994)
    • Logos
  • aaai-icon_ethics-diversity-line-yellowEthics & Diversity
  • Conference talk bubbleConferences & Symposia
    • AAAI Conference
    • AIES AAAI/ACM
    • AIIDE
    • EAAI
    • HCOMP
    • IAAI
    • ICWSM
    • Spring Symposia
    • Summer Symposia
    • Fall Symposia
    • Code of Conduct for Conferences and Events
  • PublicationsPublications
    • AI Magazine
    • Conference Proceedings
    • AAAI Publication Policies & Guidelines
    • Request to Reproduce Copyrighted Materials
    • Contribute
    • Order Proceedings
  • aaai-icon_ai-magazine-line-yellowAI Magazine
  • MembershipMembership
    • Member Login
    • Chapters

  • Career CenterAI Jobs
  • aaai-icon_ai-topics-line-yellowAITopics
  • aaai-icon_contact-line-yellowContact

  • Twitter
  • Facebook
  • LinkedIn
Home / Proceedings / Proceedings of the AAAI Conference on Artificial Intelligence, 36 /

No. 3: AAAI-22 Technical Tracks 3

AAAI Technical Track on Computer Vision III

  • Dual Decoupling Training for Semi-supervised Object Detection with Noise-Bypass Head

    Shida Zheng, Chenshu Chen, Xiaowei Cai, Tingqun Ye, Wenming Tan

    3526-3534

    PDF
  • SCALoss: Side and Corner Aligned Loss for Bounding Box Regression

    Tu Zheng, Shuai Zhao, Yang Liu, Zili Liu, Deng Cai

    3535-3543

    PDF
  • SepFusion: Finding Optimal Fusion Structures for Visual Sound Separation

    Dongzhan Zhou, Xinchi Zhou, Di Hu, Hang Zhou, Lei Bai, Ziwei Liu, Wanli Ouyang

    3544-3552

    PDF
  • Pan-Sharpening with Customized Transformer and Invertible Neural Network

    Man Zhou, Jie Huang, Yanchi Fang, Xueyang Fu, Aiping Liu

    3553-3561

    PDF
  • Promoting Single-Modal Optical Flow Network for Diverse Cross-Modal Flow Estimation

    Shili Zhou, Weimin Tan, Bo Yan

    3562-3570

    PDF
  • Edge-Aware Guidance Fusion Network for RGB–Thermal Scene Parsing

    Wujie Zhou, Shaohua Dong, Caie Xu, Yaguan Qian

    3571-3579

    PDF
  • TiGAN: Text-Based Interactive Image Generation and Manipulation

    Yufan Zhou, Ruiyi Zhang, Jiuxiang Gu, Chris Tensmeyer, Tong Yu, Changyou Chen, Jinhui Xu, Tong Sun

    3580-3588

    PDF
  • Cross-Domain Empirical Risk Minimization for Unbiased Long-Tailed Classification

    Beier Zhu, Yulei Niu, Xian-Sheng Hua, Hanwang Zhang

    3589-3597

    PDF
  • Deep Recurrent Neural Network with Multi-Scale Bi-directional Propagation for Video Deblurring

    Chao Zhu, Hang Dong, Jinshan Pan, Boyang Liang, Yuhao Huang, Lean Fu, Fei Wang

    3598-3607

    PDF
  • I Can Find You! Boundary-Guided Separated Attention Network for Camouflaged Object Detection

    Hongwei Zhu, Peng Li, Haoran Xie, Xuefeng Yan, Dong Liang, Dapeng Chen, Mingqiang Wei, Jing Qin

    3608-3616

    PDF
  • MoCaNet: Motion Retargeting In-the-Wild via Canonicalization Networks

    Wentao Zhu, Zhuoqian Yang, Ziang Di, Wayne Wu, Yizhou Wang, Chen Change Loy

    3617-3625

    PDF
  • Robust Depth Completion with Uncertainty-Driven Loss Functions

    Yufan Zhu, Weisheng Dong, Leida Li, Jinjian Wu, Xin Li, Guangming Shi

    3626-3634

    PDF
  • Efficient Model-Driven Network for Shadow Removal

    Yurui Zhu, Zeyu Xiao, Yanchi Fang, Xueyang Fu, Zhiwei Xiong, Zheng-Jun Zha

    3635-3643

    PDF
  • Learning Disentangled Classification and Localization Representations for Temporal Action Localization

    Zixin Zhu, Le Wang, Wei Tang, Ziyi Liu, Nanning Zheng, Gang Hua

    3644-3652

    PDF
  • ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation

    Chuanqing Zhuang, Zhengda Lu, Yiqun Wang, Jun Xiao, Ying Wang

    3653-3661

    PDF
  • Making Adversarial Examples More Transferable and Indistinguishable

    Junhua Zou, Yexin Duan, Boyu Li, Wu Zhang, Yu Pan, Zhisong Pan

    3662-3670

    PDF
  • Class Guided Channel Weighting Network for Fine-Grained Semantic Segmentation

    Xiang Zhang, Wanqing Zhao, Hangzai Luo, Jinye Peng, Jianping Fan

    3344-3352

    PDF
  • Context-Based Contrastive Learning for Scene Text Recognition

    Xinyun Zhang, Binwu Zhu, Xufeng Yao, Qi Sun, Ruiyu Li, Bei Yu

    3353-3361

    PDF
  • Learning Network Architecture for Open-Set Recognition

    Xuelin Zhang, Xuelian Cheng, Donghao Zhang, Paul Bonnington, Zongyuan Ge

    3362-3370

    PDF
  • An Adversarial Framework for Generating Unseen Images by Activation Maximization

    Yang Zhang, Wang Zhou, Gaoyuan Zhang, David Cox, Shiyu Chang

    3371-3379

    PDF
  • Contrastive Spatio-Temporal Pretext Learning for Self-Supervised Video Representation

    Yujia Zhang, Lai-Man Po, Xuyuan Xu, Mengyang Liu, Yexin Wang, Weifeng Ou, Yuzhi Zhao, Wing-Yin Yu

    3380-3389

    PDF
  • Pose-Invariant Face Recognition via Adaptive Angular Distillation

    Zhenduo Zhang, Yongru Chen, Wenming Yang, Guijin Wang, Qingmin Liao

    3390-3398

    PDF
  • End-to-End Learning the Partial Permutation Matrix for Robust 3D Point Cloud Registration

    Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

    3399-3407

    PDF
  • PetsGAN: Rethinking Priors for Single Image Generation

    Zicheng Zhang, Yinglu Liu, Congying Han, Hailin Shi, Tiande Guo, Bowen Zhou

    3408-3416

    PDF
  • Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

    Zizhao Zhang, Han Zhang, Long Zhao, Ting Chen, Sercan Ö. Arik, Tomas Pfister

    3417-3425

    PDF
  • OA-FSUI2IT: A Novel Few-Shot Cross Domain Object Detection Framework with Object-Aware Few-Shot Unsupervised Image-to-Image Translation

    Lifan Zhao, Yunlong Meng, Lin Xu

    3426-3435

    PDF
  • Static-Dynamic Co-teaching for Class-Incremental 3D Object Detection

    Na Zhao, Gim Hee Lee

    3436-3445

    PDF
  • Local Surface Descriptor for Geometry and Feature Preserved Mesh Denoising

    Wenbo Zhao, Xianming Liu, Junjun Jiang, Debin Zhao, Ge Li, Xiangyang Ji

    3446-3453

    PDF
  • Boosting Generative Zero-Shot Learning by Synthesizing Diverse Features with Attribute Augmentation

    Xiaojie Zhao, Yuming Shen, Shidong Wang, Haofeng Zhang

    3454-3462

    PDF
  • Self-Supervised Pretraining for RGB-D Salient Object Detection

    Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Xiang Ruan

    3463-3471

    PDF
  • Adaptive Logit Adjustment Loss for Long-Tailed Visual Recognition

    Yan Zhao, Weicong Chen, Xu Tan, Kai Huang, Jihong Zhu

    3472-3480

    PDF
  • CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-Based Autonomous Urban Driving

    Yinuo Zhao, Kun Wu, Zhiyuan Xu, Zhengping Che, Qi Lu, Jian Tang, Chi Harold Liu

    3481-3489

    PDF
  • Learning from the Tangram to Solve Mini Visual Tasks

    Yizhou Zhao, Liang Qiu, Pan Lu, Feng Shi, Tian Han, Song-Chun Zhu

    3490-3498

    PDF
  • Handling Slice Permutations Variability in Tensor Recovery

    Jingjing Zheng, Xiaoqin Zhang, Wenzhe Wang, Xianta Jiang

    3499-3507

    PDF
  • Boosting Contrastive Learning with Relation Knowledge Distillation

    Kai Zheng, Yuanjiang Wang, Ye Yuan

    3508-3516

    PDF
  • Weakly Supervised Video Moment Localization with Contrastive Negative Sample Mining

    Minghang Zheng, Yanjie Huang, Qingchao Chen, Yang Liu

    3517-3525

    PDF
  • Self-Labeling Framework for Novel Category Discovery over Domains

    Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa

    3161-3169

    PDF
  • Efficient Compact Bilinear Pooling via Kronecker Product

    Tan Yu, Yunfeng Cai, Ping Li

    3170-3178

    PDF
  • Hybrid Graph Neural Networks for Few-Shot Learning

    Tianyuan Yu, Sen He, Yi-Zhe Song, Tao Xiang

    3179-3187

    PDF
  • SOIT: Segmenting Objects with Instance-Aware Transformers

    Xiaodong Yu, Dahu Shi, Xing Wei, Ye Ren, Tingqun Ye, Wenming Tan

    3188-3196

    PDF
  • MSML: Enhancing Occlusion-Robustness by Multi-Scale Segmentation-Based Mask Learning for Face Recognition

    Ge Yuan, Huicheng Zheng, Jiayu Dong

    3197-3205

    PDF
  • Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics

    Hangjie Yuan, Mang Wang, Dong Ni, Liangpeng Xu

    3206-3214

    PDF
  • Task-Level Self-Supervision for Cross-Domain Few-Shot Learning

    Wang Yuan, Zhizhong Zhang, Cong Wang, Haichuan Song, Yuan Xie, Lizhuang Ma

    3215-3223

    PDF
  • Improving 360 Monocular Depth Estimation via Non-local Dense Prediction Transformer and Joint Supervised and Self-Supervised Learning

    Ilwi Yun, Hyuk-Jae Lee, Chae Eun Rhee

    3224-3233

    PDF
  • Homography Decomposition Networks for Planar Object Tracking

    Xinrui Zhan, Yueran Liu, Jianke Zhu, Yang Li

    3234-3242

    PDF
  • Patch Diffusion: A General Module for Face Manipulation Detection

    Baogen Zhang, Sheng Li, Guorui Feng, Zhenxing Qian, Xinpeng Zhang

    3243-3251

    PDF
  • Semi-supervised Object Detection with Adaptive Class-Rebalancing Self-Training

    Fangyuan Zhang, Tianxiang Pan, Bin Wang

    3252-3261

    PDF
  • Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching

    Huatian Zhang, Zhendong Mao, Kun Zhang, Yongdong Zhang

    3262-3270

    PDF
  • SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-resolution

    Jiangning Zhang, Chao Xu, Jian Li, Yue Han, Yabiao Wang, Ying Tai, Yong Liu

    3271-3279

    PDF
  • Energy-Based Generative Cooperative Saliency Prediction

    Jing Zhang, Jianwen Xie, Zilong Zheng, Nick Barnes

    3280-3290

    PDF
  • Attention-Based Transformation from Latent Features to Point Clouds

    Kaiyi Zhang, Ximing Yang, Yuan Wu, Cheng Jin

    3291-3299

    PDF
  • Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning

    Manlin Zhang, Jinpeng Wang, Andy J. Ma

    3300-3308

    PDF
  • LGD: Label-Guided Self-Distillation for Object Detection

    Peizhen Zhang, Zijian Kang, Tong Yang, Xiangyu Zhang, Nanning Zheng, Jian Sun

    3309-3317

    PDF
  • Uncertainty Modeling with Second-Order Transformer for Group Re-identification

    Quan Zhang, Jian-Huang Lai, Zhanxiang Feng, Xiaohua Xie

    3318-3325

    PDF
  • Deep Spatial Adaptive Network for Real Image Demosaicing

    Tao Zhang, Ying Fu, Cheng Li

    3326-3334

    PDF
  • MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning

    Wenqiao Zhang, Haochen Shi, Jiannan Guo, Shengyu Zhang, Qingpeng Cai, Juncheng Li, Sihui Luo, Yueting Zhuang

    3335-3343

    PDF
  • Clinical-BERT: Vision-Language Pre-training for Radiograph Diagnosis and Reports Generation

    Bin Yan, Mingtao Pei

    2982-2990

    PDF
  • Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention

    Kun Yan, Chenbin Zhang, Jun Hou, Ping Wang, Zied Bouraoui, Shoaib Jameel, Steven Schockaert

    2991-2999

    PDF
  • Unsupervised Domain Adaptive Salient Object Detection through Uncertainty-Aware Pseudo-Label Learning

    Pengxiang Yan, Ziyi Wu, Mengmeng Liu, Kun Zeng, Liang Lin, Guanbin Li

    3000-3008

    PDF
  • Transmission-Guided Bayesian Generative Model for Smoke Segmentation

    Siyuan Yan, Jing Zhang, Nick Barnes

    3009-3017

    PDF
  • Cross-Species 3D Face Morphing via Alignment-Aware Controller

    Xirui Yan, Zhenbo Yu, Bingbing Ni, Hang Wang

    3018-3026

    PDF
  • Exploring Visual Context for Weakly Supervised Person Search

    Yichao Yan, Jinpeng Li, Shengcai Liao, Jie Qin, Bingbing Ni, Ke Lu, Xiaokang Yang

    3027-3035

    PDF
  • Cross-Modal Mutual Learning for Audio-Visual Speech Recognition and Manipulation

    Chih-Chun Yang, Wan-Cyuan Fan, Cheng-Fu Yang, Yu-Chiang Frank Wang

    3036-3044

    PDF
  • Mutual Contrastive Learning for Visual Representation Learning

    Chuanguang Yang, Zhulin An, Linhang Cai, Yongjun Xu

    3045-3053

    PDF
  • Temporal Action Proposal Generation with Background Constraint

    Haosen Yang, Wenhao Wu, Lining Wang, Sheng Jin, Boyang Xia, Hongxun Yao, Hujie Huang

    3054-3062

    PDF
  • Cross-Modal Federated Human Activity Recognition via Modality-Agnostic and Modality-Specific Representation Learning

    Xiaoshan Yang, Baochen Xiong, Yi Huang, Changsheng Xu

    3063-3071

    PDF
  • Polygon-to-Polygon Distance Loss for Rotated Object Detection

    Yang Yang, Jifeng Chen, Xiaopin Zhong, Yuanlong Deng

    3072-3080

    PDF
  • An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA

    Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Yumao Lu, Zicheng Liu, Lijuan Wang

    3081-3089

    PDF
  • ACGNet: Action Complement Graph Network for Weakly-Supervised Temporal Action Localization

    Zichen Yang, Jie Qin, Di Huang

    3090-3098

    PDF
  • Enhancing Pseudo Label Quality for Semi-supervised Domain-Generalized Medical Image Segmentation

    Huifeng Yao, Xiaowei Hu, Xiaomeng Li

    3099-3107

    PDF
  • Image Difference Captioning with Pre-training and Contrastive Learning

    Linli Yao, Weiying Wang, Qin Jin

    3108-3116

    PDF
  • Safe Distillation Box

    Jingwen Ye, Yining Mao, Jie Song, Xinchao Wang, Cheng Jin, Mingli Song

    3117-3124

    PDF
  • Joint Deep Multi-Graph Matching and 3D Geometry Learning from Inhomogeneous 2D Image Collections

    Zhenzhang Ye, Tarun Yenamandra, Florian Bernard, Daniel Cremers

    3125-3133

    PDF
  • Content-Variant Reference Image Quality Assessment via Knowledge Distillation

    Guanghao Yin, Wei Wang, Zehuan Yuan, Chuchu Han, Wei Ji, Shouqian Sun, Changhu Wang

    3134-3142

    PDF
  • Width & Depth Pruning for Vision Transformers

    Fang Yu, Kun Huang, Meng Wang, Yuan Cheng, Wei Chu, Li Cui

    3143-3151

    PDF
  • Anisotropic Fourier Features for Neural Image-Based Rendering and Relighting

    Huangjie Yu, Anpei Chen, Xin Chen, Lan Xu, Ziyu Shao, Jingyi Yu

    3152-3160

    PDF
  • Video as Conditional Graph Hierarchy for Multi-Granular Question Answering

    Junbin Xiao, Angela Yao, Zhiyuan Liu, Yicong Li, Wei Ji, Tat-Seng Chua

    2804-2812

    PDF
  • AdaptivePose: Human Parts as Adaptive Points

    Yabo Xiao, Xiao Juan Wang, Dongdong Yu, Guoli Wang, Qian Zhang, Mingshu HE

    2813-2821

    PDF
  • Learning Quality-Aware Representation for Multi-Person Pose Regression

    Yabo Xiao, Dongdong Yu, Xiao Juan Wang, Lei Jin, Guoli Wang, Qian Zhang

    2822-2830

    PDF
  • Attribute-Based Progressive Fusion Network for RGBT Tracking

    Yun Xiao, MengMeng Yang, Chenglong Li, Lei Liu, Jin Tang

    2831-2838

    PDF
  • Detailed Facial Geometry Recovery from Multi-View Images by Learning an Implicit Function

    Yunze Xiao, Hao Zhu, Haotian Yang, Zhengyu Diao, Xiangju Lu, Xun Cao

    2839-2847

    PDF
  • FINet: Dual Branches Feature Interaction for Partial-to-Partial Point Cloud Registration

    Hao Xu, Nianjin Ye, Guanghui Liu, Bing Zeng, Shuaicheng Liu

    2848-2856

    PDF
  • Rendering-Aware HDR Environment Map Prediction from a Single Image

    Jun-Peng Xu, Chenyu Zuo, Fang-Lue Zhang, Miao Wang

    2857-2865

    PDF
  • Topology-Aware Convolutional Neural Network for Efficient Skeleton-Based Action Recognition

    Kailin Xu, Fanfan Ye, Qiaoyong Zhong, Di Xie

    2866-2874

    PDF
  • Transcoded Video Restoration by Temporal Spatial Auxiliary Network

    Li Xu, Gang He, Jinjia Zhou, Jie Lei, Weiying Xie, Yunsong Li, Yu-Wing Tai

    2875-2883

    PDF
  • DIRL: Domain-Invariant Representation Learning for Generalizable Semantic Segmentation

    Qi Xu, Liang Yao, Zhengkai Jiang, Guannan Jiang, Wenqing Chu, Wenhui Han, Wei Zhang, Chengjie Wang, Ying Tai

    2884-2892

    PDF
  • Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

    Qiangeng Xu, Yiqi Zhong, Ulrich Neumann

    2893-2901

    PDF
  • Domain Disentangled Generative Adversarial Network for Zero-Shot Sketch-Based 3D Shape Retrieval

    Rui Xu, Zongyan Han, Le Hui, Jianjun Qian, Jin Xie

    2902-2910

    PDF
  • Dual Attention Networks for Few-Shot Fine-Grained Recognition

    Shu-Lin Xu, Faen Zhang, Xiu-Shen Wei, Jianhua Wang

    2911-2919

    PDF
  • Sparse Cross-Scale Attention Network for Efficient LiDAR Panoptic Segmentation

    Shuangjie Xu, Rui Wan, Maosheng Ye, Xiaoyi Zou, Tongyi Cao

    2920-2928

    PDF
  • Towards Fully Sparse Training: Information Restoration with Spatial Similarity

    Weixiang Xu, Xiangyu He, Ke Cheng, Peisong Wang, Jian Cheng

    2929-2937

    PDF
  • Hierarchical Image Generation via Transformer-Based Sequential Patch Selection

    Xiaogang Xu, Ning Xu

    2938-2945

    PDF
  • Reliable Propagation-Correction Modulation for Video Object Segmentation

    Xiaohao Xu, Jinglu Wang, Xiao Li, Yan Lu

    2946-2954

    PDF
  • Adaptive Hypergraph Neural Network for Multi-Person Pose Estimation

    Xixia Xu, Qi Zou, Xue Lin

    2955-2963

    PDF
  • Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

    Yifan Xu, Zhijie Zhang, Mengdan Zhang, Kekai Sheng, Ke Li, Weiming Dong, Liqing Zhang, Changsheng Xu, Xing Sun

    2964-2972

    PDF
  • MobileFaceSwap: A Lightweight Framework for Video Face Swapping

    Zhiliang Xu, Zhibin Hong, Changxing Ding, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding

    2973-2981

    PDF
  • Texture Reformer: Towards Fast and Universal Interactive Texture Transfer

    Zhizhong Wang, Lei Zhao, Haibo Chen, Ailin Li, Zhiwen Zuo, Wei Xing, Dongming Lu

    2624-2632

    PDF
  • Interact, Embed, and EnlargE: Boosting Modality-Specific Representations for Multi-Modal Person Re-identification

    Zi Wang, Chenglong Li, Aihua Zheng, Ran He, Jin Tang

    2633-2641

    PDF
  • Can Semantic Labels Assist Self-Supervised Visual Representation Learning?

    Longhui Wei, Lingxi Xie, Jianzhong He, Xiaopeng Zhang, Qi Tian

    2642-2650

    PDF
  • Rethinking the Two-Stage Framework for Grounded Situation Recognition

    Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua

    2651-2658

    PDF
  • Boosting the Transferability of Video Adversarial Examples via Temporal Translation

    Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

    2659-2667

    PDF
  • Towards Transferable Adversarial Attacks on Vision Transformers

    Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang

    2668-2676

    PDF
  • L-CoDe:Language-Based Colorization Using Color-Object Decoupled Conditions

    Shuchen Weng, Hao Wu, Zheng Chang, Jiajun Tang, Si Li, Boxin Shi

    2677-2684

    PDF
  • Neural Interferometry: Image Reconstruction from Astronomical Interferometers Using Transformer-Conditioned Neural Fields

    Benjamin Wu, Chao Liu, Benjamin Eckart, Jan Kautz

    2685-2693

    PDF
  • TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition

    Changjie Wu, Jun Du, Yunqing Li, Jianshu Zhang, Chen Yang, Bo Ren, Yiqing Hu

    2694-2702

    PDF
  • Learning Token-Based Representation for Image Retrieval

    Hui Wu, Min Wang, Wengang Zhou, Yang Hu, Houqiang Li

    2703-2711

    PDF
  • Multi-Modal Answer Validation for Knowledge-Based VQA

    Jialin Wu, Jiasen Lu, Ashish Sabharwal, Roozbeh Mottaghi

    2712-2721

    PDF
  • Neighborhood Consensus Contrastive Learning for Backward-Compatible Representation

    Shengsen Wu, Liang Chen, Yihang Lou, Yan Bai, Tao Bai, Minghua Deng, Ling-Yu Duan

    2722-2730

    PDF
  • Pale Transformer: A General Vision Transformer Backbone with Pale-Shaped Attention

    Sitong Wu, Tianyi Wu, Haoru Tan, Guodong Guo

    2731-2739

    PDF
  • Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation

    Xinyi Wu, Zhenyao Wu, Yuhang Lu, Lili Ju, Song Wang

    2740-2749

    PDF
  • Multi-Centroid Representation Network for Domain Adaptive Person Re-ID

    Yuhang Wu, Tengteng Huang, Haotian Yao, Chi Zhang, Yuanjie Shao, Chuchu Han, Changxin Gao, Nong Sang

    2750-2758

    PDF
  • Efficient Non-local Contrastive Attention for Image Super-resolution

    Bin Xia, Yucheng Hang, Yapeng Tian, Wenming Yang, Qingmin Liao, Jie Zhou

    2759-2767

    PDF
  • Coarse-to-Fine Embedded PatchMatch and Multi-Scale Dynamic Aggregation for Reference-Based Super-resolution

    Bin Xia, Yapeng Tian, Yucheng Hang, Wenming Yang, Qingmin Liao, Jie Zhou

    2768-2776

    PDF
  • Cross-Domain Collaborative Normalization via Structural Knowledge

    Haifeng Xia, Zhengming Ding

    2777-2785

    PDF
  • ReMoNet: Recurrent Multi-Output Network for Efficient Video Denoising

    Liuyu Xiang, Jundong Zhou, Jirui Liu, Zerun Wang, Haidong Huang, Jie Hu, Jungong Han, Yuchen Guo, Guiguang Ding

    2786-2794

    PDF
  • Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic Segmentation

    Aoran Xiao, Jiaxing Huang, Dayan Guan, Fangneng Zhan, Shijian Lu

    2795-2803

    PDF
  • UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-Wise Perspective with Transformer

    Haonan Wang, Peng Cao, Jiaqi Wang, Osmar R. Zaiane

    2441-2449

    PDF
  • Renovate Yourself: Calibrating Feature Representation of Misclassified Pixels for Semantic Segmentation

    Hualiang Wang, Huanpeng Chu, Siming FU, Zuozhu Liu, Haoji Hu

    2450-2458

    PDF
  • Separated Contrastive Learning for Organ-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

    Jiacheng Wang, Xiaomeng Li, Yiming Han, Jing Qin, Liansheng Wang, Zhou Qichao

    2459-2467

    PDF
  • Contrastive Quantization with Code Memory for Unsupervised Image Retrieval

    Jinpeng Wang, Ziyun Zeng, Bin Chen, Tao Dai, Shu-Tao Xia

    2468-2476

    PDF
  • Learning Temporally and Semantically Consistent Unpaired Video-to-Video Translation through Pseudo-Supervision from Synthetic Optical Flow

    Kaihong Wang, Kumar Akash, Teruhisa Misu

    2477-2486

    PDF
  • Cross-Dataset Collaborative Learning for Semantic Segmentation in Autonomous Driving

    Li Wang, Dong Li, Han Liu, JinZhang Peng, Lu Tian, Yi Shan

    2487-2494

    PDF
  • Scaled ReLU Matters for Training Vision Transformers

    Pichao Wang, Xue Wang, Hao Luo, Jingkai Zhou, Zhipeng Zhou, Fan Wang, Hao Li, Rong Jin

    2495-2503

    PDF
  • CQA-Face: Contrastive Quality-Aware Attentions for Face Recognition

    Qiangchang Wang, Guodong Guo

    2504-2512

    PDF
  • Category-Specific Nuance Exploration Network for Fine-Grained Object Retrieval

    Shijie Wang, Zhihui Wang, Haojie Li, Wanli Ouyang

    2513-2521

    PDF
  • Detail-Preserving Transformer for Light Field Image Super-resolution

    Shunzhou Wang, Tianfei Zhou, Yao Lu, Huijun Di

    2522-2530

    PDF
  • One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning

    Suzhen Wang, Lincheng Li, Yu Ding, Xin Yu

    2531-2539

    PDF
  • Pose-Guided Feature Disentangling for Occluded Person Re-identification Based on Transformer

    Tao Wang, Hong Liu, Pinhao Song, Tianyu Guo, Wei Shi

    2540-2549

    PDF
  • FFNet: Frequency Fusion Network for Semantic Scene Completion

    Xuzhi Wang, Di Lin, Liang Wan

    2550-2557

    PDF
  • Privacy-Preserving Face Recognition in the Frequency Domain

    Yinggui Wang, Jian Liu, Man Luo, Le Yang, Li Wang

    2558-2566

    PDF
  • Anchor DETR: Query Design for Transformer-Based Detector

    Yingming Wang, Xiangyu Zhang, Tong Yang, Jian Sun

    2567-2575

    PDF
  • Panini-Net: GAN Prior Based Degradation-Aware Feature Interpolation for Face Restoration

    Yinhuai Wang, Yujie Hu, Jian Zhang

    2576-2584

    PDF
  • End-to-End Transformer Based Model for Image Captioning

    Yiyu Wang, Jungang Xu, Yingfei Sun

    2585-2594

    PDF
  • Learning to Detect 3D Facial Landmarks via Heatmap Regression with Graph Convolutional Network

    Yuan Wang, Min Cao, Zhenfeng Fan, Silong Peng

    2595-2603

    PDF
  • Low-Light Image Enhancement with Normalizing Flow

    Yufei Wang, Renjie Wan, Wenhan Yang, Haoliang Li, Lap-Pui Chau, Alex Kot

    2604-2612

    PDF
  • Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

    Zhenzhi Wang, Limin Wang, Tao Wu, Tianhao Li, Gangshan Wu

    2613-2623

    PDF

Primary Sidebar