已发表成果:
WOK 论文 65 篇;
Towards Omni-supervised Referring Expression Segmentation
PixelFace plus : Towards Controllable Face Generation and Manipulation with Text Descriptions and Segmentation Masks
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Towards Language-Guided Visual Recognition via Dynamic Convolutions
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer
Approximated Prompt Tuning for Vision-Language Pre-trained Models
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting
Towards local visual modeling for image captioning
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Active Teacher for Semi-Supervised Object Detection
Towards End-to-end Semi-supervised Learning for One-stage Object Detection
Towards Efficient Visual Adaption via Structural Re-parameterization
Towards Local Visual Modeling for Image Captioning
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Semantic-Guided Selective Representation for Image Captioning
HSM-QA: Question Answering System Based on Hierarchical Semantic Matching
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models