已发表成果:
WOK 论文 65 篇;
Deep hybrid transformer network for robust modulation classification in wireless communications
Image Captioning via Dynamic Path Customization
Image Captioning via Dynamic Path Customization
CycleTrans: Learning Neutral Yet Discriminative Features via Cycle Construction for Visible-Infrared Person Re-Identification
Deep Instruction Tuning for Segment Anything Model
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation