已发表成果:
WOK 论文 34 篇;
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Semi-Supervised Panoptic Narrative Grounding
Semi-Supervised Panoptic Narrative Grounding
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Towards local visual modeling for image captioning
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
Towards Local Visual Modeling for Image Captioning
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance