学者信息

孙晓帅 (SUN XIAOSHUAI)

信息学院

ORCID:https://orcid.org/0000-0003-3912-9306

微软学者

合作者

已发表成果:

WOK 论文 113 篇;中文核心 1 篇;

  • Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

    arXiv,,2023-12-19.
    Liu, Sihan (1); Ma, Yiwei (1); Zhang, Xiaoqing (1); Wang, Haowei (1); Ji, Jiayi (1); Sun, Xiaoshuai...
    EI:20230459507   10.48550/arXiv.2312.12470
    收录情况:EI
  • X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation

    arXiv,,2023-11-30.
    Ma, Yiwei (1); Fan, Yijun (1); Ji, Jiayi (1); Wang, Haowei (1); Sun, Xiaoshuai (1); Jiang, Guannan ...
    EI:20230456530   10.48550/arXiv.2312.00085
    收录情况:EI
  • Towards Omni-supervised Referring Expression Segmentation

    arXiv,,2023-11-01.
    Huang, Minglang (1); Zhou, Yiyi (1, 2); Luo, Gen (1); Jiang, Guannan (3); Zhuang, Weilin (3); Sun, ...
    EI:20230413925   10.48550/arXiv.2311.00397
    收录情况:EI
  • Semi-Supervised Panoptic Narrative Grounding

    arXiv,,2023-10-27.
    Yang, Danni (1); Ji, Jiayi (1); Sun, Xiaoshuai (1); Wang, Haowei (1); Li, Yinan (1); Ma, Yiwei (1);...
    EI:20230388009   10.48550/arXiv.2310.18142
    收录情况:EI
  • Semi-Supervised Panoptic Narrative Grounding

    MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia,,2023-10-26.
    Yang, Danni; Ji, Jiayi; Sun, Xiaoshuai; Wang, Haowei; Li, Yinan; Ma, Yiwei; Ji, Rongrong
    WOS:001199449107017   EI:20235015224670   10.1145/3581783.3612259
    收录情况:EI、CPCI-S
  • PixelFace plus : Towards Controllable Face Generation and Manipulation with Text Descriptions and Segmentation Masks

    MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia,,2023-10-26.
    Du, Xiaoxiong; Peng, Jun; Zhou, Yiyi; Zhang, Jinlu; Chen, Siting; Jiang, Guannan; Sun, Xiaoshuai; J...
    WOS:001199449104073   EI:20235015224052   10.1145/3581783.3612067
    收录情况:EI、CPCI-S
  • Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation

    MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia,,2023-10-26.
    Wang, Haowei; Tang, Jiji; Ji, Jiayi; Sun, Xiaoshuai; Zhang, Rongsheng; Ma, Yiwei; Zhao, Minda; Li, ...
    WOS:001199449103053   EI:20235015224377   10.1145/3581783.3611767
    收录情况:EI、CPCI-S
  • Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval

    MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia,,2023-10-26.
    Ma, Yiwei; Sun, Xiaoshuai; Ji, Jiayi; Jiang, Guannan; Zhuang, Weilin; Ji, Rongrong
    WOS:001199449104023   EI:20235015224378   10.1145/3581783.3611768
    收录情况:EI、CPCI-S
  • JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues

    arXiv,,2023-10-14.
    Ji, Jiayi (1); Wang, Haowei (1); Wu, Changli (1); Ma, Yiwei (1); Sun, Xiaoshuai (1); Ji, Rongrong (...
    EI:20230383428   10.48550/arXiv.2310.09503
    收录情况:EI
  • Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models

    arXiv,,2023-09-04.
    Wu, Qiong (1, 2); Yu, Wei (1, 2); Zhou, Yiyi (1, 2); Huang, Shubin (1); Sun, Xiaoshuai (1, 2); Ji, ...
    EI:20230337832   10.48550/arXiv.2309.01479
    收录情况:EI
  • 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation

    arXiv,,2023-08-31.
    Wu, Changli (1); Ma, Yiwei (1); Chen, Qi (1); Wang, Haowei (1); Luo, Gen (1); Ji, Jiayi (1); Sun, X...
    EI:20230325309   10.48550/arXiv.2308.16632
    收录情况:EI
  • Towards Language-Guided Visual Recognition via Dynamic Convolutions

    INTERNATIONAL JOURNAL OF COMPUTER VISION,0920-5691,2023-08-16.
    Luo, Gen; Zhou, Yiyi; Sun, Xiaoshuai; Wu, Yongjian; Gao, Yue; Ji, Rongrong
    WOS:001049090600001   EI:20233414587774   10.1007/s11263-023-01871-1
    收录情况:SCIE、EI
  • Continual Face Forgery Detection via Historical Distribution Preserving

    arXiv,,2023-08-11.
    Sun, Ke (1); Chen, Shen (2); Yao, Taiping (2); Sun, Xiaoshuai (1); Ding, Shouhong (2); Ji, Rongrong...
    EI:20230295620   10.48550/arXiv.2308.06217
    收录情况:EI
  • Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation

    arXiv,,2023-08-05.
    Wang, Haowei (1); Tang, Jiji (2); Ji, Jiayi (1); Sun, Xiaoshuai (1); Zhang, Rongsheng (2); Ma, Yiwe...
    EI:20230296229   10.48550/arXiv.2308.02982
    收录情况:EI
  • Towards General Visual-Linguistic Face Forgery Detection

    arXiv,,2023-07-31.
    Sun, Ke (1); Chen, Shen (2); Yao, Taiping (2); Sun, Xiaoshuai (1); Ding, Shouhong (2); Ji, Rongrong...
    EI:20230280486   10.48550/arXiv.2307.16545
    收录情况:EI
  • Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer

    arXiv,,2023-06-30.
    Mi, Peng (1); Shen, Li (2); Ren, Tianhe (1); Zhou, Yiyi (1); Xu, Tianshuo (1); Sun, Xiaoshuai (1); ...
    EI:20230242795  
    收录情况:EI
  • End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation

    Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023,,2023-06-27.
    Wu, Mingrui (1, 2); Gu, Jiaxin (3); Shen, Yunhang (2); Lin, Mingbao (2); Chen, Chao (2); Sun, Xiaos...
    EI:20233414583424  
    收录情况:EI
  • Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network

    Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023,,2023-06-27.
    Wang, Haowei (1); Ji, Jiayi (1); Zhou, Yiyi (1, 2); Wu, Yongjian (4); Sun, Xiaoshuai (1, 2, 3)
    EI:20233314551712  
    收录情况:EI
  • Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting

    arXiv,,2023-06-01.
    Huang, Shubin (1, 2); Wu, Qiong (1, 2); Zhou, Yiyi (1, 2); Chen, Weijie (3); Zhang, Rongsheng (3); ...
    EI:20230213973  
    收录情况:EI
  • Towards local visual modeling for image captioning

    Pattern Recognition,0031-3203,2023-06.
    Ma, Yiwei; Ji, Jiayi; Sun, Xiaoshuai; Zhou, Yiyi; Ji, Rongrong
    WOS:000942420500001   EI:20230713597890   10.1016/j.patcog.2023.109420
    收录情况:SCIE、EI
  • Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models

    arXiv,,2023-05-24.
    Luo, Gen (1); Zhou, Yiyi (1, 2); Ren, Tianhe (1); Chen, Shengxin (1); Sun, Xiaoshuai (1, 2); Ji, Ro...
    EI:20230199631  
    收录情况:EI
  • X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance

    arXiv,,2023-03-28.
    Ma, Yiwei (1); Zhang, Xiaoqing (1); Sun, Xiaoshuai (1, 2); Ji, Jiayi (1); Wang, Haowei (1); Jiang, ...
    EI:20230111116   10.48550/arXiv.2303.15764
    收录情况:EI
  • Active Teacher for Semi-Supervised Object Detection

    arXiv,,2023-03-14.
    Mi, Peng (1); Lin, Jianghang (1); Zhou, Yiyi (1); Shen, Yunhang (1); Luo, Gen (1); Sun, Xiaoshuai (...
    EI:20230092703   10.48550/arXiv.2303.08348
    收录情况:EI
  • Towards End-to-end Semi-supervised Learning for One-stage Object Detection

    arXiv,,2023-02-22.
    Luo, Gen (1); Zhou, Yiyi (1); Jin, Lei (1); Sun, Xiaoshuai (1); Ji, Rongrong (1)
    EI:20230066928   10.48550/arXiv.2302.11299
    收录情况:EI
  • Towards Efficient Visual Adaption via Structural Re-parameterization

    arXiv,,2023-02-16.
    Luo, Gen (1); Huang, Minglang (1); Zhou, Yiyi (1, 2); Sun, Xiaoshuai (1, 2); Jiang, Guannan (3); Wa...
    EI:20230066333   10.48550/arXiv.2302.08106
    收录情况:EI
  • Towards Local Visual Modeling for Image Captioning

    arXiv,,2023-02-12.
    Ma, Yiwei (1); Ji, Jiayi (1); Sun, Xiaoshuai (1, 2); Zhou, Yiyi (1); Ji, Rongrong (1, 2, 3)
    EI:20230058187   10.48550/arXiv.2302.06098
    收录情况:EI
  • Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network

    arXiv,,2023-01-08.
    Wang, Haowei (1); Ji, Jiayi (1); Zhou, Yiyi (1, 2); Wu, Yongjian (4); Sun, Xiaoshuai (1, 2, 3)
    EI:20230010575   10.48550/arXiv.2301.03160
    收录情况:EI
  • HSM-QA: Question Answering System Based on Hierarchical Semantic Matching

    IEEE Access,2169-3536,2023.
    Zhang, Jinlu; He, Jing; Zhou, Yiyi; Sun, Xiaoshuai; Yu, Xiao
    WOS:001041925600001   EI:20233014430112   10.1109/ACCESS.2023.3296850
    收录情况:SCIE、EI
  • A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension

    IEEE Transactions on Multimedia,1520-9210,2023.
    Luo, Gen; Zhou, Yiyi; Sun, Jiamu; Sun, Xiaoshuai; Ji, Rongrong
    WOS:001165348200006   EI:20233914778693   10.1109/TMM.2023.3314153
    收录情况:SCIE、EI
  • RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension

    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition,1063-6919,2023.
    Sun, Jiamu; Luo, Gen; Zhou, Yiyi; Sun, Xiaoshuai; Jiang, Guannan; Wang, Zhiyu; Ji, Rongrong
    WOS:001062531303044   EI:20234114868473   10.1109/CVPR52729.2023.01835
    收录情况:EI、CPCI-S
  • RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension

    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition,1063-6919,2023.
    Jin, Lei; Luo, Gen; Zhou, Yiyi; Sun, Xiaoshuai; Jiang, Guannan; Shu, Annan; Ji, Rongrong
    WOS:001058542603001   EI:20234114867429   10.1109/CVPR52729.2023.00263
    收录情况:EI、CPCI-S
  • Clover : Towards A Unified Video-Language Alignment and Fusion Model

    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),1063-6919,2023.
    Huang, Jingjia; Li, Yinan; Feng, Jiashi; Wu, Xinglong; Sun, Xiaoshuai; Ji, Rongrong
    WOS:001062522107018   EI:20250817912034   10.1109/CVPR52729.2023.01427
    收录情况:EI、CPCI-S
  • X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance

    Proceedings of the IEEE International Conference on Computer Vision,1550-5499,2023.
    Ma, Yiwei; Zhang, Xiaoqing; Sun, Xiaoshuai; Ji, Jiayi; Wang, Haowei; Jiang, Guannan; Zhuang, Weilin...
    WOS:001159644303001   EI:20240915636013   10.1109/ICCV51070.2023.00258
    收录情况:EI、CPCI-S
  • Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models

    Advances in Neural Information Processing Systems,1049-5258,2023.
    Wu, Qiong (1, 2); Yu, Wei (1, 2); Zhou, Yiyi (1, 2); Huang, Shubin (1); Sun, Xiaoshuai (1, 2); Ji, ...
    EI:20241715986574  
    收录情况:EI
  • Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models

    Advances in Neural Information Processing Systems,1049-5258,2023.
    Luo, Gen (1, 3); Zhou, Yiyi (1, 2); Ren, Tianhe (1); Chen, Shengxin (1); Sun, Xiaoshuai (1, 2); Ji,...
    EI:20241715985774  
    收录情况:EI