已发表成果:
WOK 论文 83 篇;中文核心 12 篇;其它论文 1 篇;专利发明 6 个;
DYNAMIC LANGUAGE GROUP-BASED MOE: ENHANCING EFFICIENCY AND FLEXIBILITY FOR CODE-SWITCHING SPEECH RECOGNITION
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation
MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis
IMPROVING MULTI-SPEAKER ASR WITH OVERLAP-AWARE ENCODING AND MONOTONIC ATTENTION
THE XMUSPEECH SYSTEM FOR AUDIO-VISUAL TARGET SPEAKER EXTRACTION IN MISP 2023 CHALLENGE<bold> </bold>
基于预训练模型的半监督说话人验证系统
清华大学学报(自然科学版),1000-0054,2024-07-31.面向闽南方言的自监督模型迁移学习
厦门大学学报(自然科学版),0438-0479,2024-07-28.