已发表成果:
WOK 论文 30 篇;中文核心 6 篇;
Balancing exploration and exploitation in episodic reinforcement learning
SEQUENTIAL ACTION-INDUCED INVARIANT REPRESENTATION FOR REINFORCEMENT LEARNING