已发表成果:
WOK 论文 30 篇;中文核心 6 篇;
The treatment of sepsis: an episodic memory-assisted deep reinforcement learning approach
Learning and planning in partially observable environments without prior domain knowledge
Hard Negative Sample Mining for Contrastive Representation in Reinforcement Learning
Sequential Decision Making with "Sequential Information" in Deep Reinforcement Learning
局部可观测环境下未来信息辅助的无模型深度强化学习
南京大学学报(自然科学),0469-5097,2022-09-30.