Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang Advances in Neural Information Processing Systems (NeurIPS), 2023 | 17* | 2023 |
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang arXiv preprint arXiv:2309.17382, 2023 | 10* | 2023 |
Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning S Zhang, L Shen, L Han, L Shen Conference on Lifelong Learning Agents (CoLLAs), 2021 | 6 | 2021 |
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning S Zhang Advances in Neural Information Processing Systems (NeurIPS), 2022 | 5 | 2022 |
Asking Before Action: Gather Information in Embodied Decision Making with Language Models X Chen, S Zhang, P Zhang, L Zhao, J Chen arXiv preprint arXiv:2305.15695, 2023 | 4 | 2023 |
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms S Zhang, B Liu, Z Wang, T Zhao Advances in Neural Information Processing Systems (NeurIPS), 2023 | 2 | 2023 |
Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics S Zhang, W Jin, Z Wang International Conference on Machine Learning (ICML), 2023 | 2 | 2023 |
Structure-Regularized Attention for Deformable Object Representation S Zhang, L Shen, Z Li, W Liu NeurIPS 2020 Workshop on Object Representations for Learning and Reasoning, 2021 | 1 | 2021 |