Follow
Qingfeng Lan
Title
Cited by
Cited by
Year
Maxmin Q-learning: Controlling the estimation bias of Q-learning
Q Lan, Y Pan, A Fyshe, M White
International Conference on Learning Representations (ICLR), 2020
1982020
Variational quantum soft actor-critic
Q Lan
arXiv preprint arXiv:2112.11921, 2021
202021
Loss of plasticity in deep continual learning
S Dohare, JF Hernandez-Garcia, Q Lan, P Rahman, AR Mahmood, ...
Nature 632 (8026), 768-774, 2024
182024
A deep top-k relevance matching model for ad-hoc retrieval
Z Yang, Q Lan, J Guo, Y Fan, X Zhu, Y Lan, Y Wang, X Cheng
China Conference on Information Retrieval (CCIR), 2018
162018
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
H Ishfaq*, Q Lan*, P Xu, AR Mahmood, D Precup, A Anandkumar, ...
International Conference on Learning Representations (ICLR), 2024
132024
Overcoming policy collapse in deep reinforcement learning
S Dohare, Q Lan, AR Mahmood
European Workshop on Reinforcement Learning (EWRL), 2023
102023
Memory-efficient reinforcement learning with value-based knowledge consolidation
Q Lan, Y Pan, J Luo, AR Mahmood
Transactions on Machine Learning Research (TMLR), 2023
10*2023
Model-free Policy Learning with Reward Gradients
Q Lan, S Tosatto, H Farrahi, AR Mahmood
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
102022
Reducing selection bias in counterfactual reasoning for individual treatment effects estimation
Z Zhang, Q Lan, L Ding, Y Wang, N Hassanpour, R Greiner
NeurIPS 2019 CausalML Workshop, 2019
92019
Learning to Optimize for Reinforcement Learning
Q Lan, AR Mahmood, S Yan, Z Xu
Reinforcement Learning Conference (RLC), 2024
72024
A PyTorch Reinforcement Learning Framework for Exploring New Ideas
Q Lan
https://github.com/qlan3/Explorer, 2019
72019
Elephant Neural Networks: Born to Be a Continual Learner
Q Lan, AR Mahmood
ICML Workshop on High-dimensional Learning Dynamics, 2023
22023
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
H Ishfaq, Y Tan, Y Yang, Q Lan, J Lu, AR Mahmood, D Precup, P Xu
Reinforcement Learning Conference (RLC), 2024
12024
Predictive Representation Learning for Language Modeling
Q Lan, L Kumar, M White, A Fyshe
arXiv preprint arXiv:2105.14214, 2021
12021
Weight Clipping for Deep Continual and Reinforcement Learning
M Elsayed, Q Lan, C Lyle, AR Mahmood
Reinforcement Learning Conference (RLC), 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–15