Jia Yuan Yu

Cited by

	All	Since 2019
Citations	1375	918
h-index	18	15
i10-index	32	24

180

135

200920102011201220132014201520162017201820192020202120222023202415 17 16 25 33 48 52 60 78 105 142 176 179 152 180 88

Public access

View all

16 articles

6 articles

available

not available

Based on funding mandates

Jia Yuan Yu

Amazon

Verified email at amazon.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Markov decision processes with arbitrary reward processes JY Yu, S Mannor, N Shimkin Mathematics of Operations Research 34 (3), 737-757, 2009	131	2009
Online Learning with Sample Path Constraints. S Mannor, JN Tsitsiklis, JY Yu Journal of Machine Learning Research 10 (3), 2009	127	2009
Piecewise-stationary bandit problems with side observations JY Yu, S Mannor Proceedings of the 26th annual international conference on machine learning …, 2009	111	2009
Unimodal Bandits. JY Yu, S Mannor ICML, 41-48, 2011	103	2011
A reinforcement learning technique for optimizing downlink scheduling in an energy-limited vehicular network RF Atallah, CM Assi, JY Yu IEEE Transactions on Vehicular Technology 66 (6), 4592-4601, 2016	94	2016
Lipschitz bandits without the lipschitz constant S Bubeck, G Stoltz, JY Yu Algorithmic Learning Theory: 22nd International Conference, ALT 2011, Espoo …, 2011	89	2011
Online learning in Markov decision processes with arbitrarily changing rewards and transitions JY Yu, S Mannor 2009 international conference on game theory for networks, 314-322, 2009	52	2009
On the design of campus parking systems with QoS guarantees W Griggs, JY Yu, F Wirth, F Häusler, R Shorten IEEE Transactions on Intelligent Transportation Systems 17 (5), 1428-1437, 2015	50	2015
Sample Complexity of Risk-Averse Bandit-Arm Selection. JY Yu, E Nikolova IJCAI, 2576-2582, 2013	48	2013
Arbitrarily modulated Markov decision processes JY Yu, S Mannor Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held …, 2009	45	2009
Distributed parking space detection, characterization, advertisement, and enforcement RL Cogill, O Gallay, C Lee, Z Nabi, M Rufli, R Shorten, T Tchrakian, ... US Patent 9,601,018, 2017	37	2017
Data-driven distributionally robust polynomial optimization M Mevissen, E Ragnoli, JY Yu Advances in Neural Information Processing Systems 26, 2013	30	2013
Reward modeling for mitigating toxicity in transformer-based language models F Faal, K Schmitt, JY Yu Applied Intelligence 53 (7), 8421-8435, 2023	22	2023
Reinforcement mechanism design for electric vehicle demand response in microgrid charging stations L Hou, S Ma, J Yan, C Wang, JY Yu 2020 International Joint Conference on Neural Networks (IJCNN), 1-8, 2020	21	2020
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and network utility maximization F Wirth, S Stuedli, JY Yu, M Corless, R Shorten arXiv preprint arXiv:1404.5064, 2014	21	2014
Mean field equilibria of multi armed bandit games R Gummadi, R Johari, JY Yu 2012 50th Annual Allerton Conference on Communication, Control, and …, 2012	21	2012
Mean field analysis of multi-armed bandit games R Gummadi, R Johari, S Schmit, JY Yu Available at SSRN 2045842, 2013	20	2013
Online Learning with Expert Advice and Finite-Horizon Constraints. B Kveton, JY Yu, G Theocharous, S Mannor AAAI, 331-336, 2008	18	2008
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and optimisation FR Wirth, S Stüdli, JY Yu, M Corless, R Shorten Journal of the ACM (JACM) 66 (4), 1-37, 2019	16	2019
Communication-efficient distributed multi-resource allocation SE Alam, R Shorten, F Wirth, JY Yu 2018 IEEE International Smart Cities Conference (ISC2), 1-8, 2018	16	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by