Learning parameterized skills BC da Silva, G Konidaris, A Barto International Conference on Machine Learning (ICML 2012), 2012 | 217 | 2012 |
Dealing with non-stationary environments using context detection BC da Silva, EW Basso, ALC Bazzan, PM Engel International Conference on Machine Learning (ICML 2006), 217-224, 2006 | 208 | 2006 |
Preventing undesirable behavior of intelligent machines PS Thomas, B Castro da Silva, AG Barto, S Giguere, Y Brun, E Brunskill Science 366 (6468), 999-1004, 2019 | 158 | 2019 |
Learning in groups of traffic signals ALC Bazzan, D De Oliveira, BC da Silva Engineering Applications of Artificial Intelligence 23 (4), 560-568, 2010 | 126 | 2010 |
Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator. D de Oliveira, ALC Bazzan, BC da Silva, EW Basso, L Nunes, R Rossetti, ... 4th European Workshop on Multi-Agent Systems (EUMAS 2006), 2006 | 86 | 2006 |
Gaussian Processes for Learning and Control: A Tutorial with Examples M Liu, G Chowdhary, BC Da Silva, SY Liu, JP How IEEE Control Systems Magazine 38 (5), 53-86, 2018 | 80 | 2018 |
ITSUMO: an intelligent transportation system for urban mobility BC Da Silva, R Junges, D de Oliveira, ALC Bazzan [Demonstration Track] (AAMAS 2006) - Proceedings of the 5th International …, 2006 | 74 | 2006 |
Learning parameterized motor skills on a humanoid robot BC Da Silva, G Baldassarre, G Konidaris, A Barto IEEE International Conference on Robotics and Automation (ICRA 2014), 5239-5244, 2014 | 56 | 2014 |
A task-and-technique centered survey on visual analytics for deep learning model engineering R Garcia, AC Telea, BC da Silva, J Tørresen, JLD Comba Computers & Graphics 77, 30-49, 2018 | 49 | 2018 |
Universal off-policy evaluation Y Chandak, S Niekum, B da Silva, E Learned-Miller, E Brunskill, ... Advances in Neural Information Processing Systems (NeurIPS 2021) 34, 27475-27490, 2021 | 39 | 2021 |
Adaptive traffic control with reinforcement learning B da Silva, D Oliveira, AL Bazzan, EW Basso 4th Workshop on Agents in Traffic and Transportation (ATT@AAMAS 2006), 80-86, 2006 | 37 | 2006 |
Analysing the impact of travel information for minimising the regret of route choice GO Ramos, ALC Bazzan, BC da Silva Transportation Research Part C: Emerging Technologies 88, 257-271, 2018 | 36 | 2018 |
Active learning of parameterized skills B Da Silva, G Konidaris, A Barto International Conference on Machine Learning (ICML 2014), 1737-1745, 2014 | 27 | 2014 |
Fairness Guarantees under Demographic Shift S Giguere, B Metevier, BC da Silva, Y Brun, PS Thomas, S Niekum International Conference on Learning Representations (ICLR 2022), 2022 | 26 | 2022 |
Improving reinforcement learning with context detection BC Da Silva, EW Basso, FS Perotto, AL C Bazzan, PM Engel (AAMAS 2006) Intl. Joint Conference on Autonomous Agents and Multiagent …, 2006 | 26 | 2006 |
Autonomous Reinforcement Learning of Multiple Interrelated Tasks VG Santucci, E Cartoni, BC da Silva, G Baldassarre International Conference on Development and Learning (ICDL 2019), 2019 | 24 | 2019 |
Energetic natural gradient descent P Thomas, BC Silva, C Dann, E Brunskill International Conference on Machine Learning (ICML 2016), 2887-2895, 2016 | 20 | 2016 |
Learning to minimise regret in route choice GO Ramos, BC da Silva, ALC Bazzan (AAMAS 2017) Intl. Joint Conference on Autonomous Agents and Multiagent …, 2017 | 19 | 2017 |
Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection LN Alegre, ALC Bazzan, BC da Silva (AAMAS 2021) Intl. Conference on Autonomous Agents and Multiagent Systems …, 2021 | 17 | 2021 |
On ensuring that intelligent machines are well-behaved PS Thomas, BC da Silva, AG Barto, E Brunskill arXiv preprint arXiv:1708.05448, 2017 | 17 | 2017 |