Follow
Pedro A. Ortega
Pedro A. Ortega
Artificial Intelligence & Machine Learning
Verified email at adaptiveagents.org - Homepage
Title
Cited by
Cited by
Year
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
3032019
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
2662017
Thermodynamics as a theory of decision-making with information-processing costs
PA Ortega, DA Braun
Proceedings of the Royal Society A: Mathematical, Physical and Engineering …, 2013
2412013
A Medical Claim Fraud/Abuse Detection System based on Data Mining: A Case Study in Chile.
PA Ortega, CJ Figueroa, GA Ruz
DMIN 6, 26-29, 2006
1522006
Nash equilibria in multi-agent motor interactions
DA Braun, PA Ortega, DM Wolpert
PLoS computational biology 5 (8), e1000468, 2009
1202009
Causal reasoning from meta-reinforcement learning
I Dasgupta, J Wang, S Chiappa, J Mitrovic, P Ortega, D Raposo, ...
arXiv preprint arXiv:1901.08162, 2019
972019
Meta reinforcement learning as task inference
J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess
arXiv preprint arXiv:1905.06424, 2019
942019
A minimum relative entropy principle for learning and acting
PA Ortega, DA Braun
Journal of Artificial Intelligence Research 38, 475-511, 2010
712010
Information, utility and bounded rationality
DA Ortega, PA Braun
Artificial General Intelligence: 4th International Conference, AGI 2011 …, 2011
692011
Meta-learning of sequential strategies
PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ...
arXiv preprint arXiv:1905.03030, 2019
592019
Path integral control and bounded rationality
DA Braun, PA Ortega, E Theodorou, S Schaal
2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011
572011
From Poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
502021
Intrinsic social motivation via causal influence in multi-agent RL
N Jaques, A Lazaridou, E Hughes, C Gulcehre, PA Ortega, DJ Strouse, ...
502018
Laser processing of Al2O3/a‐SiCx:H stacks: a feasible solution for the rear surface of high‐efficiency p‐type c‐Si solar cells
I Martín, P Ortega, M Colina, A Orpella, G López, R Alcubilla
Progress in Photovoltaics: Research and Applications 21 (5), 1171-1175, 2013
462013
Generalized Thompson sampling for sequential decision-making and causal inference
PA Ortega, DA Braun
Complex Adaptive Systems Modeling 2 (2), 2014
442014
Action and perception as divergence minimization
D Hafner, PA Ortega, J Ba, T Parr, K Friston, N Heess
arXiv preprint arXiv:2009.01791, 2020
382020
Human decision-making under limited time
PA Ortega, AA Stocker
Advances in Neural Information Processing Systems 29, 2016
372016
Information-Theoretic Bounded Rationality
PA Ortega, DA Braun, JS Dyer, KE Kim, N Tishby
arXiv preprint arXiv:1512.06789, 2015
362015
Motor coordination: when two have to act as one
DA Braun, PA Ortega, DM Wolpert
Experimental brain research 211, 631-641, 2011
332011
-type emitter surface passivation in solar cells by means of antireflective amorphous silicon carbide layers
R Ferre, I Martín, P Ortega, M Vetter, I Torres, R Alcubilla
Journal of applied physics 100 (7), 073703, 2006
262006
The system can't perform the operation now. Try again later.
Articles 1–20