Follow
John Pennycook
Title
Cited by
Cited by
Year
Exploring SIMD for Molecular Dynamics, Using Intel Xeon Processors and Intel Xeon Phi Coprocessors
SJ Pennycook, CJ Hughes, M Smelyanskiy, SA Jarvis
IEEE International Parallel & Distributed Processing Symposium, 2013
2012013
CosmoFlow: Using deep learning to learn the universe at scale
A Mathuriya, D Bard, P Mendygral, L Meadows, J Arnemann, L Shao, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
1132018
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark
SJ Pennycook, SD Hammond, SA Jarvis, GR Mudalige
ACM SIGMETRICS Performance Evaluation Review 38 (4), 23-29, 2011
922011
An investigation of the performance portability of OpenCL
SJ Pennycook, SD Hammond, SA Wright, JA Herdman, I Miller, SA Jarvis
Journal of Parallel and Distributed Computing 73 (11), 1439-1450, 2013
862013
A metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
arXiv preprint arXiv:1611.07409, 2016
662016
Implications of a metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
Future Generation Computer Systems 92, 947-958, 2019
652019
Parallel file system analysis through application I/O tracing
SA Wright, SD Hammond, SJ Pennycook, RF Bird, JA Herdman, I Miller, ...
The Computer Journal 56 (2), 141-155, 2013
362013
Effective performance portability
SL Harrell, J Kitson, R Bird, SJ Pennycook, J Sewall, D Jacobsen, ...
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
312018
On the acceleration of wavefront applications using distributed many-core architectures
SJ Pennycook, SD Hammond, GR Mudalige, SA Wright, SA Jarvis
The Computer Journal 55 (2), 138-153, 2012
312012
Developing performance-portable molecular dynamics kernels in OpenCL
SJ Pennycook, SA Jarvis
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012
222012
Methods and apparatus for multi-load and multi-store vector instructions
L Meadows, A Duran, S Pennycook, J Sewall
US Patent App. 15/859,033, 2019
162019
Evaluating the impact of proposed openmp 5.0 features on performance, portability and productivity
SJ Pennycook, JD Sewall, JR Hammond
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
142018
Ldplfs: Improving i/o performance without application modification
SA Wright, SD Hammond, SJ Pennycook, I Miller, JA Herdman, SA Jarvis
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
132012
Interpreting and visualizing performance portability metrics
J Sewall, SJ Pennycook, D Jacobsen, T Deakin, S McIntosh-Smith
2020 IEEE/ACM International Workshop on Performance, Portability and …, 2020
92020
Unveiling the Early Universe: Optimizing Cosmology Workloads for Intel Xeon Phi Coprocessors in an SGI UV2000 System
J Briggs, SJ Pennycook, EPS Shellard, C Martins, M Woodacre, K Feind
Tech. Rep.(SGI/Intel White Paper, 2014), 2014
72014
Towards a portable and future-proof particle-in-cell plasma physics code
RF Bird, SJ Pennycook, SA Wright, SA Jarvis
72013
Light-weight parallel I/O analysis at scale
SA Wright, SD Hammond, SJ Pennycook, SA Jarvis
Computer Performance Engineering: 8th European Performance Engineering …, 2011
72011
Navigating performance, portability, and productivity
SJ Pennycook, JD Sewall, DW Jacobsen, T Deakin, S McIntosh-Smith
Computing in Science & Engineering 23 (5), 28-38, 2021
62021
WMTrace-A Lightweight Memory Allocation Tracker and Analysis Framework
O Perks, SD Hammond, SJ Pennycook, SA Jarvis
Proceedings of the UK Performance Engineering Workshop (UKPEW 2011), 2011
62011
Model-led optimisation of a geometric multigrid application
R Bunt, S Pennycook, S Jarvis, L Lapworth, Y Ho
2013 IEEE 10th International Conference on High Performance Computing and …, 2013
52013
The system can't perform the operation now. Try again later.
Articles 1–20