Exploring SIMD for Molecular Dynamics, Using Intel Xeon Processors and Intel Xeon Phi Coprocessors SJ Pennycook, CJ Hughes, M Smelyanskiy, SA Jarvis IEEE International Parallel & Distributed Processing Symposium, 2013 | 201 | 2013 |
CosmoFlow: Using deep learning to learn the universe at scale A Mathuriya, D Bard, P Mendygral, L Meadows, J Arnemann, L Shao, ... SC18: International Conference for High Performance Computing, Networking …, 2018 | 113 | 2018 |
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark SJ Pennycook, SD Hammond, SA Jarvis, GR Mudalige ACM SIGMETRICS Performance Evaluation Review 38 (4), 23-29, 2011 | 92 | 2011 |
An investigation of the performance portability of OpenCL SJ Pennycook, SD Hammond, SA Wright, JA Herdman, I Miller, SA Jarvis Journal of Parallel and Distributed Computing 73 (11), 1439-1450, 2013 | 86 | 2013 |
A metric for performance portability SJ Pennycook, JD Sewall, VW Lee arXiv preprint arXiv:1611.07409, 2016 | 66 | 2016 |
Implications of a metric for performance portability SJ Pennycook, JD Sewall, VW Lee Future Generation Computer Systems 92, 947-958, 2019 | 65 | 2019 |
Parallel file system analysis through application I/O tracing SA Wright, SD Hammond, SJ Pennycook, RF Bird, JA Herdman, I Miller, ... The Computer Journal 56 (2), 141-155, 2013 | 36 | 2013 |
Effective performance portability SL Harrell, J Kitson, R Bird, SJ Pennycook, J Sewall, D Jacobsen, ... 2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018 | 31 | 2018 |
On the acceleration of wavefront applications using distributed many-core architectures SJ Pennycook, SD Hammond, GR Mudalige, SA Wright, SA Jarvis The Computer Journal 55 (2), 138-153, 2012 | 31 | 2012 |
Developing performance-portable molecular dynamics kernels in OpenCL SJ Pennycook, SA Jarvis 2012 SC Companion: High Performance Computing, Networking Storage and …, 2012 | 22 | 2012 |
Methods and apparatus for multi-load and multi-store vector instructions L Meadows, A Duran, S Pennycook, J Sewall US Patent App. 15/859,033, 2019 | 16 | 2019 |
Evaluating the impact of proposed openmp 5.0 features on performance, portability and productivity SJ Pennycook, JD Sewall, JR Hammond 2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018 | 14 | 2018 |
Ldplfs: Improving i/o performance without application modification SA Wright, SD Hammond, SJ Pennycook, I Miller, JA Herdman, SA Jarvis 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 13 | 2012 |
Interpreting and visualizing performance portability metrics J Sewall, SJ Pennycook, D Jacobsen, T Deakin, S McIntosh-Smith 2020 IEEE/ACM International Workshop on Performance, Portability and …, 2020 | 9 | 2020 |
Unveiling the Early Universe: Optimizing Cosmology Workloads for Intel Xeon Phi Coprocessors in an SGI UV2000 System J Briggs, SJ Pennycook, EPS Shellard, C Martins, M Woodacre, K Feind Tech. Rep.(SGI/Intel White Paper, 2014), 2014 | 7 | 2014 |
Towards a portable and future-proof particle-in-cell plasma physics code RF Bird, SJ Pennycook, SA Wright, SA Jarvis | 7 | 2013 |
Light-weight parallel I/O analysis at scale SA Wright, SD Hammond, SJ Pennycook, SA Jarvis Computer Performance Engineering: 8th European Performance Engineering …, 2011 | 7 | 2011 |
Navigating performance, portability, and productivity SJ Pennycook, JD Sewall, DW Jacobsen, T Deakin, S McIntosh-Smith Computing in Science & Engineering 23 (5), 28-38, 2021 | 6 | 2021 |
WMTrace-A Lightweight Memory Allocation Tracker and Analysis Framework O Perks, SD Hammond, SJ Pennycook, SA Jarvis Proceedings of the UK Performance Engineering Workshop (UKPEW 2011), 2011 | 6 | 2011 |
Model-led optimisation of a geometric multigrid application R Bunt, S Pennycook, S Jarvis, L Lapworth, Y Ho 2013 IEEE 10th International Conference on High Performance Computing and …, 2013 | 5 | 2013 |