Observing Performance Dynamics Using Parallel Profile Snapshots

Alan Morris , Wyatt Spear , Allen D. Malony , Sameer Shende
european conference on parallel processing 162 -171

5
2008
Multi-Level Performance Instrumentation for Kokkos Applications Using TAU

Sameer Shende , Nicholas Chaimov , Allen Malony , Neena Imam
2019 IEEE/ACM International Workshop on Programming and Performance Visualization Tools (ProTools)

4
2019
Multi-Platform SYCL Profiling with TAU

Nicholas Chaimov , Sameer Shende , Allen D. Malony
international workshop on opencl

2020
Research Initiatives for Plug-and-Play Scientific Computing

Lois Curfman McInnes , Tamara Dahlgren , Jarek Nieplocha , David Bernholdt
Journal of Physics: Conference Series 78 ( 1) 012046

2007
MPI performance engineering with the MPI tool interface: the integration of MVAPICH and TAU

Srinivasan Ramesh , Aurèle Mahéo , Sameer Shende , Allen D Malony
Proceedings of the 24th European MPI Users' Group Meeting 16

10
2017
Introducing Task-Containers as an Alternative to Runtime-Stacking

Jean-Baptiste Besnard , Julien Adam , Sameer Shende , Marc Pérache
Proceedings of the 23rd European MPI Users' Group Meeting 51 -63

6
2016
Parametric Studies in Eclipse with TAU and PerfExplorer

Kevin A. Huck , Wyatt Spear , Allen D. Malony , Sameer Shende
Euro-Par 2008 Workshops - Parallel Processing 283 -294

3
2009
TAUg: Runtime Global Performance Data Access Using MPI

Kevin A. Huck , Allen D. Malony , Sameer Shende , Alan Morris
Recent Advances in Parallel Virtual Machine and Message Passing Interface 313 -321

18
2006
An MPI Halo-Cell Implementation for Zero-Copy Abstraction

Jean-Baptiste Besnard , Allen Malony , Sameer Shende , Marc Pérache
Proceedings of the 22nd European MPI Users' Group Meeting 3

10
2015
Performance Analysis Integration in the Uintah Software Development Cycle

J. Davison de St. Germain , Alan Morris , Steven G. Parker , Allen D. Malony
International Journal of Parallel Programming 31 ( 1) 35 -53

8
2003
Kernel-Level Measurement for Integrated Parallel Performance Views: the KTAU Project

Aroon Nataraj , Allen Malony , Sameer Shende , Alan Morris
international conference on cluster computing 1 -12

13
2006
Unifying the Analysis of Performance Event Streams at the Consumer Interface Level

Jean-Baptiste Besnard , Allen D. Malony , Sameer Shende , Marc Pérache
Tools for High Performance Computing 2017 57 -71

2019
OpenSHMEM Specification 1.4

Matthew B. Baker , Swen Boehm , Aurelien Bouteiller , Barbara Chapman
Office of Scientific and Technical Information (OSTI)

3
2017
XPRESS: eXascale PRogramming Environment and System Software, Final Report

Allen Malony , Sameer Shende
Office of Scientific and Technical Information (OSTI)

2017
Early Experiences with KTAU on the IBM BG/L

Aroon Nataraj , Allen D. Malony , Alan Morris , Sameer Shende
Euro-Par 2006 Parallel Processing 99 -110

5
2006
Dynamic Performance Callstack Sampling: Merging TAU and DAQV

Sameer Shende , Allen D. Malony , Steven T. Hackstadt
parallel computing 515 -520

10
1998
Performance Visualization for TAU Instrumented Scientific Workflows.

Cong Xie , Wei Xu , Sungsoo Ha , Kevin Huck
international joint conference on computer vision imaging and computer graphics theory and applications 333 -340

4
2018
A scalable approach to MPI application performance analysis

Shirley Moore , Felix Wolf , Jack Dongarra , Sameer Shende
european pvm/mpi users group meeting on recent advances in parallel virtual machine and message passing interface 309 -316

25
2005
Trace-Based Parallel Performance Overhead Compensation

Felix Wolf , Allen D. Malony , Sameer Shende , Alan Morris
High Performance Computing and Communications 617 -628

11
2005
Hands-on practical hybrid parallel application performance engineering

Markus Geimer , Michael Gerndt , Sameer Shende , Bert Wesarg
EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface 15 -15

2
2012