Browse publications
Publications are listed in reverse chronological order.
-
2009
-
Albert Hartono and Sdayappan Ponnuswamy, "Annotation-Based Empirical Performance Tuning Using {Orio}", 23rd IEEE International Parallel \& Distributed Processing Symposium (IPDPS), Rome, Italy, May 2009 (to appear). BibTeX
-
S. Alam, R. Barrett, H. Jagode, J. Kuehn, S. Poole and R. Sankaran, "Impact of Quad-core Cray XT4 System and Software Stack on Scientific Computation", IEEE IPDPS09, Rome, Italy, May 2009. BibTeX
-
Ananta Tiwari, Chun Chen, Jacqueline Chame, Mary Hall and Jeff Hollingsworth, "Scalable Autotuning Framework for Compiler Optimization", Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS’09), Rome, Italy, May 2009 (To appear). BibTeX
-
Boyana Norris, Albert Hartono, Elizabeth Jessup and Jeremy Siek, "Generating Empirically Optimized Composed Matrix Kernels from MATLAB Prototypes", Proceedings of the International Conference on Computational Science 2009, Baton Rouge, Louisiana, U.S.A., Preprint ANL/MCS-P1581-0209, May 2009 (to appear). BibTeX
-
Doug H. Ahn, "Overcoming Scalability Challenges for Tool Daemon Launching", 2008 International Conference on Parallel Processing (ICPP-08), Portland, OR, USA, January 2009. BibTeX
-
2008
-
H. Jagode and J. Hein, "Custom assignment of MPI ranks for parallel multi-dimensional FFTs: Evaluation of BG/P versus BG/L", Proceedings of the 2008 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA-08), Sydney, Australia, Springer, InderScience, December 2008. BibTeX
-
M. Bast, J. Keuhn, C. McCurdy, J. Rogers, C. Roth and W. Yu, "Early Evaluation of IBM BlueGene/P", SC08, Austin, TX, USA, November 2008. BibTeX
-
Kevin Huck, Oscar Hernandez, Bui Van, Chandrasekaran Sunita, Chapman Barbara, Malony Allen, McInnes Lois Curfman and Norris Boyana, "Capturing Performance Knowledge for Automated Analysis", Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'08), Austin, Texas, November 2008. BibTeX
-
H. Jagode, S. Alam, C. Lively, J. Vetter and J. Dongarra, "Modeling Assertions for Petascale Applications and Systems", ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC08), Austin, TX, November 2008. BibTeX
-
Todd Gamblin, Bronis R. de Supinski, Martin Schultz, Rob Fowler and Daniel A. Reed, "Scalable Load-Balance Measurement for SPMD Codes", Proceedings of Supercomputing 2008, Austin, TX, November 2008. BibTeX
-
L. Carrington, D. Komatitsch, M. Laurenzano, M. Tikir, D. Miche, N. Le Goff, A. Snavely and J. Tromp, "High-Frequency Simulations of Global Seismic Wave Propagation Using SPECFEM3D_GLOBE.", ACM/IEEE SC08 International Conference for High Performance Computing, November 2008. BibTeX
-
Lin-Wang Wang, Byounghak Lee, Hongzhang Shan, Zhengji Zhao, Juan Meza, Erich Strohmaier and David Bailey, "Linearly Scaling 3D Fragment Method for Large-Scale Electronic Structure Calculations", SC08, Austin, TX, USA, Recipient of the 2008 ACM Gordon Bell Prize in the "Special" category for "algorithmic innovation.", November 2008. BibTeX
-
C. Lively, S. Alam, V. Taylor and and J. Vetter, "A Methodology for Developing High Fidelity Communication Models for Large-scale Applications Targeted on Multicore Systems", Proceedings of the 20th International Symposium on Computer Architecture and High Performance Computing, Campo Grande, Mato Grosso do Sul, Brazil, October 2008. BibTeX
-
Van Bui, Boyana Norris, Kevin Huck, Lois Curfman McInnes, Li Li, Oscar Hernandez and Barbara Chapman, "A Component Infrastructure for Performance and Power Modeling of Parallel Scientific Applications", Component-Based High Performance Computing Workshop, October 14-17, 2008, Karlsruhe, Germany, October 2008. BibTeX
-
Rob Fowler, Todd Gamblin, Allan K Porterfield, Patrick Dreher, Song Huang and Balint Joo, "Performance engineering challenges: the view from RENCI", J. Phys: Conf. Ser., October 2008, pp. 5pp. BibTeX
-
Bob Lucas, "Performance Engineering Research Institute 2008 Annual Report", September 2008. BibTeX
-
Jeffery L. Tilson, Mark S.C. Reed and Robert J. Fowler, "Workflows for Performance Evaluation and Tuning", Proceedings 2008 IEEE International Conference on Cluster Computing (Cluster 2008), Tsukuba, Japan, September 2008. BibTeX
-
Laksono Adhianto, Sinchan Banerjee, Michael Fagan, Mark Krentel, Gabriel Marin, John Mellor-Crummey and Nathan Tallent, "HPCToolkit: Tools for performance analysis of optimized parallel programs", Concurrency and Computation: Practice and Experience, August 2008. BibTeX
-
Alan Morris, Wyatt Spear, Allen D. Malony and Sameer Shende, "Observing Performance Dynamics using Parallel Profile Snapshots", European Conference on Parallel Processing (EuroPar 2008), August 2008. BibTeX
-
Kevin A. Huck, Wyatt Spear, Allen D. Malony, Sameer Shende and Alan Morris, "Parametric Studies in Eclipse with TAU and PerfExplorer", Workshop on Productivity and Performance Tools for HPC Application Development, August 2008. BibTeX
-
Samuel Williams, Kaushik Datta, Jonathan Carter, Leonid Oliker, John Shalf and Katherine Yelick, "PERI: Auto-tuning Memory Intensive Kernels for Multicore", Journal of Physics: Conference Series, Vol. 125, July 2008, pp. 012038. BibTeX
-
Bronis R. de Supinski, Rob Fowler, Todd Gamblin, Frank Mueller, Prasun Ratn and Martin Schulz, "An open infrastructure for scalable, reconfigurable analysis", International Workshop on Scalable Tools for High-End Computing (STHEC 2008), ACM/SIGARCH, July 2008 (to appear). BibTeX
-
Bronis R. de Supinski, Rob Fowler, Todd Gamblin, Frank Mueller, Prasun Ratn and Martin Schultz, "An Open Infrastructure for Scalable, Reconfigurable Analysis", International Workshop on Scalable Tools for High-End Computing (STHEC) 2008, Kos, Greece, July 2008. BibTeX
-
Allen D. Malony, Sameer Shende, Alan Morris, S. Biersdorff, Wyatt Spear, Kevin A. Huck and Aroon Nataraj, "Evolution of a Parallel Performance System", 2nd International Workshop on Tools for High Performance Computing, July 2008. BibTeX
-
Allan Porterfield, Robert Fowler and Mark Neyer, "MAESTRO: Dynamic Runtime Power Control", Workshop on Managed Multicore systems (MMCS), Boston, MA, June 2008. BibTeX
-
S. Alam, R. Barrett, M. Eisenbach, M. Fahey, R. Hartman-Baker, J. Kuehn, S. Poole, R. Sankaran and P. Worley, "The Cray XT4 Quad-core: A First Look", Proceedings of the 50th Cray User Group Conference, Helsinki, Finland, May 2008. BibTeX
-
M. Eisenbach, M. R. Fahey, R. Hartman-Baker, J. A. Kuehn, S. W. Poole, R. Sankaran and P. H. Worley, "The Cray XT4 Quad-core : A First Look", 50th Cray User Group Conference, Helsinki, Finland, May 2008. BibTeX
-
J. Hein, H. Jagode, U. Sigrist, A. Simpson and A. Trew, "Parallel 3D-FFTs for multi-core nodes on a mesh communication network", Proceedings of Cray User Group Conference (CUG 2008), Helsinki, Finland, May 2008. BibTeX
-
P. Worley, "Early Evaluation of the IBM BG/P", Proceedings of the LCI International Conference on High Performance Clustered Computing, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL, April 2008. BibTeX
-
Todd Gamblin, Rob Fowler and Daniel A Reed, "Scalable Methods for Monitoring and Detectiong Behavioral Classes in Scientific Codes", IPDPS 2008, Miami, FL, April 2008 (to appear). BibTeX
-
P. Agarwal, H. Ong and S. Hapmton, "Impact of multicores on large-scale molecular dynamics simulations", IEEE International Workshop on High Performance Computational Biology (HiCOMB), in conjunction with IPDPS, Miami, FL, USA, April 2008. BibTeX
-
Robert J. Fowler, Lavanya Ramakrishnan and Steven R. Thorpe, "Stateful Grid Resource Selection for Related Asynchronous Tasks", Technical Report, RENCI, Chapel Hill, NC, April 2008. BibTeX
-
R. F. Barrett and M. R. Fahey et al., "An Evaluation of the Oak Ridge National Laboratory Cray XT3", International Journal of High Performance Computing Applications, Vol. 22, no. 1, February 2008, pp. 52-80. BibTeX
-
E. Ipek, S. A. McKee, K. Singh, R. Caruana, B. R. de Supinski and M. Schulz, "Efficient Architectural Design Space Exploration via Predictive Modeling", ACM Transactions on Architecture and Code Optimization, January 2008 (to appear). BibTeX
-
J. Michalakes, J. Hacker, R. Loft, M. O. McCracken, A. Snavely, N. J. Wright, T. Spelce, B. Gorda and R. Walkup, "WRF Nature Run", 2008 Journal of Physics: Conf. Ser., Vol. 125, January 2008, pp. 012022. BibTeX
-
T. Chen, O. Khalili, R. L. Campbell Jr., L. Carrington, M. Tikir and A. Snavely, "Performance Prediction and Ranking of Supercomputers", 2008 Advances in Computers,, Vol. 72, January 2008. BibTeX
-
Kevin A. Huck, Allen D. Malony, Sameer Shende and Alan Morris, "Knowledge Support and Automation for Performance Analysis with PerfExplorer 2.0", The Journal of Scientific Programming (special issue on Large-Scale Programming Tools and Environments), Vol. 16 no. 2-3, January 2008, pp. 123-134. BibTeX
-
2007
-
Barry Rountree, David K. Lowenthal, Shelby Funk, Vincent W. Freeh, Bronis R. de Supinski and Martin Schulz, "Bounding Energy Consumption in Large-Scale MPI Programs", SC2007, Reno, NV, November 2007. BibTeX
-
Martin Schulz and Bronis R. de Supinski, "P^nMPI Tools: A Whole Lot Greater Than the Sum of Their Parts", SC2007, Reno, NV, November 2007. BibTeX
-
Todd Gamblin, Prasun Ratn, Bronis R. de Supinski, Martin Schulz, Frank Mueller, Robert J. Fowler and Daniel Reed, "An Open Framework for Scalable, Reconfigurable Performance Analysis", SC2007, Reno, NV, Poster, November 2007. BibTeX
-
Robert Preissl, Martin Schulz, Dieter Kranzlmueller, Bronis R. de Supinski and Daniel J. Quinlan, "Using MPI Communication Patterns To Guide Source Code Transformations", SC2007, Reno, NV, Poster, November 2007. BibTeX
-
P. Roth, "Characterizing the I/O Behavior of Scientific Applications on the Cray XT", Proceedings of the Petascale Data Storage Workshop, Reno, NV, November 2007. BibTeX
-
David H. Bailey, Robert Lucas, Paul Hovland, Boyana Norris, Kathy Yelick, Bronis de Supinski, Dan Quinlan, Pat Worley, Jeff Vetter, Phil Roth, Allan Snavely, Dan Reed, Ying Zhang, Jacque Chame, Dan Gunter, John Mellor-Crummey, Jeffrey Hollingsworth, Robert J. Fowler, Mary Hall, Jack Dongarra and Shirley Moore, "Performance Engineering: Understanding and Improving the Performance of Large-Scale Codes", CT Watch Quarterly, Vol. 3, no. 4, November 2007, pp. 18-23. BibTeX
-
S. R. Alam, N. Bhatia and J. S. Vetter, "Sensitivity Analysis of Biomolecular Simulations using Symbolic Models", 7th International Conference on BioInformatics and BioEngineering, Boston, MA, USA, October 2007. BibTeX
-
S. R. Alam, N. Bhatia and J. S. Vetter, "An Exploration of Performance Attributes for Symbolic Modeling of Emerging Processing Devices", 3rd International High Performance Computation Conference (HPCC), Houston, TX, USA, September 2007. BibTeX
-
Matthew Curtis-Maury, Karan Singh, Sally A. McKee, Filip Blagojevic, Dimitrios S. Nikolopoulos, Bronis R. de Supinski and Martin Schulz, "Identifying Energy-Efficient Concurrency Levels Using Machine Learning", International Workshop on Green Computing (GreenCom'07), Austin, TX, September 2007. BibTeX
-
J. S. Meredith, "Balancing Productivity and Performance on the Cell Broadband Engine", IEEE Annual International Conference on Cluster Computing, Austin, TX, USA, September 2007. BibTeX
-
Fengguang Song, Shirley Moore and Jack Dongarra, "L2 Cache Modeling for Scientific Applications on Chip Multi-processors", International Conference on Parallel Processing (ICPP07), Xi'an, China, September 2007 (to appear). BibTeX
-
Gregory L. Lee, Dong H. Ahn, Dorian C. Arnold, Bronis R. de Supinski, Barton P. Miller and Martin Schulz, "Benchmarking the Stack Trace Analysis Tool for BlueGene/L", International Conference on Parallel Computing 2007 (ParCo 2007), Aachen, Germany, September 2007. BibTeX
-
Martin Schulz and Bronis R. de Supinski, "Practical Differential Profiling", Euro-Par 2007, Rennes, France, August 2007. BibTeX
-
Bronis R. de Supinski, Jeff Hollingsworth, Shirley Moore and Patrick Worley, "Results of the PERI Survey of SciDAC Applications", SciDAC 2007, Boston, MA, June 2007. BibTeX
-
Xingfu Wu and Valerie Taylor, "Performance Analysis and Modeling of the SciDAC GTC Code on Three Large-scale Computer Systems", June 2007. BibTeX
-
Felix Wolf, Bernd Mohr, Jack Dongarra and Shirley Moore, "Automatic Analysis of Inefficiency Patterns in Parallel Applications", Concurrency and Computation: Practice and Experience, Vol. 19, June 2007 (to appear). BibTeX
-
Wu Xingfu and Valerie Taylor, "Performance Analysis and Modeling of the SciDAC MILC Code on Four Large-scale Clusters", June 2007. BibTeX
-
B. Norris, A. Hartono and W. Gropp, "Annotations for Productivity and Performance Portability", Petascale computing: Algorithms and Applications, Chapman & Hall / CRC Press, Taylor and Francis Group, Computational Science, 2007. Preprint, May 2007. BibTeX
-
K. Singh, E. Ipek, S. A. McKee, B. R. de Supinski and R. Caruana, "Predicting Parallel Application Performance via Machine Learning Approaches", Concurrency and Computation: Practice & Experience, Vol. 19, No. 17, May 2007, pp. 2219-2235. BibTeX
-
Greg Bronevetsky and Bronis R. de Supinski, "Soft Error Vulnerability of Iterative Linear Algebra Methods", The 2007 IEEE Workshop on Silicon Errors in Logic - System Effects (SELSE 3), Austin, TX, April 2007. BibTeX
-
J. Marathe, F. Mueller, T. Mohan, S. A. McKee, B. R. de Supinski and A. Yoo, "METRIC: Memory Tracing via Dynamic Binary Rewriting to Identify Cache Inefficiencies", ACM Transactions on Programming Languages and Systems, Vol. 29, No. 2, April 2007. BibTeX
-
Q. Yi, K. Seymour, H. You, R. Vuduc and D. Quinlan, "POET: Parameterized Optimizations for Empirical Tuning", Workshop on Performance Optimization of High-Level Languages and Libraries (POHLL), March 2007. BibTeX
-
D. Arnold, D. H. Ahn, B. R. de Supinski, G. Lee, B. P. Miller and M. Schulz, "Stack Trace Analysis for Large Scale Debugging", Twenty First International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, March 2007. BibTeX
-
M. Noeth, F. Mueller, M. Schulz and B. R. de Supinski, "Scalable Compression and Replay of Communication Traces in Massively Parallel Environments", Twenty First International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, (Best Paper Award), March 2007. BibTeX
-
B. C. Lee, D. M. Brooks, B. R. de Supinski, M. Schulz, K. Singh and S. A. McKee, "Methods of Inference and Learning for Performance Modeling of Parallel Applications", ACM SIGPLAN 2007 Symposium on Principles and Practice of Parallel Programming (PPoPP 2007), San Jose, CA, March 2007. BibTeX
-
H. You, K. Seymour, J. Dongarra and S. Moore, "Empirical Tuning of a Multiresolution Analysis Kernel using a Specialized Code Generator", Innovative Computing Laboratory Technical Report, March 2007. BibTeX
-
H. You, J. Dongarra, S. Moore and K. Seymour, "Automated Empirical Tuning of a Multiresolution Analysis Kernel", Innovative Computing Laboratory Technical Report, February 2007. BibTeX
-
2006
-
J. Marathe, F. Mueller and B. R. de Supinski, "Analysis of Cache Coherence Bottlenecks with Hybrid Hardware/Software Techniques", ACM Transactions on Architecture and Code Optimization, Vol. 3, No. 4, December 2006, pp. 390-423. BibTeX
-
M. Schulz, B. R. de Supinski, B. Aichinger, D. Kanzmueller, R. Preissl and T. Koeckerbauer, "Patterns in Parallel Programs - Towards High-level Understanding of Large-Scale Traces", SC2006, Tampa, FL, (poster), November 2006. BibTeX
-
E. Ipek, K. Singh, S. A. McKee, B. R. de Supinski, M. Schulz and R. Caruana, "Efficiently Exploring Architectural Design Spaces via Predictive Modeling", Twelfth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XII), San Jose, CA, October 2006. BibTeX
-
B. C. Lee, M. Schulz and B. R. de Supinski, "Regression Strategies for Parameter Space Exploration: A Case Study in Semicoarsening Multigrid and R", Lawrence Livermore Technical Report 224851, September 2006. BibTeX
-
M. Schulz, D. Kranzmueller and B. R. de Supinski, "Exploring Unexpected Behavior in MPI", 2006 International Conference on High Performance Computing and Communications (HPCC-06), Munich, Germany, September 2006. BibTeX
-
G. Lee, M. Schulz, D. H. Ahn, A. Bernat, B. R. de Supinski, S. Ko and B. Rountree, "Dynamic Binary Instrumentation and Data Aggregation on Large Scale Systems", International Journal of Parallel Programming, September 2006 (to appear). BibTeX
-
M. Schulz and B. R. de Supinski, "A Flexible and Dynamic Infrastructure for MPI Tool Interoperability", 2006 International Conference on Parallel Processing (ICPP-06), Columbus, OH, August 2006. BibTeX
-
R. Vuduc, M. Schulz, D. Quinlan, B. R. de Supinski and A. Sæbørnsen, "Improving Distributed Memory Applications Testing by Message Perturbation", Fourth Workshop on Parallel and Distributed Systems: Testing and Debugging (PADTAD - IV), Portland, ME, (Best Paper Award), July 2006. BibTeX
-
D. Quinlan, R. Vuduc, T. Panas, J. Härdtlein and A. Sæbørnsen, "Support for whole-program analysis and verification of the One-Definition Rule in C++", Static Analysis Summit, Gaithersburg, MD, June 2006. BibTeX
