PERI Older Publications
From PERI
The main PERI publications page is here.
2007
| S. R. Alam, N. Bhatia and J. S. Vetter, Sensitivity Analysis of Biomolecular Simulations using Symbolic Models", 7th International Conference on BioInformatics and BioEngineering, Boston, MA, USA, October 2007. |
| S. R. Alam, N. Bhatia and J. S. Vetter, An Exploration of Performance Attributes for Symbolic Modeling of Emerging Processing Devices, 3rd International High Performance Computation Conference (HPCC), Houston, TX, USA, September 2007. |
| D. Arnold, D. H. Ahn, B. R. de Supinski, G. Lee, B. P. Miller and M. Schulz, Stack Trace Analysis for Large Scale Debugging, Twenty First International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, March 2007. |
| David H. Bailey, Robert Lucas, Paul Hovland, Boyana Norris, Kathy Yelick, Bronis de Supinski, Dan Quinlan, Pat Worley, Jeff Vetter, Phil Roth, Allan Snavely, Dan Reed, Ying Zhang, Jacque Chame, Dan Gunter, John Mellor-Crummey, Jeffrey Hollingsworth, Robert J. Fowler, Mary Hall, Jack Dongarra and Shirley Moore, Performance Engineering: Understanding and Improving the Performance of Large-Scale Codes, CT Watch Quarterly, Vol. 3, no. 4, November 2007, pp. 18-23. |
| Greg Bronevetsky and Bronis R. de Supinski, Soft Error Vulnerability of Iterative Linear Algebra Methods, The 2007 IEEE Workshop on Silicon Errors in Logic - System Effects (SELSE 3), Austin, TX, April 2007. |
| Matthew Curtis-Maury, Karan Singh, Sally A. McKee, Filip Blagojevic, Dimitrios S. Nikolopoulos, Bronis R. de Supinski and Martin Schulz, Identifying Energy-Efficient Concurrency Levels Using Machine Learning, International Workshop on Green Computing (GreenCom'07), Austin, TX, September 2007. |
| Bronis R. de Supinski, Jeff Hollingsworth, Shirley Moore and Patrick Worley, Results of the PERI Survey of SciDAC Applications, SciDAC 2007, Boston, MA, June 2007. |
| Jack Dongarra, Dennis Gannon, Geoffrey Fox, and Ken Kennedy. The Impact of Multicore on Computational Science Software, CTWatch Quarterly, vol. 3, no. 1, Feb 2007 |
| Todd Gamblin, Prasun Ratn, Bronis R. de Supinski, Martin Schulz, Frank Mueller, Robert J. Fowler and Daniel Reed, An Open Framework for Scalable, Reconfigurable Performance Analysis, SC2007, Reno, NV, Poster, November 2007. |
| B. C. Lee, D. M. Brooks, B. R. de Supinski, M. Schulz, K. Singh and S. A. McKee, "Methods of Inference and Learning for Performance Modeling of Parallel Applications", ACM SIGPLAN 2007 Symposium on Principles and Practice of Parallel Programming (PPoPP 2007), San Jose, CA, March 2007. |
| Gregory L. Lee, Dong H. Ahn, Dorian C. Arnold, Bronis R. de Supinski, Barton P. Miller and Martin Schulz, Benchmarking the Stack Trace Analysis Tool for BlueGene/L, International Conference on Parallel Computing 2007 (ParCo 2007), Aachen, Germany, September 2007. |
| Gregory Lee, Martin Schulz, Dong Ahn, Andrew Bernat, Bronis R. de Supinski, Steven Koand Barry Rountree. Dynamic Binary Instrumentation and Data Aggregation on Large Scale Systems, International Journal of Parallel Programming, vol. 35, no. 3, June 2007, pg. 207-232 |
| J. Marathe, F. Mueller, T. Mohan, S. A. McKee, B. R. de Supinski and A. Yoo, METRIC: Memory Tracing via Dynamic Binary Rewriting to Identify Cache Inefficiencies, ACM Transactions on Programming Languages and Systems, Vol. 29, No. 2, April 2007. |
| John Mellor-Crummey, Peter Beckman, Jack Dongarra, Ken Kennedy,Barton Miller and Katherine Yelick. Software for Leadership-Class Computing, SciDAC Review, Fall 2007, pg. 36-45, to appear, available at http://www.scidacreview.org. |
| John Mellor-Crummey. Harnessing the Power of Emerging Petascale Platforms, SciDAC 2007, Journal of Physics: Conference Series 78 (2007) 012048. |
| John Mellor-Crummey, Peter Beckman, Keith Cooper, Jack Dongarra,William Gropp, Ewing Lusk, Barton Miller, Katherine Yelick. Creating Software Tools and Libraries for Leadership Computing, CTWatch Quarterly, Nov 2007. |
| J. S. Meredith, Balancing Productivity and Performance on the Cell Broadband Engine, IEEE Annual International Conference on Cluster Computing, Austin, TX, USA, September 2007. |
| M. Noeth, F. Mueller, M. Schulz and B. R. de Supinski, "Scalable Compression and Replay of Communication Traces in Massively Parallel Environments", Twenty First International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, (Best Paper Award), March 2007. |
| B. Norris, A. Hartono and W. Gropp, Annotations for Productivity and Performance Portability, Petascale computing: Algorithms and Applications, Chapman & Hall / CRC Press, Taylor and Francis Group, Computational Science, 2007. Preprint, May 2007. |
| Jelena Pjesivac-Grbovi'c, Thara Angskun, George Bosilca, Graham E. Fagg, EdgarGabriel, and Jack J. Dongarra. Performance Analysis of MPI Collective Operations, Cluster Computing Journal, vol. 10 (2007), pg. 127-143 |
| Robert Preissl, Martin Schulz, Dieter Kranzlmueller, Bronis R. de Supinski and Daniel J. Quinlan, Using MPI Communication Patterns To Guide Source Code Transformations, SC2007, Reno, NV, Poster, November 2007. |
| P. Roth. Characterizing the I/O Behavior of Scientific Applications on the Cray XT, Proceedings of the Petascale Data Storage Workshop, Reno, NV, November 2007. |
| P. C. Roth and J. S. Vetter. Intel Woodcrest: An Evaluation for Scientific Computing, 8th LCI International Conference on High-Performance Clustered Computing, 2007 |
| Barry Rountree, David K. Lowenthal, Shelby Funk, Vincent W. Freeh, Bronis R. de Supinski and Martin Schulz, Bounding Energy Consumption in Large-Scale MPI Programs, SC2007, Reno, NV, November 2007. |
| Martin Schulz and Bronis R. de Supinski, Practical Differential Profiling, Euro-Par 2007, Rennes, France, August 2007. |
| Martin Schulz and Bronis R. de Supinski, P^nMPI Tools: A Whole Lot Greater Than the Sum of Their Parts, SC2007, Reno, NV, November 2007. |
| Fengguang Song, Shirley Moore and Jack Dongarra. L2 Cache Modeling for Scientific Applications on Chip Multi-processors, International Conference on Parallel Processing (ICPP07), Xi'an, China, September 2007 (to appear). |
| K. Singh, E. Ipek, S. A. McKee, B. R. de Supinski and R. Caruana, Predicting Parallel Application Performance via Machine Learning Approaches, Concurrency and Computation: Practice & Experience, Vol. 19, No. 17, May 2007, pp. 2219-2235. |
| Vahid Tabatabaee, Jeffrey K. Hollingsworth. Automatic Software Interference Detection in Parallel Applications, SC07, Reno, NV, November 2007 |
| S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick and J. Demmel. Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms, SC07, ACM/IEEE, Nov 2007. |
| Xingfu Wu and Valerie Taylor, Performance Analysis and Modeling of the SciDAC GTC Code on Three Large-scale Computer Systems, June 2007. |
| Felix Wolf, Bernd Mohr, Jack Dongarra and Shirley Moore, Automatic Analysis of Inefficiency Patterns in Parallel Applications, Concurrency and Computation: Practice and Experience, Vol. 19, June 2007 (to appear). |
| Wu Xingfu and Valerie Taylor, Performance Analysis and Modeling of the SciDAC MILC Code on Four Large-scale Clusters, June 2007. |
| Q. Yi, K. Seymour, H. You, R. Vuduc and D. Quinlan, "POET: Parameterized Optimizations for Empirical Tuning", Workshop on Performance Optimization of High-Level Languages and Libraries (POHLL), March 2007. |
| H. You, K. Seymour, J. Dongarra and S. Moore, Empirical Tuning of a Multiresolution Analysis Kernel using a Specialized Code Generator, Innovative Computing Laboratory Technical Report, March 2007. |
| H. You, J. Dongarra, S. Moore and K. Seymour, Automated Empirical Tuning of a Multiresolution Analysis Kernel, Innovative Computing Laboratory Technical Report, February 2007. |
| W. Yu, S. Oral, J. Vetter and R. Barrett. Efficiency Evaluation of Cray XT Parallel IO Stack, Cray User Group Meeting (CUG 2007), Seattle, WA, 2007 |
2006
| E. Ipek, K. Singh, S. A. McKee, B. R. de Supinski, M. Schulz and R. Caruana, Efficiently Exploring Architectural Design Spaces via Predictive Modeling, Twelfth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XII), San Jose, CA, October 2006. |
| B. C. Lee, M. Schulz and B. R. de Supinski, Regression Strategies for Parameter Space Exploration: A Case Study in Semicoarsening Multigrid and R, Lawrence Livermore Technical Report 224851, September 2006. |
| G. Lee, M. Schulz, D. H. Ahn, A. Bernat, B. R. de Supinski, S. Ko and B. Rountree, Dynamic Binary Instrumentation and Data Aggregation on Large Scale Systems, International Journal of Parallel Programming, September 2006 (to appear). |
| J. Marathe, F. Mueller and B. R. de Supinski, Analysis of Cache Coherence Bottlenecks with Hybrid Hardware/Software Techniques, ACM Transactions on Architecture and Code Optimization, Vol. 3, No. 4, December 2006, pp. 390-423. |
| D. Quinlan, R. Vuduc, T. Panas, J. Härdtlein and A. Sæbørnsen, Support for whole-program analysis and verification of the One-Definition Rule in C++, Static Analysis Summit, Gaithersburg, MD, June 2006. |
| M. Schulz, B. R. de Supinski, B. Aichinger, D. Kranzlmueller, R. Preissl and T. Koeckerbauer, Patterns in Parallel Programs - Towards High-level Understanding of Large-Scale Traces, SC2006, Tampa, FL, (poster), November 2006. |
| M. Schulz, D. Kranzmueller and B. R. de Supinski, Exploring Unexpected Behavior in MPI, 2006 International Conference on High Performance Computing and Communications (HPCC-06), Munich, Germany, September 2006. |
| M. Schulz and B. R. de Supinski, A Flexible and Dynamic Infrastructure for MPI Tool Interoperability, 2006 International Conference on Parallel Processing (ICPP-06), Columbus, OH, August 2006. |
| M. Tikir, L. Carrington, E. Strohmaier, A. Snavely. A Genetic Algorithms Approach to Modeling the Performance of Memory-bound Computations, SC07, Nov 2007, Reno, pg. 82-94 |
| R. Vuduc, M. Schulz, D. Quinlan, B. R. de Supinski and A. Sæbørnsen, Improving Distributed Memory Applications Testing by Message Perturbation, Fourth Workshop on Parallel and Distributed Systems: Testing and Debugging (PADTAD - IV), Portland, ME, (Best Paper Award), July 2006. |
PERC-2
Publications from PERI's predecessor project, PERC-2, are available here.
