Performance modeling tools for parallel sparse linear algebra computations, Proceedings of the ParCo2009, September 1-4, 2009, Lyon.
Evaluation of sparse LU factorization and triangular solution on multicore platforms, Proceedings of the VECPAR'08, June 24-27, 2008, Toulouse. (Technical Report LBNL-571E)
Evaluation SuperLU on multicore architectures, SciDAC 2008, Journal of Physics: Conference Series 125 (2008) 012079, IOP Publishing.
Performance Evaluation of a Multilevel Sub-structuring Method for Sparse Eigenvalue Problems (with W. Gao, C. Yang and Z. Bai), 16th International Conference on Domain Decomposition Methods, January 12-15, 2005, New York University.
Performance Analysis of Parallel Right-Looking Sparse LU Factorization on Two Dimensional Grids of Processors (with L. Grigori), PARA'04 Workshop on State-of-the-art in Scientific Computing, June 20-23, 2004, Copenhagen, Denmark.
Performance Evaluation and Enhancement of SuperLU_DIST 2.0 (with Y. Wang), Technical report LBNL-53624, August 2003.
Effects of Ordering Strategies and Programming Paradigms on Sparse Matrix Computations (with L. Oliker, P. Husbands and R. Biswas), SIAM Review, Vol. 44, No. 3, pp. 373-393, September 2002.
Memory-Intensive Benchmarks: IRAM vs. Cache-Based Machines (with B.R. Gaeke, P. Husbands, L. Oliker, K.A. Yelick and R. Biswas), International Parallel and Distributed Processing Symposium (IPDPS 2002), April 15-19, 2002, Fort Lauderdale, Florida.