Performance modeling tools for parallel sparse linear algebra
computations,
Proceedings of the ParCo2009, September 1-4, 2009, Lyon.
Evaluation of sparse LU factorization and triangular solution on
multicore platforms,
Proceedings of the VECPAR'08, June 24-27, 2008, Toulouse.
(Technical Report LBNL-571E)
Evaluation SuperLU on multicore architectures,
SciDAC 2008, Journal of Physics: Conference Series 125 (2008) 012079,
IOP Publishing.
Performance Evaluation of a Multilevel Sub-structuring Method
for Sparse Eigenvalue Problems (with W. Gao, C. Yang and Z. Bai),
16th International Conference on Domain Decomposition Methods,
January 12-15, 2005, New York University.
Performance Analysis of Parallel Right-Looking Sparse LU Factorization
on Two Dimensional Grids of Processors (with L. Grigori),
PARA'04 Workshop on State-of-the-art in Scientific Computing,
June 20-23, 2004, Copenhagen, Denmark.
Performance Evaluation and Enhancement of SuperLU_DIST 2.0
(with Y. Wang), Technical report LBNL-53624, August 2003.
Effects of Ordering Strategies and Programming
Paradigms on Sparse Matrix Computations
(with L. Oliker, P. Husbands and R. Biswas),
SIAM Review, Vol. 44, No. 3, pp. 373-393, September 2002.
Memory-Intensive Benchmarks: IRAM vs. Cache-Based Machines
(with B.R. Gaeke, P. Husbands, L. Oliker, K.A. Yelick and R. Biswas),
International Parallel and Distributed Processing Symposium
(IPDPS 2002), April 15-19, 2002, Fort Lauderdale, Florida.