Optimization of sparse matrix-vector multiplication on emerging multicore platforms S Williams, L Oliker, R Vuduc, J Shalf, K Yelick, J Demmel Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, 1-12, 2007 | 1060 | 2007 |

Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures K Datta, M Murphy, V Volkov, S Williams, J Carter, L Oliker, D Patterson, ... SC'08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 1-12, 2008 | 833 | 2008 |

The potential of the cell processor for scientific computing S Williams, J Shalf, L Oliker, S Kamil, P Husbands, K Yelick Proceedings of the 3rd Conference on Computing Frontiers, 9-20, 2006 | 488 | 2006 |

Optimization and performance modeling of stencil computations on modern microprocessors K Datta, S Kamil, S Williams, L Oliker, J Shalf, K Yelick SIAM review 51 (1), 129-159, 2009 | 317 | 2009 |

A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome JA Chapman, M Mascher, A Buluç, K Barry, E Georganas, A Session, ... Genome biology 16, 1-17, 2015 | 298 | 2015 |

An auto-tuning framework for parallel multicore stencil computations S Kamil, C Chan, L Oliker, J Shalf, S Williams 2010 IEEE international symposium on parallel & distributed processing …, 2010 | 295 | 2010 |

Implicit and explicit optimizations for stencil computations S Kamil, K Datta, S Williams, L Oliker, J Shalf, K Yelick Proceedings of the 2006 workshop on Memory system performance and …, 2006 | 197 | 2006 |

Critical assessment of metagenome interpretation: the second round of challenges F Meyer, A Fritz, ZL Deng, D Koslicki, TR Lesker, A Gurevich, G Robertson, ... Nature methods 19 (4), 429-440, 2022 | 189 | 2022 |

PLUM: Parallel load balancing for adaptive unstructured meshes L Oliker, R Biswas Journal of Parallel and Distributed Computing 52 (2), 150-177, 1998 | 181 | 1998 |

Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication A Buluç, S Williams, L Oliker, J Demmel 2011 IEEE International Parallel & Distributed Processing Symposium, 721-733, 2011 | 167 | 2011 |

Job superscheduler architecture and performance in computational grid environments H Shan, L Oliker, R Biswas Proceedings of the 2003 ACM/IEEE conference on Supercomputing, 44, 2003 | 151 | 2003 |

Lattice Boltzmann simulation optimization on leading multicore platforms S Williams, J Carter, L Oliker, J Shalf, K Yelick 2008 IEEE International Symposium on Parallel and Distributed Processing, 1-14, 2008 | 146 | 2008 |

Scientific computing kernels on the cell processor S Williams, J Shalf, L Oliker, S Kamil, P Husbands, K Yelick International Journal of Parallel Programming 35 (3), 263-298, 2007 | 138 | 2007 |

Impact of modern memory subsystems on cache optimizations for stencil computations S Kamil, P Husbands, L Oliker, J Shalf, K Yelick Proceedings of the 2005 workshop on Memory system performance, 36-43, 2005 | 138 | 2005 |

Roofline model toolkit: A practical tool for architectural and program analysis YJ Lo, S Williams, B Van Straalen, TJ Ligocki, MJ Cordery, NJ Wright, ... High Performance Computing Systems. Performance Modeling, Benchmarking, and …, 2015 | 132 | 2015 |

Scientific computations on modern parallel vector systems L Oliker, A Canning, J Carter, J Shalf, S Ethier SC'04: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, 10-10, 2004 | 114 | 2004 |

Parallel de bruijn graph construction and traversal for de novo genome assembly E Georganas, A Buluç, J Chapman, L Oliker, D Rokhsar, K Yelick SC'14: Proceedings of the International Conference for High Performance …, 2014 | 105 | 2014 |

Analyzing ultra-scale application communication requirements for a reconfigurable hybrid interconnect J Shalf, S Kamil, L Oliker, D Skinner SC'05: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, 17-17, 2005 | 105 | 2005 |

Effects of ordering strategies and programming paradigms on sparse matrix computations L Oliker, X Li, P Husbands, R Biswas Siam Review 44 (3), 373-393, 2002 | 100 | 2002 |

Investigation of leading HPC I/O performance using a scientific-application derived benchmark J Borrill, L Oliker, J Shalf, H Shan Proceedings of the 2007 ACM/IEEE conference on Supercomputing, 1-12, 2007 | 94 | 2007 |