Follow
Suchita Pati
Suchita Pati
Verified email at cs.wisc.edu - Homepage
Title
Cited by
Cited by
Year
Analyzing machine learning workloads using a detailed GPU simulator
J Lew, DA Shah, S Pati, S Cattell, M Zhang, A Sandhupatla, C Ng, N Goli, ...
2019 IEEE international symposium on performance analysis of systems and …, 2019
772019
Demystifying bert: System design implications
S Pati, S Aga, N Jayasena, MD Sinclair
2022 IEEE International Symposium on Workload Characterization (IISWC), 296-309, 2022
132022
SeqPoint: Identifying representative iterations of sequence-based neural networks
S Pati, S Aga, MD Sinclair, N Jayasena
2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020
112020
Demystifying bert: Implications for accelerator design
S Pati, S Aga, N Jayasena, MD Sinclair
arXiv preprint arXiv:2104.08335, 2021
102021
Computation vs. Communication Scaling for Future Transformers on Future Hardware
S Pati, S Aga, M Islam, N Jayasena, MD Sinclair
arXiv preprint arXiv:2302.02825, 2023
5*2023
Analyzing Machine Learning Workloads Using a Detailed GPU Simulator. CoRR abs/1811.08933 (2018)
J Lew, D Shah, S Pati, S Cattell, M Zhang, A Sandhupatla, C Ng, N Goli, ...
arXiv preprint arXiv:1811.08933, 2018
32018
Darts: Performance-counter driven sampling using binary translators
R Kumar, S Pati, K Lahiri
2017 IEEE International Symposium on Performance Analysis of Systems and …, 2017
32017
Just-in-time Quantization with Processing-In-Memory for Efficient ML Training
MA Ibrahim, S Aga, A Li, S Pati, M Islam
arXiv preprint arXiv:2311.05034, 2023
12023
Improving GPU Utilization in ML Workloads Through Finer-Grained Synchronization
R Kuper, S Pati, MD Sinclair
3rd Young Architects Workshop, 2021
12021
T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives
S Pati, S Aga, M Islam, N Jayasena, MD Sinclair
arXiv preprint arXiv:2401.16677, 2024
2024
Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware
S Pati, S Aga, M Islam, N Jayasena, MD Sinclair
2023 IEEE International Symposium on Workload Characterization (IISWC), 140-153, 2023
2023
Exploring GPU Architectural Optimizations for RNNs
S Pati
Young Architect Workshop (YArch), in conjunction with HPCA'19, 2019
2019
Effective Prefetching for Multicore/Multiprocessor Systems
S Pati, P Mahapatra
Transparent Compression for Flash SSDs
S Pati, Y Trivedi
The system can't perform the operation now. Try again later.
Articles 1–14