publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- NSDILibra: Flexible Request Partitioning and Scheduling for Serving Unbalanced and Dynamic LLM WorkloadsIn Proc. of the USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2026
- NSDICortex: Achieving Low-Latency, Cost-Efficient Remote Data Access For LLM via Semantic-Aware Knowledge CachingIn Proc. of the USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2026
- ICLRRevisiting Parameter Server in LLM Post-TrainingIn Proc. of the International Conference on Learning Representations (ICLR), 2026
2025
- VLDBAdapting to Data Affinity Changes in Geo-Replicated Database via Row-Level Paxos- Group Affiliation Re-AssignmentProc. VLDB Endow., 2025
- ICCDDHeLlam: General-Purpose, Automatic Micro-batch Co-execution for Distributed LLM TrainingIn Proc. of the IEEE International Conference on Computer Design (ICCD), 2025
2023
- ASPLOSPersistent Memory Disaggregation for Cloud-Native Relational DatabasesIn Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, Vancouver, BC, Canada, 2023
2021
- VLDBTowards cost-effective and elastic cloud database deployment via memory disaggregationProc. VLDB Endow., Jun 2021
- FASTSpanDB: A fast, Cost-Effective LSM-tree based KV store on hybrid storageIn 19th USENIX Conference on File and Storage Technologies (FAST 21), Jun 2021
- PerfEstimator: A Generic and Extensible Performance Estimator for Data Parallel DNN TrainingIn 2021 IEEE/ACM International Workshop on Cloud Intelligence (CloudIntelligence), Oct 2021
2018
- infocommAppDNA: App Behavior Profiling via Graph-based Deep LearningIn IEEE INFOCOM 2018 - IEEE Conference on Computer Communications, Oct 2018