Jeff A. Bilmes's Publications
• Sorted by Date • Classified by Publication Type • Classified by Research Category • Sorted by First Author Last Name • Classified by Author Last Name •
BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers
Lilly Kumari, Shengjie Wang, Tianyi Zhou, Nikhil Sarda, Anthony Rowe, and Jeff Bilmes. BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers. In First Conference on Language Modeling, Seattle, WA, 2024. Published as a conference paper at COLM 2024
Download
Abstract
(unavailable)
BibTeX
@inproceedings{kumari2024bumblebee,
title={BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers},
author={Kumari, Lilly and Wang, Shengjie and Zhou, Tianyi and Sarda, Nikhil and Rowe, Anthony and Bilmes, Jeff},
booktitle={Proceedings of the Conference on Learning Machines (COLM)},
year={2024},
address={Seattle, WA},
organization={COLM},
booktitle={First Conference on Language Modeling},
note={Published as a conference paper at COLM 2024},
url={https://openreview.net/pdf?id=8w0RApM5yG},
}
Share
Generated by bib2html.pl (written by Patrick Riley ) on Wed Nov 12, 2025 23:43:39