Jeff A. Bilmes's Publications
• Sorted by Date • Classified by Publication Type • Classified by Research Category • Sorted by First Author Last Name • Classified by Author Last Name •
BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers
Lilly Kumari, Shengjie Wang, Tianyi Zhou, Nikhil Sarda, Anthony Rowe, and Jeff Bilmes. BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers. In First Conference on Language Modeling, Seattle, WA, 2024. Published as a conference paper at COLM 2024
Download
Abstract
(unavailable)
BibTeX
@inproceedings{kumari2024bumblebee, title={BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers}, author={Kumari, Lilly and Wang, Shengjie and Zhou, Tianyi and Sarda, Nikhil and Rowe, Anthony and Bilmes, Jeff}, booktitle={Proceedings of the Conference on Learning Machines (COLM)}, year={2024}, address={Seattle, WA}, organization={COLM}, booktitle={First Conference on Language Modeling}, note={Published as a conference paper at COLM 2024}, url={https://openreview.net/pdf?id=8w0RApM5yG}, }
Share
Generated by bib2html.pl (written by Patrick Riley ) on Mon Oct 14, 2024 00:38:45