Jeff A. Bilmes's Publications

Sorted by DateClassified by Publication TypeClassified by Research CategorySorted by First Author Last NameClassified by Author Last Name

BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers

Lilly Kumari, Shengjie Wang, Tianyi Zhou, Nikhil Sarda, Anthony Rowe, and Jeff Bilmes. BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers. In First Conference on Language Modeling, Seattle, WA, 2024. Published as a conference paper at COLM 2024

Download

[HTML] 

Abstract

(unavailable)

BibTeX

@inproceedings{kumari2024bumblebee,
  title={BumbleBee: Dynamic KV-Cache Streaming Submodular Summarization for Infinite-Context Transformers},
  author={Kumari, Lilly and Wang, Shengjie and Zhou, Tianyi and Sarda, Nikhil and Rowe, Anthony and Bilmes, Jeff},
  booktitle={Proceedings of the Conference on Learning Machines (COLM)},
  year={2024},
  address={Seattle, WA},
  organization={COLM},
  booktitle={First Conference on Language Modeling},
  note={Published as a conference paper at COLM 2024},
  url={https://openreview.net/pdf?id=8w0RApM5yG},
}

Share


Generated by bib2html.pl (written by Patrick Riley ) on Mon Oct 14, 2024 00:38:45