XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference (English)
Free access
- New search for: Monteiro, João
- New search for: Marcotte, Étienne
- New search for: Noël, Pierre-André
- New search for: Zantedeschi, Valentina
- New search for: Vázquez, David
- New search for: Chapados, Nicolas
- New search for: Pal, Christopher
- New search for: Taslakian, Perouz
- New search for: Monteiro, João
- New search for: Marcotte, Étienne
- New search for: Noël, Pierre-André
- New search for: Zantedeschi, Valentina
- New search for: Vázquez, David
- New search for: Chapados, Nicolas
- New search for: Pal, Christopher
- New search for: Taslakian, Perouz
2024
- Preprint / Electronic Resource
-
Title:XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
-
Contributors:Monteiro, João ( author ) / Marcotte, Étienne ( author ) / Noël, Pierre-André ( author ) / Zantedeschi, Valentina ( author ) / Vázquez, David ( author ) / Chapados, Nicolas ( author ) / Pal, Christopher ( author ) / Taslakian, Perouz ( author )
-
Publisher:
- New search for: arXiv
-
Publication date:2024
-
Type of media:Preprint
-
Type of material:Electronic Resource
-
Language:English
-
Licence:
-
Source: