Gaperon Collection Our French-English LLM suite (SFT models are coming soon) • 16 items • Updated 8 days ago • 16
Gaperon: A Peppered English-French Generative Language Model Suite Paper • 2510.25771 • Published Oct 29 • 15
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published Mar 4 • 10
Headless Language Models: Learning without Predicting with Contrastive Weight Tying Paper • 2309.08351 • Published Sep 15, 2023 • 3