Artificial Hippocampus Networks (AHNs) for Efficient Long-Context Modeling
-
ByteDance-Seed/AHN-Mamba2-for-Qwen-2.5-Instruct-14B
Text Generation • Updated • 77 • 9 -
ByteDance-Seed/AHN-Mamba2-for-Qwen-2.5-Instruct-7B
Text Generation • Updated • 73 • 2 -
ByteDance-Seed/AHN-Mamba2-for-Qwen-2.5-Instruct-3B
Text Generation • Updated • 374 • 4 -
ByteDance-Seed/AHN-GDN-for-Qwen-2.5-Instruct-14B
Text Generation • Updated • 71 • 4