Embed Mamba2 Collection Text embedding models based on Mamba2 with linear-time and constant-memory inference through vertical chunking. • 5 items • Updated 1 day ago • 2
REAM Collection Compressed MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 12 items • Updated 2 days ago • 6