Will there be a distilled model that fits inside 48GB VRAM (2x 3090)?

#29

by gameveloster - opened Jan 18, 2023

And maybe another that fits inside 96GB when using a node of 4x 3090?

Hope someone can help distil this, thanks!

BigScience Workshop org Jan 18, 2023

I'd recommend using bloomz-7b1 or mt0-xxl, which should work well for inference given your setup.

@ybelkada also ran distillation experiments on BLOOM - I'm not sure what the verdict was i.e. if it makes sense for models of this scale?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment