Domino: Decoupling Causal Modeling from Autoregressive Drafting in
Speculative Decoding
黄佳诺
Huang2020
AI & ML interests
None yet
Recent Activity
reacted to Banaxi-Tech's post with 🔥 about 10 hours ago
Hello AI Community! 👋
We have just released BananaMind 1.5 Base and it outperforms other models at its size.
It outperforms GPT 2 124M while being ~50M params smaller
Check it out: https://huggingface.co/BananaMind/BananaMind-1.5-Base
OLD POST CONTENTS EDITED:
We currently have a new AI Model and we are currently training it.
We are training it on 27B tokens and are currently 8% done.
Follow us to be notified when it releases 🚀
Some Info:
Parameters 75M
GPU: RTX Pro 6000
We expect to be able to release it in the coming dayshttps://huggingface.co/BananaMind/BananaMind-1.5-Base upvoted a collection 24 days ago
speculative_decoding upvoted a collection 24 days ago
InferenceOrganizations
None yet