Official Model Parameters for "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding" • 8 items • Updated
README.md exists but content is empty.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
