VOOZH about

URL: https://github.com/SqueezeAILab

⇱ SqueezeAILab · GitHub


Skip to content

Popular repositories Loading

  1. [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    Python 1.8k 129

  2. [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Python 715 50

  3. [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!

    Python 479 70

  4. KVQuant Public

    [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

    Python 419 39

  5. LLM2LLM Public

    [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

    Python 195 15

  6. [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference

    Python 58 8

Repositories

Showing 10 of 16 repositories
You can’t perform that action at this time.