👁 InternLM
InternLM
InternLM 7B
Legacy1.4KDownloads96LikesJul 2023Released8K tokensContextApache 2.0License50 GoodQuality
InternLM 7B (7B parameters) requires approximately 13.9 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 16 GB of VRAM.
Get started
— copy & paste to run locallyCopy-paste commands to run InternLM 7B on your machine.
Run
docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
--hf-repo "InternLM/InternLM-7B" \
--hf-file "InternLM-7B-Q4_K_M.gguf" \
-c 4096 -ngl 99Quick specs
Parameters7B
Architecturedense
Context8K tokens
Modalitytext
Min RAM2.7 GB
Rec. RAM4.3 GB (Q4_K_M)
LicenseApache 2.0
FamilyInternLM
✓ Chat✓ Reasoning
About this model
- •It leverages trillions of high-quality tokens for training to establish a powerful knowledge base
- •It provides a versatile toolset for users to flexibly build their own workflows
Related models
Your hardware
Detecting...
Quick picks
Best hardware
Top picks for InternLM 7B
Run this model
Quantization options
VRAM estimates by quant level
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 2.7 GB | Low | — |
Q3_K_S | 3 | 3.4 GB | Low | — |
NVFP4 | 4 | 3.9 GB | Medium | — |
Q4_K_M | 4 | 4.3 GB | Medium | — |
Q5_K_M | 5 | 5.0 GB | High | — |
Q6_K | 6 | 5.7 GB | High | — |
Q8_0 | 8 | 7.5 GB | Very High | — |
F16 | 16 | 14.3 GB | Maximum | — |
Hardware compatibility
Fit estimates across all hardware
Computing compatibility...
Memory breakdown
Reference: RTX 2060 6GB
Weights4.3 GB
KV Cache7.8 GB
Runtime1.2 GB
Headroom0.6 GB
Frequently asked questions
FAQ — InternLM 7B
See also
