2B • Updated • 2
Rui-Jie Zhu
ridger
👁 Image
21world's profile picture👁 Image
hanwenzhu's profile picture👁 Image
petro233's profile picture
21world's profile picture👁 Image
hanwenzhu's profile picture👁 Image
petro233's profile picture
·
AI & ML interests
None yet
Recent Activity
new activity 27 days ago
ByteDance/Ouro-1.4B-Thinking:Fix default RoPE init function reference in OuroRotaryEmbedding upvoted a paper about 2 months ago
How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models upvoted a paper about 2 months ago
Large Language Models Explore by Latent DistillingOrganizations
0.3B • Updated • 1
0.3B • Updated • 1
1B • Updated • 3
1B • Updated • 2
Updated • 12
Updated • 38
Updated • 11
1B • Updated • 3
Text Generation • 1B • Updated • 43 • 5
Text Generation • 0.4B • Updated • 2.52k • 18
Text Generation • 3B • Updated • 2.42k • 36
Updated
Updated • 14
Text Generation • Updated • 21
