👁 Image Submitted by 👁 Image Vivien Cabannes 23 Efficient RL Training for LLMs with Experience Replay 👁 meta-llama Meta Llama 2
👁 Image Submitted by 👁 Image Niels Rogge 36 V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning 👁 meta-llama Meta Llama 2
👁 Image Submitted by 👁 Image Yixin Nie 7 CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production 👁 meta-llama Meta Llama 2
👁 Image Submitted by 👁 Image Deqing Fu 19 TLDR: Token-Level Detective Reward Model for Large Vision Language Models 👁 meta-llama Meta Llama 3