gabrielmbmb

👁 Image
Arshavir's profile picture 👁 Image
GloriaMK's profile picture 👁 Image
julien-c's profile picture

AI & ML interests

ML Engineer

Recent Activity

liked a model 7 days ago

unsloth/diffusiongemma-26B-A4B-it-GGUF

liked a model 8 days ago

CohereLabs/North-Mini-Code-1.0

liked a model 20 days ago

unsloth/Qwen3.6-35B-A3B-MTP-GGUF

View all activity

Organizations

👁 Spaces-explorers's profile picture
👁 SomosNLP's profile picture
👁 Blog-explorers's profile picture
👁 Social Post Explorers's profile picture
👁 Hugging Face Discord Community's profile picture
👁 LLHF's profile picture
👁 SLLHF's profile picture
👁 Data Is Better Together Contributor's profile picture
👁 Supersonik's profile picture
👁 Toad HF Inference Explorers's profile picture

Posts 4

view post

Yesterday @mattshumer released mattshumer/Reflection-Llama-3.1-70B, an impressive model that achieved incredible results in benchmarks like MMLU. The model was fine-tuned using Reflection-Tuning and the dataset used wasn't released, but I created a small recipe with distilabel that allows generating a dataset with a similar output format:

1. We use MagPie 🐦 in combination with https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct to generate reasoning instructions.
2. We generate a response again using https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct, but we steer the LLM to generate an specific output format using a custom system prompt. In the system prompt, we instruct the LLM that it will have first to think 💭 and have reflections that will help resolving ambiguities. After that, we instruct the LLM to generate an output based on the previous thinking

In this dataset gabrielmbmb/distilabel-reflection-tuning you can found 5 rows that I generated with this recipe. You can also found the code of the pipeline in the file called reflection.py.

View all Posts