Trained on GSM8K and AQuA-RAT datasets with 3 epochs. Final loss: 3.8256
ยท Sign up or log in to comment