Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated
a model
5 days ago
ricdomolm/pythia-1.4b-sft-gsm8k-3e
published
a model
5 days ago
ricdomolm/pythia-1.4b-sft-gsm8k-3e
updated
a model
5 days ago
ricdomolm/pythia-1.4b-sft-gsm8k-1e
Organizations
None yet
Collections
1
models
67
ricdomolm/pythia-1.4b-sft-gsm8k-3e
Text Generation
•
Updated
•
327
ricdomolm/pythia-1.4b-sft-gsm8k-1e
Text Generation
•
Updated
•
17
ricdomolm/ml4331-reward-model
Text Generation
•
Updated
•
262
ricdomolm/ml4331-reward-model2
Text Generation
•
Updated
•
4
ricdomolm/ml4331-dpo-model
Text Generation
•
Updated
•
214
ricdomolm/ml4331-instruction-model
Text Generation
•
Updated
•
369
ricdomolm/test-model
Updated
ricdomolm/SmolLM2-135M-SFT-Alpaca
Updated
ricdomolm/reward-model-exercise
Updated
ricdomolm/lawma-8b
Text Generation
•
Updated
•
2.2k
•
6
datasets
15
ricdomolm/caselawqa_leaderboard_results
Updated
•
1.18k
ricdomolm/caselawqa_leaderboard_requests
Viewer
•
Updated
•
29
•
1.15k
ricdomolm/lawma-instructions_gemma2_8k
Viewer
•
Updated
•
554k
•
71
ricdomolm/lawma-instructions_llama3_16k
Viewer
•
Updated
•
554k
•
35
ricdomolm/lawma-instructions_llama3_8k
Viewer
•
Updated
•
554k
•
54
ricdomolm/lawma-instructions
Viewer
•
Updated
•
554k
•
41
ricdomolm/lawma-tasks
Viewer
•
Updated
•
692k
•
950
•
2
ricdomolm/lawma-task-files
Updated
•
31
ricdomolm/caselawqa-8k
Viewer
•
Updated
•
16.1k
•
38
•
2
ricdomolm/lawma-all-tasks
Viewer
•
Updated
•
575k
•
50