AI & ML interests
None defined yet.
Aletheia-Bench/DPO-Think-14B
Text Generation
• 15B • Updated • 17
• 1
Aletheia-Bench/DPO-Think-1.5B
Text Generation
• 2B • Updated • 31
• Aletheia-Bench/BatchOnline-GRPO-7B
Text Generation
• 8B • Updated • 25
• 1
Aletheia-Bench/BatchOnline-GRPO-14B
Text Generation
• 15B • Updated • 14
• 1
Aletheia-Bench/BatchOnline-GRPO-1.5B
Text Generation
• 2B • Updated • 12
Aletheia-Bench/GRPO-Think-14B-8k
Text Generation
• 15B • Updated • 12
• 1
Aletheia-Bench/GRPO-Think-7B-8k
Text Generation
• 8B • Updated • 25
Aletheia-Bench/GRPO-Think-14B-4k
Text Generation
• 15B • Updated • 18
8B • Updated • 22
Aletheia-Bench/GRPO-Think-1.5B-8k
Text Generation
• 2B • Updated • 10
Aletheia-Bench/GRPO-Think-7B-4k
Text Generation
• 8B • Updated • 23
Aletheia-Bench/GRPO-Think-1.5B-4k
Text Generation
• 2B • Updated • 14
15B • Updated • 13
Aletheia-Bench/GRPO-Instruct-14B
Text Generation
• 15B • Updated • 22
Aletheia-Bench/GRPO-Instruct-1.5B
Text Generation
• 2B • Updated • 13
Aletheia-Bench/GRPO-Instruct-7B
Text Generation
• 8B • Updated • 30
2B • Updated • 17
Aletheia-Bench/DPO-Think-7B
Text Generation
• 8B • Updated • 23
Aletheia-Bench/GRPO-Think-14B-16k
Text Generation
• 15B • Updated • 11
Aletheia-Bench/GRPO-Think-1.5B-16k
Text Generation
• 2B • Updated • 28
• Aletheia-Bench/GRPO-Think-7B-16k
Text Generation
• 8B • Updated • 22