t.d.a.g. PRO

sequelbox

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

Organizations

Valiant Labs's profile picture

sequelbox's activity

posted an update about 20 hours ago
view post
Post
997
SNEAK PREVIEW: Tachibana 2! A new high-difficulty code-reasoning dataset to use and challenge deepseek-ai/DeepSeek-R1 - harder prompts, complex requirements, deeper technical skill.

Link here: sequelbox/Tachibana2-DeepSeek-R1-PREVIEW

All responses generated by DeepSeek's R1 model, all prompts synthetically generated by Llama 3.1 405b Instruct.

excited to bring out the full dataset for everyone's use as soon as I can! more to come soon.
New activity in open-thoughts/OpenThoughts-114k 11 days ago

license

5
#2 opened 28 days ago by
sequelbox
New activity in sequelbox/Raiden-DeepSeek-R1 13 days ago
posted an update 15 days ago
reacted to rubenroy's post with šŸš€ 20 days ago
view post
Post
2432
šŸ”„šŸš€ Hey everyone! I'm excited to share my latest LLM release: Gilgamesh 72B, a model built on Qwen 2.5-72B Instruct. Gilgamesh was trained on a couple of my GammaCorpus datasets, specifically:

- rubenroy/GammaCorpus-CoT-Math-170k
- rubenroy/GammaCorpus-v2-5m
- rubenroy/GammaCorpus-Fact-QA-450k

I've submitted GGM 72B to the Open LLM Leaderboard for benchmarking, I'll send an update post once the results are in!

You can try it out and share your feedback, check out the model page and see what it can do:
šŸ‘‰ rubenroy/Gilgamesh-72B

Would love to hear your thoughts!
New activity in sequelbox/Raiden-DSR1-PREVIEW 20 days ago
posted an update 22 days ago
view post
Post
1899
New sneak preview of my next release! Raiden is a deepseek-ai/DeepSeek-R1 synthetic dataset that uses creative-reasoning and analytic-reasoning prompts!

This preview release has the first 5.8k rows, all responses generated using DeepSeek's 685b parameter R1 model: sequelbox/Raiden-DSR1-PREVIEW

Enjoy this look at R1's reasoning skills! Full dataset coming soon.