81 8 151

t.d.a.g. PRO

sequelbox

sequelbox.bsky.social

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

posted an update about 20 hours ago

SNEAK PREVIEW: Tachibana 2! A new high-difficulty code-reasoning dataset to use and challenge https://huggingface.co/deepseek-ai/DeepSeek-R1 - harder prompts, complex requirements, deeper technical skill. Link here: https://huggingface.co/datasets/sequelbox/Tachibana2-DeepSeek-R1-PREVIEW All responses generated by DeepSeek's R1 model, all prompts synthetically generated by Llama 3.1 405b Instruct. excited to bring out the full dataset for everyone's use as soon as I can! more to come soon.

published a dataset about 20 hours ago

sequelbox/Tachibana2-DeepSeek-R1-PREVIEW

updated a dataset about 20 hours ago

sequelbox/Tachibana2-DeepSeek-R1-PREVIEW

View all activity

Organizations

sequelbox's activity

posted an update about 20 hours ago

Post

997

SNEAK PREVIEW: Tachibana 2! A new high-difficulty code-reasoning dataset to use and challenge deepseek-ai/DeepSeek-R1 - harder prompts, complex requirements, deeper technical skill.

Link here: sequelbox/Tachibana2-DeepSeek-R1-PREVIEW

All responses generated by DeepSeek's R1 model, all prompts synthetically generated by Llama 3.1 405b Instruct.

excited to bring out the full dataset for everyone's use as soon as I can! more to come soon.

published a dataset about 20 hours ago

sequelbox/Tachibana2-DeepSeek-R1-PREVIEW

Viewer • Updated about 20 hours ago • 6.12k • 6 • 1

updated a dataset about 20 hours ago

sequelbox/Tachibana2-DeepSeek-R1-PREVIEW

Viewer • Updated about 20 hours ago • 6.12k • 6 • 1

liked a model 3 days ago

sometimesanotion/Lamarck-14B-v0.7

Text Generation • Updated 8 days ago • 6.55k • 37

liked a dataset 7 days ago

open-r1/OpenR1-Math-220k

Viewer • Updated 7 days ago • 450k • 23.8k • 421

liked a model 10 days ago

open-thoughts/OpenThinker-32B

Text Generation • Updated 12 days ago • 2.33k • 153

New activity in open-thoughts/OpenThoughts-114k 11 days ago

license

#2 opened 28 days ago by

sequelbox

liked 2 models 13 days ago

nvidia/AceInstruct-7B

Text Generation • Updated Jan 16 • 458 • 16

nvidia/AceInstruct-72B

Text Generation • Updated Jan 16 • 167 • 14

New activity in sequelbox/Raiden-DeepSeek-R1 13 days ago

[bot] Conversion to Parquet

#1 opened 14 days ago by

parquet-converter

liked a dataset 14 days ago

sequelbox/Raiden-DeepSeek-R1

Viewer • Updated 15 days ago • 62.9k • 771 • 38

posted an update 15 days ago

Post

2667

Raiden is here! 63k creative-reasoning and analytic-reasoning prompts answered by DeepSeek's 685b R1 model!

- All prompts from microsoft/orca-agentinstruct-1M-v1 and all responses from deepseek-ai/DeepSeek-R1
- A deep look at R1's reasoning skills! Use as you will.

Get it now: sequelbox/Raiden-DeepSeek-R1

for everyone :)

published a dataset 15 days ago

sequelbox/Raiden-DeepSeek-R1

Viewer • Updated 15 days ago • 62.9k • 771 • 38

updated a dataset 15 days ago

sequelbox/Raiden-DeepSeek-R1

Viewer • Updated 15 days ago • 62.9k • 771 • 38

liked a model 20 days ago

ibm-granite/granite-timeseries-ttm-r2

Time Series Forecasting • Updated 1 day ago • 823k • 51

reacted to rubenroy's post with 🚀 20 days ago

Post

2432

🔥🚀 Hey everyone! I'm excited to share my latest LLM release: Gilgamesh 72B, a model built on Qwen 2.5-72B Instruct. Gilgamesh was trained on a couple of my GammaCorpus datasets, specifically:

- rubenroy/GammaCorpus-CoT-Math-170k
- rubenroy/GammaCorpus-v2-5m
- rubenroy/GammaCorpus-Fact-QA-450k

I've submitted GGM 72B to the Open LLM Leaderboard for benchmarking, I'll send an update post once the results are in!

You can try it out and share your feedback, check out the model page and see what it can do:
👉 rubenroy/Gilgamesh-72B

Would love to hear your thoughts!

New activity in sequelbox/Raiden-DSR1-PREVIEW 20 days ago

[bot] Conversion to Parquet

#1 opened 21 days ago by

parquet-converter

posted an update 22 days ago

Post

1899

New sneak preview of my next release! Raiden is a deepseek-ai/DeepSeek-R1 synthetic dataset that uses creative-reasoning and analytic-reasoning prompts!

This preview release has the first 5.8k rows, all responses generated using DeepSeek's 685b parameter R1 model: sequelbox/Raiden-DSR1-PREVIEW

Enjoy this look at R1's reasoning skills! Full dataset coming soon.

published a dataset 22 days ago

sequelbox/Raiden-DSR1-PREVIEW

Viewer • Updated 22 days ago • 5.8k • 77 • 3

updated a dataset 22 days ago

sequelbox/Raiden-DSR1-PREVIEW

Viewer • Updated 22 days ago • 5.8k • 77 • 3