SB
shb777
AI & ML interests
LLM's , Speech 2 Speech and Synthetic Data
Recent Activity
replied to
csabakecskemeti's
post
1 day ago
I've run the open llm leaderboard evaluations + hellaswag on https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.
If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results
Am I made some mistake, or (at least this distilled version) not as good/better than the competition?
I'll run the same on the Qwen 7B distilled version too.
updated
a dataset
5 days ago
shb777/gemini-flash-2.0-speech
liked
a Space
9 days ago
black-forest-labs/FLUX.1-dev
Organizations
shb777's activity
[bot] Conversion to Parquet
#1 opened 10 days ago
by
parquet-converter
Dataset License
#2 opened 14 days ago
by
shb777
[bot] Conversion to Parquet
#1 opened 18 days ago
by
parquet-converter
[bot] Conversion to Parquet
#1 opened 19 days ago
by
parquet-converter
[bot] Conversion to Parquet
#1 opened 21 days ago
by
parquet-converter
[bot] Conversion to Parquet
#1 opened about 2 months ago
by
parquet-converter
COLLAGE- Generated Images
9
#1 opened 8 months ago
by
prithivMLmods