Wei Xiong's picture

Wei Xiong

weqweasdas

·

https://weixiongust.github.io/WeiXiongUST/index.html

AI & ML interests

Machine learning, RLHF

Recent Activity

updated a dataset about 9 hours ago

selfcorrexp/distill_40koldc2r_120kw_84kcorr

published a dataset about 9 hours ago

selfcorrexp/distill_40koldc2r_120kw_84kcorr

updated a dataset about 9 hours ago

selfcorrexp/distill_0kc2r_120kw_84kcorr

View all activity

Organizations

Papers 4

arxiv:2405.07863

arxiv:2312.11456

arxiv:2306.12420

arxiv:2304.06767

models 23

weqweasdas/zephyr-7b-dpo-full

Text Generation • Updated May 3, 2024 • 7

weqweasdas/zephyr-7b-gemma-dpo

Updated May 1, 2024

weqweasdas/zephyr-7b-sft-full

Updated Apr 30, 2024

weqweasdas/zephyr-7b-dpo-qlora

Updated Apr 30, 2024

weqweasdas/gpt2-cpt-dutch

Text Generation • Updated Apr 29, 2024 • 65

weqweasdas/zephyr-7b-gemma-sft

Updated Apr 29, 2024

weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085

Text Generation • Updated Apr 16, 2024 • 4

weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6

Text Generation • Updated Apr 16, 2024 • 5

weqweasdas/raft_baseline_zephyr_packing_model6

Text Generation • Updated Apr 15, 2024 • 4

weqweasdas/raft_baseline_openchat_llama13b_model1

Text Generation • Updated Apr 14, 2024 • 6

datasets 172

weqweasdas/ace_processed

Viewer • Updated about 20 hours ago • 5.18M

weqweasdas/llama31_70b_chosen_type12_mix

Viewer • Updated 8 days ago • 21.5k • 15

weqweasdas/prompt_math_test

Viewer • Updated 8 days ago • 15k • 19

weqweasdas/fixed05_llasft_math_7ktype2_7ktype3_ver2_150_tmp10_generation_with_rewards

Viewer • Updated 8 days ago • 30k • 30

weqweasdas/filtered_numia_prompt15k

Viewer • Updated 9 days ago • 15k • 18

weqweasdas/filtered_numia_prompt30k

Viewer • Updated 9 days ago • 30.6k • 17

weqweasdas/prompt_numinamath

Viewer • Updated 10 days ago • 119k • 19

weqweasdas/prompt_numinamath_with_gts

Viewer • Updated 10 days ago • 168k • 16

weqweasdas/fixed05_llasft_math_3ktype2_7ktype3_ver2_250_tmp10_generation_with_rewards

Viewer • Updated 10 days ago • 50k • 18

weqweasdas/fixed05_llasft_math_3ktype2_7ktype3_ver2_250_more_datatmp10_vllmexp_retest2_generation

Viewer • Updated 10 days ago • 50k • 16