loubnabnl HF staff commited on
Commit
2b36efb
·
verified ·
1 Parent(s): 2137483

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -75,7 +75,7 @@ We validated these classifiers by filtering Stack v2 data and testing on an inte
75
 
76
  The table below shows Stack-Edu dataset statistics and MultiPL-E scores for the top 4 (in terms of size) programming languages. We use HumanEval for Python evaluation. For the ablation, we started from a mid-training checkpoint of SmolLM2 at 3T tokens which was trained primarily on web data, and perform linear annealing on 200B tokens, uniformly distributed across 15 of the most commonly used programming languages (~14B tokens each).
77
 
78
- | Language | Before filtering (B tokens) | After filtering (B tokens) | MultiPL-E (Original → Filtered) |
79
  |------------|-------------------------|---------------------|-------------------------------|
80
  | Python | 50.6 | 21.8 | 20.7 → 25.6 |
81
  | C++ | 69.7 | 16.0 | 16.7 → 24.8 |
 
75
 
76
  The table below shows Stack-Edu dataset statistics and MultiPL-E scores for the top 4 (in terms of size) programming languages. We use HumanEval for Python evaluation. For the ablation, we started from a mid-training checkpoint of SmolLM2 at 3T tokens which was trained primarily on web data, and perform linear annealing on 200B tokens, uniformly distributed across 15 of the most commonly used programming languages (~14B tokens each).
77
 
78
+ | Language | Size before filtering (B tokens) | Size after filtering (B tokens) | MultiPL-E score (Original → Filtered) |
79
  |------------|-------------------------|---------------------|-------------------------------|
80
  | Python | 50.6 | 21.8 | 20.7 → 25.6 |
81
  | C++ | 69.7 | 16.0 | 16.7 → 24.8 |