minpeter
/

Llama-3.2-1B-chatml-tool-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

minpeter commited on 1 day ago

Commit

efbb0e0

·

verified ·

1 Parent(s): 1635bf7

Update README.md

Files changed (1) hide show

README.md +8 -9

README.md CHANGED Viewed

@@ -16,14 +16,6 @@ tags:
 ---
-The only difference from Llama-3.2-1B-chatml-tool-v1 is that it uses AlternateTokenizer, which does not define tool-related tokens (<tools>, <tool_call>, <tool_response>).
-In the case of the existing tool-AlternateTokenizer, the <tool_call> tag was not properly generated before the function call, but in v2, it was observed that it performed well when trained with the general AlternateTokenizer.
-need to check whether this phenomenon is repeated in larger models (3B, 8B).
 ## Model Performance Comparison (BFCL)
 | task name        | minpeter/Llama-3.2-1B-chatml-tool-v2 | meta-llama/Llama-3.2-1B-Instruct (measure) | meta-llama/Llama-3.2-1B-Instruct (Reported) |
@@ -33,5 +25,12 @@ need to check whether this phenomenon is repeated in larger models (3B, 8B).
 | simple          | **0.72**                             | 0.215                               | 0.2925 |
 | multiple        | **0.695**                                | 0.17                                | 0.335 |
 *Parallel calls are not taken into account. 0 points are expected. We plan to fix this in v3.

 ---
 ## Model Performance Comparison (BFCL)
 | task name        | minpeter/Llama-3.2-1B-chatml-tool-v2 | meta-llama/Llama-3.2-1B-Instruct (measure) | meta-llama/Llama-3.2-1B-Instruct (Reported) |
 | simple          | **0.72**                             | 0.215                               | 0.2925 |
 | multiple        | **0.695**                                | 0.17                                | 0.335 |
 *Parallel calls are not taken into account. 0 points are expected. We plan to fix this in v3.
+### Note
+The only difference from Llama-3.2-1B-chatml-tool-v1 is that it uses AlternateTokenizer, which does not define tool-related tokens (`<tools>`, `<tool_call>`, `<tool_response>`).
+In the case of the existing tool-AlternateTokenizer, the `<tool_call>` tag was not properly generated before the function call, but in v2, it was observed that it performed well when trained with the general AlternateTokenizer.
+We need to check whether this phenomenon is repeated in larger models (3B, 8B).