Update README.md
Browse files
README.md
CHANGED
@@ -9,14 +9,16 @@ base_model:
|
|
9 |
# Hammer2.0-1.5b Function Calling Model
|
10 |
|
11 |
## Introduction
|
12 |
-
|
|
|
|
|
|
|
13 |
|
14 |
## Model Details
|
15 |
-
|
16 |
-
|
17 |
|
18 |
## Tuning Details
|
19 |
-
|
20 |
|
21 |
## Evaluation
|
22 |
The evaluation results of Hammer 2.0 series on the Berkeley Function-Calling Leaderboard (BFCL) are presented in the following table:
|
@@ -31,6 +33,7 @@ In addition, we evaluated Hammer2.0 on other academic benchmarks to further show
|
|
31 |
</div>
|
32 |
|
33 |
On comparison, Hammer 2.0 outperforms models with similar sizes and even surpass many larger models overall.
|
|
|
34 |
## Requiements
|
35 |
The code of Hammer2.0-1.5b has been in the latest Hugging face transformers and we advise you to install `transformers>=4.37.0`.
|
36 |
|
|
|
9 |
# Hammer2.0-1.5b Function Calling Model
|
10 |
|
11 |
## Introduction
|
12 |
+
We're excited to introduce Hammer 2.0, the latest in our Hammer Large Language Models series designed to enhance AI function calling. Differing from existing models focusing on training data refinement, Hammer optimizes performance primarily through advanced training techniques. In this version, we release a number of models with sizes ranging from 0.5B to 7B:
|
13 |
+
[0.5B](https://huggingface.co/MadeAgents/Hammer2.0-0.5b),
|
14 |
+
[1.5B](https://huggingface.co/MadeAgents/Hammer2.0-1.5b),
|
15 |
+
[4B](https://huggingface.co/MadeAgents/Hammer2.0-3b), and [7B](https://huggingface.co/MadeAgents/Hammer2.0-0.5b).
|
16 |
|
17 |
## Model Details
|
18 |
+
Hammer2.0 finetuned based on [Qwen 2.5 series](https://huggingface.co/collections/Qwen/qwen25-66e81a666513e518adb90d9e) and [Qwen 2.5 coder series](https://huggingface.co/collections/Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f). It's trained using the [APIGen Function Calling Datasets](https://huggingface.co/datasets/Salesforce/xlam-function-calling-60k) containing 60,000 samples, supplemented by [7,500 irrelevance detection data](https://huggingface.co/datasets/MadeAgents/XLAM-7.5k-Irrelevance) we generated. Employing innovative training techniques of function masking, function shuffling, and prompt optimization, Hammer2.0 has achieved exceptional performances across numerous benchmarks including [Berkley Function Calling Leaderboard](https://gorilla.cs.berkeley.edu/leaderboard.html), [API-Bank](https://arxiv.org/abs/2304.08244), [Tool-Alpaca](https://arxiv.org/abs/2306.05301), [Nexus Raven](https://github.com/nexusflowai/NexusRaven-V2) and [Seal-Tools](https://arxiv.org/abs/2405.08355).
|
|
|
19 |
|
20 |
## Tuning Details
|
21 |
+
We will soon release a report detailing the technical aspects of our models. Stay tuned!
|
22 |
|
23 |
## Evaluation
|
24 |
The evaluation results of Hammer 2.0 series on the Berkeley Function-Calling Leaderboard (BFCL) are presented in the following table:
|
|
|
33 |
</div>
|
34 |
|
35 |
On comparison, Hammer 2.0 outperforms models with similar sizes and even surpass many larger models overall.
|
36 |
+
|
37 |
## Requiements
|
38 |
The code of Hammer2.0-1.5b has been in the latest Hugging face transformers and we advise you to install `transformers>=4.37.0`.
|
39 |
|