prithivMLmods
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -14,3 +14,7 @@ tags:
|
|
14 |
---
|
15 |
|
16 |
![python.gif](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/4nYxcbXSfu2Q0fIXul41e.gif)
|
|
|
|
|
|
|
|
|
|
14 |
---
|
15 |
|
16 |
![python.gif](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/4nYxcbXSfu2Q0fIXul41e.gif)
|
17 |
+
|
18 |
+
# **PyThagorean-10B**
|
19 |
+
|
20 |
+
PyThagorean [Python + Math] is a Python and mathematics-based model designed to solve mathematical problems using Python libraries and coding. It has been fine-tuned on 1.5 million entries and is built on LLaMA's architecture. The model supports different parameter sizes, including 10B, 3B, and 1B (Tiny). These instruction-tuned, text-only models are optimized for multilingual dialogue use cases, including agent-based retrieval and summarization tasks. PyThagorean leverages an auto-regressive language model that uses an optimized transformer architecture. The tuned versions employ supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
|