prithivMLmods commited on
Commit
304eea4
·
verified ·
1 Parent(s): ec4f088

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -14,3 +14,7 @@ tags:
14
  ---
15
 
16
  ![python.gif](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/4nYxcbXSfu2Q0fIXul41e.gif)
 
 
 
 
 
14
  ---
15
 
16
  ![python.gif](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/4nYxcbXSfu2Q0fIXul41e.gif)
17
+
18
+ # **PyThagorean-10B**
19
+
20
+ PyThagorean [Python + Math] is a Python and mathematics-based model designed to solve mathematical problems using Python libraries and coding. It has been fine-tuned on 1.5 million entries and is built on LLaMA's architecture. The model supports different parameter sizes, including 10B, 3B, and 1B (Tiny). These instruction-tuned, text-only models are optimized for multilingual dialogue use cases, including agent-based retrieval and summarization tasks. PyThagorean leverages an auto-regressive language model that uses an optimized transformer architecture. The tuned versions employ supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.