prithivMLmods
/

PyThagorean-10B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

prithivMLmods commited on 14 days ago

Commit

304eea4

·

verified ·

1 Parent(s): ec4f088

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -14,3 +14,7 @@ tags:
 ---
 ![python.gif](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/4nYxcbXSfu2Q0fIXul41e.gif)

 ---
 ![python.gif](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/4nYxcbXSfu2Q0fIXul41e.gif)
+# **PyThagorean-10B**
+PyThagorean [Python + Math] is a Python and mathematics-based model designed to solve mathematical problems using Python libraries and coding. It has been fine-tuned on 1.5 million entries and is built on LLaMA's architecture. The model supports different parameter sizes, including 10B, 3B, and 1B (Tiny). These instruction-tuned, text-only models are optimized for multilingual dialogue use cases, including agent-based retrieval and summarization tasks. PyThagorean leverages an auto-regressive language model that uses an optimized transformer architecture. The tuned versions employ supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.