prithivMLmods's picture
Update README.md
28cd639 verified
|
raw
history blame
695 Bytes
metadata
library_name: transformers
license: llama3.1
language:
  - en
base_model:
  - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
pipeline_tag: text-generation
tags:
  - llama
  - deepseek

Llama-8B-Distill-CoT

Llama-8B-Distill-CoT is based on the Llama [ KT ] model, distilled by DeepSeek-R1-Distill-Llama-8B. It has been fine-tuned on the long chain-of-thought reasoning model and specialized datasets, focusing on chain-of-thought (CoT) reasoning for problem-solving. This model is optimized for tasks requiring logical reasoning, detailed explanations, and multi-step problem-solving, making it ideal for applications such as instruction-following, text generation, and complex reasoning tasks.