sn2234

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Undi95/MistralThinker-v1.1

liked a model 1 day ago

TheDrummer/Cydonia-24B-v2.1

reacted to Undi95's post with 👍 4 days ago

Hi there! If you want to create your own thinking model or do a better MistralThinker, I just uploaded my entire dataset made on Deepseek R1 and the axolotl config. (well I made them public) Axolotl config : https://huggingface.co/Undi95/MistralThinker-v1.1/blob/main/axolotl_config_l6xu8_mg.yml The dataset : https://huggingface.co/datasets/Undi95/R1-RP-ShareGPT3 You can also read all I did on those two discord screenshot from two days ago, I'm a little lazy to rewrite all kek. Hope you will use them!

View all activity

Organizations

None yet

sn2234's activity

liked 2 models 1 day ago

Undi95/MistralThinker-v1.1

Updated 4 days ago • 444 • 29

TheDrummer/Cydonia-24B-v2.1

Updated 1 day ago • 47 • 25

reacted to Undi95's post with 👍 4 days ago

Post

4165

Hi there!

If you want to create your own thinking model or do a better MistralThinker, I just uploaded my entire dataset made on Deepseek R1 and the axolotl config. (well I made them public)

Axolotl config : Undi95/MistralThinker-v1.1

The dataset : Undi95/R1-RP-ShareGPT3

You can also read all I did on those two discord screenshot from two days ago, I'm a little lazy to rewrite all kek.

Hope you will use them!

5 replies

reacted to Kseniase's post with 👍 6 days ago

Post

5996

9 types of "Chain-of-..." approaches:

Chain-of-Thought (CoT) prompting enhances reasoning in AI models by breaking down complex problems into step-by-step logical sequences. It continues proving its effectiveness, especially in top-performing reasoning models. However, there are other similar methods, that expand CoT and can be used for different purposes. Here are 9 of them:

1. Chain-of-Action-Thought (COAT) -> Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search (2502.02508)
Helps model decide when to keep thinking, double-check their work, or try a different approach, using special guiding tokens.

2. Chain of Draft (CoD) -> Chain of Draft: Thinking Faster by Writing Less (2502.18600)
It helps model generate short but meaningful reasoning steps, cutting costs and making processing faster

3. Chain-of-Agents -> Chain of Agents: Large Language Models Collaborating on Long-Context Tasks (2406.02818)
Uses multi-agent collaboration: Worker agents process text parts in a structured chain, and manager agent summarizes the results

4. Chain-of-RAG ->https://huggingface.co/papers/2501.14342
Creates retrieval chains, instead of retrieving all info at once. It can dynamically adjust its search process and its parameters like step number

5. Chain-of-Shot Prompting (CoS) -> CoS: Chain-of-Shot Prompting for Long Video Understanding (2502.06428)
Helps models pick frames crucial for understanding a video, using a binary video summary and video co-reasoning module.

6. Chain of Hindsight (CoH) -> Chain of Hindsight Aligns Language Models with Feedback (2302.02676)
Converts all feedback into sequences to fine-tune the model and refine outputs

7. Chain-of-Note (CoN) -> Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models (2311.09210)
Generates sequential reading notes for each retrieved document to assess relevance before integrating info into the final answer

8. Chain of Diagnosis (CoD) -> CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis (2407.13301)
Transforms the diagnostic process into a diagnostic chain

9. Chain(s)-of-Knowledge -> https://www.turingpost.com/p/cok
Enhance LLMs by dynamically pulling in external knowledge to improve accuracy and reduce errors

liked 2 models 11 days ago

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated 11 days ago • 182k • • 937

arcee-ai/Arcee-Blitz

Text Generation • Updated 10 days ago • 2.7k • 62

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 13 days ago • 3.89M • • 11k

updated a collection 3 months ago

VisionLLMs

Collection

11 items • Updated Dec 4, 2024 • 1

updated a collection 5 months ago

General Vision Models

Collection

2 items • Updated Oct 9, 2024

reacted to merve's post with 🔥 5 months ago

Post

3797

Meta AI vision has been cooking @facebook
They shipped multiple models and demos for their papers at @ECCV 🤗

Here's a compilation of my top picks:
- Sapiens is family of foundation models for human-centric depth estimation, segmentation and more, all models have open weights and demos 👏

All models have their demos and even torchscript checkpoints!
A collection of models and demos: facebook/sapiens-66d22047daa6402d565cb2fc
- VFusion3D is state-of-the-art consistent 3D generation model from images

Model: facebook/vfusion3d
Demo: facebook/VFusion3D

- CoTracker is the state-of-the-art point (pixel) tracking model

Demo: facebook/cotracker
Model: facebook/cotracker

updated 2 collections 5 months ago

Small LLM's

Collection

8 items • Updated Oct 9, 2024 • 3

General Vision Models

Collection

2 items • Updated Oct 9, 2024

updated a collection 6 months ago

VisionLLMs

Collection

11 items • Updated Dec 4, 2024 • 1