6 1

nb

ndvb

AI & ML interests

None yet

Recent Activity

commented on an article 10 days ago

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

updated a model 5 months ago

ndvb/segformer-b0-finetuned-segments-sidewalk-oct-22

updated a collection about 1 year ago

Text to image

View all activity

Organizations

None yet

ndvb's activity

commented on From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning 10 days ago

Do you understand how the reward model is built there? They say it's formed a rule-based on correctness, so is it only applied to prompts taken from math problems and leet-code problems? How were the prompts chosen/generated in the RL phase?

updated a model 5 months ago

ndvb/segformer-b0-finetuned-segments-sidewalk-oct-22

Updated Sep 17, 2024

updated a collection about 1 year ago

Text to image

Collection

1 item • Updated Dec 10, 2023

upvoted a paper about 1 year ago

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

Paper • 2311.10093 • Published Nov 16, 2023 • 58

New activity in tuetschek/e2e_nlg over 1 year ago

Code to run the benchmark

#2 opened over 1 year ago by

ndvb

New activity in evaluate-metric/glue over 1 year ago

What about training?

#2 opened over 1 year ago by

ndvb

New activity in nyu-mll/glue over 1 year ago

Code to run the Glue on Huggingface models?

#11 opened over 1 year ago by

ndvb

New activity in ItbearZhang/facebook-opt-125m-with-alpacadataset over 1 year ago

How can we see the code that does the training?

#2 opened over 1 year ago by

ndvb

Adding `safetensors` variant of this model

#1 opened over 1 year ago by

SFconvertbot

New activity in ramsrigouthamg/t5-large-paraphraser-diverse-high-quality about 2 years ago

How do I export it to torchscript?

#2 opened over 2 years ago by

elavneet