126 77 1938

Nicky

NickyNicky

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

HKUSTAudio/Llasa-1B-Multilingual

liked a dataset 1 day ago

Nitral-AI/toxic-qa-style

liked a dataset 1 day ago

AymanTarig/xlam-function-calling-60k-llama-format

View all activity

Organizations

NickyNicky's activity

upvoted an article 2 days ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 40

upvoted a paper 3 days ago

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts

Paper • 2305.17679 • Published May 28, 2023 • 2

upvoted an article 3 days ago

Article

Open-source DeepResearch – Freeing our search agents

6 days ago

• 809

upvoted an article 4 days ago

Article

Welcome to Inference Providers on the Hub 🔥

13 days ago

• 294

upvoted an article 7 days ago

Article

The AI tools for Art Newsletter - Issue 1

10 days ago

• 48

upvoted an article 12 days ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 34

upvoted 3 articles 13 days ago

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

•

16 days ago

• 12

Article

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

•

17 days ago

• 1

Article

We now support VLMs in smolagents!

17 days ago

• 74

upvoted a collection 16 days ago

ProLIP

Collection

Official ProLIP weights • 4 items • Updated Dec 9, 2024 • 6

upvoted 6 articles 17 days ago

Article

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

•

Dec 13, 2024

• 4

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

•

21 days ago

• 13

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

20 days ago

• 34

Article

Yay! Organizations can now publish blog Articles

and 3 others •

20 days ago

• 32

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

17 days ago

• 61

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

20 days ago

• 60

upvoted a paper 17 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 18 days ago • 305

upvoted 2 articles 18 days ago

Article

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

Dec 23, 2024

• 39

Article

Faster Text Generation with Self-Speculative Decoding

Nov 20, 2024

• 51

upvoted a paper 23 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 26 days ago • 53