Nikita

PQlet

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

liked a Space 6 days ago

nanotron/ultrascale-playbook

upvoted a paper 14 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

View all activity

Organizations

None yet

PQlet's activity

upvoted a paper 2 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 5 days ago • 135

liked a Space 6 days ago

1.6k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 14 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published 16 days ago • 85

upvoted an article about 1 month ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 558

liked a dataset 2 months ago

roneneldan/TinyStories

Viewer • Updated Aug 12, 2024 • 2.14M • 22.2k • 621

liked 2 models 4 months ago

BAAI/bge-m3

jinaai/jina-embeddings-v2-base-en

Feature Extraction • Updated Jan 6 • 260k • • 712

upvoted a paper 4 months ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published Oct 23, 2024 • 201

liked a dataset 4 months ago

laion/laion-coco

Viewer • Updated Jul 14, 2024 • 641M • 2.54k • 79

upvoted an article 4 months ago

Article

Understanding InstaFlow/Rectified Flow

•

Oct 6, 2023

• 27

liked a dataset 4 months ago

Rowan/hellaswag

Viewer • Updated Sep 28, 2023 • 60k • 256k • 110

upvoted a paper 4 months ago

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 18

updated a model 5 months ago

PQlet/SkGenAI-InternVL2_1B-demo

Updated Oct 12, 2024

upvoted a collection 5 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 102 items • Updated 9 days ago • 97

updated 2 models 5 months ago

PQlet/SkGenAI-InternVL2_1B-demobest_model

Updated Oct 5, 2024

PQlet/test1

Updated Oct 3, 2024

liked a model 5 months ago

OpenGVLab/InternVL2-1B

Image-Text-to-Text • Updated 21 days ago • 89.4k • 62

upvoted an article 5 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 240

upvoted a paper 7 months ago

Layerwise Recurrent Router for Mixture-of-Experts

Paper • 2408.06793 • Published Aug 13, 2024 • 32

upvoted a paper 8 months ago

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 80