Jay P's picture

Jay P

jayomb

·

AI & ML interests

None yet

Recent Activity

liked a dataset 30 minutes ago

stallone/glaive-code-assistant-v3

liked a dataset 40 minutes ago

mrcuddle/Human-Like-Alpaca

liked a dataset about 11 hours ago

allenai/olmOCR-mix-0225

View all activity

Organizations

jayomb's activity

upvoted a collection 7 days ago

Synthetic Data and Self-Improvement

70 items • Updated 5 days ago • 6

upvoted 3 papers 7 days ago

Non-instructional Fine-tuning: Enabling Instruction-Following Capabilities in Pre-trained Language Models without Instruction-Following Data

Paper • 2409.00096 • Published Aug 27, 2024 • 1

RNR: Teaching Large Language Models to Follow Roles and Rules

Paper • 2409.13733 • Published Sep 10, 2024 • 1

Response Tuning: Aligning Large Language Models without Instruction

Paper • 2410.02465 • Published Oct 3, 2024 • 13

upvoted a paper 10 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 22 days ago • 194

upvoted a paper 15 days ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published 17 days ago • 32

upvoted a collection 28 days ago

cool datasets

142 items • Updated 3 days ago • 14

upvoted a paper about 2 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

upvoted a collection 9 months ago

abliterated-v3

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3, 2024 • 115

upvoted a paper 9 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted a paper 10 months ago

Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Paper • 2405.06932 • Published May 11, 2024 • 20

upvoted an article 10 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 233

upvoted 2 papers 11 months ago

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 33

upvoted a collection 11 months ago

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 124

upvoted 2 papers 12 months ago

Knowledge Fusion of Large Language Models

Paper • 2401.10491 • Published Jan 19, 2024 • 5

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26, 2024 • 27