13 26 68

Asaf Yehudai

Asaf-Yehudai

AI & ML interests

None yet

Recent Activity

liked a dataset about 10 hours ago

gili-ibm/constrained-lmsys-chat-1m

upvoted a paper 8 days ago

Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models

liked a Space 27 days ago

webml-community/attention-visualization

View all activity

Organizations

Asaf-Yehudai's activity

upvoted a paper 8 days ago

Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models

Paper • 2409.04787 • Published Sep 7, 2024 • 1

upvoted 3 papers about 2 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

JuStRank: Benchmarking LLM Judges for System Ranking

Paper • 2412.09569 • Published Dec 12, 2024 • 19

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

Paper • 2401.14019 • Published Jan 25, 2024 • 23

upvoted 2 collections 2 months ago

multimodal

Collection

221 items • Updated 6 days ago • 8

VisionLM

Collection

641 items • Updated 13 days ago • 40

upvoted 2 papers 3 months ago

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2, 2024 • 54

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 11

upvoted 2 papers 4 months ago

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published Oct 14, 2024 • 26

Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3, 2024 • 19

upvoted 2 papers 5 months ago

What's In My Big Data?

Paper • 2310.20707 • Published Oct 31, 2023 • 11

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15, 2024 • 21

upvoted an article 5 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15, 2024

• 81

upvoted a paper 6 months ago

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Paper • 2407.13696 • Published Jul 18, 2024 • 5

upvoted a paper 7 months ago

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29, 2024 • 22

upvoted an article 7 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

•

Jul 9, 2024

• 42

upvoted 3 papers 8 months ago

upvoted a paper 9 months ago

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18, 2024 • 35