Papers4Reading - a omarcevi Collection

omarcevi 's Collections

Papers4Reading

updated about 14 hours ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published 22 days ago • 198
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published 15 days ago • 52
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published 14 days ago • 57
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 10 days ago • 44
Benchmarking and Dissecting the Nvidia Hopper GPU Architecture

Paper • 2402.13499 • Published Feb 21
Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published 7 days ago • 16
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 7 days ago • 96
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published 3 days ago • 52
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 14
Mixtral of Experts

Paper • 2401.04088 • Published Jan 8 • 159