Abdullah Abdelrhim's picture

Abdullah Abdelrhim

abdullah

·

abodacs

AI & ML interests

None yet

Organizations

abdullah's activity

upvoted a paper 4 days ago

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2 • 10

upvoted a paper 7 days ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 48

upvoted an article 9 days ago

Article

Decoding Strategies in Large Language Models

By

•

11 days ago

• 34

upvoted 2 articles 13 days ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

13 days ago

• 37

Article

Everything About Long Context Fine-tuning

By

•

May 10

• 30

upvoted a collection 14 days ago

LongVU

7 items • Updated 9 days ago • 26

upvoted an article 19 days ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

20 days ago

• 30

upvoted a paper about 1 month ago

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Paper • 2410.03017 • Published Oct 3 • 25

upvoted 2 collections about 1 month ago

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 7 items • Updated 11 days ago • 13

Llama 3.2 3B & 1B GGUF Quants

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated Sep 26 • 46

upvoted an article about 1 month ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

Sep 27

• 35

upvoted a collection about 1 month ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Sep 26 • 269

upvoted an article about 2 months ago

Article

Document Similarity Search with ColPali

By

•

Sep 21

• 46

upvoted 4 papers about 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 125

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 134

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16 • 38

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16 • 37

upvoted a collection about 2 months ago

MagpieLM

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated Sep 22 • 15

upvoted 2 papers 2 months ago

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27 • 13

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published Aug 23 • 21