ICML2023
AI & ML interests
None defined yet.
Recent Activity
View all activity
ICML2023's activity
ameerazam08
posted
an
update
4 days ago
Post
1588
R1 is out! And with a lot of other R1 releated models...
hysts
updated
a
Space
23 days ago
vwxyzjn
authored
5
papers
29 days ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Paper
•
2403.17031
•
Published
•
5
A2C is a special case of PPO
Paper
•
2205.09123
•
Published
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Paper
•
2410.18252
•
Published
•
5
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
•
2411.15124
•
Published
•
59
2 OLMo 2 Furious
Paper
•
2501.00656
•
Published
•
16
mbrack
authored
a
paper
about 1 month ago
Post
7808
Google drops Gemini 2.0 Flash Thinking
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
Kameshr
authored
a
paper
about 2 months ago
Post
8798
QwQ-32B-Preview is now available in anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
Post
3929
Post
2888
anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
xzyao
authored
a
paper
3 months ago
Lupin1998
authored
2
papers
4 months ago