Berkeley-Nest

non-profit

https://starling.cs.berkeley.edu

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

hanlinzhu authored a paper 19 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

natolambert authored a paper about 2 months ago

Objective Mismatch in Model-based Reinforcement Learning

natolambert authored a paper about 2 months ago

Confidence-Building Measures for Artificial Intelligence: Workshop Proceedings

View all activity

berkeley-nest's activity

hanlinzhu

authored a paper 19 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 20 days ago • 13

natolambert

authored 9 papers about 2 months ago

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 2

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 13

Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence

Paper • 2405.15802 • Published May 17, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 108

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 16

natolambert

authored a paper 3 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 59

natolambert

authored a paper 4 months ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 12

natolambert

authored 2 papers 6 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 78

Self-Directed Synthetic Dialogues and Revisions Technical Report

Paper • 2407.18421 • Published Jul 25, 2024

evan-nexusflow

updated a model 7 months ago

berkeley-nest/Starling-RM-7B-alpha

Updated Jul 30, 2024 • 19 • 102

banghua

authored a paper 8 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 6

evanfrick

authored a paper 8 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 6

Timmli

authored a paper 8 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 6

banghua

authored a paper 10 months ago

The Effective Horizon Explains Deep RL Performance in Stochastic Environments

Paper • 2312.08369 • Published Dec 13, 2023

hanlinzhu

authored a paper 10 months ago

On Representation Complexity of Model-based and Model-free Reinforcement Learning

Paper • 2310.01706 • Published Oct 3, 2023

AI & ML interests

Recent Activity

Team members 9

berkeley-nest's activity