Yuuraa (Yura Choi)

upvoted 3 papers 6 days ago

upvoted 11 papers about 1 month ago

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

Paper • 2409.11564 • Published Sep 17 • 19

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18 • 36

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18 • 43

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 73

Language Models Learn to Mislead Humans via RLHF

Paper • 2409.12822 • Published Sep 19 • 9

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Paper • 2409.12903 • Published Sep 19 • 21

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 134

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19 • 36

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 23

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 99

upvoted 3 papers 3 months ago

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20 • 40

TWLV-I: Analysis and Insights from Holistic Evaluation on Video Foundation Models

Paper • 2408.11318 • Published Aug 21 • 54

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22 • 62

upvoted an article 4 months ago

Article

How OpenGPT 4o works

By

•

Jul 17

• 36

Yura Choi

AI & ML interests

Organizations

Yuuraa's activity

Decoding Reading Goals from Eye Movements

ReferEverything: Towards Segmenting Everything We Can Speak of in Videos

Learning Video Representations without Natural Videos

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

A Controlled Study on Long Context Extension and Generalization in LLMs

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Language Models Learn to Mislead Humans via RLHF

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Training Language Models to Self-Correct via Reinforcement Learning

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

To Code, or Not To Code? Exploring Impact of Code in Pre-training

TWLV-I: Analysis and Insights from Holistic Evaluation on Video Foundation Models

Controllable Text Generation for Large Language Models: A Survey

How OpenGPT 4o works