-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 76 -
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Paper • 2404.02258 • Published • 104 -
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
Paper • 2404.14619 • Published • 127 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 256
HAN JUNGU
JUNGU
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
JUNGU/llama3.1-8b-grpo-test
published
a model
1 day ago
JUNGU/llama3.1-8b-grpo-test
liked
a Space
2 days ago
JUNGU/make-article-agent
Organizations
Collections
3
-
MVDream: Multi-view Diffusion for 3D Generation
Paper • 2308.16512 • Published • 102 -
Seeing through the Brain: Image Reconstruction of Visual Perception from Human Brain Signals
Paper • 2308.02510 • Published • 22 -
421
ICON - Clothed Human Digitization
🤼 -
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning
Paper • 2404.16994 • Published • 36
spaces
45
models
20
JUNGU/llama3.1-8b-grpo-test
Updated
•
24
JUNGU/phi-4-Q4-mlx
Text Generation
•
Updated
•
9
JUNGU/Llama-3.1-8b-kr
Updated
JUNGU/lora_model
Updated
JUNGU/Reinforce-Pixelcopter-PLE-v0-retry1
Reinforcement Learning
•
Updated
JUNGU/Reinforce-Pixelcopter-PLE-v0-retry
Reinforcement Learning
•
Updated
JUNGU/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
JUNGU/Reinforce-CartPole-v1-RETRY
Reinforcement Learning
•
Updated
JUNGU/qlora-koalpaca-polyglot-12.8b-50step
Updated
•
1
JUNGU/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
3
datasets
None public yet