-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 76 -
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Paper • 2404.02258 • Published • 105 -
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
Paper • 2404.14619 • Published • 127 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 258
HAN JUNGU PRO
JUNGU
AI & ML interests
None yet
Recent Activity
updated
a Space
about 14 hours ago
conanssam/schoolrecord_gen
liked
a model
10 days ago
LGAI-EXAONE/EXAONE-Deep-32B-AWQ
upvoted
a
collection
11 days ago
EXAONE-Deep
Organizations
Collections
3
-
MVDream: Multi-view Diffusion for 3D Generation
Paper • 2308.16512 • Published • 102 -
Seeing through the Brain: Image Reconstruction of Visual Perception from Human Brain Signals
Paper • 2308.02510 • Published • 22 -
420
ICON - Clothed Human Digitization
🤼 -
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning
Paper • 2404.16994 • Published • 36
spaces
45
models
20
JUNGU/llama3.1-8b-grpo-test
Updated
•
108
JUNGU/phi-4-Q4-mlx
Text Generation
•
Updated
•
25
JUNGU/Llama-3.1-8b-kr
Updated
JUNGU/lora_model
Updated
JUNGU/Reinforce-Pixelcopter-PLE-v0-retry1
Reinforcement Learning
•
Updated
JUNGU/Reinforce-Pixelcopter-PLE-v0-retry
Reinforcement Learning
•
Updated
JUNGU/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
JUNGU/Reinforce-CartPole-v1-RETRY
Reinforcement Learning
•
Updated
JUNGU/qlora-koalpaca-polyglot-12.8b-50step
Updated
•
1
JUNGU/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
datasets
None public yet