1 38 12

Hammer++++

HammerW

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

upvoted a paper 2 months ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

upvoted a paper 2 months ago

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

View all activity

Organizations

None yet

HammerW's activity

upvoted 3 papers 2 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 58

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 72

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Paper • 2411.06208 • Published Nov 9, 2024 • 19

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted 2 papers 5 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 78

upvoted an article 5 months ago

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

Jun 13, 2024

• 45

upvoted a paper 5 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 124

upvoted an article 5 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 72

upvoted a paper 6 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 110

upvoted an article 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 299

liked a dataset 6 months ago

OpenAssistant/oasst2

Viewer • Updated Jan 11, 2024 • 135k • 1.3k • 222

upvoted 2 papers 6 months ago

Course-Correction: Safety Alignment Using Synthetic Preferences

Paper • 2407.16637 • Published Jul 23, 2024 • 26

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 70

upvoted a paper 7 months ago

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion

Paper • 2407.08642 • Published Jul 11, 2024 • 9

liked a dataset 7 months ago

lmsys/lmsys-chat-1m

Viewer • Updated Jul 27, 2024 • 1M • 2.21k • 627

upvoted an article 7 months ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 233