Max's picture

3 3

Max

ZhMax

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

OpenCoder-LLM/opc-sft-stage2

liked a dataset about 2 months ago

nvidia/OpenCodeGeneticInstruct

upvoted a paper 3 months ago

Risk-Averse Reinforcement Learning with Itakura-Saito Loss

View all activity

Organizations

None yet

upvoted a paper 3 months ago

Risk-Averse Reinforcement Learning with Itakura-Saito Loss

Paper • 2505.16925 • Published May 22 • 26

upvoted a paper 12 months ago

GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs

Paper • 2408.15300 • Published Aug 27, 2024 • 3

upvoted an article over 1 year ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 239