Std Yossh

yosshstd

yosshstd

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

liked a model about 2 months ago

cyberagent/DeepSeek-R1-Distill-Qwen-14B-Japanese

liked a model about 2 months ago

cl-nagoya/ruri-v3-310m

View all activity

Organizations

None yet

upvoted a paper 1 day ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published 9 days ago • 144

liked 2 models about 2 months ago

cyberagent/DeepSeek-R1-Distill-Qwen-14B-Japanese

Text Generation • 15B • Updated Jan 27 • 1.37k • • 90

cl-nagoya/ruri-v3-310m

liked a dataset about 2 months ago

msfm/ichikara-instruction-all

Viewer • Updated Dec 11, 2024 • 4.8k • 41 • 1

liked a model 3 months ago

RaphaelMourad/Mistral-Prot-v1-417M

Text Generation • 0.4B • Updated May 30, 2024 • 6 • 1

liked a model 5 months ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated May 16 • 40.8k • 1.53k

updated a model 5 months ago

yosshstd/Transporter-GPT

Updated Mar 23 • 2

published a model 5 months ago

yosshstd/Transporter-GPT

Updated Mar 23 • 2

updated a dataset 5 months ago

yosshstd/transporter_instruction_dataset

Viewer • Updated Mar 21 • 214k • 6

published a dataset 5 months ago

yosshstd/transporter_instruction_dataset

Viewer • Updated Mar 21 • 214k • 6

updated a dataset 5 months ago

yosshstd/transporter_instruction_dataset

Viewer • Updated Mar 21 • 214k • 6

Std Yossh

AI & ML interests

Recent Activity

Organizations

yosshstd's activity