9 2 85

Ali Bidaran

alibidaran

AI & ML interests

LLMs, Computer Vision, Generative AI, NLP, Machine /Deep learning, Reinforcement Learning

Recent Activity

liked a dataset about 14 hours ago

angie-chen55/python-github-code

liked a dataset about 14 hours ago

dipesh/python-code-ds-mini

reacted to sergiopaniego's post with 👍 14 days ago

Just included example scripts for aligning models using GSPO (including VLM example) 🙆‍♂️🙆‍♂️ GSPO is the latest RL alignment algo by @Alibaba_Qwen and it's already supported in the latest TRL v0.20 release. Super-easy-to-get-started example scripts below, GO run them!👩‍💻👩‍💻 🧑‍🎨 Script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo.py 🦄 VLM script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo_vlm.py 🧩 More TRL examples: https://huggingface.co/docs/trl/main/en/example_overview 🧙‍♂️ GSPO paper: https://huggingface.co/papers/2507.18071

View all activity

Organizations

None yet

New activity in alibidaran/Gemma2_Farsi 8 months ago

fine tuning

#2 opened 8 months ago by

arshiaafshani

New activity in alibidaran/Mental_health_detection about 1 year ago

License?

#2 opened about 1 year ago by

nofarb17

New activity in alibidaran/Gemma2_Farsi over 1 year ago

Samples aren't working

#1 opened over 1 year ago by

hossainiir

New activity in mostafaamiri/persian_llama_7B_merged over 1 year ago

Questions for training

#1 opened over 1 year ago by

alibidaran

New activity in alibidaran/llama-2-7b-virtual_doctor over 1 year ago

What dataset did you used for fine tuning ?

#1 opened over 1 year ago by

AiModelsMarket

New activity in alibidaran/medical_transcription_generator over 1 year ago

How to fine tuning ?

#3 opened over 1 year ago by

cuongtk2002

New activity in alibidaran/medical_transcription_generator about 2 years ago

Adding `safetensors` variant of this model

#1 opened about 2 years ago by

alibidaran

Ali Bidaran

AI & ML interests

Recent Activity

Organizations

alibidaran's activity

fine tuning

License?

Samples aren't working

Questions for training

What dataset did you used for fine tuning ?

How to fine tuning ?

Adding `safetensors` variant of this model