Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Nguyễn Minh Phúc
DatPySci
Follow
Oztobuzz's profile picture
1 follower
·
1 following
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a model
about 2 months ago
DatPySci/Qwen-2.5-7B-Simple-RL
published
a model
about 2 months ago
DatPySci/Qwen-2.5-7B-Simple-RL
published
a model
about 2 months ago
DatPySci/Llama-3.2-3B-sft-mixture
View all activity
Organizations
DatPySci
's models
90
Sort: Recently updated
DatPySci/EleutherAI_pythia-1b-deduped__SNIS_off_policy_0.1_1e-6__tldr
Updated
Aug 23, 2024
DatPySci/EleutherAI_pythia-1b-deduped__off_policy_length_IS_0.05_1e-6__tldr
Updated
Aug 23, 2024
DatPySci/EleutherAI_pythia-1b-deduped__off_policy_clipped_0.05_1e-6__tldr
Updated
Aug 22, 2024
DatPySci/EleutherAI_pythia-1b-deduped__off_policy_length_IS_0.1_1e-6__tldr
Updated
Aug 22, 2024
DatPySci/EleutherAI_pythia-1b-deduped__reward__tldr
Updated
Aug 11, 2024
DatPySci/mistral7b_principle
Text Generation
•
Updated
Jun 10, 2024
•
11
DatPySci/mistral7b_rlcd
Text Generation
•
Updated
Jun 10, 2024
•
10
DatPySci/pythia-2_8b_sft-gpt-turbo
Text Generation
•
Updated
Jun 2, 2024
•
14
DatPySci/pythia-2_8b_sft-gpt4
Text Generation
•
Updated
May 31, 2024
•
11
DatPySci/pythia-2_8b_sft
Text Generation
•
Updated
May 31, 2024
•
12
DatPySci/zephyr-7b-dpo-full
Updated
May 13, 2024
DatPySci/phi-2-sft-full
Updated
Apr 20, 2024
DatPySci/pythia-410m-sft-full
Text Generation
•
Updated
Apr 20, 2024
•
8
DatPySci/zephyr-7b-kto-iter0-des133
Text Generation
•
Updated
Apr 11, 2024
•
11
DatPySci/zephyr-7b-kto-iter0
Text Generation
•
Updated
Apr 9, 2024
•
12
DatPySci/zephyr-7b-kto-iter1
Text Generation
•
Updated
Apr 9, 2024
•
27
DatPySci/tiny-llama-sft
Text Generation
•
Updated
Mar 20, 2024
•
36
DatPySci/tiny-llama-spin-iter2
Text Generation
•
Updated
Mar 17, 2024
•
20
DatPySci/tiny-llama-kto-iter2
Text Generation
•
Updated
Mar 15, 2024
•
13
DatPySci/tiny-llama-kto-iter1
Text Generation
•
Updated
Mar 15, 2024
•
19
DatPySci/tiny-llama-kto-iter0-0.6-epoch1
Updated
Mar 11, 2024
DatPySci/tiny-llama-kto-iter0-0.5-epoch1
Text Generation
•
Updated
Mar 11, 2024
•
29
DatPySci/tiny-llama-kto-iter0-0.3-epoch1
Text Generation
•
Updated
Mar 10, 2024
•
15
DatPySci/tiny-llama-kto-iter0-0.2-epoch1
Text Generation
•
Updated
Mar 10, 2024
•
12
DatPySci/tiny-llama-kto-iter0-0.1-epoch1
Text Generation
•
Updated
Mar 9, 2024
•
39
DatPySci/zephyr-7b-kto-iter0-0.2-epoch1
Updated
Mar 8, 2024
DatPySci/pythia-160m-sft-full
Updated
Mar 1, 2024
DatPySci/pythia-1b-kto-iter0
Text Generation
•
Updated
Feb 27, 2024
•
13
DatPySci/pythia-1b-self-kto-iter1
Text Generation
•
Updated
Feb 27, 2024
•
16
DatPySci/pythia-1b-self-kto-iter0
Text Generation
•
Updated
Feb 26, 2024
•
16
Previous
1
2
3
Next