Nguyễn Minh Phúc
DatPySci
·
AI & ML interests
Reinforcement learning, NLP
Recent Activity
published
a dataset
2 days ago
DatPySci/gpt2_dpo_tldr
updated
a dataset
7 days ago
DatPySci/tldr_synthetic_llama3_3b_32
published
a dataset
7 days ago
DatPySci/tldr_synthetic_llama3_3b_32
Organizations
Collections
1
models
95
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_72000__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_72000__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_72000__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_32400__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_32400__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_32400__tldr
Updated
DatPySci/llama3-1b_reward_tldr
Text Classification
•
Updated
•
105
DatPySci/EleutherAI_pythia-2.8b-deduped__ipo_pythia-2.8b_beta-0.1__tldr
Updated
DatPySci/EleutherAI_pythia-2.8b-deduped__dpo_pythia-2.8b_beta-0.05__tldr
Updated
DatPySci/EleutherAI_pythia-2.8b-deduped__length_IS_pythia-2.8b_beta-0.05__tldr
Updated
datasets
55
DatPySci/tldr_synthetic_llama3_3b_32
Viewer
•
Updated
•
5.47k
•
49
DatPySci/llama3_3b_sft_tldr_synthetic
Viewer
•
Updated
•
5.47k
•
92
DatPySci/weak_gpt2_large_dpo_hh
Viewer
•
Updated
•
8k
•
43
DatPySci/weak_gpt2_medium_dpo_hh
Viewer
•
Updated
•
8k
•
44
DatPySci/weak_gpt2_dpo_hh
Viewer
•
Updated
•
8k
•
44
DatPySci/Llama-3.2-3B_refine_gpt2-large_tldr
Viewer
•
Updated
•
8k
•
77
DatPySci/Llama-3.2-3B_refine_gpt2-medium_tldr
Viewer
•
Updated
•
8k
•
78
DatPySci/Llama-3.2-3B_refine_gpt2_tldr
Viewer
•
Updated
•
8k
•
69
DatPySci/Llama-3.2-1B_refine_gpt2-large_tldr
Viewer
•
Updated
•
8k
•
44
DatPySci/Llama-3.2-1B_refine_gpt2-medium_tldr
Viewer
•
Updated
•
8k
•
43