haoran
haorannlp
ยท
AI & ML interests
nlp, language model
Recent Activity
liked
a dataset
4 days ago
decube/synthetic-complex-Text-to-SQL
liked
a dataset
4 days ago
yuyijiong/multi-doc-qa-zh-translated
commented on
a paper
4 days ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification