Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
5
10
Miguel Gargallo
miguelgargallo
Follow
21world's profile picture
1 follower
ยท
6 following
https://itamaesan.org
miguelgargallo
miguelgargallo
AI & ML interests
Master in Engineer Informatics, UPC CATALUNYA BARCELONA
Recent Activity
reacted
to
m-ric
's
post
with ๐ฅ
about 1 month ago
A non-Instruct LLM assistant is mostly useless. ๐ง Since it's mostly a model trained to complete text, when you ask it a question like "What to do during a stopover in Paris?", it can just go on and on adding more details to your question instead of answering, which would be valid to complete text from its training corpus, but not to answer questions. โก๏ธ So the post-training stage includes an important Instruction tuning step where you teach your model how to be useful : answer questions, be concise, be polite... RLHF is a well known technique for this. For people interested to understand how this step works, the folks at Adaptive ML have made a great guide! Read it here ๐ https://www.adaptive-ml.com/post/from-zero-to-ppo
reacted
to
m-ric
's
post
with ๐
about 1 month ago
A non-Instruct LLM assistant is mostly useless. ๐ง Since it's mostly a model trained to complete text, when you ask it a question like "What to do during a stopover in Paris?", it can just go on and on adding more details to your question instead of answering, which would be valid to complete text from its training corpus, but not to answer questions. โก๏ธ So the post-training stage includes an important Instruction tuning step where you teach your model how to be useful : answer questions, be concise, be polite... RLHF is a well known technique for this. For people interested to understand how this step works, the folks at Adaptive ML have made a great guide! Read it here ๐ https://www.adaptive-ml.com/post/from-zero-to-ppo
View all activity
Organizations
models
2
Sort:ย Recently updated
miguelgargallo/huggingtweets
Text Generation
โข
Updated
May 2, 2023
โข
15
โข
1
miguelgargallo/chatPylarAI-v0.0.1
Updated
Jan 26, 2023
โข
1
datasets
None public yet