Miguel Gargallo

miguelgargallo
Ā·

AI & ML interests

Master in Engineer Informatics, UPC CATALUNYA BARCELONA

Recent Activity

View all activity

Organizations

superdatas's profile picture Hugging Face Discord Community's profile picture

miguelgargallo's activity

reacted to m-ric's post with šŸ”„šŸ‘€ about 1 month ago
view post
Post
2367
A non-Instruct LLM assistant is mostly useless. šŸ§

Since it's mostly a model trained to complete text, when you ask it a question like "What to do during a stopover in Paris?", it can just go on and on adding more details to your question instead of answering, which would be valid to complete text from its training corpus, but not to answer questions.

āž”ļø So the post-training stage includes an important Instruction tuning step where you teach your model how to be useful : answer questions, be concise, be polite... RLHF is a well known technique for this.

For people interested to understand how this step works, the folks at Adaptive ML have made a great guide!

Read it here šŸ‘‰ https://www.adaptive-ml.com/post/from-zero-to-ppo
updated a Space 7 months ago
updated a Space 9 months ago
updated a Space 11 months ago
liked a Space almost 2 years ago