Haitham Bou Ammar

hba123

AI & ML interests

LLMs, VLMs, Robotics, Reinforcement Learning, Bayesian Optimisation

Recent Activity

Articles

Organizations

None yet

hba123's activity

reacted to their post with 🚀 4 days ago
upvoted an article 5 days ago
view article
Article

Accelerating Language Model Inference with Mixture of Attentions

By hba123 •
• 24
posted an update 5 days ago
published an article 5 days ago
view article
Article

Accelerating Language Model Inference with Mixture of Attentions

By hba123 •
• 24
reacted to their post with 🚀 16 days ago
view post
Post
1793
Blindly applying algorithms without understanding the math behind them is not a good idea frmpv. So, I am on a quest to fix this!

I wrote my first hugging face article on how you would derive closed-form solutions for KL-regularised reinforcement learning problems - what is used for DPO.


Check it out: https://huggingface.co/blog/hba123/derivingdpo
posted an update 19 days ago
view post
Post
1793
Blindly applying algorithms without understanding the math behind them is not a good idea frmpv. So, I am on a quest to fix this!

I wrote my first hugging face article on how you would derive closed-form solutions for KL-regularised reinforcement learning problems - what is used for DPO.


Check it out: https://huggingface.co/blog/hba123/derivingdpo
upvoted an article 19 days ago