Paul Lemaistre PRO

paulml

AI & ML interests

TTS - LLM - Agents | Lead Dev @ TW3 Partners

Recent Activity

liked a model 2 months ago
ahmed-masry/ColFlor
updated a model 3 months ago
paulml/mcdse-2b-v1-AWQ
updated a model 4 months ago
paulml/Qwen2.5-Coder-78B-Instruct
View all activity

Organizations

TW3 Partners's profile picture racine.ai's profile picture AI Starter Pack's profile picture

paulml's activity

New activity in paulml/ECE-ILAB-Q1 6 months ago
New activity in sam-paech/mmlu-pro-nomath-sml 6 months ago
reacted to FeYuan's post with πŸ‘ 6 months ago
view post
Post
4766
Hi everyone,

I am excited to introduce our latest work, LLaMAX. 😁😁😁

LLaMAX is a powerful language model created specifically for multilingual scenarios. Built upon Meta's LLaMA series models, LLaMAX undergoes extensive training across more than 100 languages.

Remarkably, it enhances its multilingual capabilities without compromising its generalization ability, surpassing existing LLMs.

✨Highlights:

🎈 LLaMAX supports the 102 languages covered by Flores-101, and its performance in translating between low-resource languages far surpasses other decoder-only LLMs.

🎈 Even for languages not covered in Flores-200, LLaMAX still shows significant improvements in translation performance.

🎈 By performing simple SFT on English task data, LLaMAX demonstrates impressive multilingual transfer abilities in downstream tasks.

🎈 In our paper, we discuss effective methods for enhancing the multilingual capabilities of LLMs during the continued training phase.

We welcome you to use our model and provide feedback.

More Details:

πŸŽ‰ Code: https://github.com/CONE-MT/LLaMAX/

πŸŽ‰ Model: https://huggingface.co/LLaMAX/
Β·
reacted to clem's post with ❀️ 7 months ago
view post
Post
5781
5,000 new repos (models, datasets, spaces) are created EVERY DAY on HF now. The community is amazing!
New activity in paulml/DPOB-INMTOB-7B 11 months ago

Great model

2
#4 opened 11 months ago by
RedLeader721
New activity in yam-peleg/Experiment26-7B 11 months ago

What a score!

#2 opened 11 months ago by
paulml
New activity in CultriX/NeuralTrix-7B-dpo 11 months ago