Murat Jumashev

murat

AI & ML interests

NLP, Kyrgyz corpus, sentiment analysis, NER, typo correction

Recent Activity

Organizations

None yet

murat's activity

New activity in facebook/mms-1b 5 days ago
New activity in facebook/seamless-m4t-v2-large 3 months ago

Demo is not working :pray:

#16 opened 3 months ago by
murat
New activity in Nexusflow/Athene-70B 5 months ago
reacted to FremyCompany's post with ❤️ 9 months ago
view post
Post
2250
Today, April 26, is the Day of the Tatar Language! 🌟
To celebrate, we release our new language model, Tweety Tatar 🐣

https://huggingface.co/Tweeties/tweety-tatar-base-7b-2024-v1

The model was converted from Mistral Instruct v0.2 using a novel technique called trans-tokenization. As a result, the model uses a brand-new tokenizer, fully tailored for the Tatar language.

We also release a model which can be finetuned for translation of English or Russian into Tatar, and achieves a performance similar to commercial offerings:

https://huggingface.co/Tweeties/tweety-tatar-hydra-base-7b-2024-v1

More details in our upcoming paper 👀
François REMY, Pieter Delobelle, Alfiya Khabibullina

Татар теле көне белән!
·
New activity in legacy-datasets/mc4 almost 2 years ago

How can I help with a cleanup?

#6 opened almost 2 years ago by
murat
New activity in murat/kyrgyz_language_NER almost 2 years ago

Project continuation

1
#2 opened almost 2 years ago by
alymbeks