malteos's picture

malteos

malteos

·

https://ostendorff.org

malteos

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

malteos/images

updated a Space 2 days ago

malteos/seed-crawl-annotator

updated a dataset 2 days ago

malteos/seed-crawl-urls

View all activity

Articles

Announcing Occiglot-Fineweb

Organizations

malteos's activity

upvoted a collection 18 days ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 27 days ago • 62

upvoted an article 7 months ago

Article

Announcing Occiglot-Fineweb

By

•

Jun 4

• 6

upvoted a collection 7 months ago

Wikimedia Datasets

Wikimedia datasets, across languages and modalities, from different Wikimedia projects, on the hub. Not all tested. • 19 items • Updated May 16 • 10

upvoted a collection 10 months ago

occiglot-eu5-7b-v0.1

First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1. • 10 items • Updated Mar 7 • 21

upvoted a paper 12 months ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 79

upvoted a paper about 1 year ago

Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning

Paper • 2301.09626 • Published Jan 23, 2023 • 2