LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper β’ 2501.03895 β’ Published Jan 7 β’ 49
CamemBERT 2.0: A Smarter French Language Model Aged to Perfection Paper β’ 2411.08868 β’ Published Nov 13, 2024 β’ 12
Running 22 22 Common Crawl Pipeline Creator πΈ Create and customize a data processing pipeline for Common Crawl data
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper β’ 2410.05993 β’ Published Oct 8, 2024 β’ 108
Running 100 100 TxT360: Trillion Extracted Text π Create a large, deduplicated dataset for LLM pre-training