Joao Pedro Silva Dias Moura Mesquita's picture

Joao Pedro Silva Dias Moura Mesquita

inkasaras

·

joaopedrosdmm

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

upvoted a collection 1 day ago

updated a collection 3 days ago

View all activity

Organizations

None yet

inkasaras's activity

upvoted a collection 1 day ago

Step-Audio

Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 1 day ago • 16

upvoted an article 3 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

22 days ago

• 755

upvoted an article 4 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

15 days ago

• 101

upvoted a paper 7 days ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published 9 days ago • 31

upvoted an article 7 days ago

Article

Open R1: Update #2

By

and 6 others •

8 days ago

• 174

upvoted a paper 7 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 11 days ago • 107

upvoted a collection 12 days ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 5 items • Updated 12 days ago • 33

upvoted an article 14 days ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

18 days ago

• 34

upvoted a paper 14 days ago

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Paper • 2501.18119 • Published 20 days ago • 24

upvoted an article 14 days ago

Article

Open-source DeepResearch – Freeing our search agents

15 days ago

• 1.03k

upvoted an article 20 days ago

Article

Janus Pro: DeepSeek's Revolutionary Multimodal AI Model

By

•

22 days ago

• 31

upvoted a collection 26 days ago

Albertina

Albertina family of encoders for Portuguese • 9 items • Updated Jul 26, 2024 • 2

upvoted an article 27 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

27 days ago

• 62

upvoted a collection about 1 month ago

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 261

upvoted 2 collections about 2 months ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 58

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated Jan 1 • 43

upvoted a collection 2 months ago

DeepSeek-VL2

5 items • Updated 10 days ago • 69

upvoted 3 collections 3 months ago

GTE models

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated 29 days ago • 23

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 8 days ago • 59

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 8 days ago • 82