Junxiong Wang's picture

7 4 2

Junxiong Wang PRO

JunxiongWang

·

https://www.cs.cornell.edu/~junxiong/

jxiw

AI & ML interests

Attention Free Model / Subquadratic Language Models

Recent Activity

updated a model 14 days ago

togethercomputer/M1-3B

upvoted a paper about 1 month ago

MambaByte: Token-free Selective State Space Model

upvoted a paper about 1 month ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

View all activity

Organizations

Collections 9

View 9 collections

Papers 4

arxiv:2504.10449

arxiv:2408.15237

arxiv:2401.13660

arxiv:2212.10544

models 51

JunxiongWang/M1-3B

Text Generation • 3B • Updated Apr 16 • 1.35k • 1

JunxiongWang/M1-3B-SFT

Text Generation • 3B • Updated Apr 16 • 6 • 1

JunxiongWang/MambaInLlama1B_SFT_MATH

1B • Updated Feb 11 • 2

JunxiongWang/MambaInLlama3B_SFT_MATH

3B • Updated Feb 7 • 6

JunxiongWang/MambaInLlama3B_DPO2

3B • Updated Feb 5 • 2

JunxiongWang/MambaInLlama3B_DPO1

3B • Updated Feb 5 • 1

JunxiongWang/MambaInLlama3B_Distill_MATH

3B • Updated Jan 27 • 250

JunxiongWang/MambaInLlama3B_v3

3B • Updated Jan 25 • 2

JunxiongWang/MambaInLlama1B_Distill_MATH

1B • Updated Jan 23 • 3

JunxiongWang/mamba_0_5_distill

Updated Dec 25, 2024 • 3

datasets 20

JunxiongWang/QwenFineMATH

Viewer • Updated Jun 18 • 6.71M • 319

JunxiongWang/R1_GR_SFT

Viewer • Updated Apr 16 • 44k • 22

JunxiongWang/R1_SFT

Updated Apr 16 • 48

JunxiongWang/R1_Sythetic_SFT

Viewer • Updated Apr 16 • 1M • 45

JunxiongWang/MATH_SFT

Viewer • Updated Apr 15 • 19.1M • 87

JunxiongWang/R1_OpenThoughts_SFT

Viewer • Updated Apr 7 • 862k • 15

JunxiongWang/R1_am_SFT

Viewer • Updated Apr 1 • 1.4M • 45

JunxiongWang/qwen1b_it_math

Viewer • Updated Feb 15 • 19.1M • 104

JunxiongWang/test_math

Viewer • Updated Feb 3 • 89.1k • 316

JunxiongWang/FineMathV4

Viewer • Updated Jan 29 • 6.7M • 86

View 20 datasets