yuyijiong's picture

yuyijiong

yuyijiong

·

yuyijiong

AI & ML interests

NLP, sentiment analyze, long context model

Recent Activity

updated a model 17 days ago

opencsg/OpenCSG-Qwen2.5-3B-GUI

updated a model 18 days ago

opencsg/OpenCSG-Qwen2.5-7B-GUI

published a model 19 days ago

opencsg/OpenCSG-Qwen2.5-7B-GUI

View all activity

Organizations

yuyijiong's activity

upvoted a paper 2 months ago

OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training

Paper • 2501.08197 • Published Jan 14 • 8

upvoted a collection 3 months ago

high-quality Chinese training datasets

a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated 7 days ago • 11

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 354

upvoted a paper 4 months ago

LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24, 2024 • 10

upvoted a collection 4 months ago

Chinese pretrain datasets

3 items • Updated Nov 26, 2024 • 1

upvoted a paper 4 months ago

Patience Is The Key to Large Language Model Reasoning

Paper • 2411.13082 • Published Nov 20, 2024 • 7

upvoted a paper 5 months ago

Hyper-multi-step: The Truth Behind Difficult Long-context Tasks

Paper • 2410.04422 • Published Oct 6, 2024 • 7

upvoted a collection 7 months ago

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 51

upvoted 2 collections 8 months ago

MAmmoTH

The datasets and models for the MAmmoTH project • 9 items • Updated Apr 19, 2024 • 2

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 26 days ago • 218

upvoted a paper 9 months ago

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Paper • 2407.02490 • Published Jul 2, 2024 • 25