Yifan Zhang's picture

Yifan Zhang

yifAI

·

https://github.com/yifanzhang-pro

yifanzhang-pro

AI & ML interests

Language

Recent Activity

authored a paper 6 days ago

A Markov Categorical Framework for Language Modeling

liked a model 7 days ago

openai/gpt-oss-120b

liked a model 7 days ago

openai/gpt-oss-20b

View all activity

Organizations

New activity in math-ai/AutoMathText about 2 months ago

Inquiry About 0-byte Files in data/arxiv/0.90-1.00 Directory

#3 opened 2 months ago by

commented 2 papers 3 months ago

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Paper • 2505.17508 • Published May 23 • 5 •

AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts

Paper • 2402.07625 • Published Feb 12, 2024 • 15 •

New activity in math-ai/amc23 5 months ago

[bot] Conversion to Parquet

#2 opened 6 months ago by

parquet-converter

New activity in math-ai/TemplateGSM 5 months ago

Minor improvements to dataset card

#2 opened 5 months ago by

commented a paper 6 months ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published Feb 18 • 38 •

New activity in math-ai/AutoMathText 6 months ago

Add link to paper

#2 opened 6 months ago by

commented a paper 6 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 123 •

New activity in math-ai/amc23 6 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 6 months ago by

commented a paper 6 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •

New activity in math-ai/olympiadbench 6 months ago

[bot] Conversion to Parquet

#1 opened 6 months ago by

parquet-converter

Librarian Bot: Add language metadata for dataset

#2 opened 6 months ago by

commented 2 papers 6 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •

commented a paper 7 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89 •

commented a paper 8 months ago

Scaling Image Tokenizers with Grouped Spherical Quantization

Paper • 2412.02632 • Published Dec 3, 2024 • 10 •

commented a paper 9 months ago

Training and Evaluating Language Models with Template-based Data Generation

Paper • 2411.18104 • Published Nov 27, 2024 • 3 •

New activity in math-ai/StackMathQA 9 months ago

Update README.md

#4 opened 10 months ago by

commented a paper 10 months ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 9 •

commented a paper 11 months ago

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 14 •