Fangyuan Yu's picture

Fangyuan Yu PRO

Ksgk-fy

·

fangyuan-ksgk

AI & ML interests

AGI

Recent Activity

updated a model about 18 hours ago

Ksgk-fy/kanji_i2t_finetune

published a model about 21 hours ago

Ksgk-fy/kanji_i2t_finetune

updated a model 4 days ago

Ksgk-fy/kanji_inception_finetune_v4

View all activity

Organizations

Ksgk-fy's activity

upvoted a paper 5 days ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 9 days ago • 22

upvoted 2 papers 19 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 20 days ago • 81

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 20 days ago • 249

upvoted a paper 25 days ago

LTX-Video: Realtime Video Latent Diffusion

Paper • 2501.00103 • Published 28 days ago • 41

upvoted a paper 27 days ago

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published 28 days ago • 21

upvoted 3 papers about 1 month ago

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Paper • 2412.04432 • Published Dec 5, 2024 • 15

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Paper • 2412.13180 • Published Dec 17, 2024 • 13

Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers

Paper • 2412.12276 • Published Dec 16, 2024 • 15

upvoted 12 papers about 2 months ago

Human Expertise in Algorithmic Prediction

Paper • 2402.00793 • Published Feb 1, 2024 • 1

AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment

Paper • 2411.10606 • Published Nov 15, 2024 • 1

MaestroMotif: Skill Design from Artificial Intelligence Feedback

Paper • 2412.08542 • Published Dec 11, 2024 • 1

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Paper • 2412.07393 • Published Dec 10, 2024 • 2

Video Token Merging for Long-form Video Understanding

Paper • 2410.23782 • Published Oct 31, 2024 • 2

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 59

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 38

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 21

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 75

Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction

Paper • 2411.14762 • Published Nov 22, 2024 • 11

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Paper • 2406.16218 • Published Jun 23, 2024 • 2

Combining Induction and Transduction for Abstract Reasoning

Paper • 2411.02272 • Published Nov 4, 2024 • 1