1 8 17

Weizhe Yuan

weizhey

AI & ML interests

NLP

Recent Activity

liked a dataset about 1 month ago

facebook/natural_reasoning

updated a dataset about 1 month ago

facebook/natural_reasoning

upvoted a paper about 2 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

View all activity

Organizations

weizhey's activity

liked a dataset about 1 month ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 12.3k • 472

updated a dataset about 1 month ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 12.3k • 472

upvoted 2 papers about 2 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 58

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 60

upvoted a paper 4 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 48

authored a paper 4 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 48

upvoted a paper 4 months ago

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10

authored a paper 6 months ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 19

authored 8 papers 8 months ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5, 2024 • 29

BARTScore: Evaluating Generated Text as Text Generation

Paper • 2106.11520 • Published Jun 22, 2021 • 2

FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

Paper • 2307.13528 • Published Jul 25, 2023

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Paper • 2107.13586 • Published Jul 28, 2021

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20

upvoted a paper 8 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20

liked a Space 9 months ago

701

Tile Upscaler

🚀

Enhance and upscale images with controlnet guidance

authored a paper 11 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 49

reacted to xiaohk's post with 🤗 about 1 year ago

Post

Hello world!