5 7 18

Xiao Liu

ShawLiu

https://github.com/xiao9905

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

updated a model 16 days ago

THUDM/webrl-orm-llama-3.1-8b

updated a model about 1 month ago

THUDM/webrl-llama-3.1-70b

View all activity

Organizations

ShawLiu's activity

upvoted a paper 5 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 9 days ago • 15

updated a model 16 days ago

THUDM/webrl-orm-llama-3.1-8b

Updated 16 days ago • 19

updated a model about 1 month ago

THUDM/webrl-llama-3.1-70b

Updated Nov 12 • 13 • 4

liked a model about 2 months ago

THUDM/webrl-llama-3.1-8b

Updated Nov 6 • 158 • 3

updated a model about 2 months ago

THUDM/webrl-llama-3.1-8b

Updated Nov 6 • 158 • 3

liked a model about 2 months ago

THUDM/webrl-glm-4-9b

Updated Nov 5 • 47 • 8

authored 14 papers about 2 months ago

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation

Paper • 2304.05977 • Published Apr 12, 2023 • 1

Self-supervised Learning: Generative or Contrastive

Paper • 2006.08218 • Published Jun 15, 2020

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

Paper • 2311.04155 • Published Nov 7, 2023 • 1

Language Models are Open Knowledge Graphs

Paper • 2010.11967 • Published Oct 22, 2020

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

Paper • 2311.18702 • Published Nov 30, 2023

P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks

Paper • 2110.07602 • Published Oct 14, 2021

SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions

Paper • 2309.07045 • Published Sep 13, 2023

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Paper • 2308.14508 • Published Aug 28, 2023 • 2

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

Paper • 2402.14672 • Published Feb 22

GraphMAE: Self-Supervised Masked Graph Autoencoders

Paper • 2205.10803 • Published May 22, 2022