Yuan He's picture

Yuan He

heyuan

·

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

OpenDriveLab/Vista

liked a model 2 months ago

briaai/RMBG-2.0

liked a model 2 months ago

si-pbc/hertz-dev

View all activity

Organizations

heyuan's activity

upvoted 2 papers 8 months ago

Grandmaster-Level Chess Without Search

Paper • 2402.04494 • Published Feb 7, 2024 • 68

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 29

upvoted 3 papers 9 months ago

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 31

NExT: Teaching Large Language Models to Reason about Code Execution

Paper • 2404.14662 • Published Apr 23, 2024 • 4

Wukong: Towards a Scaling Law for Large-Scale Recommendation

Paper • 2403.02545 • Published Mar 4, 2024 • 17

upvoted 6 papers 11 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 126

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 54

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 24

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models

Paper • 2311.07138 • Published Nov 13, 2023 • 2

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22, 2024 • 82

Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20, 2024 • 31

upvoted a collection 11 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 226

upvoted 3 papers about 1 year ago

DocGraphLM: Documental Graph Language Model for Information Extraction

Paper • 2401.02823 • Published Jan 5, 2024 • 36

City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web

Paper • 2312.16457 • Published Dec 27, 2023 • 14

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Paper • 2311.10794 • Published Nov 17, 2023 • 25

upvoted a collection about 1 year ago

Zephyr 7B

Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12, 2024 • 147