README / README.md
anonymitaet's picture
Update README.md
f069b48 verified
|
raw
history blame
3.55 kB
metadata
title: README
emoji: 🐠
colorFrom: gray
colorTo: green
sdk: static
pinned: false

Discord Logo Discord | ArXiv Logo Paper

Base Models
(Yi-6B/9B/34B)
Chat Models
(Yi-6B/9B/34B Chat)
Multimodal Models
(Yi-VL-6B/34B)

Welcome to Yi! 😘

Yi model family is a series of language and multimodal models. It is based on 6B and 34B pretrained language models, then extended to chat models, 200K long context models, depth-upscaled models (9B), and vision-language models.

⭐️ Highlights

  • Strong performance: Yi-1.5-34B is on par with or surpasses GPT-3.5 in commonsense reasoning, college exams, math, coding, reading comprehension, and human preference win-rate on multiple evaluation benchmarks.

  • Cost-effective: For 6B, 9B, and 34B, you can perform inference on consumer-grade hardware (like the RTX 4090). Additionally, 34B is large enough with complex reasoning and emergent abilities, giving a nice performance-cost balance.

πŸ“Š Benchmarks

TBD

πŸ“° News

  • 2024-03-16: The Yi-9B-200K is open-sourced and available to the public.

  • 2024-03-08: Yi Tech Report is published!

  • 2024-03-07: The long text capability of the Yi-34B-200K has been enhanced.

For complete news history, see News.