README / README.md
anonymitaet's picture
Update README.md
a89168b verified
|
raw
history blame
3.72 kB
metadata
title: README
emoji: 🐠
colorFrom: gray
colorTo: green
sdk: static
pinned: false

Discord Logo Discord | β€’ ArXiv Logo Paper

Base Models
Yi-6B/9B/34B
Chat Models
Yi-6B/9B/34B Chat
Multimodal Models
Yi-VL-6B/34B

Welcome to Yi! 😘

Yi model family is a series of language and multimodal models. It is based on 6B and 34B pretrained language models, then extended to chat models, 200K long context models, depth-upscaled models (9B), and vision-language models.

⭐️ Highlights

  • Strong performance: Yi-1.5-34B is on par with or surpasses GPT-3.5 in commonsense reasoning, college exams, math, coding, reading comprehension, and human preference win-rate on multiple evaluation benchmarks.

  • Cost-effective: For 6B, 9B, and 34B, you can perform inference on consumer-grade hardware (like the RTX 4090). Additionally, 34B is large enough with complex reasoning and emergent abilities, giving a nice performance-cost balance.

πŸ“Š Benchmarks

TBD

πŸ“° News

  • 2024-03-16: The Yi-9B-200K is open-sourced and available to the public.

  • 2024-03-08: Yi Tech Report is published!

  • 2024-03-07: The long text capability of the Yi-34B-200K has been enhanced.

For complete news history, see News.