Spaces:

openchat
/

README

Running

App Files Files Community

README / README.md

alpayariyak

Update README.md

59b6a94 over 1 year ago

preview code

raw

history blame

8.75 kB

	---
	title: README
	emoji: 🚀
	colorFrom: blue
	colorTo: blue
	sdk: static
	pinned: false
	---

	<style>
	a {
	text-decoration: none; /* Removes the default underline */
	color: inherit; /* Inherits the color from the parent element */
	}

	a:hover, a:focus {
	text-decoration: underline; /* Adds underline on hover or focus for better accessibility */
	}

	.link-text {
	display: inline-block;
	vertical-align: middle;
	font-size: 1.26em;
	margin-right: 10px;
	margin-left: 0px;
	margin-top: 0px;
	margin-bottom: 0px;
	}
	</style>

	<style>
	.common-button {
	background-color: white;
	color: #3c72db;
	border: 2px solid #3c72db;
	padding: 10px 25px;
	border-radius: 0.3em;
	margin: 2px;
	font-size: 1em;
	text-decoration: none;
	}
	</style>

	<link href='https://fonts.googleapis.com/css?family=Source+Sans+Pro' rel='stylesheet' type='text/css'>

	<div align="center">
	<a>
	<img src="https://github.com/imoneoi/openchat/raw/master/assets/logo_new.png" style="width: 55%">
	</a>
	<h1 style="font-size: 1.30em; ">Advancing Open-source Language Models with Mixed-Quality Data</h1>
	</div>

	<p align="center" style="margin-top: 0px;">
	<a href="https://openchat.team">
	<img src="https://github.com/alpayariyak/openchat/blob/master/assets/logo_nobg.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
	<span class="link-text">Online Demo</span>
	</a> \|
	<a href="https://github.com/imoneoi/openchat">
	<img src="https://camo.githubusercontent.com/4133dc1cd4511d4a292b84ce10e52e4ed92569fb2a8165381c9c47be5edc2796/68747470733a2f2f6564656e742e6769746875622e696f2f537570657254696e7949636f6e732f696d616765732f706e672f6769746875622e706e67" alt="GitHub Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
	<span class="link-text">GitHub</span>
	</a> \|
	<a href="https://arxiv.org/pdf/2309.11235.pdf">
	<img src="https://github.com/alpayariyak/openchat/blob/master/assets/arxiv-logomark-small-square-border.png?raw=true" alt="ArXiv Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
	<span class="link-text">Paper</span>
	</a> \|
	<a href="https://discord.gg/pQjnXvNKHY">
	<img src="https://cloud.githubusercontent.com/assets/6291467/26705903/96c2d66e-477c-11e7-9f4e-f3c0efe96c9a.png" alt="Discord Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
	<span class="link-text">Discord</span>
	</a>
	</p>


	<hr>
	<div style="background-color: white; padding: 0.7em; border-radius: 0.5em; color: black; display: flex; flex-direction: column; justify-content: center; text-align: center; ont-size: 0.5em;">
	<a href="https://huggingface.co/openchat/openchat-3.5-1210" style="text-decoration: none; color: black;">
	<span style="font-size: 1.7em; font-family: 'Helvetica'; letter-spacing: 0.1em; font-weight: bold; color: black;">OPENCHAT</span><span style="font-size: 1.8em; font-family: 'Helvetica'; color: #3c72db; ">3.5</span>
	<span style="font-size: 0.7em; font-family: 'Helvetica'; color: white; vertical-align: top; background-color:red; border-radius: 6em; padding: 0.066em 0.4em; letter-spacing: 0.1em; font-weight: bold;">1210</span>
	<span style="font-size: 0.85em; font-family: 'Helvetica'; color: black;">
	<br> 🏆 The Overall Best Performing Open Source 7B Model 🏆
	<br> 🤖 Outperforms <span style="font-weight: bold;">ChatGPT</span> (March) and <span style="font-weight: bold;">Grok-1</span> 🤖
	<br> 🚀<span style="font-size: 1em; font-family: 'Helvetica'; color: black; font-weight: bold;">15</span>-point improvement in Coding over <span style="font-size: 0.9em;
	font-family: 'Helvetica'; color: black; font-weight: bold;">OpenChat-3.5🚀</span>
	<br><br><span style="font-size: 1em; font-family: 'Helvetica'; color: #3c72db; font-weight: bold;">New Features</span>
	<br> 💡 2 Modes: Coding + Generalist, Mathematical Reasoning 💡
	<br> 🧑‍⚖️ Experimental support for Evaluator and Feedback capabilities 🧑‍⚖️
	</span>
	</a>
	</div>

	<div style="display: flex; justify-content: center; align-items: center">
	<img src="https://github.com/alpayariyak/openchat/blob/master/assets/1210bench.png?raw=true" style="width: 100%; border-radius: 1em">
	</div>

	<h1 style="vertical-align: middle;">
	<img src="https://github.com/alpayariyak/openchat/blob/master/assets/logo_nobg.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 0px; margin-top: 0px; margin-bottom: 0px;"/>About OpenChat
	</h1>

	- OpenChat is an innovative library of open-source language models, fine-tuned with [C-RLFT](https://arxiv.org/pdf/2309.11235.pdf) - a strategy inspired by offline reinforcement learning.
	- Our models learn from mixed-quality data without preference labels, delivering exceptional performance on par with `ChatGPT`, even with a `7B` model which can be run on a consumer GPU (e.g. RTX 3090).
	- Despite our simple approach, we are committed to developing a high-performance, commercially viable, open-source large language model, and we continue to make significant strides toward this vision.

	# 📰 News

	- [2023/12/10] We released the [OpenChat-3.5-1210](https://huggingface.co/openchat/openchat-3.5-1210) model, 15-point improvements in coding.

	- [2023/11/01] We released the [OpenChat-3.5-7B](https://huggingface.co/openchat/openchat_3.5) model, surpassing ChatGPT on various benchmarks 🔥.

	- [2023/09/21] We released our paper [OpenChat: Advancing Open-source Language Models with Mixed-Quality Data](https://arxiv.org/pdf/2309.11235.pdf).

	# 📊 Benchmarks

	\| Model \| # Params \| Average \| MT-Bench \| HumanEval \| BBH MC \| AGIEval \| TruthfulQA \| MMLU \| GSM8K \| BBH CoT \|
	\|--------------------\|----------\|----------\|--------------\|-----------------\|----------\|----------\|---------------\|--------------\|--------------\|-------------\|
	\| OpenChat-3.5-1210 \| 7B \| 63.8 \| 7.76 \| 68.9 \| 49.5 \| 48.0 \| 61.8 \| 65.3 \| 77.3 \| 61.8 \|
	\| OpenChat-3.5 \| 7B \| 61.6 \| 7.81 \| 55.5 \| 47.6 \| 47.4 \| 59.1 \| 64.3 \| 77.3 \| 63.5 \|
	\| ChatGPT (March)* \| ? \| 61.5 \| 7.94 \| 48.1 \| 47.6 \| 47.1 \| 57.7 \| 67.3 \| 74.9 \| 70.1 \|
	\| \| \| \| \| \| \| \| \| \| \| \|
	\| OpenHermes 2.5 \| 7B \| 59.3 \| 7.54 \| 48.2 \| 49.4 \| 46.5 \| 57.5 \| 63.8 \| 73.5 \| 59.9 \|
	\| OpenOrca Mistral \| 7B \| 52.7 \| 6.86 \| 38.4 \| 49.4 \| 42.9 \| 45.9 \| 59.3 \| 59.1 \| 58.1 \|
	\| Zephyr-β^ \| 7B \| 34.6 \| 7.34 \| 22.0 \| 40.6 \| 39.0 \| 40.8 \| 39.8 \| 5.1 \| 16.0 \|
	\| Mistral \| 7B \| - \| 6.84 \| 30.5 \| 39.0 \| 38.0 \| - \| 60.1 \| 52.2 \| - \|
	## 𝕏 Comparison with [X.AI Grok](https://x.ai/)

	\| \| License \| # Param \| Average \| MMLU \| HumanEval \| MATH \| GSM8k \|
	\|-------------------\|-------------\|---------\|----------\|------\|-----------\|----------\|----------\|
	\| OpenChat 3.5 1210 \| Apache-2.0 \| 7B \| 60.1 \| 65.3 \| 68.9 \| 28.9 \| 77.3 \|
	\| OpenChat 3.5 \| Apache-2.0 \| 7B \| 56.4 \| 64.3 \| 55.5 \| 28.6 \| 77.3 \|
	\| Grok-0 \| Proprietary \| 33B \| 44.5 \| 65.7 \| 39.7 \| 15.7 \| 56.8 \|
	\| Grok-1 \| Proprietary \| ???B \| 55.8 \| 73 \| 63.2 \| 23.9 \| 62.9 \|

	# 💌Contact

	We are a student team working on OpenChat, a project that requires additional computing power or LLMs API keys for further development. If you are interested in our project and would like to offer support, please feel free to reach out to us:

	* Wang Guan [[email protected]]
	* Cheng Sijie [[email protected]]
	* Alpay Ariyak [[email protected]]

	We look forward to hearing you and collaborating on this exciting project!