cortexso
/

marco-o1

Text Generation

Model card Files Files and versions

marco-o1 / README.md

Minh141120's picture

Update README.md

0c8e0cd verified 6 months ago

|

history blame contribute delete

1.48 kB

	---
	license: other
	pipeline_tag: text-generation
	tags:
	- cortex.cpp
	---

	## Overview
	Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding—which are well-suited for reinforcement learning (RL)—but also places greater emphasis on open-ended resolutions. We aim to address the question: "Can the o1 model effectively generalize to broader domains where clear standards are absent and rewards are challenging to quantify?"

	Currently, Marco-o1 Large Language Model (LLM) is powered by Chain-of-Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), reflection mechanisms, and innovative reasoning strategies—optimized for complex real-world problem-solving tasks.

	## Variants

	\| No \| Variant \| Cortex CLI command \|
	\| --- \| --- \| --- \|
	\| 1 \| [Marco-o1-8b](https://huggingface.co/cortexso/marco-o1/tree/8b) \| `cortex run marco-o1:8b` \|

	## Use it with Jan (UI)

	1. Install Jan using [Quickstart](https://jan.ai/docs/quickstart)
	2. Use in Jan model Hub:
	```bash
	cortexhub/marco-o1
	```

	## Use it with Cortex (CLI)

	1. Install Cortex using [Quickstart](https://cortex.jan.ai/docs/quickstart)
	2. Run the model with command:
	```bash
	cortex run marco-o1
	```

	## Credits

	- Author: AIDC-AI
	- Converter: [Homebrew](https://homebrew.ltd/)
	- Original License: [Licence](https://huggingface.co/AIDC-AI/Marco-o1/blob/main/LICENSE)
	- Papers: [Paper](https://arxiv.org/abs/2411.14405)