support ollama ??

#14

by skju - opened Aug 30, 2024

Discussion

skju

Aug 30, 2024

i coverted this to .gguf file or used another link .gguf file
but do not support in ollama (version 0.3.8)

ollama create : success
ollama run : failed

yireun

LG AI Research org Sep 9, 2024

•

edited Sep 9, 2024

Here is simple guidelines for using the EXAONE model on ollama:

Download the EXAONE model from HuggingFace, and save to /path/to/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct.
Llamafy the EXAONE model by referring to the followings.
- maywell/EXAONE-3.0-7.8B-Instruct-Llamafied
- CarrotAI/EXAONE-3.0-7.8B-Instruct-Llamafied-cpu
Create the EXAONE Modelfile. See https://github.com/ollama/ollama/blob/main/docs/modelfile.md for more information. This is an example of the EXAONE Modelfile.


# Set the base model.
FROM /path/to/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct-Llamafied

# Set the parameter values according to your application.
PARAMETER stop "[|endofturn|]"
PARAMETER num_predict -2
PARAMETER top_k 1

# Set the template.
TEMPLATE """{{ if .System }}[|system|]{{ .System }}[|endofturn|]
{{ end }}{{ if .Prompt }}[|user|]{{ .Prompt }}
{{ end }}[|assistant|]{{ .Response }}[|endofturn|]
"""

# Set the system prompt.
SYSTEM """You are EXAONE model from LG AI Research, a helpful assistant."""

# Set the license.
LICENSE """EXAONE AI Model License Agreement 1.1 - NC """

Convert the EXAONE model saved as pyTorch safetensors to ollama. To quantize the EXAONE model, you can add --quantize flag. Please refer to https://github.com/ollama/ollama/blob/main/docs/import.md for the quantization flag.
$ ollama create exaone3 -f <the EXAONE Modelfile>

Good luck to you.

skju

Sep 9, 2024

•

edited Sep 9, 2024

@yireun Thank you for your reply. But when I tried
Again, the following message occurs.

ollama create : ok
ollama run : failed
"Error: llama runner process has terminated: this model is not supported by your version of Ollama. You may need to upgrade"

I think the llama.cpp library inside ollama should be updated.

yireun

LG AI Research org Sep 9, 2024

@skju
Sorry, I missed the step of llamafying the EXAONE model.
I updated above posted guidelines. Would you try again?

skju

Sep 10, 2024

Thank you , but ....ㅜ.ㅜ
I received the "maywell/EXAONE-3.0-7.8B-Instruct-Llamafied" weights

but
I got a error message.
Error: no safetensors or torch files found

The files are already there, also Modelfile path is right!

please make a gguf file for ollama

yireun

LG AI Research org Sep 10, 2024

Would you check your path again?
According to https://github.com/ollama/ollama/blob/4a8069f9c4c8cb761cd6c10ca5f4be6af21fa0ae/cmd/cmd.go#L222,
the error "Error: no safetensors or torch files found" occurs when ollama cannot find files "model*.safetensors".

When I used the maywell/EXAONE-3.0-7.8B-Instruct-Llamafied weights, no errors occurred.

If you use an ollama docker container, those two paths should point the paths in container.
- FROM <the EXAONE-Llamafied model path>
- $ ollama create exaone3 -f <the EXAONE Modelfile>

Good luck to you.

skju

Sep 10, 2024

Thank you for your help!
my mistake Modelfile Path

I succeeded in loading the ollama EXAONE

ollama create -q q4_K_M EXAONE-3.0 -f EXAONE-3.0-7.8B-Instruct-Llamafied/Modelfile
ollama run EXAONE-3.0:latest

hunie

Oct 30, 2024

•

edited Oct 30, 2024

안녕하세요
@yireun

혹시 엑사원을 올라마에 올려서 사용시 툴을 호출할수 있는Modelfile(템플릿)도 있나요?

yireun

LG AI Research org Nov 1, 2024

안녕하세요, @hunie

기 공개된 EXAONE v3.0은 Tool Calling 기능을 지원하지 않습니다.
그러나, 해당 기능이 필요하다고 판단하여 EXAONE 모델에 Tool Calling 기능을 추가하기 위한 연구/개발을 진행하고 있습니다.

감사합니다.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment