Method to get 8bit quantized model

by kitaharatomoyo - opened Jul 6, 2023

Jul 6, 2023

Can you tell me how you get the 8bit quantized model from falcon-7b?
I want to get my own 8bit quantized model from finetuned falcon-7b model.

goldmermaid

Cambio ML org Jul 6, 2023

Hi @kitaharatomoyo , you can try load and directly directly from HG as below:

model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", load_in_8bit=True, trust_remote_code=True)

MODEL_SAVE_FOLDER_NAME = "falcon-7b-8bit"
model.save_pretrained(MODEL_SAVE_FOLDER_NAME)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment