基于 https://huggingface.co/THUDM/chatglm2-6b 采用AutoModelForCausalLM.from_pretrained load_in_8bit自量化 上传自用 --- license: apache-2.0 ---