Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen-7B-Chat-Int8
like
8
Follow
Qwen
7.46k
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
8-bit precision
gptq
arxiv:
5 papers
Model card
Files
Files and versions
Community
1
Train
Use this model
main
Qwen-7B-Chat-Int8
Commit History
update wechat
e531de0
yangapku
commited on
Dec 13, 2023
update wechat
2aa673f
yangapku
commited on
Dec 13, 2023
update modeling_qwen.py
c6b0a09
yangapku
commited on
Dec 7, 2023
update modeling_qwen.py
de89198
yangapku
commited on
Dec 6, 2023
update modeling_qwen.py
c04bccd
yangapku
commited on
Dec 4, 2023
update modeling_qwen.py
75cd2af
yangapku
commited on
Dec 3, 2023
update
01803f3
yangapku
commited on
Nov 30, 2023
remove fix-sized causal mask
dcef457
yangapku
commited on
Nov 14, 2023
update wechat.png
1b74d25
yangapku
commited on
Nov 14, 2023
add kernel file check in modeling_qwen.py
c94803d
yangapku
commited on
Nov 5, 2023
update modeling.py
24ac14a
yangapku
commited on
Oct 26, 2023
Upload 3 files
7a7e29c
yangapku
commited on
Oct 17, 2023
update int8 quantization info
e5b2289
yangapku
commited on
Oct 17, 2023
update modeling_qwen.py
502a463
yangapku
commited on
Oct 16, 2023
update batch inference
1241954
yangapku
commited on
Oct 14, 2023
update default generate hyperparams
5db622a
yangapku
commited on
Oct 13, 2023
upload model
ce1512e
yangapku
commited on
Oct 11, 2023
initial commit
8be21a7
yangapku
commited on
Oct 11, 2023