Robert Shaw
robertgshaw2
AI & ML interests
None yet
Organizations
Update tokenizer_config.json
5
#3 opened 6 months ago
by
erichartford

What is an actorder group and what are the advantages of running this in vLLM?
4
#1 opened 8 months ago
by
nickandbro
Can I apply a LoRA?
2
#1 opened 8 months ago
by
RonanMcGovern

Nice model, any info on scripts used to quantize?
1
#1 opened 8 months ago
by
RonanMcGovern

How to download the model with transformer library
5
#6 opened 10 months ago
by
Rick10
Update README.md
3
#25 opened 10 months ago
by
robertgshaw2
Issue running on vLLM using FP8
2
#3 opened 10 months ago
by
ffleandro
vllm says the requested model does not exist
2
#1 opened 12 months ago
by
shivams101
Storage format differs from other w4a16 models
2
#2 opened 12 months ago
by
timdettmers

Model weights are not loaded
4
#3 opened 12 months ago
by
MarvelousMouse

Can not be inferenced with vllm openai server
1
#1 opened about 1 year ago
by
jjqsdq
Code example request with vllm
2
#1 opened about 1 year ago
by
ShiningJazz
4bit quantisation does not reduce vram usage.
1
#2 opened about 1 year ago
by
fu-man
How to run Meta-Llama-3-70B-Instruct-FP8 using several devices?
5
#3 opened about 1 year ago
by
Fertel
Reproduction
2
#792 opened about 1 year ago
by
robertgshaw2
Fails to run with nm-vllm
1
#1 opened over 1 year ago
by
clintonruairi
Update chart template
#2 opened over 1 year ago
by
robertgshaw2