runtime error
Exit code: 1. Reason: <00:00, 341MB/s] Downloading shards: 50%|βββββ | 1/2 [00:11<00:11, 11.71s/it][A model-00002-of-00002.safetensors: 0%| | 0.00/2.20G [00:00<?, ?B/s][A model-00002-of-00002.safetensors: 1%| | 21.0M/2.20G [00:02<04:24, 8.24MB/s][A model-00002-of-00002.safetensors: 2%|β | 41.9M/2.20G [00:03<02:59, 12.1MB/s][A model-00002-of-00002.safetensors: 31%|βββ | 682M/2.20G [00:04<00:07, 215MB/s] [A model-00002-of-00002.safetensors: 61%|βββββββ | 1.35G/2.20G [00:05<00:02, 349MB/s][A model-00002-of-00002.safetensors: 99%|ββββββββββ| 2.17G/2.20G [00:06<00:00, 491MB/s][A model-00002-of-00002.safetensors: 100%|ββββββββββ| 2.20G/2.20G [00:07<00:00, 307MB/s] Downloading shards: 100%|ββββββββββ| 2/2 [00:18<00:00, 9.08s/it][A Downloading shards: 100%|ββββββββββ| 2/2 [00:18<00:00, 9.48s/it] Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|ββββββββββ| 2/2 [00:00<00:00, 39945.75it/s] generation_config.json: 0%| | 0.00/242 [00:00<?, ?B/s][A generation_config.json: 100%|ββββββββββ| 242/242 [00:00<00:00, 1.52MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 578, in <module> chat_model_state = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto") File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 564, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4323, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 498, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.
Container logs:
Fetching error logs...