runtime error
e-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`. warnings.warn( Some weights of BertModel were not initialized from the model checkpoint at language-ml-lab/AzerBert and are newly initialized: ['bert.pooler.dense.bias', 'bert.pooler.dense.weight'] You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference. Map: 0%| | 0/271 [00:00<?, ? examples/s][A Map: 0%| | 0/271 [00:00<?, ? examples/s] Traceback (most recent call last): File "/home/user/app/app.py", line 41, in <module> tr.init() File "/home/user/app/train.py", line 39, in init train_dataset = prepare_text_dataset(constitution_text, chunk_size) File "/home/user/app/train.py", line 15, in prepare_text_dataset formatted_dataset = formatted_dataset.map( File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 602, in wrapper out: Union["Dataset", "DatasetDict"] = func(self, *args, **kwargs) File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 567, in wrapper out: Union["Dataset", "DatasetDict"] = func(self, *args, **kwargs) File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3161, in map for rank, done, content in Dataset._map_single(**dataset_kwargs): File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3552, in _map_single batch = apply_function_on_filtered_inputs( File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3421, in apply_function_on_filtered_inputs processed_inputs = function(*fn_args, *additional_args, **fn_kwargs) File "/home/user/app/train.py", line 16, in <lambda> lambda x: tokenizer(x["text"], truncation=True, padding="max_length"), NameError: name 'tokenizer' is not defined. Did you mean: 'BertTokenizer'?
Container logs:
Fetching error logs...