2024-06-11 19:24:05,913 - INFO: Calling run.. 2024-06-11 19:24:05,913 - INFO: Problem Type: text_causal_classification_modeling 2024-06-11 19:24:05,913 - INFO: Global random seed: 849738 2024-06-11 19:24:05,913 - INFO: Preparing the data... 2024-06-11 19:24:05,913 - INFO: Setting up automatic validation split... 2024-06-11 19:24:05,955 - INFO: Preparing train and validation data 2024-06-11 19:24:05,955 - INFO: Loading train dataset... 2024-06-11 19:24:10,161 - INFO: Loading validation dataset... 2024-06-11 19:24:10,266 - INFO: Number of observations in train dataset: 9600 2024-06-11 19:24:10,267 - INFO: Number of observations in validation dataset: 2400 2024-06-11 19:24:10,464 - WARNING: EOS token id not matching between config and tokenizer. Overwriting with tokenizer id. 2024-06-11 19:24:10,465 - WARNING: PAD token id not matching between config and tokenizer. Overwriting with tokenizer id. 2024-06-11 19:24:10,469 - INFO: Using bfloat16 for backbone 2024-06-11 19:24:10,469 - INFO: Loading tiiuae/falcon-rw-1b. This may take a while. 2024-06-11 19:24:22,250 - INFO: Loaded tiiuae/falcon-rw-1b. 2024-06-11 19:24:22,253 - WARNING: EOS token id not matching between generation config and tokenizer. Overwriting with tokenizer id. 2024-06-11 19:24:22,253 - WARNING: PAD token id not matching between generation config and tokenizer. Overwriting with tokenizer id. 2024-06-11 19:24:22,253 - INFO: Lora module names: ['query_key_value', 'dense', 'dense_h_to_4h', 'dense_4h_to_h'] 2024-06-11 19:24:22,374 - INFO: Enough space available for saving model weights.Required space: 2591.11MB, Available space: 995995.53MB. 2024-06-11 19:24:22,379 - INFO: Optimizer AdamW has been provided with parameters {'weight_decay': 0.0, 'eps': 1e-08, 'betas': (0.8999999762, 0.9990000129), 'lr': 0.0001} 2024-06-11 19:24:22,520 - WARNING: No order set for keys: ['answer_column_label', 'num_classes']. 2024-06-11 19:24:22,535 - WARNING: No order set for keys: ['answer_column_label', 'num_classes']. 2024-06-11 19:24:23,373 - INFO: started process: 0, can_track: True, tracking_mode: TrackingMode.DURING_EPOCH 2024-06-11 19:24:23,374 - INFO: Training Epoch: 1 / 1 2024-06-11 19:24:23,374 - INFO: train loss: 0%| | 0/600 [00:00