2024-06-11 18:12:02,678 - INFO: Calling run.. 2024-06-11 18:12:02,678 - INFO: Problem Type: text_causal_classification_modeling 2024-06-11 18:12:02,678 - INFO: Global random seed: 5001 2024-06-11 18:12:02,678 - INFO: Preparing the data... 2024-06-11 18:12:02,679 - INFO: Setting up automatic validation split... 2024-06-11 18:12:02,720 - INFO: Preparing train and validation data 2024-06-11 18:12:02,721 - INFO: Loading train dataset... 2024-06-11 18:12:03,290 - INFO: Loading validation dataset... 2024-06-11 18:12:03,398 - INFO: Number of observations in train dataset: 9600 2024-06-11 18:12:03,399 - INFO: Number of observations in validation dataset: 2400 2024-06-11 18:12:03,566 - WARNING: EOS token id not matching between config and tokenizer. Overwriting with tokenizer id. 2024-06-11 18:12:03,566 - WARNING: PAD token id not matching between config and tokenizer. Overwriting with tokenizer id. 2024-06-11 18:12:03,571 - INFO: Using bfloat16 for backbone 2024-06-11 18:12:03,571 - INFO: Loading tiiuae/falcon-rw-1b. This may take a while. 2024-06-11 18:13:13,609 - INFO: Loaded tiiuae/falcon-rw-1b. 2024-06-11 18:13:13,612 - WARNING: EOS token id not matching between generation config and tokenizer. Overwriting with tokenizer id. 2024-06-11 18:13:13,612 - WARNING: PAD token id not matching between generation config and tokenizer. Overwriting with tokenizer id. 2024-06-11 18:13:13,612 - INFO: Lora module names: ['query_key_value', 'dense', 'dense_h_to_4h', 'dense_4h_to_h'] 2024-06-11 18:13:13,747 - INFO: Enough space available for saving model weights.Required space: 2591.11MB, Available space: 996275.30MB. 2024-06-11 18:13:13,752 - INFO: Optimizer AdamW has been provided with parameters {'weight_decay': 0.0, 'eps': 1e-08, 'betas': (0.8999999762, 0.9990000129), 'lr': 0.0001} 2024-06-11 18:13:13,897 - WARNING: No order set for keys: ['answer_column_label', 'num_classes']. 2024-06-11 18:13:13,912 - WARNING: No order set for keys: ['answer_column_label', 'num_classes']. 2024-06-11 18:13:14,944 - INFO: started process: 0, can_track: True, tracking_mode: TrackingMode.DURING_EPOCH 2024-06-11 18:13:14,945 - INFO: Training Epoch: 1 / 1 2024-06-11 18:13:14,945 - INFO: train loss: 0%| | 0/600 [00:00