SUMMARY MODEL: Model Params Size: 60492288 Model Params Size Formatted: 60.49 M Model Disk Size: 242030465 Model Disk Size Formatted: 242.03 MB TRAINING AND VALIDATION RESULTS: Training batch size: 4 Validation batch size: 8 Total expected epochs: 4 Total expected trainig steps: 15052 Total expected trainig steps 2: 15052 Total trained epochs: 4.0 Total trained steps: 15052 Elapsed time: 7849.268085718155 seconds Elapsed time (formatted): 02:10:49 Total flos: 8148659183026176.0 Total flos (formatted): 8.148659e+15 Best epoch val_loss: 0.5480290651321411 Best model checkpoint: E:/000_Tesis/test_executions/pretrain_utg4java\checkpoint-15052 SUMMARY DATASETS: Loaded Dataset: DatasetDict({ train: Dataset({ features: ['text'], num_rows: 15052 }) valid: Dataset({ features: ['text'], num_rows: 1881 }) test: Dataset({ features: ['text'], num_rows: 1882 }) }) Tokenized Dataset: DatasetDict({ train: Dataset({ features: ['input_ids'], num_rows: 15052 }) valid: Dataset({ features: ['input_ids'], num_rows: 1881 }) test: Dataset({ features: ['input_ids'], num_rows: 1882 }) })