lahirum commited on
Commit
8bbd0ac
1 Parent(s): 94befc5

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. README.md +1 -1
  2. replay.mp4 +2 -2
  3. sf_log.txt +380 -0
README.md CHANGED
@@ -15,7 +15,7 @@ model-index:
15
  type: doom_health_gathering_supreme
16
  metrics:
17
  - type: mean_reward
18
- value: 9.05 +/- 4.30
19
  name: mean_reward
20
  verified: false
21
  ---
 
15
  type: doom_health_gathering_supreme
16
  metrics:
17
  - type: mean_reward
18
+ value: 9.93 +/- 4.61
19
  name: mean_reward
20
  verified: false
21
  ---
replay.mp4 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c5eeae3e9e853c5717bdda01fa509af78baf4b7f7af2d3df9b1069df39fa24e2
3
- size 17258602
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e98b913f21245b8f3b491a6d7b7242865e392ec317123bb9c0952cf4a35c48b
3
+ size 18314288
sf_log.txt CHANGED
@@ -701,3 +701,383 @@ main_loop: 242.3498
701
  [2024-11-09 15:33:46,073][00359] Avg episode rewards: #0: 19.548, true rewards: #0: 9.048
702
  [2024-11-09 15:33:46,074][00359] Avg episode reward: 19.548, avg true_objective: 9.048
703
  [2024-11-09 15:34:07,689][00359] Replay video saved to /content/train_dir/default_experiment/replay.mp4!
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
701
  [2024-11-09 15:33:46,073][00359] Avg episode rewards: #0: 19.548, true rewards: #0: 9.048
702
  [2024-11-09 15:33:46,074][00359] Avg episode reward: 19.548, avg true_objective: 9.048
703
  [2024-11-09 15:34:07,689][00359] Replay video saved to /content/train_dir/default_experiment/replay.mp4!
704
+ [2024-11-09 15:34:13,175][00359] The model has been pushed to https://huggingface.co/lahirum/rl_course_vizdoom_health_gathering_supreme
705
+ [2024-11-09 15:35:07,244][00359] Loading legacy config file train_dir/doom_health_gathering_supreme_2222/cfg.json instead of train_dir/doom_health_gathering_supreme_2222/config.json
706
+ [2024-11-09 15:35:07,246][00359] Loading existing experiment configuration from train_dir/doom_health_gathering_supreme_2222/config.json
707
+ [2024-11-09 15:35:07,247][00359] Overriding arg 'experiment' with value 'doom_health_gathering_supreme_2222' passed from command line
708
+ [2024-11-09 15:35:07,249][00359] Overriding arg 'train_dir' with value 'train_dir' passed from command line
709
+ [2024-11-09 15:35:07,250][00359] Overriding arg 'num_workers' with value 1 passed from command line
710
+ [2024-11-09 15:35:07,251][00359] Adding new argument 'lr_adaptive_min'=1e-06 that is not in the saved config file!
711
+ [2024-11-09 15:35:07,253][00359] Adding new argument 'lr_adaptive_max'=0.01 that is not in the saved config file!
712
+ [2024-11-09 15:35:07,254][00359] Adding new argument 'env_gpu_observations'=True that is not in the saved config file!
713
+ [2024-11-09 15:35:07,255][00359] Adding new argument 'no_render'=True that is not in the saved config file!
714
+ [2024-11-09 15:35:07,256][00359] Adding new argument 'save_video'=True that is not in the saved config file!
715
+ [2024-11-09 15:35:07,258][00359] Adding new argument 'video_frames'=1000000000.0 that is not in the saved config file!
716
+ [2024-11-09 15:35:07,259][00359] Adding new argument 'video_name'=None that is not in the saved config file!
717
+ [2024-11-09 15:35:07,260][00359] Adding new argument 'max_num_frames'=1000000000.0 that is not in the saved config file!
718
+ [2024-11-09 15:35:07,261][00359] Adding new argument 'max_num_episodes'=10 that is not in the saved config file!
719
+ [2024-11-09 15:35:07,262][00359] Adding new argument 'push_to_hub'=False that is not in the saved config file!
720
+ [2024-11-09 15:35:07,264][00359] Adding new argument 'hf_repository'=None that is not in the saved config file!
721
+ [2024-11-09 15:35:07,265][00359] Adding new argument 'policy_index'=0 that is not in the saved config file!
722
+ [2024-11-09 15:35:07,266][00359] Adding new argument 'eval_deterministic'=False that is not in the saved config file!
723
+ [2024-11-09 15:35:07,267][00359] Adding new argument 'train_script'=None that is not in the saved config file!
724
+ [2024-11-09 15:35:07,269][00359] Adding new argument 'enjoy_script'=None that is not in the saved config file!
725
+ [2024-11-09 15:35:07,270][00359] Using frameskip 1 and render_action_repeat=4 for evaluation
726
+ [2024-11-09 15:35:07,294][00359] RunningMeanStd input shape: (3, 72, 128)
727
+ [2024-11-09 15:35:07,296][00359] RunningMeanStd input shape: (1,)
728
+ [2024-11-09 15:35:07,310][00359] ConvEncoder: input_channels=3
729
+ [2024-11-09 15:35:07,361][00359] Conv encoder output size: 512
730
+ [2024-11-09 15:35:07,363][00359] Policy head output size: 512
731
+ [2024-11-09 15:35:07,386][00359] Loading state from checkpoint train_dir/doom_health_gathering_supreme_2222/checkpoint_p0/checkpoint_000539850_4422451200.pth...
732
+ [2024-11-09 15:35:07,818][00359] Num frames 100...
733
+ [2024-11-09 15:35:07,941][00359] Num frames 200...
734
+ [2024-11-09 15:35:08,063][00359] Num frames 300...
735
+ [2024-11-09 15:35:08,183][00359] Num frames 400...
736
+ [2024-11-09 15:35:08,305][00359] Num frames 500...
737
+ [2024-11-09 15:35:08,428][00359] Num frames 600...
738
+ [2024-11-09 15:35:08,549][00359] Num frames 700...
739
+ [2024-11-09 15:35:08,672][00359] Num frames 800...
740
+ [2024-11-09 15:35:08,795][00359] Num frames 900...
741
+ [2024-11-09 15:35:08,918][00359] Num frames 1000...
742
+ [2024-11-09 15:35:09,040][00359] Num frames 1100...
743
+ [2024-11-09 15:35:09,161][00359] Num frames 1200...
744
+ [2024-11-09 15:35:09,282][00359] Num frames 1300...
745
+ [2024-11-09 15:35:09,406][00359] Num frames 1400...
746
+ [2024-11-09 15:35:09,532][00359] Num frames 1500...
747
+ [2024-11-09 15:35:09,659][00359] Num frames 1600...
748
+ [2024-11-09 15:35:09,792][00359] Num frames 1700...
749
+ [2024-11-09 15:35:09,932][00359] Num frames 1800...
750
+ [2024-11-09 15:35:10,059][00359] Num frames 1900...
751
+ [2024-11-09 15:35:10,220][00359] Avg episode rewards: #0: 61.839, true rewards: #0: 19.840
752
+ [2024-11-09 15:35:10,222][00359] Avg episode reward: 61.839, avg true_objective: 19.840
753
+ [2024-11-09 15:35:10,245][00359] Num frames 2000...
754
+ [2024-11-09 15:35:10,370][00359] Num frames 2100...
755
+ [2024-11-09 15:35:10,500][00359] Num frames 2200...
756
+ [2024-11-09 15:35:10,631][00359] Num frames 2300...
757
+ [2024-11-09 15:35:10,764][00359] Num frames 2400...
758
+ [2024-11-09 15:35:10,894][00359] Num frames 2500...
759
+ [2024-11-09 15:35:11,024][00359] Num frames 2600...
760
+ [2024-11-09 15:35:11,155][00359] Num frames 2700...
761
+ [2024-11-09 15:35:11,282][00359] Num frames 2800...
762
+ [2024-11-09 15:35:11,433][00359] Num frames 2900...
763
+ [2024-11-09 15:35:11,563][00359] Num frames 3000...
764
+ [2024-11-09 15:35:11,691][00359] Num frames 3100...
765
+ [2024-11-09 15:35:11,816][00359] Num frames 3200...
766
+ [2024-11-09 15:35:11,943][00359] Num frames 3300...
767
+ [2024-11-09 15:35:12,067][00359] Num frames 3400...
768
+ [2024-11-09 15:35:12,190][00359] Num frames 3500...
769
+ [2024-11-09 15:35:12,315][00359] Num frames 3600...
770
+ [2024-11-09 15:35:12,440][00359] Num frames 3700...
771
+ [2024-11-09 15:35:12,568][00359] Num frames 3800...
772
+ [2024-11-09 15:35:12,691][00359] Num frames 3900...
773
+ [2024-11-09 15:35:12,815][00359] Num frames 4000...
774
+ [2024-11-09 15:35:12,977][00359] Avg episode rewards: #0: 61.919, true rewards: #0: 20.420
775
+ [2024-11-09 15:35:12,980][00359] Avg episode reward: 61.919, avg true_objective: 20.420
776
+ [2024-11-09 15:35:13,001][00359] Num frames 4100...
777
+ [2024-11-09 15:35:13,125][00359] Num frames 4200...
778
+ [2024-11-09 15:35:13,246][00359] Num frames 4300...
779
+ [2024-11-09 15:35:13,371][00359] Num frames 4400...
780
+ [2024-11-09 15:35:13,494][00359] Num frames 4500...
781
+ [2024-11-09 15:35:13,618][00359] Num frames 4600...
782
+ [2024-11-09 15:35:13,743][00359] Num frames 4700...
783
+ [2024-11-09 15:35:13,868][00359] Num frames 4800...
784
+ [2024-11-09 15:35:13,993][00359] Num frames 4900...
785
+ [2024-11-09 15:35:14,121][00359] Num frames 5000...
786
+ [2024-11-09 15:35:14,244][00359] Num frames 5100...
787
+ [2024-11-09 15:35:14,368][00359] Num frames 5200...
788
+ [2024-11-09 15:35:14,492][00359] Num frames 5300...
789
+ [2024-11-09 15:35:14,618][00359] Num frames 5400...
790
+ [2024-11-09 15:35:14,744][00359] Num frames 5500...
791
+ [2024-11-09 15:35:14,868][00359] Num frames 5600...
792
+ [2024-11-09 15:35:14,992][00359] Num frames 5700...
793
+ [2024-11-09 15:35:15,119][00359] Num frames 5800...
794
+ [2024-11-09 15:35:15,245][00359] Num frames 5900...
795
+ [2024-11-09 15:35:15,368][00359] Num frames 6000...
796
+ [2024-11-09 15:35:15,494][00359] Num frames 6100...
797
+ [2024-11-09 15:35:15,654][00359] Avg episode rewards: #0: 62.279, true rewards: #0: 20.613
798
+ [2024-11-09 15:35:15,656][00359] Avg episode reward: 62.279, avg true_objective: 20.613
799
+ [2024-11-09 15:35:15,678][00359] Num frames 6200...
800
+ [2024-11-09 15:35:15,803][00359] Num frames 6300...
801
+ [2024-11-09 15:35:15,928][00359] Num frames 6400...
802
+ [2024-11-09 15:35:16,051][00359] Num frames 6500...
803
+ [2024-11-09 15:35:16,174][00359] Num frames 6600...
804
+ [2024-11-09 15:35:16,297][00359] Num frames 6700...
805
+ [2024-11-09 15:35:16,420][00359] Num frames 6800...
806
+ [2024-11-09 15:35:16,547][00359] Num frames 6900...
807
+ [2024-11-09 15:35:16,670][00359] Num frames 7000...
808
+ [2024-11-09 15:35:16,797][00359] Num frames 7100...
809
+ [2024-11-09 15:35:16,923][00359] Num frames 7200...
810
+ [2024-11-09 15:35:17,048][00359] Num frames 7300...
811
+ [2024-11-09 15:35:17,172][00359] Num frames 7400...
812
+ [2024-11-09 15:35:17,297][00359] Num frames 7500...
813
+ [2024-11-09 15:35:17,422][00359] Num frames 7600...
814
+ [2024-11-09 15:35:17,547][00359] Num frames 7700...
815
+ [2024-11-09 15:35:17,670][00359] Num frames 7800...
816
+ [2024-11-09 15:35:17,799][00359] Num frames 7900...
817
+ [2024-11-09 15:35:17,925][00359] Num frames 8000...
818
+ [2024-11-09 15:35:18,050][00359] Num frames 8100...
819
+ [2024-11-09 15:35:18,175][00359] Num frames 8200...
820
+ [2024-11-09 15:35:18,335][00359] Avg episode rewards: #0: 61.959, true rewards: #0: 20.710
821
+ [2024-11-09 15:35:18,337][00359] Avg episode reward: 61.959, avg true_objective: 20.710
822
+ [2024-11-09 15:35:18,359][00359] Num frames 8300...
823
+ [2024-11-09 15:35:18,484][00359] Num frames 8400...
824
+ [2024-11-09 15:35:18,607][00359] Num frames 8500...
825
+ [2024-11-09 15:35:18,732][00359] Num frames 8600...
826
+ [2024-11-09 15:35:18,858][00359] Num frames 8700...
827
+ [2024-11-09 15:35:18,983][00359] Num frames 8800...
828
+ [2024-11-09 15:35:19,106][00359] Num frames 8900...
829
+ [2024-11-09 15:35:19,230][00359] Num frames 9000...
830
+ [2024-11-09 15:35:19,355][00359] Num frames 9100...
831
+ [2024-11-09 15:35:19,478][00359] Num frames 9200...
832
+ [2024-11-09 15:35:19,601][00359] Num frames 9300...
833
+ [2024-11-09 15:35:19,726][00359] Num frames 9400...
834
+ [2024-11-09 15:35:19,855][00359] Num frames 9500...
835
+ [2024-11-09 15:35:19,982][00359] Num frames 9600...
836
+ [2024-11-09 15:35:20,107][00359] Num frames 9700...
837
+ [2024-11-09 15:35:20,234][00359] Num frames 9800...
838
+ [2024-11-09 15:35:20,363][00359] Num frames 9900...
839
+ [2024-11-09 15:35:20,492][00359] Num frames 10000...
840
+ [2024-11-09 15:35:20,617][00359] Num frames 10100...
841
+ [2024-11-09 15:35:20,744][00359] Num frames 10200...
842
+ [2024-11-09 15:35:20,871][00359] Num frames 10300...
843
+ [2024-11-09 15:35:21,032][00359] Avg episode rewards: #0: 62.167, true rewards: #0: 20.768
844
+ [2024-11-09 15:35:21,035][00359] Avg episode reward: 62.167, avg true_objective: 20.768
845
+ [2024-11-09 15:35:21,058][00359] Num frames 10400...
846
+ [2024-11-09 15:35:21,183][00359] Num frames 10500...
847
+ [2024-11-09 15:35:21,309][00359] Num frames 10600...
848
+ [2024-11-09 15:35:21,435][00359] Num frames 10700...
849
+ [2024-11-09 15:35:21,563][00359] Num frames 10800...
850
+ [2024-11-09 15:35:21,691][00359] Num frames 10900...
851
+ [2024-11-09 15:35:21,823][00359] Num frames 11000...
852
+ [2024-11-09 15:35:21,953][00359] Num frames 11100...
853
+ [2024-11-09 15:35:22,081][00359] Num frames 11200...
854
+ [2024-11-09 15:35:22,213][00359] Num frames 11300...
855
+ [2024-11-09 15:35:22,343][00359] Num frames 11400...
856
+ [2024-11-09 15:35:22,479][00359] Num frames 11500...
857
+ [2024-11-09 15:35:22,614][00359] Num frames 11600...
858
+ [2024-11-09 15:35:22,745][00359] Num frames 11700...
859
+ [2024-11-09 15:35:22,881][00359] Num frames 11800...
860
+ [2024-11-09 15:35:23,016][00359] Num frames 11900...
861
+ [2024-11-09 15:35:23,146][00359] Num frames 12000...
862
+ [2024-11-09 15:35:23,271][00359] Num frames 12100...
863
+ [2024-11-09 15:35:23,399][00359] Num frames 12200...
864
+ [2024-11-09 15:35:23,525][00359] Num frames 12300...
865
+ [2024-11-09 15:35:23,649][00359] Num frames 12400...
866
+ [2024-11-09 15:35:23,808][00359] Avg episode rewards: #0: 62.805, true rewards: #0: 20.807
867
+ [2024-11-09 15:35:23,810][00359] Avg episode reward: 62.805, avg true_objective: 20.807
868
+ [2024-11-09 15:35:23,833][00359] Num frames 12500...
869
+ [2024-11-09 15:35:23,959][00359] Num frames 12600...
870
+ [2024-11-09 15:35:24,083][00359] Num frames 12700...
871
+ [2024-11-09 15:35:24,209][00359] Num frames 12800...
872
+ [2024-11-09 15:35:24,336][00359] Num frames 12900...
873
+ [2024-11-09 15:35:24,462][00359] Num frames 13000...
874
+ [2024-11-09 15:35:24,587][00359] Num frames 13100...
875
+ [2024-11-09 15:35:24,712][00359] Num frames 13200...
876
+ [2024-11-09 15:35:24,836][00359] Num frames 13300...
877
+ [2024-11-09 15:35:24,963][00359] Num frames 13400...
878
+ [2024-11-09 15:35:25,088][00359] Num frames 13500...
879
+ [2024-11-09 15:35:25,213][00359] Num frames 13600...
880
+ [2024-11-09 15:35:25,337][00359] Num frames 13700...
881
+ [2024-11-09 15:35:25,464][00359] Num frames 13800...
882
+ [2024-11-09 15:35:25,591][00359] Num frames 13900...
883
+ [2024-11-09 15:35:25,716][00359] Num frames 14000...
884
+ [2024-11-09 15:35:25,842][00359] Num frames 14100...
885
+ [2024-11-09 15:35:25,967][00359] Num frames 14200...
886
+ [2024-11-09 15:35:26,095][00359] Num frames 14300...
887
+ [2024-11-09 15:35:26,219][00359] Num frames 14400...
888
+ [2024-11-09 15:35:26,345][00359] Num frames 14500...
889
+ [2024-11-09 15:35:26,507][00359] Avg episode rewards: #0: 63.404, true rewards: #0: 20.834
890
+ [2024-11-09 15:35:26,508][00359] Avg episode reward: 63.404, avg true_objective: 20.834
891
+ [2024-11-09 15:35:26,532][00359] Num frames 14600...
892
+ [2024-11-09 15:35:26,663][00359] Num frames 14700...
893
+ [2024-11-09 15:35:26,794][00359] Num frames 14800...
894
+ [2024-11-09 15:35:26,926][00359] Num frames 14900...
895
+ [2024-11-09 15:35:27,056][00359] Num frames 15000...
896
+ [2024-11-09 15:35:27,187][00359] Num frames 15100...
897
+ [2024-11-09 15:35:27,319][00359] Num frames 15200...
898
+ [2024-11-09 15:35:27,450][00359] Num frames 15300...
899
+ [2024-11-09 15:35:27,583][00359] Num frames 15400...
900
+ [2024-11-09 15:35:27,712][00359] Num frames 15500...
901
+ [2024-11-09 15:35:27,835][00359] Num frames 15600...
902
+ [2024-11-09 15:35:27,960][00359] Num frames 15700...
903
+ [2024-11-09 15:35:28,086][00359] Num frames 15800...
904
+ [2024-11-09 15:35:28,210][00359] Num frames 15900...
905
+ [2024-11-09 15:35:28,332][00359] Num frames 16000...
906
+ [2024-11-09 15:35:28,457][00359] Num frames 16100...
907
+ [2024-11-09 15:35:28,582][00359] Num frames 16200...
908
+ [2024-11-09 15:35:28,709][00359] Num frames 16300...
909
+ [2024-11-09 15:35:28,836][00359] Num frames 16400...
910
+ [2024-11-09 15:35:28,934][00359] Avg episode rewards: #0: 62.665, true rewards: #0: 20.541
911
+ [2024-11-09 15:35:28,936][00359] Avg episode reward: 62.665, avg true_objective: 20.541
912
+ [2024-11-09 15:35:29,019][00359] Num frames 16500...
913
+ [2024-11-09 15:35:29,144][00359] Num frames 16600...
914
+ [2024-11-09 15:35:29,273][00359] Num frames 16700...
915
+ [2024-11-09 15:35:29,404][00359] Num frames 16800...
916
+ [2024-11-09 15:35:29,536][00359] Num frames 16900...
917
+ [2024-11-09 15:35:29,664][00359] Num frames 17000...
918
+ [2024-11-09 15:35:29,788][00359] Num frames 17100...
919
+ [2024-11-09 15:35:29,914][00359] Num frames 17200...
920
+ [2024-11-09 15:35:30,038][00359] Num frames 17300...
921
+ [2024-11-09 15:35:30,161][00359] Num frames 17400...
922
+ [2024-11-09 15:35:30,287][00359] Num frames 17500...
923
+ [2024-11-09 15:35:30,415][00359] Num frames 17600...
924
+ [2024-11-09 15:35:30,540][00359] Num frames 17700...
925
+ [2024-11-09 15:35:30,665][00359] Num frames 17800...
926
+ [2024-11-09 15:35:30,787][00359] Num frames 17900...
927
+ [2024-11-09 15:35:30,913][00359] Num frames 18000...
928
+ [2024-11-09 15:35:31,036][00359] Num frames 18100...
929
+ [2024-11-09 15:35:31,160][00359] Num frames 18200...
930
+ [2024-11-09 15:35:31,285][00359] Num frames 18300...
931
+ [2024-11-09 15:35:31,410][00359] Num frames 18400...
932
+ [2024-11-09 15:35:31,539][00359] Num frames 18500...
933
+ [2024-11-09 15:35:31,639][00359] Avg episode rewards: #0: 62.258, true rewards: #0: 20.592
934
+ [2024-11-09 15:35:31,641][00359] Avg episode reward: 62.258, avg true_objective: 20.592
935
+ [2024-11-09 15:35:31,726][00359] Num frames 18600...
936
+ [2024-11-09 15:35:31,856][00359] Num frames 18700...
937
+ [2024-11-09 15:35:31,982][00359] Num frames 18800...
938
+ [2024-11-09 15:35:32,107][00359] Num frames 18900...
939
+ [2024-11-09 15:35:32,233][00359] Num frames 19000...
940
+ [2024-11-09 15:35:32,387][00359] Avg episode rewards: #0: 57.176, true rewards: #0: 19.077
941
+ [2024-11-09 15:35:32,389][00359] Avg episode reward: 57.176, avg true_objective: 19.077
942
+ [2024-11-09 15:36:17,821][00359] Replay video saved to train_dir/doom_health_gathering_supreme_2222/replay.mp4!
943
+ [2024-11-09 15:37:17,872][00359] Loading existing experiment configuration from /content/train_dir/default_experiment/config.json
944
+ [2024-11-09 15:37:17,873][00359] Overriding arg 'num_workers' with value 1 passed from command line
945
+ [2024-11-09 15:37:17,875][00359] Adding new argument 'no_render'=True that is not in the saved config file!
946
+ [2024-11-09 15:37:17,876][00359] Adding new argument 'save_video'=True that is not in the saved config file!
947
+ [2024-11-09 15:37:17,877][00359] Adding new argument 'video_frames'=1000000000.0 that is not in the saved config file!
948
+ [2024-11-09 15:37:17,879][00359] Adding new argument 'video_name'=None that is not in the saved config file!
949
+ [2024-11-09 15:37:17,880][00359] Adding new argument 'max_num_frames'=100000 that is not in the saved config file!
950
+ [2024-11-09 15:37:17,881][00359] Adding new argument 'max_num_episodes'=10 that is not in the saved config file!
951
+ [2024-11-09 15:37:17,883][00359] Adding new argument 'push_to_hub'=True that is not in the saved config file!
952
+ [2024-11-09 15:37:17,884][00359] Adding new argument 'hf_repository'='lahirum/rl_course_vizdoom_health_gathering_supreme' that is not in the saved config file!
953
+ [2024-11-09 15:37:17,885][00359] Adding new argument 'policy_index'=0 that is not in the saved config file!
954
+ [2024-11-09 15:37:17,887][00359] Adding new argument 'eval_deterministic'=False that is not in the saved config file!
955
+ [2024-11-09 15:37:17,888][00359] Adding new argument 'train_script'=None that is not in the saved config file!
956
+ [2024-11-09 15:37:17,889][00359] Adding new argument 'enjoy_script'=None that is not in the saved config file!
957
+ [2024-11-09 15:37:17,890][00359] Using frameskip 1 and render_action_repeat=4 for evaluation
958
+ [2024-11-09 15:37:17,914][00359] RunningMeanStd input shape: (3, 72, 128)
959
+ [2024-11-09 15:37:17,916][00359] RunningMeanStd input shape: (1,)
960
+ [2024-11-09 15:37:17,929][00359] ConvEncoder: input_channels=3
961
+ [2024-11-09 15:37:17,968][00359] Conv encoder output size: 512
962
+ [2024-11-09 15:37:17,970][00359] Policy head output size: 512
963
+ [2024-11-09 15:37:17,990][00359] Loading state from checkpoint /content/train_dir/default_experiment/checkpoint_p0/checkpoint_000000978_4005888.pth...
964
+ [2024-11-09 15:37:18,428][00359] Num frames 100...
965
+ [2024-11-09 15:37:18,554][00359] Num frames 200...
966
+ [2024-11-09 15:37:18,674][00359] Num frames 300...
967
+ [2024-11-09 15:37:18,795][00359] Num frames 400...
968
+ [2024-11-09 15:37:18,920][00359] Num frames 500...
969
+ [2024-11-09 15:37:19,047][00359] Num frames 600...
970
+ [2024-11-09 15:37:19,171][00359] Num frames 700...
971
+ [2024-11-09 15:37:19,294][00359] Num frames 800...
972
+ [2024-11-09 15:37:19,424][00359] Num frames 900...
973
+ [2024-11-09 15:37:19,562][00359] Num frames 1000...
974
+ [2024-11-09 15:37:19,695][00359] Avg episode rewards: #0: 22.590, true rewards: #0: 10.590
975
+ [2024-11-09 15:37:19,696][00359] Avg episode reward: 22.590, avg true_objective: 10.590
976
+ [2024-11-09 15:37:19,749][00359] Num frames 1100...
977
+ [2024-11-09 15:37:19,872][00359] Num frames 1200...
978
+ [2024-11-09 15:37:19,994][00359] Num frames 1300...
979
+ [2024-11-09 15:37:20,116][00359] Num frames 1400...
980
+ [2024-11-09 15:37:20,236][00359] Num frames 1500...
981
+ [2024-11-09 15:37:20,361][00359] Num frames 1600...
982
+ [2024-11-09 15:37:20,483][00359] Num frames 1700...
983
+ [2024-11-09 15:37:20,653][00359] Avg episode rewards: #0: 19.475, true rewards: #0: 8.975
984
+ [2024-11-09 15:37:20,655][00359] Avg episode reward: 19.475, avg true_objective: 8.975
985
+ [2024-11-09 15:37:20,664][00359] Num frames 1800...
986
+ [2024-11-09 15:37:20,787][00359] Num frames 1900...
987
+ [2024-11-09 15:37:20,910][00359] Num frames 2000...
988
+ [2024-11-09 15:37:21,029][00359] Num frames 2100...
989
+ [2024-11-09 15:37:21,147][00359] Num frames 2200...
990
+ [2024-11-09 15:37:21,265][00359] Num frames 2300...
991
+ [2024-11-09 15:37:21,385][00359] Num frames 2400...
992
+ [2024-11-09 15:37:21,518][00359] Avg episode rewards: #0: 16.557, true rewards: #0: 8.223
993
+ [2024-11-09 15:37:21,520][00359] Avg episode reward: 16.557, avg true_objective: 8.223
994
+ [2024-11-09 15:37:21,561][00359] Num frames 2500...
995
+ [2024-11-09 15:37:21,681][00359] Num frames 2600...
996
+ [2024-11-09 15:37:21,804][00359] Num frames 2700...
997
+ [2024-11-09 15:37:21,927][00359] Num frames 2800...
998
+ [2024-11-09 15:37:22,049][00359] Num frames 2900...
999
+ [2024-11-09 15:37:22,170][00359] Num frames 3000...
1000
+ [2024-11-09 15:37:22,292][00359] Num frames 3100...
1001
+ [2024-11-09 15:37:22,414][00359] Num frames 3200...
1002
+ [2024-11-09 15:37:22,542][00359] Num frames 3300...
1003
+ [2024-11-09 15:37:22,665][00359] Num frames 3400...
1004
+ [2024-11-09 15:37:22,788][00359] Num frames 3500...
1005
+ [2024-11-09 15:37:22,909][00359] Num frames 3600...
1006
+ [2024-11-09 15:37:23,031][00359] Num frames 3700...
1007
+ [2024-11-09 15:37:23,153][00359] Num frames 3800...
1008
+ [2024-11-09 15:37:23,274][00359] Num frames 3900...
1009
+ [2024-11-09 15:37:23,393][00359] Num frames 4000...
1010
+ [2024-11-09 15:37:23,516][00359] Num frames 4100...
1011
+ [2024-11-09 15:37:23,639][00359] Num frames 4200...
1012
+ [2024-11-09 15:37:23,763][00359] Num frames 4300...
1013
+ [2024-11-09 15:37:23,857][00359] Avg episode rewards: #0: 26.080, true rewards: #0: 10.830
1014
+ [2024-11-09 15:37:23,859][00359] Avg episode reward: 26.080, avg true_objective: 10.830
1015
+ [2024-11-09 15:37:23,943][00359] Num frames 4400...
1016
+ [2024-11-09 15:37:24,064][00359] Num frames 4500...
1017
+ [2024-11-09 15:37:24,184][00359] Num frames 4600...
1018
+ [2024-11-09 15:37:24,307][00359] Num frames 4700...
1019
+ [2024-11-09 15:37:24,430][00359] Num frames 4800...
1020
+ [2024-11-09 15:37:24,517][00359] Avg episode rewards: #0: 22.452, true rewards: #0: 9.652
1021
+ [2024-11-09 15:37:24,519][00359] Avg episode reward: 22.452, avg true_objective: 9.652
1022
+ [2024-11-09 15:37:24,613][00359] Num frames 4900...
1023
+ [2024-11-09 15:37:24,731][00359] Num frames 5000...
1024
+ [2024-11-09 15:37:24,853][00359] Num frames 5100...
1025
+ [2024-11-09 15:37:24,973][00359] Num frames 5200...
1026
+ [2024-11-09 15:37:25,093][00359] Num frames 5300...
1027
+ [2024-11-09 15:37:25,214][00359] Num frames 5400...
1028
+ [2024-11-09 15:37:25,335][00359] Num frames 5500...
1029
+ [2024-11-09 15:37:25,455][00359] Num frames 5600...
1030
+ [2024-11-09 15:37:25,635][00359] Avg episode rewards: #0: 21.665, true rewards: #0: 9.498
1031
+ [2024-11-09 15:37:25,637][00359] Avg episode reward: 21.665, avg true_objective: 9.498
1032
+ [2024-11-09 15:37:25,640][00359] Num frames 5700...
1033
+ [2024-11-09 15:37:25,760][00359] Num frames 5800...
1034
+ [2024-11-09 15:37:25,884][00359] Num frames 5900...
1035
+ [2024-11-09 15:37:26,007][00359] Num frames 6000...
1036
+ [2024-11-09 15:37:26,127][00359] Num frames 6100...
1037
+ [2024-11-09 15:37:26,247][00359] Num frames 6200...
1038
+ [2024-11-09 15:37:26,368][00359] Num frames 6300...
1039
+ [2024-11-09 15:37:26,493][00359] Num frames 6400...
1040
+ [2024-11-09 15:37:26,614][00359] Num frames 6500...
1041
+ [2024-11-09 15:37:26,734][00359] Num frames 6600...
1042
+ [2024-11-09 15:37:26,858][00359] Num frames 6700...
1043
+ [2024-11-09 15:37:26,979][00359] Num frames 6800...
1044
+ [2024-11-09 15:37:27,100][00359] Num frames 6900...
1045
+ [2024-11-09 15:37:27,220][00359] Num frames 7000...
1046
+ [2024-11-09 15:37:27,378][00359] Avg episode rewards: #0: 23.981, true rewards: #0: 10.124
1047
+ [2024-11-09 15:37:27,380][00359] Avg episode reward: 23.981, avg true_objective: 10.124
1048
+ [2024-11-09 15:37:27,398][00359] Num frames 7100...
1049
+ [2024-11-09 15:37:27,524][00359] Num frames 7200...
1050
+ [2024-11-09 15:37:27,646][00359] Num frames 7300...
1051
+ [2024-11-09 15:37:27,764][00359] Num frames 7400...
1052
+ [2024-11-09 15:37:27,885][00359] Num frames 7500...
1053
+ [2024-11-09 15:37:27,945][00359] Avg episode rewards: #0: 21.629, true rewards: #0: 9.379
1054
+ [2024-11-09 15:37:27,947][00359] Avg episode reward: 21.629, avg true_objective: 9.379
1055
+ [2024-11-09 15:37:28,065][00359] Num frames 7600...
1056
+ [2024-11-09 15:37:28,184][00359] Num frames 7700...
1057
+ [2024-11-09 15:37:28,304][00359] Num frames 7800...
1058
+ [2024-11-09 15:37:28,426][00359] Num frames 7900...
1059
+ [2024-11-09 15:37:28,551][00359] Num frames 8000...
1060
+ [2024-11-09 15:37:28,680][00359] Num frames 8100...
1061
+ [2024-11-09 15:37:28,808][00359] Num frames 8200...
1062
+ [2024-11-09 15:37:28,938][00359] Num frames 8300...
1063
+ [2024-11-09 15:37:28,998][00359] Avg episode rewards: #0: 21.003, true rewards: #0: 9.226
1064
+ [2024-11-09 15:37:28,999][00359] Avg episode reward: 21.003, avg true_objective: 9.226
1065
+ [2024-11-09 15:37:29,120][00359] Num frames 8400...
1066
+ [2024-11-09 15:37:29,242][00359] Num frames 8500...
1067
+ [2024-11-09 15:37:29,363][00359] Num frames 8600...
1068
+ [2024-11-09 15:37:29,485][00359] Num frames 8700...
1069
+ [2024-11-09 15:37:29,605][00359] Num frames 8800...
1070
+ [2024-11-09 15:37:29,734][00359] Num frames 8900...
1071
+ [2024-11-09 15:37:29,861][00359] Num frames 9000...
1072
+ [2024-11-09 15:37:29,987][00359] Num frames 9100...
1073
+ [2024-11-09 15:37:30,113][00359] Num frames 9200...
1074
+ [2024-11-09 15:37:30,242][00359] Num frames 9300...
1075
+ [2024-11-09 15:37:30,371][00359] Num frames 9400...
1076
+ [2024-11-09 15:37:30,502][00359] Num frames 9500...
1077
+ [2024-11-09 15:37:30,631][00359] Num frames 9600...
1078
+ [2024-11-09 15:37:30,760][00359] Num frames 9700...
1079
+ [2024-11-09 15:37:30,889][00359] Num frames 9800...
1080
+ [2024-11-09 15:37:31,014][00359] Num frames 9900...
1081
+ [2024-11-09 15:37:31,104][00359] Avg episode rewards: #0: 23.328, true rewards: #0: 9.928
1082
+ [2024-11-09 15:37:31,106][00359] Avg episode reward: 23.328, avg true_objective: 9.928
1083
+ [2024-11-09 15:37:54,373][00359] Replay video saved to /content/train_dir/default_experiment/replay.mp4!