Add checkpoint files
ff9fa11
verified
bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_32_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_33_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_34_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_35_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_36_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_37_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_38_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_39_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_40_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_41_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_42_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "collections.OrderedDict",
- "torch.FloatStorage",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_43_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_44_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_45_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_46_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_47_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_48_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_49_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_50_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_51_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "collections.OrderedDict",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "torch._utils._rebuild_tensor_v2",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch.FloatStorage"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_52_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_53_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
1.2 GB
Add checkpoint files
bf16_zero_pp_rank_54_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "collections.OrderedDict",
- "deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
1.2 GB
Add checkpoint files