Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Fsoft-AIC
/
Phi3.5-Siglip-MoE
like
0
Follow
FPT Software AI Center
64
Image-Text-to-Text
TensorBoard
Safetensors
English
arxiv:
2411.00918
License:
apache-2.0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
6187
refs/pr/6154
Phi3.5-Siglip-MoE
/
sft_pretrain
/
Full_smoe_sharev3
/
checkpoint-5198
/
global_step5198
10.1 GB
1 contributor
History:
1 commit
DavidNguyen
Upload folder using huggingface_hub
a57ee2b
verified
6 months ago
bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (7)
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"torch.Tensor"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"torch._utils._rebuild_tensor_v2"
,
"torch._tensor._rebuild_from_type_v2"
,
"torch.FloatStorage"
How to fix it?
397 MB
xet
Upload folder using huggingface_hub
6 months ago
bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (7)
"torch._tensor._rebuild_from_type_v2"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"torch.Tensor"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
How to fix it?
397 MB
xet
Upload folder using huggingface_hub
6 months ago
bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (7)
"torch._tensor._rebuild_from_type_v2"
,
"torch.FloatStorage"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"torch.Tensor"
How to fix it?
397 MB
xet
Upload folder using huggingface_hub
6 months ago
bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (7)
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"torch.Tensor"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch._tensor._rebuild_from_type_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
How to fix it?
397 MB
xet
Upload folder using huggingface_hub
6 months ago
zero_pp_rank_0_mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"__builtin__.set"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
,
"torch.Size"
How to fix it?
2.12 GB
xet
Upload folder using huggingface_hub
6 months ago
zero_pp_rank_1_mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"torch._utils._rebuild_tensor_v2"
,
"torch.Size"
,
"__builtin__.set"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
How to fix it?
2.12 GB
xet
Upload folder using huggingface_hub
6 months ago
zero_pp_rank_2_mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"torch.Size"
,
"torch._utils._rebuild_tensor_v2"
,
"__builtin__.set"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
How to fix it?
2.12 GB
xet
Upload folder using huggingface_hub
6 months ago
zero_pp_rank_3_mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"torch.Size"
,
"torch._utils._rebuild_tensor_v2"
,
"__builtin__.set"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
How to fix it?
2.12 GB
xet
Upload folder using huggingface_hub
6 months ago