Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bobox
/
DeBERTa-ST-AllLayers-v3.1bis
like
0
Sentence Similarity
sentence-transformers
PyTorch
13 datasets
English
deberta-v2
feature-extraction
Generated from Trainer
dataset_size:165061
loss:AdaptiveLayerLoss
loss:GISTEmbedLoss
loss:OnlineContrastiveLoss
loss:MultipleNegativesSymmetricRankingLoss
loss:MultipleNegativesRankingLoss
Eval Results
arXiv:
4 papers
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
main
DeBERTa-ST-AllLayers-v3.1bis
577 MB
1 contributor
History:
2 commits
bobox
KL divergence loss layers selfdistill....Multi step multi task training.
869170b
verified
over 1 year ago
1_Pooling
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
Safe
408 kB
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
added_tokens.json
Safe
23 Bytes
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
config.json
Safe
879 Bytes
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
config_sentence_transformers.json
Safe
195 Bytes
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
modules.json
Safe
229 Bytes
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
565 MB
xet
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
sentence_bert_config.json
Safe
53 Bytes
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
special_tokens_map.json
Safe
970 Bytes
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
spm.model
Safe
2.46 MB
xet
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
tokenizer.json
Safe
8.65 MB
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago
tokenizer_config.json
Safe
1.48 kB
KL divergence loss layers selfdistill....Multi step multi task training.
over 1 year ago