SentenceTransformer
This is a sentence-transformers model trained. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 768 tokens
- Similarity Function: Cosine Similarity
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: DistilBertModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
'100% Cotton Throw Blanket for Couch Sofa Bed Outdoors Hypoallergenic 83"x70" Brown',
'Hetao 100% Cotton Handmade Crochet Round Tablecloth Doilies Lace Table Covers,Beige, 27 Inch\n',
'Life Clothing Co. Womens Tops Jade Tie Dye Hoodie (XL)\n',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Training Details
Training Dataset
Unnamed Dataset
- Size: 1,830 training samples
- Columns:
sentence_0
,sentence_1
, andsentence_2
- Approximate statistics based on the first 1000 samples:
sentence_0 sentence_1 sentence_2 type string string string details - min: 7 tokens
- mean: 16.37 tokens
- max: 23 tokens
- min: 2 tokens
- mean: 161.41 tokens
- max: 512 tokens
- min: 2 tokens
- mean: 74.05 tokens
- max: 512 tokens
- Samples:
sentence_0 sentence_1 sentence_2 "acrylic lollipop holder cake pop stand with sticks, bags, and twist ties"
Zealax 15pcs Treat Bags Gold Polka Dot Print Drawstring Plastic Party Favors for Cookie Roasting Treat Candy Buffet Gift Wrapping Goodies Package, 4.6 inches x 6.7 inches
Twinkle Star Solid Brass Heavy Duty Adjustable Twist Hose Nozzle Jet Sweeper Nozzle, TWIS3231
Product Description Shut Off Valve Shut Off Valve Adjustable Hose Nozzle Adjustable Twist Hose Nozzle Jet Sweeper Screw Threads 3/4" 3/4" 3/4" 3/4" 3/4" Shut-Off Valve YES YES YES YES NO Material Brass Brass Brass Brass Brass Package Includes 2 Pack 1 Pack 1 Pack 2 Pack 2 Pack Garden Hose Quick Connect Set Garden Hose Quick Connect Set Hose Caps Double Female Swivel Connectors Double Male Quick Connectors Screw Threads 3/4" 3/4" 3/4" 3/4" 3/4" Material Aluminum Brass Brass Brass Brass Package Includes 4 Sets 4 Sets 4 Pack 2 Pack 2 Pack Specifications: Body Material: Brass Package Includes: 1 x adjustable nozzle, 1 x jet sweeper nozzle. Twinkle Star Solid Brass Heavy Duty Adjustable Twist Hose Nozzle Jet Sweeper Nozzle Heavy-duty solid brass construction. With 4 holes at the tip for maximum pressure and water flow, fitted with O-ring seals at the back and front to prevent any leaks. Twisting barrel to adjusts water from a fine mist to a powerful jet stream. Fits standard garden hose, great for watering gardens, car washing, deck, siding & driveway cleaning and more. Notes: 1. Please choose the correct hose size. 2. To prevent leakage, make sure the jet has rubber ring. 3. If water leaks after a long period of use, please replace with a new washer. Read more Adjustable jet rotates from a light stream to powerful stream. Heavy duty brass 3/4”female thread. Solid brass integral inner core, anti - damage, anti - rust, anti - leakage, durable. Read moreNFL Women's OTS Fleece Hoodie
Ultra Game NFL womens Fleece Hoodie Pullover Sweatshirt Tie Neck
Womens Antler Evolution Whitetail Tee Short Sleeve
"Belkin QODE Ultimate Pro Keyboard Case for iPad Air 2 White"
iPad Pro Guide
Under Armour Boys' Prototype Short
From the manufacturer Read more Read more Read more - Loss:
TripletLoss
with these parameters:{ "distance_metric": "TripletDistanceMetric.EUCLIDEAN", "triplet_margin": 25.0 }
Training Hyperparameters
Non-Default Hyperparameters
per_device_train_batch_size
: 54per_device_eval_batch_size
: 54num_train_epochs
: 5multi_dataset_batch_sampler
: round_robin
All Hyperparameters
Click to expand
overwrite_output_dir
: Falsedo_predict
: Falseeval_strategy
: noprediction_loss_only
: Trueper_device_train_batch_size
: 54per_device_eval_batch_size
: 54per_gpu_train_batch_size
: Noneper_gpu_eval_batch_size
: Nonegradient_accumulation_steps
: 1eval_accumulation_steps
: Nonetorch_empty_cache_steps
: Nonelearning_rate
: 5e-05weight_decay
: 0.0adam_beta1
: 0.9adam_beta2
: 0.999adam_epsilon
: 1e-08max_grad_norm
: 1num_train_epochs
: 5max_steps
: -1lr_scheduler_type
: linearlr_scheduler_kwargs
: {}warmup_ratio
: 0.0warmup_steps
: 0log_level
: passivelog_level_replica
: warninglog_on_each_node
: Truelogging_nan_inf_filter
: Truesave_safetensors
: Truesave_on_each_node
: Falsesave_only_model
: Falserestore_callback_states_from_checkpoint
: Falseno_cuda
: Falseuse_cpu
: Falseuse_mps_device
: Falseseed
: 42data_seed
: Nonejit_mode_eval
: Falseuse_ipex
: Falsebf16
: Falsefp16
: Falsefp16_opt_level
: O1half_precision_backend
: autobf16_full_eval
: Falsefp16_full_eval
: Falsetf32
: Nonelocal_rank
: 0ddp_backend
: Nonetpu_num_cores
: Nonetpu_metrics_debug
: Falsedebug
: []dataloader_drop_last
: Falsedataloader_num_workers
: 0dataloader_prefetch_factor
: Nonepast_index
: -1disable_tqdm
: Falseremove_unused_columns
: Truelabel_names
: Noneload_best_model_at_end
: Falseignore_data_skip
: Falsefsdp
: []fsdp_min_num_params
: 0fsdp_config
: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap
: Noneaccelerator_config
: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed
: Nonelabel_smoothing_factor
: 0.0optim
: adamw_torchoptim_args
: Noneadafactor
: Falsegroup_by_length
: Falselength_column_name
: lengthddp_find_unused_parameters
: Noneddp_bucket_cap_mb
: Noneddp_broadcast_buffers
: Falsedataloader_pin_memory
: Truedataloader_persistent_workers
: Falseskip_memory_metrics
: Trueuse_legacy_prediction_loop
: Falsepush_to_hub
: Falseresume_from_checkpoint
: Nonehub_model_id
: Nonehub_strategy
: every_savehub_private_repo
: Falsehub_always_push
: Falsegradient_checkpointing
: Falsegradient_checkpointing_kwargs
: Noneinclude_inputs_for_metrics
: Falseeval_do_concat_batches
: Truefp16_backend
: autopush_to_hub_model_id
: Nonepush_to_hub_organization
: Nonemp_parameters
:auto_find_batch_size
: Falsefull_determinism
: Falsetorchdynamo
: Noneray_scope
: lastddp_timeout
: 1800torch_compile
: Falsetorch_compile_backend
: Nonetorch_compile_mode
: Nonedispatch_batches
: Nonesplit_batches
: Noneinclude_tokens_per_second
: Falseinclude_num_input_tokens_seen
: Falseneftune_noise_alpha
: Noneoptim_target_modules
: Nonebatch_eval_metrics
: Falseeval_on_start
: Falseeval_use_gather_object
: Falsebatch_sampler
: batch_samplermulti_dataset_batch_sampler
: round_robin
Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.1.1
- Transformers: 4.44.2
- PyTorch: 2.4.1+cu121
- Accelerate: 0.34.2
- Datasets: 3.0.0
- Tokenizers: 0.19.1
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
TripletLoss
@misc{hermans2017defense,
title={In Defense of the Triplet Loss for Person Re-Identification},
author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
year={2017},
eprint={1703.07737},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
- Downloads last month
- 13
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.