metadata
language:
- en
library_name: sentence-transformers
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:689221
- loss:MultipleNegativesRankingLoss
- loss:CoSENTLoss
- loss:GISTEmbedLoss
- loss:OnlineContrastiveLoss
- loss:MultipleNegativesSymmetricRankingLoss
base_model: microsoft/deberta-v3-small
datasets:
- sentence-transformers/all-nli
- sentence-transformers/stsb
- tals/vitaminc
- nyu-mll/glue
- allenai/scitail
- sentence-transformers/xsum
- sentence-transformers/sentence-compression
widget:
- source_sentence: >-
What are the exceptions in the constitution that require special
considerations to amend?
sentences:
- The river makes a distinctive turn to the north near Chur.
- >-
The Victorian Constitution can be amended by the Parliament of Victoria,
except for certain "entrenched" provisions that require either an
absolute majority in both houses, a three-fifths majority in both
houses, or the approval of the Victorian people in a referendum,
depending on the provision.
- >-
A new arrangement of the theme, once again by Gold, was introduced in
the 2007 Christmas special episode, "Voyage of the Damned"; Gold
returned as composer for the 2010 series.
- source_sentence: What is the name of a Bodhisattva vow?
sentences:
- >-
In Tibetan Buddhism the teachers of Dharma in Tibet are most commonly
called a Lama.
- >-
This origin of chloroplasts was first suggested by the Russian biologist
Konstantin Mereschkowski in 1905 after Andreas Schimper observed in 1883
that chloroplasts closely resemble cyanobacteria.
- >-
The announcement came a day after Setanta Sports confirmed that it would
launch in March as a subscription service on the digital terrestrial
platform, and on the same day that NTL's services re-branded as Virgin
Media.
- source_sentence: Two dogs run around inside a fence.
sentences:
- A young woman tennis player have many tennis balls.
- Two dogs are inside a fence.
- A little girl in red plays tennis.
- source_sentence: >-
A little boy wearing a blue stiped shirt has a party hat on his head and
is playing in a puddle.
sentences:
- The party boy is playing in a puddle.
- There is a crowd
- Four people are skiing
- source_sentence: Two wrestlers jump in a ring while an official watches.
sentences:
- The man was walking.
- Two men are dressed in makeup
- Two wrestlers were just tagged in on a tag team match.
pipeline_tag: sentence-similarity
SentenceTransformer based on microsoft/deberta-v3-small
This is a sentence-transformers model finetuned from microsoft/deberta-v3-small on the nli-pairs, sts-label, vitaminc-pairs, qnli-contrastive, scitail-pairs-qa, scitail-pairs-pos, xsum-pairs and compression-pairs datasets. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: microsoft/deberta-v3-small
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 768 tokens
- Similarity Function: Cosine Similarity
- Training Datasets:
- Language: en
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: DebertaV2Model
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("bobox/DeBERTaV3-small-GeneralSentenceTransformer-checkpoints-tmp")
# Run inference
sentences = [
'Two wrestlers jump in a ring while an official watches.',
'Two wrestlers were just tagged in on a tag team match.',
'Two men are dressed in makeup',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Training Details
Training Datasets
nli-pairs
- Dataset: nli-pairs at d482672
- Size: 150,000 training samples
- Columns:
sentence1
andsentence2
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 type string string details - min: 5 tokens
- mean: 16.62 tokens
- max: 62 tokens
- min: 4 tokens
- mean: 9.46 tokens
- max: 29 tokens
- Samples:
sentence1 sentence2 A person on a horse jumps over a broken down airplane.
A person is outdoors, on a horse.
Children smiling and waving at camera
There are children present
A boy is jumping on skateboard in the middle of a red bridge.
The boy does a skateboarding trick.
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
sts-label
- Dataset: sts-label at ab7a5ac
- Size: 5,749 training samples
- Columns:
sentence1
,sentence2
, andscore
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 score type string string float details - min: 6 tokens
- mean: 9.81 tokens
- max: 27 tokens
- min: 5 tokens
- mean: 9.74 tokens
- max: 25 tokens
- min: 0.0
- mean: 0.54
- max: 1.0
- Samples:
sentence1 sentence2 score A plane is taking off.
An air plane is taking off.
1.0
A man is playing a large flute.
A man is playing a flute.
0.76
A man is spreading shreded cheese on a pizza.
A man is spreading shredded cheese on an uncooked pizza.
0.76
- Loss:
CoSENTLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "pairwise_cos_sim" }
vitaminc-pairs
- Dataset: vitaminc-pairs at be6febb
- Size: 75,142 training samples
- Columns:
label
,sentence1
, andsentence2
- Approximate statistics based on the first 1000 samples:
label sentence1 sentence2 type int string string details - 1: 100.00%
- min: 7 tokens
- mean: 17.44 tokens
- max: 53 tokens
- min: 8 tokens
- mean: 38.0 tokens
- max: 151 tokens
- Samples:
label sentence1 sentence2 1
Penguins has a rating of less than 92 % , defined by more than 20 reviews on Rotten Tomatoes .
On review aggregator Rotten Tomatoes , the film holds an approval rating of 91 % based on 22 reviews , with an average rating of 7.14/10 .
1
Fluoxetine , acts as a positive allosteric modulator of the GABAA receptor at high concentrations , as does norfluoxetine though more potently .
In addition , it acts as a positive allosteric modulator of the GABAA receptor at high concentrations , and norfluoxetine does the same but more potently , actions which may be clinically-relevant .
1
Andrew Robertson is considered by many experts to be one of the best left backs .
He is considered by many pundits to be one of the best left backs in the world due to his pace and crossing ability.
- Loss:
GISTEmbedLoss
with these parameters:{'guide': SentenceTransformer( (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True}) (2): Normalize() ), 'temperature': 0.05}
qnli-contrastive
- Dataset: qnli-contrastive at bcdcba7
- Size: 104,743 training samples
- Columns:
sentence1
,sentence2
, andlabel
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 label type string string int details - min: 3 tokens
- mean: 13.82 tokens
- max: 39 tokens
- min: 5 tokens
- mean: 34.56 tokens
- max: 110 tokens
- 0: 100.00%
- Samples:
sentence1 sentence2 label Which Formula One racing team developed the C-X75's used for filming.
The C-X75s used for filming were developed by the engineering division of Formula One racing team Williams, who built the original C-X75 prototype for Jaguar.
0
When did the University of Michigan leave Detroit?
In June 2009, the Michigan State University College of Osteopathic Medicine which is based in East Lansing opened a satellite campus located at the Detroit Medical Center.
0
When did the Vlachs migrate into the region?
The Gorals of southern Poland and northern Slovakia are partially descended from Romance-speaking Vlachs who migrated into the region from the 14th to 17th centuries and were absorbed into the local population.
0
- Loss:
OnlineContrastiveLoss
scitail-pairs-qa
- Dataset: scitail-pairs-qa at 0cc4353
- Size: 14,987 training samples
- Columns:
sentence2
andsentence1
- Approximate statistics based on the first 1000 samples:
sentence2 sentence1 type string string details - min: 7 tokens
- mean: 16.04 tokens
- max: 41 tokens
- min: 6 tokens
- mean: 15.14 tokens
- max: 34 tokens
- Samples:
sentence2 sentence1 Voltage is not the same as energy, but means the energy per unit charge.
What term is not the same as energy, but means the energy per unit charge?
A jellyfish does not have a circulatory system.
Name the type of system that a jellyfish does not have?
Insight learning is based on past experience and reasoning.
What type of learning is based on past experience and reasoning?
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
scitail-pairs-pos
- Dataset: scitail-pairs-pos at 0cc4353
- Size: 8,600 training samples
- Columns:
sentence1
andsentence2
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 type string string details - min: 6 tokens
- mean: 23.99 tokens
- max: 65 tokens
- min: 7 tokens
- mean: 15.54 tokens
- max: 39 tokens
- Samples:
sentence1 sentence2 A) A calorie is a unit of measure used to express the amount of energy a food produces in the body.
Another unit of energy, used widely in the health professions and everyday life, is calorie ( cal )?
solid 1 A state that retains shape independent of the shape of the container it occupies.
Solid takes neither the shape nor the volume of its container.
Sometimes the two sides of a fracture moved due to the pressure and a fault was formed.
A fault is the fracture caused when rocks on both sides move.
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
xsum-pairs
- Dataset: xsum-pairs at 788ddaf
- Size: 150,000 training samples
- Columns:
sentence1
andsentence2
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 type string string details - min: 13 tokens
- mean: 346.32 tokens
- max: 512 tokens
- min: 7 tokens
- mean: 26.95 tokens
- max: 66 tokens
- Samples:
sentence1 sentence2 Jacob Murphy fired in his 10th goal of the season from inside the box to give the Canaries the lead at the break.
Adam Hammill, Angus MacDonald and Marley Watkins all went close for the visitors after the restart.
Norwich then stretched their lead thanks to MacDonald's own goal to leave them five points behind sixth-placed Sheffield Wednesday.
Victory means caretaker boss Alan Irvine has now claimed four points from a possible six since the departure of Alex Neil.
The hosts dominated the early proceedings, with Jonny Howson and Alex Pritchard both being denied by Barnsley keeper Adam Davies.
After Hammill had a goal ruled out for a clear offside at the other end, winger Murphy gave the Canaries a deserved lead moments before the break when, having being picked out by Cameron Jerome, he drilled a shot low and into the corner of the net.
Hammill was unlucky to not get a strong enough flick on Andy Yiadom's cross to make it 1-1 after the restart and MacDonald saw a close-range effort well saved by Michael McGovern from the resulting corner.
But, after Steven Naismith fired over for the Canaries with just the keeper to beat, they doubled their lead in fortunate circumstances as an effort from Murphy deflected off MacDonald into the net.
Jerome and Howson then went close to adding a third as Norwich coasted to three points.
Norwich caretaker manager Alan Irvine:
"I was asked to take charge for two games and I have done that. I haven't heard anything more about what happens going forward, but I should imagine I will be speaking to someone soon to find out what happens next week.
"If that is to be my last game in charge it was a good way to finish - and the win certainly makes it interesting as far as the play-offs are concerned.
"Being five points behind sounds a lot better than being eight points behind - and as I said last week there are still plenty of points to play for."
Barnsley manager Paul Heckingbottom:
"The take-away message from that game is hit the target, score goals.
"There were plenty of positives to take away from it, but if you are going to get anything in this league you have got to be clinical in front of goal.
"It's frustrating, but there is still plenty to play for. We will keep striving to get that perfect performance and obviously want to win as many games as possible between now and the end of the season."
Match ends, Norwich City 2, Barnsley 0.
Second Half ends, Norwich City 2, Barnsley 0.
Hand ball by Nélson Oliveira (Norwich City).
Attempt missed. Ryan Kent (Barnsley) right footed shot from the centre of the box is close, but misses to the left. Assisted by Ryan Hedges with a cross.
Attempt saved. Nélson Oliveira (Norwich City) left footed shot from outside the box is saved in the centre of the goal.
Alex Pritchard (Norwich City) wins a free kick in the attacking half.
Foul by Alex Mowatt (Barnsley).
Corner, Barnsley. Conceded by Jonny Howson.
Foul by Graham Dorrans (Norwich City).
Matthew James (Barnsley) wins a free kick in the defensive half.
Attempt missed. Tom Bradshaw (Barnsley) left footed shot from the centre of the box is too high. Assisted by Gethin Jones with a cross.
Attempt missed. Steven Naismith (Norwich City) right footed shot from the right side of the box misses to the left. Assisted by Alex Pritchard.
Corner, Norwich City. Conceded by Angus MacDonald.
Attempt blocked. Jonny Howson (Norwich City) right footed shot from the right side of the box is blocked. Assisted by Graham Dorrans with a through ball.
Substitution, Norwich City. Graham Dorrans replaces Jacob Murphy.
Substitution, Norwich City. Nélson Oliveira replaces Cameron Jerome.
Substitution, Barnsley. Ryan Hedges replaces Adam Hammill.
Attempt missed. Ryan Kent (Barnsley) left footed shot from the centre of the box is high and wide to the left. Assisted by Matthew James with a cross.
Attempt saved. Cameron Jerome (Norwich City) right footed shot from the centre of the box is saved in the bottom right corner. Assisted by Jacob Murphy with a through ball.
Substitution, Barnsley. Alex Mowatt replaces Marley Watkins.
Corner, Barnsley. Conceded by Ivo Pinto.
Corner, Barnsley. Conceded by Russell Martin.
Attempt blocked. Tom Bradshaw (Barnsley) right footed shot from the right side of the box is blocked. Assisted by Ryan Kent.
Own Goal by Angus MacDonald, Barnsley. Norwich City 2, Barnsley 0.
Attempt saved. Jacob Murphy (Norwich City) right footed shot from the centre of the box is saved in the bottom right corner. Assisted by Alex Pritchard.
Attempt saved. Steven Naismith (Norwich City) left footed shot from the centre of the box is saved in the bottom left corner. Assisted by Steven Whittaker with a cross.
Ivo Pinto (Norwich City) wins a free kick in the defensive half.
Foul by Adam Hammill (Barnsley).
Attempt saved. Ryan Kent (Barnsley) right footed shot from outside the box is saved in the centre of the goal. Assisted by Marley Watkins.
Attempt missed. Josh Scowen (Barnsley) right footed shot from outside the box is high and wide to the right. Assisted by Adam Hammill.
Jacob Murphy (Norwich City) wins a free kick in the attacking half.
Foul by Angus MacDonald (Barnsley).
Ryan Bennett (Norwich City) wins a free kick in the defensive half.
Foul by Marc Roberts (Barnsley).
Ivo Pinto (Norwich City) is shown the yellow card for a bad foul.
Foul by Ivo Pinto (Norwich City).
Ryan Kent (Barnsley) wins a free kick in the attacking half.
Foul by Ryan Bennett (Norwich City).
Tom Bradshaw (Barnsley) wins a free kick in the attacking half.
Attempt missed. Steven Naismith (Norwich City) left footed shot from the left side of the box is too high. Assisted by Alex Pritchard.Norwich City kept their Championship play-off hopes alive by beating Barnsley at Carrow Road.
Political reporter Samantha Maiden said the offensive text, which also contained strong language, was intended for disgraced ex-minister Jamie Briggs.
She said Mr Dutton apologised for the message about her article referring to Mr Briggs' recent resignation.
The BBC has approached Mr Dutton's office for comment.
He reportedly told News Corp in a statement he is expecting a "tough time" in Ms Maiden's next article.
"Sam and I have exchanged some robust language over the years so we had a laugh after this and I apologised to her straightaway, which she took in good faith," Mr Dutton was quoted as saying.
Former Cities Minister Jamie Briggs resigned last week following a complaint from a female public servant over his alleged conduct during a night out in Hong Kong.Australia's Immigration Minister Peter Dutton has reportedly apologised for mistakenly sending an SMS to a journalist, calling her a "mad witch".
Demonstrators have moved around several sites since April to highlight a crisis in temporary housing.
The council's lawyer told the court "trespass, highways and planning laws" were the grounds for the case.
The cost to the council in terms of additional policing, security and legal costs has exceeded £100,000, he added.
Ahead of the hearing, tents were set up and a banner reading "The homeless resistance" was hung outside Manchester Civil Justice Centre.
'Grave and serious'
Protesters said they hoped to be offered "permanent, suitable accommodation".
Some had earlier refused temporary accommodation offered by the council because they said it was "not suitable" and they felt unsafe.
The council said it had engaged with the protestors and had offered them support, but it could not accept anti-social behaviour and disruption to residents and businesses.
Councillor Nigel Murphy added the exclusion order was "designed to prevent the recurrence of camps and not targeted at individual rough sleepers".
He said the council would work with police and court bailiffs to "regain possession" of areas taken over by camps in St Ann's Square and Castlefield as soon as possible.
John Clegg, from Unison's community branch, said there was a lack of social housing in Manchester.
He added: "There is a large amount of money for building private flats, more hotels are going up all the time, but there are no plans to build any social housing. That's wrong. That's absolutely wrong."
"In our view an injunction is a form of gating, and sending out a message that poor people are not wanted and should not be coming in to the city centre."A Manchester City Council application for an injunction to stop the setting up of homeless camps in the city centre has been granted.
- Loss:
MultipleNegativesSymmetricRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
compression-pairs
- Dataset: compression-pairs at 605bc91
- Size: 180,000 training samples
- Columns:
sentence1
andsentence2
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 type string string details - min: 10 tokens
- mean: 31.89 tokens
- max: 125 tokens
- min: 5 tokens
- mean: 10.21 tokens
- max: 28 tokens
- Samples:
sentence1 sentence2 The USHL completed an expansion draft on Monday as 10 players who were on the rosters of USHL teams during the 2009-10 season were selected by the League's two newest entries, the Muskegon Lumberjacks and Dubuque Fighting Saints.
USHL completes expansion draft
Major League Baseball Commissioner Bud Selig will be speaking at St. Norbert College next month.
Bud Selig to speak at St. Norbert College
It's fresh cherry time in Michigan and the best time to enjoy this delicious and nutritious fruit.
It's cherry time
- Loss:
MultipleNegativesSymmetricRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
Evaluation Datasets
nli-pairs
- Dataset: nli-pairs at d482672
- Size: 6,808 evaluation samples
- Columns:
sentence1
andsentence2
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 type string string details - min: 5 tokens
- mean: 17.64 tokens
- max: 63 tokens
- min: 4 tokens
- mean: 9.67 tokens
- max: 29 tokens
- Samples:
sentence1 sentence2 Two women are embracing while holding to go packages.
Two woman are holding packages.
Two young children in blue jerseys, one with the number 9 and one with the number 2 are standing on wooden steps in a bathroom and washing their hands in a sink.
Two kids in numbered jerseys wash their hands.
A man selling donuts to a customer during a world exhibition event held in the city of Angeles
A man selling donuts to a customer.
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
qnli-contrastive
- Dataset: qnli-contrastive at bcdcba7
- Size: 5,463 evaluation samples
- Columns:
sentence1
,sentence2
, andlabel
- Approximate statistics based on the first 1000 samples:
sentence1 sentence2 label type string string int details - min: 6 tokens
- mean: 14.13 tokens
- max: 36 tokens
- min: 4 tokens
- mean: 36.58 tokens
- max: 225 tokens
- 0: 100.00%
- Samples:
sentence1 sentence2 label What came into force after the new constitution was herald?
As of that day, the new constitution heralding the Second Republic came into force.
0
What is the first major city in the stream of the Rhine?
The most important tributaries in this area are the Ill below of Strasbourg, the Neckar in Mannheim and the Main across from Mainz.
0
What is the minimum required if you want to teach in Canada?
In most provinces a second Bachelor's Degree such as a Bachelor of Education is required to become a qualified teacher.
0
- Loss:
OnlineContrastiveLoss
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy
: stepsper_device_train_batch_size
: 94per_device_eval_batch_size
: 32learning_rate
: 2e-05weight_decay
: 1e-10num_train_epochs
: 2lr_scheduler_type
: cosinewarmup_ratio
: 0.33save_safetensors
: Falsefp16
: Truepush_to_hub
: Truehub_model_id
: bobox/DeBERTaV3-small-GeneralSentenceTransformer-checkpoints-tmphub_strategy
: checkpointbatch_sampler
: no_duplicates
All Hyperparameters
Click to expand
overwrite_output_dir
: Falsedo_predict
: Falseeval_strategy
: stepsprediction_loss_only
: Trueper_device_train_batch_size
: 94per_device_eval_batch_size
: 32per_gpu_train_batch_size
: Noneper_gpu_eval_batch_size
: Nonegradient_accumulation_steps
: 1eval_accumulation_steps
: Nonelearning_rate
: 2e-05weight_decay
: 1e-10adam_beta1
: 0.9adam_beta2
: 0.999adam_epsilon
: 1e-08max_grad_norm
: 1.0num_train_epochs
: 2max_steps
: -1lr_scheduler_type
: cosinelr_scheduler_kwargs
: {}warmup_ratio
: 0.33warmup_steps
: 0log_level
: passivelog_level_replica
: warninglog_on_each_node
: Truelogging_nan_inf_filter
: Truesave_safetensors
: Falsesave_on_each_node
: Falsesave_only_model
: Falserestore_callback_states_from_checkpoint
: Falseno_cuda
: Falseuse_cpu
: Falseuse_mps_device
: Falseseed
: 42data_seed
: Nonejit_mode_eval
: Falseuse_ipex
: Falsebf16
: Falsefp16
: Truefp16_opt_level
: O1half_precision_backend
: autobf16_full_eval
: Falsefp16_full_eval
: Falsetf32
: Nonelocal_rank
: 0ddp_backend
: Nonetpu_num_cores
: Nonetpu_metrics_debug
: Falsedebug
: []dataloader_drop_last
: Falsedataloader_num_workers
: 0dataloader_prefetch_factor
: Nonepast_index
: -1disable_tqdm
: Falseremove_unused_columns
: Truelabel_names
: Noneload_best_model_at_end
: Falseignore_data_skip
: Falsefsdp
: []fsdp_min_num_params
: 0fsdp_config
: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap
: Noneaccelerator_config
: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed
: Nonelabel_smoothing_factor
: 0.0optim
: adamw_torchoptim_args
: Noneadafactor
: Falsegroup_by_length
: Falselength_column_name
: lengthddp_find_unused_parameters
: Noneddp_bucket_cap_mb
: Noneddp_broadcast_buffers
: Falsedataloader_pin_memory
: Truedataloader_persistent_workers
: Falseskip_memory_metrics
: Trueuse_legacy_prediction_loop
: Falsepush_to_hub
: Trueresume_from_checkpoint
: Nonehub_model_id
: bobox/DeBERTaV3-small-GeneralSentenceTransformer-checkpoints-tmphub_strategy
: checkpointhub_private_repo
: Falsehub_always_push
: Falsegradient_checkpointing
: Falsegradient_checkpointing_kwargs
: Noneinclude_inputs_for_metrics
: Falseeval_do_concat_batches
: Truefp16_backend
: autopush_to_hub_model_id
: Nonepush_to_hub_organization
: Nonemp_parameters
:auto_find_batch_size
: Falsefull_determinism
: Falsetorchdynamo
: Noneray_scope
: lastddp_timeout
: 1800torch_compile
: Falsetorch_compile_backend
: Nonetorch_compile_mode
: Nonedispatch_batches
: Nonesplit_batches
: Noneinclude_tokens_per_second
: Falseinclude_num_input_tokens_seen
: Falseneftune_noise_alpha
: Noneoptim_target_modules
: Nonebatch_eval_metrics
: Falsebatch_sampler
: no_duplicatesmulti_dataset_batch_sampler
: proportional
Training Logs
Epoch | Step | Training Loss | qnli-contrastive loss | nli-pairs loss |
---|---|---|---|---|
None | 0 | - | 20.1737 | 4.0959 |
0.1001 | 734 | 4.796 | - | - |
0.2001 | 1468 | 1.3015 | 0.0358 | 0.9115 |
0.3002 | 2202 | 0.89 | - | - |
0.4002 | 2936 | 0.716 | 0.0168 | 0.5944 |
0.5003 | 3670 | 0.6365 | - | - |
0.6003 | 4404 | 0.5883 | 0.0164 | 0.4975 |
0.7004 | 5138 | 0.5192 | - | - |
0.8004 | 5872 | 0.4961 | 0.0288 | 0.4450 |
0.9005 | 6606 | 0.6035 | - | - |
1.0005 | 7340 | 0.4733 | 0.0110 | 0.4215 |
1.1006 | 8074 | 0.4002 | - | - |
1.2007 | 8808 | 0.3929 | 0.0454 | 0.3796 |
1.3007 | 9542 | 0.3826 | - | - |
1.4008 | 10276 | 0.3522 | 0.0178 | 0.3714 |
1.5008 | 11010 | 0.3627 | - | - |
1.6009 | 11744 | 0.3553 | 0.0257 | 0.3629 |
1.7009 | 12478 | 0.3406 | - | - |
1.8010 | 13212 | 0.3288 | 0.0289 | 0.3575 |
1.9010 | 13946 | 0.4563 | - | - |
Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.0.1
- Transformers: 4.41.2
- PyTorch: 2.3.0+cu121
- Accelerate: 0.31.0
- Datasets: 2.20.0
- Tokenizers: 0.19.1
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
CoSENTLoss
@online{kexuefm-8847,
title={CoSENT: A more efficient sentence vector scheme than Sentence-BERT},
author={Su Jianlin},
year={2022},
month={Jan},
url={https://kexue.fm/archives/8847},
}
GISTEmbedLoss
@misc{solatorio2024gistembed,
title={GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning},
author={Aivin V. Solatorio},
year={2024},
eprint={2402.16829},
archivePrefix={arXiv},
primaryClass={cs.LG}
}