Daxtra's picture
Add new SentenceTransformer model
0dc49b0 verified
metadata
base_model: sentence-transformers/all-MiniLM-L6-v2
library_name: sentence-transformers
metrics:
  - cosine_accuracy@10
  - cosine_precision@10
  - cosine_recall@10
  - cosine_ndcg@10
  - cosine_mrr@10
  - cosine_map@10
pipeline_tag: sentence-similarity
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - generated_from_trainer
  - dataset_size:149352
  - loss:MultipleNegativesRankingLoss
widget:
  - source_sentence: >-
      - Digital PR Specialist, requiring 2-4 years of experience in digital PR,
      preferably from an agency or startup environment.

      - Craft and execute digital PR campaigns aligned with SEO goals and brand
      messaging.

      - Develop and pitch newsworthy content to media outlets.

      - Lead client calls and utilize digital marketing analytics tools for
      campaign optimization.

      - Maintain up-to-date knowledge of industry trends and competitive
      landscape.

      - Proficient in SEO, digital marketing analytics (e.g., Google Analytics,
      Ahrefs, SEMrush).

      - Expertise with media databases and PR tools, confident in client
      interactions.

      - Remote work experience essential.

      - Ability to engage with large brand clients in Europe and the US.

      - Valid work permit to operate in the UK.
    sentences:
      - >-
        - Digital Marketing and Social Media Specialist with 8+ years of
        experience managing digital marketing campaigns and social media
        accounts for the automotive industry.

        - Successfully increased online visibility and customer engagement for
        VW, Ford, Fiat, and ISUZU franchises through SEO, content creation, and
        social media strategy.

        - Developed and executed digital marketing plans, enhanced online
        reputation, and managed multiple online platforms for customer
        engagement.

        - Expertise in Google Analytics, SEMrush, Moz, Canva Content Planner,
        and Zoho CRM for targeted campaigns and customer relationship
        management.

        - Strong skills in content production, email marketing, website
        optimization, and digital campaign analysis.
      - >-
        - Data and Business Analyst with expertise in SQL, Python, and Excel,
        seeking a Graduate Finance Analyst role.

        - Developed data querying, manipulation, and database management skills
        using SQL; enhanced analysis with Python and Power BI.

        - Achieved 20% improvement in event safety and addressed safety concerns
        proactively.

        - Strong problem-solving, time management, and team collaboration
        skills.

        - Holds a Bachelor of Science in Finance with Mathematics, demonstrating
        analytical and financial expertise.

        - Proficient in data analysis, financial modelling, and business
        operations; seeks to apply skills in finance.
      - >-
        - Digital Marketing Executive with extensive experience in rebranding
        social media and websites for clients to enhance online presence.

        - Successfully created content calendars and blogs for client websites,
        and managed social media campaigns on platforms like TikTok and
        Instagram.

        - Bachelor of Arts in Business and Management with a focus on ethics,
        finance, and sociology.

        - Strong skills in verbal communication, teamwork, and problem-solving
        under pressure.

        - Experience in COVID testing site operations, including health and
        hazard training, and maintaining a high standard of detail and
        initiative.

        - Proficient in Microsoft Excel, PowerPoint, and Word; skilled in
        analytics, market research, and communication.
  - source_sentence: >-
      - Residential and Commercial Roofers needed for tear-off, roofing
      installation, and debris removal.

      - Responsibilities include installation of underlayment and roofing
      systems, loading and unloading materials, and cleanup.

      - Preferred: Valid Driver's License; Preferred experience in roofing,
      entry-level with construction experience.

      - Basic knowledge of tools required; ability to read tape measures.

      - Must meet at the shop and drive in company trucks to job sites.
    sentences:
      - >-
        - Experienced Registered Nurse with a strong background in both
        inpatient and outpatient settings.

        - Currently providing comprehensive care at Water Gap Wellness Treatment
        Center, focusing on patients with psychiatric conditions.

        - Skills include monitoring patient wellbeing, medication
        administration, treatment planning, and collaborating with healthcare
        professionals.

        - Proficient in patient behavior management, team collaboration, and
        creating high-quality care plans.

        - Equipped with Basic Life Support Certification, CPR Certification, and
        Immunization Certification.

        - Holds a High School Diploma and Registered Nurse certification,
        specializing in RN roles.

        - Capable of driving, managing schedules, and delivering emotional
        support.

        - Team player with excellent attention to detail, capable of
        multi-tasking, and working independently.
      - >-
        - Senior Software Engineer with over 10 years of experience in
        full-stack development and a passion for ReactJS and cutting-edge
        technologies.

        - Currently a Senior Backend Engineer specializing in AWS and Azure
        environments, using tools like AWS Fargate, CloudWatch, and Cloud
        Formation.

        - Experienced in managing web applications, configuration, and databases
        (e.g., MongoDB, MySQL, PostgreSQL).

        - Skilled in automation, performance improvement, and scalability,
        achieving a 30% reduction in infrastructure costs.

        - Proficient in Python, JavaScript, ReactJS, React Native, HTML5, CSS,
        Python Programming, and AWS services.

        - Notable expertise in containerization, Docker, and cloud
        infrastructure management.

        - Holds Bachelor's degrees in Computer Science and Computer Engineering.
      - >-
        - Caregiver with extensive experience in patient care, including
        Alzheimer's disease and autism, and experience in various industries.

        - Certified Home Health Aide, Registered Dental Assistant, and trained
        in First Aid, CPR, and Basic Life Support.

        - Proficient in handling EMR, Foodservice, and Hospitality Industry,
        along with janitorial and patient monitoring.

        - Skilled in communication, quick learning, and handling vital signs;
        experience with dental and dietary departments.

        - Possesses forklift, safety, and other certifications relevant to
        caregiving and healthcare environments.
  - source_sentence: >-
      - Legal Assistant role for candidates with legal experience or schooling,
      ideal for growth opportunities.

      - Act as primary contact for clients and manage a busy phone system.

      - Organize client documents, data entry, and handle courier packages.

      - Requires Criminal Justice degree or paralegal degree; Criminal Justice
      or Legal Schooling preferred.

      - Must be computer savvy, open to learning new programs for the legal
      industry.

      - Preferred bilingual (Spanish).

      - Reception or Office Assistant experience is a plus.
    sentences:
      - >-
        - UI / Frontend Developer with a strong background in creating digital
        frontends and web-based user interfaces for pharmaceutical companies.

        - Successfully developed company websites, custom HTML emails, and
        web-based learning modules with WCAG Level A compliance.

        - Led web accessibility testing and QA, ensuring usability and
        functionality.

        - Collaborated with designers to convert designs into digital frontends
        and conducted development review processes (Alpha, Beta).

        - Enhanced website SEO and user retention by 30% and managed
        client-facing projects, meeting deadlines.

        - Holds a Bachelor of Science in Information Technology and a UI
        Developer certification.

        - Proficient in HTML, CSS, JavaScript, and MJML; experienced with CSS3,
        TypeScript, React, and Git.
      - >-
        - Service Project Manager with experience in mechanical engineering and
        project management roles.

        - Currently a Project Engineer, specializing in project coordination and
        management.

        - Proficient in Microsoft Excel, PowerPoint, and Office Suite tools.

        - Holds Bachelor of Science in Mechanical Engineering.

        - Previous roles include Service Sales Engineer, Mechanical Design
        Engineer, and Project Engineer Intern.

        - Strong project management skills and experience in team coordination.
      - >-
        - Experienced Sales Representative specializing in inside and outside
        sales, including cold calling and field visits.

        - Currently assisting in construction as a Creative Homeworks
        contractor, handling site preparation, equipment maintenance, problem
        management, and basic plumbing and electrical work.

        - Skilled in project management and handling specifications.

        - Previous experience includes contractor assistance and inside/outside
        sales roles.

        - Proficient in sales techniques and customer relationship management.

        - Holds certifications in various construction and sales-related areas.
  - source_sentence: >-
      - Program Manager (Senior) for CMS Star Ratings, with 5+ years of relevant
      professional experience in healthcare, preferred with a degree in
      business, nursing, or public health.

      - Lead design, development, implementation, and evaluation of CMS Star
      Ratings programs to meet organizational and market requirements.

      - Collaborate with cross-functional teams to advance strategic priorities
      and manage programs with tight deadlines.

      - Responsible for resolving complex business issues, managing change, and
      liaising between departments.

      - Requires strong knowledge of CMS Star Ratings measures and experience
      with the Medicare population.

      - Essential skills: organizational relationship management, critical
      thinking, problem-solving, and effective communication.

      - Must be adaptable, self-motivated, and capable of working under pressure
      in a dynamic environment.
    sentences:
      - >-
        - Project Coordinator with strong business management and military
        background, skilled in data analytics, scheduling, and leadership.

        - Currently coordinating projects for operational and capital projects
        at Seattle Children's Hospital, managing stakeholders and ensuring
        timely execution.

        - Expertise in Microsoft Office and various other systems like GTIMS and
        Smartsheet; proficient in vendor management and scheduling.

        - Previous roles include operations scheduling manager and vendor
        manager, with experience in large-scale military exercises and
        international projects.

        - Skilled in communication, critical thinking, multi-tasking, and
        negotiation, with a keen eye for detail and strong analytical abilities.
      - >-
        - Legal Nurse Consultant with experience in medical analysis and
        consultation for legal teams, specializing in healthcare-related legal
        strategies.

        - Led Legal Nurse roles at Kaiser Permanente and Kaiser Permanente,
        managing quality improvement initiatives and educating families.

        - Served as a Clinical Nurse II, providing direct care in Neonatal
        Intensive Care and Urgent Care, and maintaining medical records.

        - Holds a Master of Science in Health Care Administration and a Bachelor
        of Science in Nursing from San Diego State University.

        - Certified Six Sigma Green Belt, skilled in clinical data analysis and
        patient care planning.

        - Proficient in electronic medical records and patient safety, with
        expertise in critical care nursing and legal consultation.
      - >-
        - SEO Manager with 7 years of experience, specializing in traffic growth
        and content strategy.

        - Achieved a 45% increase in web traffic through dynamic keyword
        insertion and SEO management.

        - Managed freelancers for technical blog content, creating and
        overseeing a content marketing strategy.

        - Efficiently managed social media accounts, increasing engagement by
        200% over three months.

        - Skilled in database management, A/B testing, analytics, and SEO tools
        like Google Analytics, Google AdWords, and Google Ads.

        - Holds a Bachelor's Degree in Classical Archaeology; strong in team
        management and content development.

        - Proficient in Microsoft Excel, Word, Outlook, and PowerPoint.

        - Strong analytical, communication, and multichannel marketing skills.
  - source_sentence: >-
      - Data Management Specialist (Junior/Intermediate), requiring a Bachelor's
      degree in computer science, information systems, or related field.

      - Must have a solid understanding of web- and app-based platforms.

      - Required experience with SQL, Data warehousing, and Tableau.

      - Ability to analyze, interpret, and organize large data sets.

      - Proficiency in Microsoft Suite, G-Suite, Slack, Zoom, ZenDesk, and
      Monday.com is essential.
    sentences:
      - >-
        - Experienced Technical Project Manager with 12 years in IT
        infrastructure and project management.

        - Expertise in IT project planning, budgeting, resource allocation, and
        stakeholder engagement.

        - Led successful IT infrastructure upgrades, including EFT and Manta
        application migrations to AWS.

        - Managed cloud and on-premise infrastructure projects, ensuring
        compliance and minimal disruption.

        - Holds PMP certification and proficiency in Agile and Waterfall
        methodologies.

        - Master's in Business Administration with a focus on Marketing and User
        Experience.

        - Strong analytical skills, stakeholder management, and continuous
        improvement practices.

        - Proficient in Microsoft Office, Windows Server, and Azure, with
        experience in networking, security, and VMware.
      - >-
        - Leadership roles in healthcare and pharmaceuticals with extensive
        experience in contract negotiation and site management.

        - Parexel Site Contract Leader: Lead global CSA strategy development,
        budget creation, and legal drafting, ensuring compliance with ICH-GCP
        and local regulations.

        - Drafts, reviews, and finalizes contracts with study sites, managing
        budgets and negotiations, and maintaining quality standards.

        - Expertise in financial risk assessment, budgeting, and regulatory
        compliance with a Bachelor's in Criminal Justice.

        - Proficient in Microsoft Excel, PowerPoint, and Salesforce.com, with a
        background in biotechnology, clinical trials, and patient safety.
      - >-
        - Financial professional with extensive experience in member service and
        trading, currently a Member Representative focusing on customer service
        and banking transactions.

        - Expertise in handling ATMs, deposit envelopes, and savings accounts;
        ensures member data security and efficient payment processing.

        - Proficient in making outbound calls for sales and inquiry, building
        relationships with customers and banking managers.

        - Experienced in trading major currencies and assets with a strong
        foundation in technical and fundamental analysis.

        - Holds a Masters in Business Administration and proficient in Microsoft
        Excel, Python, and social media platforms.

        - Fluent in English and French, with skills in communication, content
        writing, and project planning.
model-index:
  - name: SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: vac res matcher
          type: vac-res-matcher
        metrics:
          - type: cosine_accuracy@10
            value: 0.3696705426356589
            name: Cosine Accuracy@10
          - type: cosine_precision@10
            value: 0.07374031007751937
            name: Cosine Precision@10
          - type: cosine_recall@10
            value: 0.10916001476122919
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.1177083437550142
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.19530038759689897
            name: Cosine Mrr@10
          - type: cosine_map@10
            value: 0.07133683851447556
            name: Cosine Map@10

SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2

This is a sentence-transformers model finetuned from sentence-transformers/all-MiniLM-L6-v2. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: sentence-transformers/all-MiniLM-L6-v2
  • Maximum Sequence Length: 128 tokens
  • Output Dimensionality: 384 tokens
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 128, 'do_lower_case': False}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("Daxtra/sbert-summaries-minilm-24-batch")
# Run inference
sentences = [
    "- Data Management Specialist (Junior/Intermediate), requiring a Bachelor's degree in computer science, information systems, or related field.\n- Must have a solid understanding of web- and app-based platforms.\n- Required experience with SQL, Data warehousing, and Tableau.\n- Ability to analyze, interpret, and organize large data sets.\n- Proficiency in Microsoft Suite, G-Suite, Slack, Zoom, ZenDesk, and Monday.com is essential.",
    "- Leadership roles in healthcare and pharmaceuticals with extensive experience in contract negotiation and site management.\n- Parexel Site Contract Leader: Lead global CSA strategy development, budget creation, and legal drafting, ensuring compliance with ICH-GCP and local regulations.\n- Drafts, reviews, and finalizes contracts with study sites, managing budgets and negotiations, and maintaining quality standards.\n- Expertise in financial risk assessment, budgeting, and regulatory compliance with a Bachelor's in Criminal Justice.\n- Proficient in Microsoft Excel, PowerPoint, and Salesforce.com, with a background in biotechnology, clinical trials, and patient safety.",
    "- Experienced Technical Project Manager with 12 years in IT infrastructure and project management.\n- Expertise in IT project planning, budgeting, resource allocation, and stakeholder engagement.\n- Led successful IT infrastructure upgrades, including EFT and Manta application migrations to AWS.\n- Managed cloud and on-premise infrastructure projects, ensuring compliance and minimal disruption.\n- Holds PMP certification and proficiency in Agile and Waterfall methodologies.\n- Master's in Business Administration with a focus on Marketing and User Experience.\n- Strong analytical skills, stakeholder management, and continuous improvement practices.\n- Proficient in Microsoft Office, Windows Server, and Azure, with experience in networking, security, and VMware.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@10 0.3697
cosine_precision@10 0.0737
cosine_recall@10 0.1092
cosine_ndcg@10 0.1177
cosine_mrr@10 0.1953
cosine_map@10 0.0713

Training Details

Training Dataset

Unnamed Dataset

  • Size: 149,352 training samples
  • Columns: sentence_0 and sentence_1
  • Approximate statistics based on the first 1000 samples:
    sentence_0 sentence_1
    type string string
    details
    • min: 53 tokens
    • mean: 115.57 tokens
    • max: 128 tokens
    • min: 51 tokens
    • mean: 119.82 tokens
    • max: 128 tokens
  • Samples:
    sentence_0 sentence_1
    - Solutions Architect/Snowflake position requiring 8+ years of experience in data management and implementation of Snowflake solutions.
    - Responsibilities include developing data models, optimizing data pipelines, and integrating with AWS/Azure PBAS.
    - Must possess deep knowledge of relational and NoSQL databases, SQL, and Unix Shell/Python Scripting.
    - Experience with data security, database optimization, and Snowflake's Resource Monitors.
    - Required: Bachelor's or Master's degree in Computer Science or related field.
    - Preferred: Management consulting experience, expertise in AI use cases, and experience with Cloud technologies.
    - Skills: Strong leadership, team management, and experience with globally distributed teams.
    - Senior IT Consultant with 30 years of experience, specializing in enterprise architecture and solution architecture.
    - Currently a Senior Consultant, with significant experience in architecture roles and delivering solutions for mission-critical projects.
    - Expertise in Enterprise Architecture, Enterprise Consortium management, and IT Governance.
    - Leads projects ranging from $100M to $900M, focusing on architecture for business, data, and applications.
    - Designed enterprise-wide architecture models and created strategies for product development based on product architecture.
    - Skilled in modern technologies like Spring Framework, Kafka, Spring Cloud, and Docker, and in containerization with Kubernetes.
    - Holds Masters Degrees in Computer Science and Law, with a Bachelor's in Computer Sciences.
    - Global Head of Business Development - Financial Education, with a focus on rapid adaptation and leadership in Hong Kong.
    - Responsible for identifying leads, building pipelines, and converting prospects into clients, and managing international teams.
    - Implement relationship-based sales practices, nurture industry relationships, and lead global expansion efforts.
    - Requires a proven track record in financial services business development and experience in sales with tech, media, or administration.
    - Strong networking and relationship-building skills, leadership abilities across cultures, and excellent communication and negotiation skills.
    - Must have the right to work in the United Kingdom.
    - Management Accountant with extensive experience in financial planning and analysis, specializing in the banking sector.
    - Leads financial planning and analysis roles, overseeing budget processes, regulatory reporting, and financial performance analysis.
    - Streamlined processes to reduce payment processing time by 15% and minimize penalties.
    - Led quarterly and year-end audits, ensuring accurate financial audits and implementing improvement suggestions.
    - Proficient in financial reporting, budgeting, forecasting, and variance analysis; adept at using Microsoft Excel and QuickBooks.
    - Strong communication and team management skills, with expertise in process improvement and strategic initiatives.
    - Customer Service Administrator position requiring experience in customer service or administration.
    - Key responsibilities include managing customer feedback, coordinating deliveries, processing refunds, and general office support.
    - Must possess strong communication, problem-solving, and attention to detail skills.
    - Proficiency in MS Office and CRM software required.
    - Ability to multitask and prioritize workload effectively.
    - Must have the right to work in the United Kingdom.
    - Experienced Technical Services Coordinator with a strong background in administration and team collaboration, seeking to advance in London.
    - Manages UK technical services, handling device delivery, fault reporting, and resource planning.
    - Expert in Microsoft Office, maintaining databases and customer service through communication and negotiation.
    - Holds NVQ Level 2 and 3, with skills in GDPR compliance and financial transactions.
    - Proven experience in collecting evidence for claims, negotiating settlements, and managing key performance indicators.
    - Strong communication, organizational, and interpersonal skills, with proficiency in using initiative and effective negotiation.
    - Holds GCSE qualifications and relevant experience in customer service roles.
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 24
  • per_device_eval_batch_size: 24
  • num_train_epochs: 1
  • multi_dataset_batch_sampler: round_robin

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 24
  • per_device_eval_batch_size: 24
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 5e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1
  • num_train_epochs: 1
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.0
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: False
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • eval_use_gather_object: False
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: round_robin

Training Logs

Epoch Step Training Loss vac-res-matcher_cosine_map@10
0.0803 500 1.5369 -
0.1000 622 - 0.0697
0.1607 1000 1.2768 -
0.1999 1244 - 0.0692
0.2410 1500 1.2 -
0.2999 1866 - 0.0673
0.3214 2000 1.1463 -
0.3998 2488 - 0.0705
0.4017 2500 1.1206 -
0.4821 3000 1.1043 -
0.4998 3110 - 0.0683
0.5624 3500 1.0768 -
0.5997 3732 - 0.0700
0.6428 4000 1.0905 -
0.6997 4354 - 0.0705
0.7231 4500 1.0804 -
0.7996 4976 - 0.0699
0.8035 5000 1.0536 -
0.8838 5500 1.0352 -
0.8996 5598 - 0.0715
0.9642 6000 1.0292 -
0.9995 6220 - 0.0713
1.0 6223 - 0.0713

Framework Versions

  • Python: 3.10.12
  • Sentence Transformers: 3.2.1
  • Transformers: 4.44.2
  • PyTorch: 2.4.1+cu121
  • Accelerate: 0.34.2
  • Datasets: 3.0.1
  • Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}