Add SetFit model

Browse files

Files changed (13) hide show

1_Pooling/config.json +7 -0
README.md +464 -3
config.json +29 -0
config_sentence_transformers.json +7 -0
config_setfit.json +7 -0
model_head.pkl +3 -0
modules.json +14 -0
pytorch_model.bin +3 -0
sentence_bert_config.json +4 -0
special_tokens_map.json +9 -0
tokenizer.json +0 -0
tokenizer_config.json +24 -0
vocab.txt +0 -0

1_Pooling/config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "word_embedding_dimension": 768,
+  "pooling_mode_cls_token": false,
+  "pooling_mode_mean_tokens": true,
+  "pooling_mode_max_tokens": false,
+  "pooling_mode_mean_sqrt_len_tokens": false
+}

README.md CHANGED Viewed

@@ -1,3 +1,464 @@
----
-license: apache-2.0
----

+---
+base_model: jhgan/ko-sroberta-multitask
+library_name: setfit
+metrics:
+- accuracy
+pipeline_tag: text-classification
+tags:
+- setfit
+- sentence-transformers
+- text-classification
+- generated_from_setfit_trainer
+widget:
+- text: 기업이 파산 신청을 할 때 채무자의 주된 책임 범위는 어떠한가요?
+- text: 무선 전력 전송 기술을 이용한 스마트 가전기기를 설계 중이야. 이와 동일한 연구나 특허가 있는지 알아봐줘
+- text: 회사 합병 시 소액주주의 권리 보호 방안은 어떤 방식으로 이루어지나요?
+- text: 화재 예방 시스템 설계에 대한 연구를 수행하고 있어. 기존 연구에서 이와 관련된 유사한 시스템 설계도나 논문이 있는지 찾고 싶어
+- text: 블랙홀 정보 역설에 대해 설명한 논문의 핵심 포인트를 짧게 집약해 줄래?
+inference: true
+model-index:
+- name: SetFit with jhgan/ko-sroberta-multitask
+  results:
+  - task:
+      type: text-classification
+      name: Text Classification
+    dataset:
+      name: Unknown
+      type: unknown
+      split: test
+    metrics:
+    - type: accuracy
+      value: 0.9558823529411765
+      name: Accuracy
+---
+# SetFit with jhgan/ko-sroberta-multitask
+This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Text Classification. This SetFit model uses [jhgan/ko-sroberta-multitask](https://huggingface.co/jhgan/ko-sroberta-multitask) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
+The model has been trained using an efficient few-shot learning technique that involves:
+1. Fine-tuning a [Sentence Transformer](https://www.sbert.net) with contrastive learning.
+2. Training a classification head with features from the fine-tuned Sentence Transformer.
+## Model Details
+### Model Description
+- **Model Type:** SetFit
+- **Sentence Transformer body:** [jhgan/ko-sroberta-multitask](https://huggingface.co/jhgan/ko-sroberta-multitask)
+- **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
+- **Maximum Sequence Length:** 128 tokens
+- **Number of Classes:** 5 classes
+<!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
+<!-- - **Language:** Unknown -->
+<!-- - **License:** Unknown -->
+### Model Sources
+- **Repository:** [SetFit on GitHub](https://github.com/huggingface/setfit)
+- **Paper:** [Efficient Few-Shot Learning Without Prompts](https://arxiv.org/abs/2209.11055)
+- **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
+### Model Labels
+| Label      | Examples                                                                                                                                                                                                                                                          |
+|:-----------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| 오탈자 탐지     | <ul><li>'경영 보고서 내용에 대한 오탈자를 검토하고 수정해 드릴 수 있을까요?'</li><li>'경영 보고서에 포함된 오탈자를 잡아 줄 수 있나요?'</li><li>'경쟁사 분석 항목 내 문장 구성의 오류를 지적해주겠습니까?'</li></ul>                                                                                                                      |
+| 요약         | <ul><li>'(특정 특허번호)를 기반으로 한 발명의 전체적인 개념을 짧게 설명 부탁드립니다.'</li><li>'1장의 데이터 수집 기술에 대해 요약해주세요'</li><li>'2022년 경제 성장 동향에 관한 문서의 두 번째 챕터를 축약해 주세요.'</li></ul>                                                                                                            |
+| 유사문서       | <ul><li>'5G 통신 모듈 최적화에 관련된 프로젝트를 하고 있는데, 비슷한 내용의 프로젝트나 논문이 있는지 연결해서 말해줄래?'</li><li>'5nm 공정을 이용한 반도체 제조 방법에 대해 작성하고 있어. 이와 연관 있는 보고서나 논문이 있으면 찾아주세요'</li><li>'AI 기반 헬스케어 솔루션 개발에 관한 문헌 조사를 하고 있습니다. 와 같은 주제를 다룬 문서를 찾아줄 수 있을까요?'</li></ul>                         |
+| 중복성 검토     | <ul><li>'5G 네트워크 최적화 기술을 연구 중입니다. 기존 연구와 어떤 부분이 중복되는지, 중복의 이유를 명확히 설명해줄 수 있나요?'</li><li>'감정 노동자의 복지 증진 방안을 찾고 있어요. 이와 동일한 주제로 진행된 다른 프로젝트가 있었는지 알려주실래요? 그리고 그 이유도 설명해주세요.'</li><li>'건물의 내진 설계 강화 방안을 조사하고 있는데 이에 연관된 기존 프로젝트가 무엇이 있는지 그리고 왜 겹치는지 말해줄래?'</li></ul> |
+| 특화 지식정보 제공 | <ul><li>'3D 금속 배선 기술(HBM, TSV)의 도입으로 인한 전력 소비 감소 방안에는 어떤 것이 있는가요?'</li><li>'AI 워크로드를 처리하기 위한 반도체 아키텍처 설계에서는 어떤 전략들이 사용되나요?'</li><li>'B2B 마케팅에서 특히 효과적인 콘텐츠 형식이나 채널은 어떤 것이 많아요?'</li></ul>                                                                         |
+## Evaluation
+### Metrics
+| Label   | Accuracy |
+|:--------|:---------|
+| **all** | 0.9559   |
+## Uses
+### Direct Use for Inference
+First install the SetFit library:
+```bash
+pip install setfit
+```
+Then you can load this model and run inference.
+```python
+from setfit import SetFitModel
+# Download from the 🤗 Hub
+model = SetFitModel.from_pretrained("NTIS/kepri-embedding")
+# Run inference
+preds = model("기업이 파산 신청을 할 때 채무자의 주된 책임 범위는 어떠한가요?")
+```
+<!--
+### Downstream Use
+*List how someone could finetune this model on their own dataset.*
+-->
+<!--
+### Out-of-Scope Use
+*List how the model may foreseeably be misused and address what users ought not to do with the model.*
+-->
+<!--
+## Bias, Risks and Limitations
+*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
+-->
+<!--
+### Recommendations
+*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
+-->
+## Training Details
+### Training Set Metrics
+| Training set | Min | Median  | Max |
+|:-------------|:----|:--------|:----|
+| Word count   | 6   | 12.4219 | 27  |
+| Label   | Training Sample Count |
+|:--------|:----------------------|
+| rag     | 0                     |
+| general | 0                     |
+### Training Hyperparameters
+- batch_size: (64, 64)
+- num_epochs: (4, 4)
+- max_steps: -1
+- sampling_strategy: oversampling
+- body_learning_rate: (2e-05, 1e-05)
+- head_learning_rate: 0.01
+- loss: CosineSimilarityLoss
+- distance_metric: cosine_distance
+- margin: 0.25
+- end_to_end: False
+- use_amp: False
+- warmup_proportion: 0.1
+- seed: 42
+- eval_max_steps: -1
+- load_best_model_at_end: True
+### Training Results
+| Epoch   | Step      | Training Loss | Validation Loss |
+|:-------:|:---------:|:-------------:|:---------------:|
+| 0.0003  | 1         | 0.1889        | -               |
+| 0.0153  | 50        | 0.1818        | -               |
+| 0.0306  | 100       | 0.1421        | -               |
+| 0.0459  | 150       | 0.0582        | -               |
+| 0.0612  | 200       | 0.0299        | -               |
+| 0.0765  | 250       | 0.0093        | -               |
+| 0.0918  | 300       | 0.0036        | -               |
+| 0.1071  | 350       | 0.001         | -               |
+| 0.1224  | 400       | 0.0012        | -               |
+| 0.1377  | 450       | 0.0006        | -               |
+| 0.1530  | 500       | 0.0006        | -               |
+| 0.1683  | 550       | 0.0003        | -               |
+| 0.1836  | 600       | 0.0003        | -               |
+| 0.1989  | 650       | 0.0003        | -               |
+| 0.2142  | 700       | 0.0002        | -               |
+| 0.2295  | 750       | 0.0002        | -               |
+| 0.2448  | 800       | 0.0002        | -               |
+| 0.2601  | 850       | 0.0001        | -               |
+| 0.2754  | 900       | 0.0001        | -               |
+| 0.2907  | 950       | 0.0001        | -               |
+| 0.3060  | 1000      | 0.0001        | -               |
+| 0.3213  | 1050      | 0.0001        | -               |
+| 0.3366  | 1100      | 0.0001        | -               |
+| 0.3519  | 1150      | 0.0001        | -               |
+| 0.3672  | 1200      | 0.0001        | -               |
+| 0.3825  | 1250      | 0.0001        | -               |
+| 0.3978  | 1300      | 0.0001        | -               |
+| 0.4131  | 1350      | 0.0001        | -               |
+| 0.4284  | 1400      | 0.0001        | -               |
+| 0.4437  | 1450      | 0.0001        | -               |
+| 0.4590  | 1500      | 0.0           | -               |
+| 0.4743  | 1550      | 0.0001        | -               |
+| 0.4896  | 1600      | 0.0           | -               |
+| 0.5049  | 1650      | 0.0001        | -               |
+| 0.5202  | 1700      | 0.0           | -               |
+| 0.5355  | 1750      | 0.0           | -               |
+| 0.5508  | 1800      | 0.0           | -               |
+| 0.5661  | 1850      | 0.0           | -               |
+| 0.5814  | 1900      | 0.0           | -               |
+| 0.5967  | 1950      | 0.0           | -               |
+| 0.6120  | 2000      | 0.0           | -               |
+| 0.6273  | 2050      | 0.0           | -               |
+| 0.6426  | 2100      | 0.0           | -               |
+| 0.6579  | 2150      | 0.0           | -               |
+| 0.6732  | 2200      | 0.0           | -               |
+| 0.6885  | 2250      | 0.0           | -               |
+| 0.7038  | 2300      | 0.0           | -               |
+| 0.7191  | 2350      | 0.0           | -               |
+| 0.7344  | 2400      | 0.0           | -               |
+| 0.7497  | 2450      | 0.0           | -               |
+| 0.7650  | 2500      | 0.0           | -               |
+| 0.7803  | 2550      | 0.0           | -               |
+| 0.7956  | 2600      | 0.0           | -               |
+| 0.8109  | 2650      | 0.0           | -               |
+| 0.8262  | 2700      | 0.0           | -               |
+| 0.8415  | 2750      | 0.0           | -               |
+| 0.8568  | 2800      | 0.0           | -               |
+| 0.8721  | 2850      | 0.0           | -               |
+| 0.8874  | 2900      | 0.0           | -               |
+| 0.9027  | 2950      | 0.0           | -               |
+| 0.9180  | 3000      | 0.0           | -               |
+| 0.9333  | 3050      | 0.0           | -               |
+| 0.9486  | 3100      | 0.0           | -               |
+| 0.9639  | 3150      | 0.0           | -               |
+| 0.9792  | 3200      | 0.0           | -               |
+| 0.9945  | 3250      | 0.0           | -               |
+| 1.0     | 3268      | -             | 0.0497          |
+| 1.0098  | 3300      | 0.0           | -               |
+| 1.0251  | 3350      | 0.0           | -               |
+| 1.0404  | 3400      | 0.0           | -               |
+| 1.0557  | 3450      | 0.0           | -               |
+| 1.0710  | 3500      | 0.0           | -               |
+| 1.0863  | 3550      | 0.0           | -               |
+| 1.1016  | 3600      | 0.0           | -               |
+| 1.1169  | 3650      | 0.0           | -               |
+| 1.1322  | 3700      | 0.0           | -               |
+| 1.1475  | 3750      | 0.0           | -               |
+| 1.1628  | 3800      | 0.0           | -               |
+| 1.1781  | 3850      | 0.0           | -               |
+| 1.1934  | 3900      | 0.0           | -               |
+| 1.2087  | 3950      | 0.0           | -               |
+| 1.2240  | 4000      | 0.0           | -               |
+| 1.2393  | 4050      | 0.0           | -               |
+| 1.2546  | 4100      | 0.0           | -               |
+| 1.2699  | 4150      | 0.0           | -               |
+| 1.2852  | 4200      | 0.0           | -               |
+| 1.3005  | 4250      | 0.0           | -               |
+| 1.3158  | 4300      | 0.0           | -               |
+| 1.3311  | 4350      | 0.0           | -               |
+| 1.3464  | 4400      | 0.0           | -               |
+| 1.3617  | 4450      | 0.0           | -               |
+| 1.3770  | 4500      | 0.0           | -               |
+| 1.3923  | 4550      | 0.0           | -               |
+| 1.4076  | 4600      | 0.0           | -               |
+| 1.4229  | 4650      | 0.0           | -               |
+| 1.4382  | 4700      | 0.0           | -               |
+| 1.4535  | 4750      | 0.0           | -               |
+| 1.4688  | 4800      | 0.0           | -               |
+| 1.4841  | 4850      | 0.0           | -               |
+| 1.4994  | 4900      | 0.0           | -               |
+| 1.5147  | 4950      | 0.0           | -               |
+| 1.5300  | 5000      | 0.0           | -               |
+| 1.5453  | 5050      | 0.0           | -               |
+| 1.5606  | 5100      | 0.0           | -               |
+| 1.5759  | 5150      | 0.0           | -               |
+| 1.5912  | 5200      | 0.0           | -               |
+| 1.6065  | 5250      | 0.0           | -               |
+| 1.6218  | 5300      | 0.0           | -               |
+| 1.6371  | 5350      | 0.0           | -               |
+| 1.6524  | 5400      | 0.0           | -               |
+| 1.6677  | 5450      | 0.0           | -               |
+| 1.6830  | 5500      | 0.0           | -               |
+| 1.6983  | 5550      | 0.0           | -               |
+| 1.7136  | 5600      | 0.0           | -               |
+| 1.7289  | 5650      | 0.0           | -               |
+| 1.7442  | 5700      | 0.0           | -               |
+| 1.7595  | 5750      | 0.0           | -               |
+| 1.7748  | 5800      | 0.0           | -               |
+| 1.7901  | 5850      | 0.0           | -               |
+| 1.8054  | 5900      | 0.0           | -               |
+| 1.8207  | 5950      | 0.0           | -               |
+| 1.8360  | 6000      | 0.0           | -               |
+| 1.8513  | 6050      | 0.0           | -               |
+| 1.8666  | 6100      | 0.0           | -               |
+| 1.8819  | 6150      | 0.0           | -               |
+| 1.8972  | 6200      | 0.0           | -               |
+| 1.9125  | 6250      | 0.0           | -               |
+| 1.9278  | 6300      | 0.0           | -               |
+| 1.9431  | 6350      | 0.0           | -               |
+| 1.9584  | 6400      | 0.0           | -               |
+| 1.9737  | 6450      | 0.0           | -               |
+| 1.9890  | 6500      | 0.0           | -               |
+| 2.0     | 6536      | -             | 0.056           |
+| 2.0043  | 6550      | 0.0           | -               |
+| 2.0196  | 6600      | 0.0           | -               |
+| 2.0349  | 6650      | 0.0           | -               |
+| 2.0502  | 6700      | 0.0           | -               |
+| 2.0655  | 6750      | 0.0           | -               |
+| 2.0808  | 6800      | 0.0           | -               |
+| 2.0961  | 6850      | 0.0           | -               |
+| 2.1114  | 6900      | 0.0           | -               |
+| 2.1267  | 6950      | 0.0           | -               |
+| 2.1420  | 7000      | 0.0           | -               |
+| 2.1573  | 7050      | 0.0           | -               |
+| 2.1726  | 7100      | 0.0           | -               |
+| 2.1879  | 7150      | 0.0           | -               |
+| 2.2032  | 7200      | 0.0           | -               |
+| 2.2185  | 7250      | 0.0           | -               |
+| 2.2338  | 7300      | 0.0           | -               |
+| 2.2491  | 7350      | 0.0           | -               |
+| 2.2644  | 7400      | 0.0           | -               |
+| 2.2797  | 7450      | 0.0           | -               |
+| 2.2950  | 7500      | 0.0           | -               |
+| 2.3103  | 7550      | 0.0           | -               |
+| 2.3256  | 7600      | 0.0           | -               |
+| 2.3409  | 7650      | 0.0           | -               |
+| 2.3562  | 7700      | 0.0           | -               |
+| 2.3715  | 7750      | 0.0           | -               |
+| 2.3868  | 7800      | 0.0           | -               |
+| 2.4021  | 7850      | 0.0           | -               |
+| 2.4174  | 7900      | 0.0           | -               |
+| 2.4327  | 7950      | 0.0           | -               |
+| 2.4480  | 8000      | 0.0           | -               |
+| 2.4633  | 8050      | 0.0           | -               |
+| 2.4786  | 8100      | 0.0           | -               |
+| 2.4939  | 8150      | 0.0           | -               |
+| 2.5092  | 8200      | 0.0           | -               |
+| 2.5245  | 8250      | 0.0           | -               |
+| 2.5398  | 8300      | 0.0           | -               |
+| 2.5551  | 8350      | 0.0           | -               |
+| 2.5704  | 8400      | 0.0           | -               |
+| 2.5857  | 8450      | 0.0           | -               |
+| 2.6010  | 8500      | 0.0           | -               |
+| 2.6163  | 8550      | 0.0           | -               |
+| 2.6316  | 8600      | 0.0           | -               |
+| 2.6469  | 8650      | 0.0           | -               |
+| 2.6622  | 8700      | 0.0           | -               |
+| 2.6775  | 8750      | 0.0           | -               |
+| 2.6928  | 8800      | 0.0           | -               |
+| 2.7081  | 8850      | 0.0           | -               |
+| 2.7234  | 8900      | 0.0           | -               |
+| 2.7387  | 8950      | 0.0           | -               |
+| 2.7540  | 9000      | 0.0           | -               |
+| 2.7693  | 9050      | 0.0           | -               |
+| 2.7846  | 9100      | 0.0           | -               |
+| 2.7999  | 9150      | 0.0           | -               |
+| 2.8152  | 9200      | 0.0           | -               |
+| 2.8305  | 9250      | 0.0           | -               |
+| 2.8458  | 9300      | 0.0           | -               |
+| 2.8611  | 9350      | 0.0           | -               |
+| 2.8764  | 9400      | 0.0           | -               |
+| 2.8917  | 9450      | 0.0           | -               |
+| 2.9070  | 9500      | 0.0           | -               |
+| 2.9223  | 9550      | 0.0           | -               |
+| 2.9376  | 9600      | 0.0           | -               |
+| 2.9529  | 9650      | 0.0           | -               |
+| 2.9682  | 9700      | 0.0           | -               |
+| 2.9835  | 9750      | 0.0           | -               |
+| 2.9988  | 9800      | 0.0           | -               |
+| 3.0     | 9804      | -             | 0.061           |
+| 3.0141  | 9850      | 0.0           | -               |
+| 3.0294  | 9900      | 0.0           | -               |
+| 3.0447  | 9950      | 0.0           | -               |
+| 3.0600  | 10000     | 0.0           | -               |
+| 3.0753  | 10050     | 0.0           | -               |
+| 3.0906  | 10100     | 0.0           | -               |
+| 3.1059  | 10150     | 0.0           | -               |
+| 3.1212  | 10200     | 0.0           | -               |
+| 3.1365  | 10250     | 0.0           | -               |
+| 3.1518  | 10300     | 0.0           | -               |
+| 3.1671  | 10350     | 0.0           | -               |
+| 3.1824  | 10400     | 0.0           | -               |
+| 3.1977  | 10450     | 0.0           | -               |
+| 3.2130  | 10500     | 0.0           | -               |
+| 3.2283  | 10550     | 0.0           | -               |
+| 3.2436  | 10600     | 0.0           | -               |
+| 3.2589  | 10650     | 0.0           | -               |
+| 3.2742  | 10700     | 0.0           | -               |
+| 3.2895  | 10750     | 0.0           | -               |
+| 3.3048  | 10800     | 0.0           | -               |
+| 3.3201  | 10850     | 0.0           | -               |
+| 3.3354  | 10900     | 0.0           | -               |
+| 3.3507  | 10950     | 0.0           | -               |
+| 3.3660  | 11000     | 0.0           | -               |
+| 3.3813  | 11050     | 0.0           | -               |
+| 3.3966  | 11100     | 0.0           | -               |
+| 3.4119  | 11150     | 0.0           | -               |
+| 3.4272  | 11200     | 0.0001        | -               |
+| 3.4425  | 11250     | 0.0           | -               |
+| 3.4578  | 11300     | 0.0           | -               |
+| 3.4731  | 11350     | 0.0           | -               |
+| 3.4884  | 11400     | 0.0           | -               |
+| 3.5037  | 11450     | 0.0           | -               |
+| 3.5190  | 11500     | 0.0           | -               |
+| 3.5343  | 11550     | 0.0           | -               |
+| 3.5496  | 11600     | 0.0           | -               |
+| 3.5649  | 11650     | 0.0           | -               |
+| 3.5802  | 11700     | 0.0           | -               |
+| 3.5955  | 11750     | 0.0           | -               |
+| 3.6108  | 11800     | 0.0           | -               |
+| 3.6261  | 11850     | 0.0           | -               |
+| 3.6414  | 11900     | 0.0           | -               |
+| 3.6567  | 11950     | 0.0           | -               |
+| 3.6720  | 12000     | 0.0           | -               |
+| 3.6873  | 12050     | 0.0           | -               |
+| 3.7026  | 12100     | 0.0           | -               |
+| 3.7179  | 12150     | 0.0           | -               |
+| 3.7332  | 12200     | 0.0           | -               |
+| 3.7485  | 12250     | 0.0           | -               |
+| 3.7638  | 12300     | 0.0           | -               |
+| 3.7791  | 12350     | 0.0           | -               |
+| 3.7944  | 12400     | 0.0           | -               |
+| 3.8097  | 12450     | 0.0           | -               |
+| 3.8250  | 12500     | 0.0           | -               |
+| 3.8403  | 12550     | 0.0           | -               |
+| 3.8556  | 12600     | 0.0           | -               |
+| 3.8709  | 12650     | 0.0           | -               |
+| 3.8862  | 12700     | 0.0           | -               |
+| 3.9015  | 12750     | 0.0           | -               |
+| 3.9168  | 12800     | 0.0           | -               |
+| 3.9321  | 12850     | 0.0           | -               |
+| 3.9474  | 12900     | 0.0           | -               |
+| 3.9627  | 12950     | 0.0           | -               |
+| 3.9780  | 13000     | 0.0           | -               |
+| 3.9933  | 13050     | 0.0           | -               |
+| **4.0** | **13072** | **-**         | **0.0479**      |
+* The bold row denotes the saved checkpoint.
+### Framework Versions
+- Python: 3.9.18
+- SetFit: 1.0.3
+- Sentence Transformers: 2.2.1
+- Transformers: 4.32.1
+- PyTorch: 1.10.0
+- Datasets: 2.20.0
+- Tokenizers: 0.13.3
+## Citation
+### BibTeX
+```bibtex
+@article{https://doi.org/10.48550/arxiv.2209.11055,
+    doi = {10.48550/ARXIV.2209.11055},
+    url = {https://arxiv.org/abs/2209.11055},
+    author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
+    keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
+    title = {Efficient Few-Shot Learning Without Prompts},
+    publisher = {arXiv},
+    year = {2022},
+    copyright = {Creative Commons Attribution 4.0 International}
+}
+```
+<!--
+## Glossary
+*Clearly define terms in order to be accessible across audiences.*
+-->
+<!--
+## Model Card Authors
+*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
+-->
+<!--
+## Model Card Contact
+*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
+-->

config.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "_name_or_path": "checkpoints/step_13072/",
+  "architectures": [
+    "RobertaModel"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "tokenizer_class": "BertTokenizer",
+  "torch_dtype": "float32",
+  "transformers_version": "4.32.1",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 32000
+}

config_sentence_transformers.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "__version__": {
+    "sentence_transformers": "2.1.0",
+    "transformers": "4.13.0",
+    "pytorch": "1.7.0+cu110"
+  }
+}

config_setfit.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "normalize_embeddings": false,
+  "labels": [
+    "rag",
+    "general"
+  ]
+}

model_head.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b803e130af8176c6a5f09d4f0f4980d3b68fad01c59a87d6f32b1a4f959f7be7
+size 31807

modules.json ADDED Viewed

	@@ -0,0 +1,14 @@

+[
+  {
+    "idx": 0,
+    "name": "0",
+    "path": "",
+    "type": "sentence_transformers.models.Transformer"
+  },
+  {
+    "idx": 1,
+    "name": "1",
+    "path": "1_Pooling",
+    "type": "sentence_transformers.models.Pooling"
+  }
+]

pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1348e19b347964f5ae8a88b8622faca7925f98eb8f2e01802bd2246463e1fdf2
+size 442537395

sentence_bert_config.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "max_seq_length": 128,
+  "do_lower_case": false
+}

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "bos_token": "[CLS]",
+  "cls_token": "[CLS]",
+  "eos_token": "[SEP]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": "[CLS]",
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "[CLS]",
+  "do_basic_tokenize": true,
+  "do_lower_case": false,
+  "eos_token": "[SEP]",
+  "mask_token": "[MASK]",
+  "max_length": 128,
+  "model_max_length": 512,
+  "never_split": null,
+  "pad_to_multiple_of": null,
+  "pad_token": "[PAD]",
+  "pad_token_type_id": 0,
+  "padding_side": "right",
+  "sep_token": "[SEP]",
+  "stride": 0,
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "truncation_side": "right",
+  "truncation_strategy": "longest_first",
+  "unk_token": "[UNK]"
+}

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff