codegood commited on
Commit
2402628
·
1 Parent(s): 4a12d89

Upload 11 files

Browse files
README.md CHANGED
@@ -1,3 +1,219 @@
1
  ---
2
- license: bigscience-openrail-m
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ library_name: peft
3
+ base_model: filipealmeida/Mistral-7B-Instruct-v0.1-sharded
4
  ---
5
+
6
+ # Model Card for Model ID
7
+
8
+ <!-- Provide a quick summary of what the model is/does. -->
9
+
10
+
11
+
12
+ ## Model Details
13
+
14
+ ### Model Description
15
+
16
+ <!-- Provide a longer summary of what this model is. -->
17
+
18
+
19
+
20
+ - **Developed by:** [More Information Needed]
21
+ - **Shared by [optional]:** [More Information Needed]
22
+ - **Model type:** [More Information Needed]
23
+ - **Language(s) (NLP):** [More Information Needed]
24
+ - **License:** [More Information Needed]
25
+ - **Finetuned from model [optional]:** [More Information Needed]
26
+
27
+ ### Model Sources [optional]
28
+
29
+ <!-- Provide the basic links for the model. -->
30
+
31
+ - **Repository:** [More Information Needed]
32
+ - **Paper [optional]:** [More Information Needed]
33
+ - **Demo [optional]:** [More Information Needed]
34
+
35
+ ## Uses
36
+
37
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
38
+
39
+ ### Direct Use
40
+
41
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
42
+
43
+ [More Information Needed]
44
+
45
+ ### Downstream Use [optional]
46
+
47
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
48
+
49
+ [More Information Needed]
50
+
51
+ ### Out-of-Scope Use
52
+
53
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
54
+
55
+ [More Information Needed]
56
+
57
+ ## Bias, Risks, and Limitations
58
+
59
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
60
+
61
+ [More Information Needed]
62
+
63
+ ### Recommendations
64
+
65
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
66
+
67
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
68
+
69
+ ## How to Get Started with the Model
70
+
71
+ Use the code below to get started with the model.
72
+
73
+ [More Information Needed]
74
+
75
+ ## Training Details
76
+
77
+ ### Training Data
78
+
79
+ <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
80
+
81
+ [More Information Needed]
82
+
83
+ ### Training Procedure
84
+
85
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
86
+
87
+ #### Preprocessing [optional]
88
+
89
+ [More Information Needed]
90
+
91
+
92
+ #### Training Hyperparameters
93
+
94
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
95
+
96
+ #### Speeds, Sizes, Times [optional]
97
+
98
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
99
+
100
+ [More Information Needed]
101
+
102
+ ## Evaluation
103
+
104
+ <!-- This section describes the evaluation protocols and provides the results. -->
105
+
106
+ ### Testing Data, Factors & Metrics
107
+
108
+ #### Testing Data
109
+
110
+ <!-- This should link to a Data Card if possible. -->
111
+
112
+ [More Information Needed]
113
+
114
+ #### Factors
115
+
116
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
117
+
118
+ [More Information Needed]
119
+
120
+ #### Metrics
121
+
122
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
123
+
124
+ [More Information Needed]
125
+
126
+ ### Results
127
+
128
+ [More Information Needed]
129
+
130
+ #### Summary
131
+
132
+
133
+
134
+ ## Model Examination [optional]
135
+
136
+ <!-- Relevant interpretability work for the model goes here -->
137
+
138
+ [More Information Needed]
139
+
140
+ ## Environmental Impact
141
+
142
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
143
+
144
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
145
+
146
+ - **Hardware Type:** [More Information Needed]
147
+ - **Hours used:** [More Information Needed]
148
+ - **Cloud Provider:** [More Information Needed]
149
+ - **Compute Region:** [More Information Needed]
150
+ - **Carbon Emitted:** [More Information Needed]
151
+
152
+ ## Technical Specifications [optional]
153
+
154
+ ### Model Architecture and Objective
155
+
156
+ [More Information Needed]
157
+
158
+ ### Compute Infrastructure
159
+
160
+ [More Information Needed]
161
+
162
+ #### Hardware
163
+
164
+ [More Information Needed]
165
+
166
+ #### Software
167
+
168
+ [More Information Needed]
169
+
170
+ ## Citation [optional]
171
+
172
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
173
+
174
+ **BibTeX:**
175
+
176
+ [More Information Needed]
177
+
178
+ **APA:**
179
+
180
+ [More Information Needed]
181
+
182
+ ## Glossary [optional]
183
+
184
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
185
+
186
+ [More Information Needed]
187
+
188
+ ## More Information [optional]
189
+
190
+ [More Information Needed]
191
+
192
+ ## Model Card Authors [optional]
193
+
194
+ [More Information Needed]
195
+
196
+ ## Model Card Contact
197
+
198
+ [More Information Needed]
199
+
200
+
201
+ ## Training procedure
202
+
203
+
204
+ The following `bitsandbytes` quantization config was used during training:
205
+ - quant_method: bitsandbytes
206
+ - load_in_8bit: False
207
+ - load_in_4bit: True
208
+ - llm_int8_threshold: 6.0
209
+ - llm_int8_skip_modules: None
210
+ - llm_int8_enable_fp32_cpu_offload: False
211
+ - llm_int8_has_fp16_weight: False
212
+ - bnb_4bit_quant_type: nf4
213
+ - bnb_4bit_use_double_quant: False
214
+ - bnb_4bit_compute_dtype: bfloat16
215
+
216
+ ### Framework versions
217
+
218
+
219
+ - PEFT 0.6.0.dev0
Test_preds.csv ADDED
@@ -0,0 +1,130 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,Question,Answer,Prediction,RLFscore,R2Fscore,R1Fscore,RLPrec,R2Prec,R1Prec,RLrecall,R2Recall,R1Recall
2
+ 0, When did Dr. Jochen Raimann start working at the Renal Research Institute? , Dr. Jochen Raimann became a postdoctoral research fellow at the Renal Research Institute in 2007. ," Dr. Jochen Raimann became a postdoctoral research fellow at the Renal Research Institute in 2007.
3
+ ",0.5925925925925926,0.4799999999999999,0.5925925925925926,0.5333333333333333,0.42857142857142855,0.5333333333333333,0.6666666666666666,0.5454545454545454,0.6666666666666666
4
+ 1, Where did Peter Nandori hold previous appointments? ," Peter Nandori held previous appointments at the University of Maryland, College Park (Brin fellow, 2015 - 2019) and New York University (assistant professor/Courant instructor 2013 - 2015). "," Peter Nandori held previous appointments at the University of Maryland, College Park (Brin fellow, 2015 - 2019) and New York University (assistant professor/Courant instructor 2013 - 2015).
5
+ ",0.24242424242424246,0.12903225806451613,0.24242424242424246,0.15384615384615385,0.08,0.15384615384615385,0.5714285714285714,0.3333333333333333,0.5714285714285714
6
+ 2, What is the focus of Abraham Oxilas' teaching at YU? ," Abraham Oxilas' teaching at YU focuses on Human Anatomy, Patient Evaluation, Clinical Skills and Procedures, and Clinical Medicine. "," Abraham Oxilas' teaching at YU focuses on Human Anatomy, Patient Evaluation, Clinical Skills and Procedures, and Clinical Medicine.
7
+ ",0.35714285714285715,0.30769230769230765,0.35714285714285715,0.2777777777777778,0.23529411764705882,0.2777777777777778,0.5,0.4444444444444444,0.5
8
+ 3, What degrees does Herman Weiss hold? , Herman Weiss holds an M.D. from Ohio State Medical Center and an MBA from George Washington University. ," Herman Weiss holds an M.D. from Ohio State Medical Center and an MBA from George Washington University.
9
+ ",0.25,0.1818181818181818,0.25,0.16666666666666666,0.11764705882352941,0.16666666666666666,0.5,0.4,0.5
10
+ 4, What is Dr. Elisabeth Mlawski's area of interest? ," Dr. Elisabeth Mlawski's areas of interest include normal/disordered language development, literacy, clinical training, assessment/diagnostics, and supervision. "," Dr. Elisabeth Mlawski's areas of interest include normal/disordered language development, literacy, clinical training, assessment/diagnostics, and supervision.
11
+ ",0.5,0.46153846153846156,0.5,0.3684210526315789,0.3333333333333333,0.3684210526315789,0.7777777777777778,0.75,0.7777777777777778
12
+ 5, What is the Biotech Fellows program? ," The Biotech Fellows program is a program where students receive industry mentorship, showcase and publish their research at Katz School's annual Symposium on Science, Technology and Health, and receive a scholarship reducing total tuition to $25K for their entire graduate degree. "," The Biotech Fellows program is a program where students receive industry mentorship, showcase and publish their research at Katz School's annual Symposium on Science, Technology and Health, and receive a scholarship reducing total tuition to $25K for their entire graduate degree.
13
+ ",0.16666666666666666,0.13043478260869565,0.20833333333333331,0.09523809523809523,0.07317073170731707,0.11904761904761904,0.6666666666666666,0.6,0.8333333333333334
14
+ 6, What is the main focus of David Imre's audit work? , The main focus of David Imre's audit work is on risks and controls involved in securing information technology and protecting information assets. ," The main focus of David Imre's audit work is on risks and controls involved in securing information technology and protecting information assets.
15
+ ",0.5294117647058824,0.5000000000000001,0.5882352941176471,0.391304347826087,0.36363636363636365,0.43478260869565216,0.8181818181818182,0.8,0.9090909090909091
16
+ 7, How long has Atif Ghauri been a cyber security expert? , Atif Ghauri has been a cyber security expert for 20 years. , Atif Ghauri has been a cyber security expert for 20 years. What is the focus of Atif Ghauri's expertise in cybersecurity? ,0.4375,0.33333333333333326,0.5000000000000001,0.3181818181818182,0.23809523809523808,0.36363636363636365,0.7,0.5555555555555556,0.8
17
+ 8, What subject does Gary Sztajnman teach? , Gary Sztajnman teaches Data Product Design. ," Gary Sztajnman teaches Data Product Design.
18
+ ",0.5,0.4000000000000001,0.5,0.5,0.4,0.5,0.5,0.4,0.5
19
+ 9, What is Tom Balga's role at the Katz School? , Tom Balga is a clinical assistant professor and a member of the PA program advisory committee at the Katz School. ," Tom Balga is a clinical assistant professor and a member of the PA program advisory committee at the Katz School.
20
+ ",0.4,0.2857142857142857,0.4666666666666667,0.3,0.21052631578947367,0.35,0.6,0.4444444444444444,0.7
21
+ 10, What is Joshua Moritz's professional background? ," Joshua Moritz has a professional background in marketing and entrepreneurship, with experience in consulting, founding companies, and holding management positions. "," Joshua Moritz has a professional background in marketing and entrepreneurship, with experience in consulting, founding companies, and holding management positions.
22
+ ",0.29629629629629634,0.16,0.29629629629629634,0.2,0.10526315789473684,0.2,0.5714285714285714,0.3333333333333333,0.5714285714285714
23
+ 11, What was Dr. Rana Khan's role at the University of Maryland University College (UMUC)? ," Dr. Rana Khan served as Vice Dean in the Graduate School at UMUC, providing leadership to a diverse offering of science and information technology-related programs. "," Dr. Rana Khan served as Vice Dean in the Graduate School at UMUC, providing leadership to a diverse offering of science and information technology-related programs.
24
+ ",0.2439024390243902,0.10256410256410256,0.3414634146341463,0.19230769230769232,0.08,0.2692307692307692,0.3333333333333333,0.14285714285714285,0.4666666666666667
25
+ 12, What was Dr. Kadiyala's research focus during his postdoctoral fellowship at Nanyang Technological University? , Dr. Kadiyala's research focus during his postdoctoral fellowship at Nanyang Technological University was on designing approximate architectures for low-power hearing aids in the School of Electrical Engineering. ," Dr. Kadiyala's research focus during his postdoctoral fellowship at Nanyang Technological University was on designing approximate architectures for low-power hearing aids in the School of Electrical Engineering.
26
+ ",0.5909090909090909,0.5714285714285714,0.6363636363636364,0.4482758620689655,0.42857142857142855,0.4827586206896552,0.8666666666666667,0.8571428571428571,0.9333333333333333
27
+ 13, What is Dr. Peter Nandori's area of expertise? ," Dr. Peter Nandori's area of expertise is ergodic theory and dynamical systems, especially statistical properties of hyperbolic systems; probability theory; and mathematical statistical physics. "," Dr. Peter Nandori's area of expertise is ergodic theory and dynamical systems, especially statistical properties of hyperbolic systems; probability theory; and mathematical statistical physics.
28
+ ",0.4117647058823529,0.375,0.47058823529411764,0.28,0.25,0.32,0.7777777777777778,0.75,0.8888888888888888
29
+ 14, What has Dr. Michaela Medved trained clinicians from around the world in? , Dr. Michaela Medved has trained clinicians from around the world in the use of Neuromuscular Electrical Stimulation (NMES) and Surface Electromyography (sEMG) for dysphagia intervention. ," Dr. Michaela Medved has trained clinicians from around the world in the use of Neuromuscular Electrical Stimulation (NMES) and Surface Electromyography (sEMG) for dysphagia intervention.
30
+ ",0.5405405405405406,0.4571428571428572,0.5945945945945945,0.4,0.3333333333333333,0.44,0.8333333333333334,0.7272727272727273,0.9166666666666666
31
+ 15, What is Dr. Marissa Barrera's area of expertise? ," Dr. Barrera's area of expertise is Multiple Sclerosis, dysphagia, motor speech disorders, cognition, and neurogenerative diseases. "," Dr. Barrera's area of expertise is Multiple Sclerosis, dysphagia, motor speech disorders, cognition, and neurogenerative diseases.
32
+ ",0.46153846153846156,0.3333333333333333,0.5384615384615384,0.35294117647058826,0.25,0.4117647058823529,0.6666666666666666,0.5,0.7777777777777778
33
+ 16, What is Dr. Rana Khan's contact information? ," Dr. Rana Khan's email is [email protected] and her office phone number is 646-592-Her office address is Beren Campus, 205 Lexington Avenue, "," Dr. Rana Khan's email is [email protected] and her office phone number is 646-592-Her office address is Beren Campus, 205 Lexington Avenue,
34
+ ",0.22857142857142856,0.18181818181818182,0.2857142857142857,0.14814814814814814,0.11538461538461539,0.18518518518518517,0.5,0.42857142857142855,0.625
35
+ 17, What courses does James Topor teach? ," James Topor teaches Analytics Programming, Data Science, and Data Acquisition Management. "," James Topor teaches Analytics Programming, Data Science, and Data Acquisition Management.
36
+ ",0.3529411764705882,0.26666666666666666,0.3529411764705882,0.2727272727272727,0.2,0.2727272727272727,0.5,0.4,0.5
37
+ 18, What is Gary Sztajnman's area of expertise? , Gary Sztajnman's area of expertise is data science and data product design. ," Gary Sztajnman's area of expertise is data science and data product design.
38
+ ",0.5714285714285714,0.5263157894736842,0.6666666666666667,0.46153846153846156,0.4166666666666667,0.5384615384615384,0.75,0.7142857142857143,0.875
39
+ 19, What are Antonella Marini's research interests? ," Antonella Marini's research interests include Geometric Analysis, Partial Differential Equations, and Mathematical Physics. "," Antonella Marini's research interests include Geometric Analysis, Partial Differential Equations, and Mathematical Physics.
40
+ ",0.4761904761904762,0.42105263157894735,0.4761904761904762,0.35714285714285715,0.3076923076923077,0.35714285714285715,0.7142857142857143,0.6666666666666666,0.7142857142857143
41
+ 20, What is the focus of Wenxiong Chen's research? ," The focus of Wenxiong Chen's research is on nonlinear elliptic partial differential equations, fractional Laplacians and other non-local operators, nonlinear functional analysis, and geometric analysis. "," The focus of Wenxiong Chen's research is on nonlinear elliptic partial differential equations, fractional Laplacians and other non-local operators, nonlinear functional analysis, and geometric analysis.
42
+ ",0.3888888888888889,0.3529411764705882,0.4444444444444444,0.25925925925925924,0.23076923076923078,0.2962962962962963,0.7777777777777778,0.75,0.8888888888888888
43
+ 21, What is the room number of Professor Buldyrev's office? , Room#1112 ," Room#1112
44
+ ",0.16666666666666669,0.0,0.16666666666666669,0.5,0.0,0.5,0.1,0.0,0.1
45
+ 22, What is Lea F. Santos' educational background? ," Lea F. Santos has a PhD in theoretical physics from the University of Sao Paulo, as well as an MS and BS from the same university. "," Lea F. Santos has a PhD in theoretical physics from the University of Sao Paulo, as well as an MS and BS from the same university.
46
+ ",0.18181818181818182,0.12903225806451613,0.18181818181818182,0.11538461538461539,0.08,0.11538461538461539,0.42857142857142855,0.3333333333333333,0.42857142857142855
47
+ 23, How many faculty members teach in the program? ," The program has multiple faculty members, each teaching to their own specialty. "," The program has multiple faculty members, each teaching to their own specialty.
48
+ ",0.3,0.2222222222222222,0.5,0.25,0.18181818181818182,0.4166666666666667,0.375,0.2857142857142857,0.625
49
+ 24, What is Marian Gidea's teaching experience? ," Marian Gidea has taught introductory and advanced undergraduate mathematics courses, as well as graduate mathematics courses. "," Marian Gidea has taught introductory and advanced undergraduate mathematics courses, as well as graduate mathematics courses.
50
+ ",0.17391304347826086,0.09523809523809522,0.17391304347826086,0.125,0.06666666666666667,0.125,0.2857142857142857,0.16666666666666666,0.2857142857142857
51
+ 25, What was Herman Weiss's role at Teva? , Herman Weiss was senior director of clinical development for women's health and global head of women's health medical affairs at Teva. ," Herman Weiss was senior director of clinical development for women's health and global head of women's health medical affairs at Teva.
52
+ ",0.3225806451612903,0.13793103448275862,0.3870967741935483,0.21739130434782608,0.09090909090909091,0.2608695652173913,0.625,0.2857142857142857,0.75
53
+ 26, What is Youshan Zhang's educational background? , Youshan Zhang holds a Ph.D. in computer science from Lehigh University and a master's degree in neuroimaging and informatics from USC. ," Youshan Zhang holds a Ph.D. in computer science from Lehigh University and a master's degree in neuroimaging and informatics from USC.
54
+ ",0.2,0.07142857142857144,0.2,0.13043478260869565,0.045454545454545456,0.13043478260869565,0.42857142857142855,0.16666666666666666,0.42857142857142855
55
+ 27, How many research grants has Wenxiong Chen received from the National Science Foundation? , Wenxiong Chen has received six research grants from the National Science Foundation. ," Wenxiong Chen has received six research grants from the National Science Foundation.
56
+ ",0.64,0.5217391304347826,0.8799999999999999,0.6666666666666666,0.5454545454545454,0.9166666666666666,0.6153846153846154,0.5,0.8461538461538461
57
+ 28, What courses does Jessica Rifkind teach? , Jessica Rifkind teaches Visual Design and Storytelling; and Data Driven Organizations. ," Jessica Rifkind teaches Visual Design and Storytelling; and Data Driven Organizations.
58
+ ",0.3529411764705882,0.26666666666666666,0.3529411764705882,0.2727272727272727,0.2,0.2727272727272727,0.5,0.4,0.5
59
+ 29, What is Jeff Nieman's current work? , Jeff Nieman's current work involves building systems engineering and model governance and support using open source and COTS software. ," Jeff Nieman's current work involves building systems engineering and model governance and support using open source and COTS software.
60
+ ",0.37037037037037035,0.32,0.37037037037037035,0.25,0.21052631578947367,0.25,0.7142857142857143,0.6666666666666666,0.7142857142857143
61
+ 30, What units does Brandon Chiazza direct in the technology strategy division? ," Brandon Chiazza directs employees in three units—Enterprise Data Services, Project Management Office, and Design and Quality Assurance—in the technology strategy division. "," Brandon Chiazza directs employees in three units—Enterprise Data Services, Project Management Office, and Design and Quality Assurance—in the technology strategy division.
62
+ ",0.4705882352941176,0.37499999999999994,0.5294117647058824,0.34782608695652173,0.2727272727272727,0.391304347826087,0.7272727272727273,0.6,0.8181818181818182
63
+ 31, What is PASSPort? , PASSPort is an end-to-end cloud-based procurement solution for the City of New York. ," PASSPort is an end-to-end cloud-based procurement solution for the City of New York.
64
+ ",0.10526315789473684,0.0,0.21052631578947367,0.0625,0.0,0.125,0.3333333333333333,0.0,0.6666666666666666
65
+ 32, What is Joseph Weber-Lopez's current position? , He is the medical director of the PA program at Yeshiva University. ," He is the medical director of the PA program at Yeshiva University.
66
+ ",0.1,0.0,0.1,0.08333333333333333,0.0,0.08333333333333333,0.125,0.0,0.125
67
+ 33, What is the name of the organization that awarded Troy Dargin the Margaret L. Anderson Graduate Teaching Award? , The Margaret L. Anderson Graduate Teaching Award was awarded to Troy Dargin by the University of Kansas Speech-Language-Hearing Association. ," The Margaret L. Anderson Graduate Teaching Award was awarded to Troy Dargin by the University of Kansas Speech-Language-Hearing Association.
68
+ ",0.358974358974359,0.37837837837837834,0.6153846153846153,0.3333333333333333,0.35,0.5714285714285714,0.3888888888888889,0.4117647058823529,0.6666666666666666
69
+ 34, What is Professor Buldyrev's email address? , [email protected] ," [email protected]
70
+ ",0.2,0.0,0.2,0.3333333333333333,0.0,0.3333333333333333,0.14285714285714285,0.0,0.14285714285714285
71
+ 35, What is IQVIA? , IQVIA is a global R&D leader where Marchand currently works. ," IQVIA is a global R&D leader where Marchand currently works.
72
+ ",0.14285714285714288,0.0,0.28571428571428575,0.09090909090909091,0.0,0.18181818181818182,0.3333333333333333,0.0,0.6666666666666666
73
+ 36, What did Dr. David Whipple build for a New York medical research facility? , Dr. David Whipple built a secure repository for proprietary research material with strict identification and access controls and multifactor employee authentication for a New York medical research facility. ," Dr. David Whipple built a secure repository for proprietary research material with strict identification and access controls and multifactor employee authentication for a New York medical research facility.
74
+ ",0.48780487804878053,0.41025641025641024,0.48780487804878053,0.35714285714285715,0.2962962962962963,0.35714285714285715,0.7692307692307693,0.6666666666666666,0.7692307692307693
75
+ 37, What is the significance of Abraham Oxilas returning home to Queens? , The text does not provide information on the significance of Abraham Oxilas returning home to Queens. ," The text does not provide information on the significance of Abraham Oxilas returning home to Queens.
76
+ ",0.6666666666666666,0.64,0.6666666666666666,0.5625,0.5333333333333333,0.5625,0.8181818181818182,0.8,0.8181818181818182
77
+ 38, What is Gary Sztajnman's previous work experience? ," Gary Sztajnman has worked for an A.I. startup where he developed an application that learns musical taste, and he currently works as a product data scientist at Verizon where he builds data products. "," Gary Sztajnman has worked for an A.I. startup where he developed an application that learns musical taste, and he currently works as a product data scientist at Verizon where he builds data products.
78
+ ",0.14285714285714285,0.05,0.14285714285714285,0.08823529411764706,0.030303030303030304,0.08823529411764706,0.375,0.14285714285714285,0.375
79
+ 39, What was the purpose of Joshua Moritz's first grant from New York University? , Joshua Moritz's first grant from New York University was to study the impact of marketing simulations on student outcomes. ," Joshua Moritz's first grant from New York University was to study the impact of marketing simulations on student outcomes.
80
+ ",0.5294117647058824,0.5,0.7058823529411764,0.45,0.42105263157894735,0.6,0.6428571428571429,0.6153846153846154,0.8571428571428571
81
+ 40, Where is the program located? , The program is located in the heart of New York City. , The program is located in the heart of New York City. What are some of the courses offered by YU's M.A. in Physics? ,0.19999999999999998,0.07142857142857142,0.26666666666666666,0.12,0.041666666666666664,0.16,0.6,0.25,0.8
82
+ 41, What is Fayrose Abodeshisha's area of expertise? ," Fayrose Abodeshisha's area of expertise includes PA Education, General Surgery, Trauma & Acute Care Surgery, and Ambulatory Surgery. "," Fayrose Abodeshisha's area of expertise includes PA Education, General Surgery, Trauma & Acute Care Surgery, and Ambulatory Surgery.
83
+ ",0.46153846153846156,0.4166666666666667,0.46153846153846156,0.3333333333333333,0.29411764705882354,0.3333333333333333,0.75,0.7142857142857143,0.75
84
+ 42, What is the focus of Dr. Michaela Medved's clinical practice? , The focus of Dr. Michaela Medved's clinical practice is evaluating and treating individuals with neurological and complex medical conditions. ," The focus of Dr. Michaela Medved's clinical practice is evaluating and treating individuals with neurological and complex medical conditions.
85
+ ",0.5806451612903226,0.5517241379310345,0.6451612903225806,0.45,0.42105263157894735,0.5,0.8181818181818182,0.8,0.9090909090909091
86
+ 43, What is the focus of Dr. Laura Sylvia's research? , The text does not provide information on the focus of Dr. Laura Sylvia's research. , The text does not provide information on the focus of Dr. Laura Sylvia's research. ]]>,0.64,0.6086956521739131,0.64,0.5333333333333333,0.5,0.5333333333333333,0.8,0.7777777777777778,0.8
87
+ 44, What is Dwayne A. Williams' research interest? , The text does not provide information on Dwayne A. Williams' research interest. ," The text does not provide information on Dwayne A. Williams' research interest.
88
+ ",0.5263157894736842,0.4705882352941177,0.5263157894736842,0.4166666666666667,0.36363636363636365,0.4166666666666667,0.7142857142857143,0.6666666666666666,0.7142857142857143
89
+ 45, What is the significance of Fayrose Abodeshisha's work in ambulatory surgery? ," Fayrose Abodeshisha's work in ambulatory surgery is important because it allows patients to receive surgical care in an outpatient setting, which can be more convenient and cost-effective. "," Fayrose Abodeshisha's work in ambulatory surgery is important because it allows patients to receive surgical care in an outpatient setting, which can be more convenient and cost-effective.
90
+ ",0.34146341463414637,0.30769230769230765,0.3902439024390244,0.2413793103448276,0.21428571428571427,0.27586206896551724,0.5833333333333334,0.5454545454545454,0.6666666666666666
91
+ 46, What are some of Dr. Marini's teaching interests? ," Dr. Marini's teaching interests include Ordinary and Partial Differential Equations, Calculus, Topology, Linear Algebra, Abstract Algebra, Real and Complex Variables, Functional Analysis, Geometric Analysis, Morse Theory, Differential Geometry, and Lie Groups. "," Dr. Marini's teaching interests include Ordinary and Partial Differential Equations, Calculus, Topology, Linear Algebra, Abstract Algebra, Real and Complex Variables, Functional Analysis, Geometric Analysis, Morse Theory, Differential Geometry, and Lie Groups.
92
+ ",0.24390243902439024,0.20512820512820512,0.24390243902439024,0.15625,0.12903225806451613,0.15625,0.5555555555555556,0.5,0.5555555555555556
93
+ 47, What topics has Dr. Barrera lectured on? ," Dr. Barrera has lectured on a wide range of clinical care topics, including dysphagia, NMES, motor speech disorders, cognition, and neurogenerative diseases. "," Dr. Barrera has lectured on a wide range of clinical care topics, including dysphagia, NMES, motor speech disorders, cognition, and neurogenerative diseases.
94
+ ",0.27586206896551724,0.14814814814814814,0.41379310344827586,0.18181818181818182,0.09523809523809523,0.2727272727272727,0.5714285714285714,0.3333333333333333,0.8571428571428571
95
+ 48, What types of companies has Elaine Zacharakis Loumbas counseled on regulatory issues? ," Elaine Zacharakis Loumbas has counseled pharmaceutical, medical device, and biotech companies on regulatory issues. "," Elaine Zacharakis Loumbas has counseled pharmaceutical, medical device, and biotech companies on regulatory issues.
96
+ ",0.5384615384615384,0.33333333333333337,0.6923076923076924,0.5,0.3076923076923077,0.6428571428571429,0.5833333333333334,0.36363636363636365,0.75
97
+ 49, What is the academic background of the author? , The author has a Ph.D. in Computer Science from Lehigh University. ," The author has a Ph.D. in Computer Science from Lehigh University.
98
+ ",0.2,0.1111111111111111,0.2,0.16666666666666666,0.09090909090909091,0.16666666666666666,0.25,0.14285714285714285,0.25
99
+ 50, In what year did Fayrose Abodeshisha earn her Master of Physician Assistant Studies? , Fayrose Abodeshisha earned her Master of Physician Assistant Studies in 2013. ," Fayrose Abodeshisha earned her Master of Physician Assistant Studies in 2013.
100
+ ",0.7500000000000001,0.7272727272727272,0.8333333333333333,0.8181818181818182,0.8,0.9090909090909091,0.6923076923076923,0.6666666666666666,0.7692307692307693
101
+ 51, What courses does Lawrence Fulton teach? ," Lawrence Fulton teaches Data Analytics, Machine Learning, and Structured Data Management. "," Lawrence Fulton teaches Data Analytics, Machine Learning, and Structured Data Management.
102
+ ",0.3529411764705882,0.26666666666666666,0.3529411764705882,0.2727272727272727,0.2,0.2727272727272727,0.5,0.4,0.5
103
+ 52, What courses does Dr. Sergey Fogelson teach? ," Dr. Sergey Fogelson teaches Computational Statistics, Linear Algebra, and Machine Learning. "," Dr. Sergey Fogelson teaches Computational Statistics, Linear Algebra, and Machine Learning.
104
+ ",0.4444444444444444,0.37499999999999994,0.4444444444444444,0.36363636363636365,0.3,0.36363636363636365,0.5714285714285714,0.5,0.5714285714285714
105
+ 53, What kind of staff did Nebahat Bayrakcioglu recruit and manage? , Nebahat Bayrakcioglu recruited and managed program office staff. ," Nebahat Bayrakcioglu recruited and managed program office staff.
106
+ ",0.5555555555555556,0.5,0.6666666666666665,0.625,0.5714285714285714,0.75,0.5,0.4444444444444444,0.6
107
+ 54, What is Sivan Tehila's rank in the Israel Defense Forces? , Sivan Tehila was honorably discharged from the Israel Defense Forces as a captain. ," Sivan Tehila was honorably discharged from the Israel Defense Forces as a captain.
108
+ ",0.4999999999999999,0.3636363636363636,0.4999999999999999,0.46153846153846156,0.3333333333333333,0.46153846153846156,0.5454545454545454,0.4,0.5454545454545454
109
+ 55," What is the median salary for jobs with a Ph.D. in mathematics, according to the U.S. Bureau of Labor Statistics? "," As of 2019, the median salaries for jobs with a Ph.D. in mathematics were: actuary ($108,350); postsecondary education administrator ($95,410); mathematician and statistician ($92,030); operations research analyst ($84,810); and postsecondary mathematics and science teacher ($73,690). "," As of 2019, the median salaries for jobs with a Ph.D. in mathematics were: actuary ($108,350); postsecondary education administrator ($95,410); mathematician and statistician ($92,030); operations research analyst ($84,810); and postsecondary mathematics and science teacher ($73,690).
110
+ ",0.34920634920634924,0.32786885245901637,0.3809523809523809,0.2682926829268293,0.25,0.2926829268292683,0.5,0.47619047619047616,0.5454545454545454
111
+ 56, When did Peter Nandori join Yeshiva University? , Peter Nandori joined Yeshiva University in 2019. , Peter Nandori joined Yeshiva University in 2019. What is the focus of his research? ,0.4761904761904762,0.42105263157894735,0.4761904761904762,0.35714285714285715,0.3076923076923077,0.35714285714285715,0.7142857142857143,0.6666666666666666,0.7142857142857143
112
+ 57, What is Dr. Wagner's professional background? , Dr. Wagner is an OTR/L and has worked primarily with older adults. ," Dr. Wagner is an OTR/L and has worked primarily with older adults.
113
+ ",0.2,0.1111111111111111,0.3,0.15384615384615385,0.08333333333333333,0.23076923076923078,0.2857142857142857,0.16666666666666666,0.42857142857142855
114
+ 58, What programming language does Jeff Wunderman code in? , Jeff Wunderman codes Python in Jupyter notebooks. ," Jeff Wunderman codes Python in Jupyter notebooks.
115
+ ",0.5333333333333333,0.30769230769230765,0.5333333333333333,0.5714285714285714,0.3333333333333333,0.5714285714285714,0.5,0.2857142857142857,0.5
116
+ 59, What is Sivan Tehila's involvement in promoting women in cybersecurity? , Sivan Tehila has dedicated herself to promoting women in cybersecurity and has developed a unique cybersecurity program for Manhattan High School for girls. ," Sivan Tehila has dedicated herself to promoting women in cybersecurity and has developed a unique cybersecurity program for Manhattan High School for girls.
117
+ ",0.3529411764705882,0.25000000000000006,0.3529411764705882,0.2608695652173913,0.18181818181818182,0.2608695652173913,0.5454545454545454,0.4,0.5454545454545454
118
+ 60, What accolades has Lea F. Santos received? ," Lea F. Santos has received numerous accolades, including the Simons Fellow in Theoretical Physics, NSF CAREER Award, Cottrell College Science Award, Outstanding Junior Faculty Member Award, and fellowships from the Brazilian and New Zealand governments. "," Lea F. Santos has received numerous accolades, including the Simons Fellow in Theoretical Physics, NSF CAREER Award, Cottrell College Science Award, Outstanding Junior Faculty Member Award, and fellowships from the Brazilian and New Zealand governments.
119
+ ",0.19047619047619044,0.1,0.2857142857142857,0.11428571428571428,0.058823529411764705,0.17142857142857143,0.5714285714285714,0.3333333333333333,0.8571428571428571
120
+ 61, What is Brittany Palmer's area of expertise? , Brittany Palmer's area of expertise is voice disorders. ," Brittany Palmer's area of expertise is voice disorders.
121
+ ",0.7058823529411765,0.6666666666666666,0.823529411764706,0.6666666666666666,0.625,0.7777777777777778,0.75,0.7142857142857143,0.875
122
+ 62, What modalities does Dr. Barrera use for speech and swallowing rehabilitation? ," Dr. Barrera uses Neuromuscular Electrical Stimulation (NMES), Surface Electromyography (sEMG), neuromuscular taping, and ultrasound for speech and swallowing rehabilitation. "," Dr. Barrera uses Neuromuscular Electrical Stimulation (NMES), Surface Electromyography (sEMG), neuromuscular taping, and ultrasound for speech and swallowing rehabilitation.
123
+ ",0.5333333333333333,0.42857142857142855,0.5333333333333333,0.42105263157894735,0.3333333333333333,0.42105263157894735,0.7272727272727273,0.6,0.7272727272727273
124
+ 63, Is Joseph Weber-Lopez a board-certified physician? ," Yes, he is board-certified in family medicine and sports medicine. "," Yes, he is board-certified in family medicine and sports medicine. What are some of the conditions that Dr. Wagner can treat with Shockwave therapy? ",0.18181818181818182,0.06451612903225806,0.18181818181818182,0.12,0.041666666666666664,0.12,0.375,0.14285714285714285,0.375
125
+ 64, What kind of software does Pablo Roldan like to write? ," Pablo Roldan likes to write his own software, including special-purpose numerical algorithms for dynamical systems and, in some cases, High-Performance Computing programs. "," Pablo Roldan likes to write his own software, including special-purpose numerical algorithms for dynamical systems and, in some cases, High-Performance Computing programs.
126
+ ",0.29411764705882354,0.25,0.35294117647058826,0.20833333333333334,0.17391304347826086,0.25,0.5,0.4444444444444444,0.6
127
+ 65, Where do PAs practice? , PAs practice nationwide. ," PAs practice nationwide.
128
+ ",0.5714285714285715,0.4,0.5714285714285715,0.6666666666666666,0.5,0.6666666666666666,0.5,0.3333333333333333,0.5
129
+ 66, What social media platforms can students follow the Katz School on? ," Students can follow the Katz School on LinkedIn, Instagram, Facebook, and their blog. "," Students can follow the Katz School on LinkedIn, Instagram, Facebook, and their blog.
130
+ ",0.4999999999999999,0.3636363636363636,0.5833333333333334,0.46153846153846156,0.3333333333333333,0.5384615384615384,0.5454545454545454,0.4,0.6363636363636364
Train_preds.csv ADDED
@@ -0,0 +1,129 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,Question,Answer,Prediction,RLFscore,R2Fscore,R1Fscore,RLPrec,R2Prec,R1Prec,RLrecall,R2Recall,R1Recall
2
+ 0, What are Dr. Rana Khan's teaching interests? ," Dr. Rana Khan's teaching interests include courses in molecular biology, commercialization and business of biotechnology, regulatory environment in biotechnology, and lab methods in recombinant DNA technology. "," Dr. Rana Khan's teaching interests include courses in molecular biology, commercialization and business of biotechnology, regulatory environment in biotechnology, and lab methods in recombinant DNA technology.
3
+ ",0.34285714285714286,0.30303030303030304,0.34285714285714286,0.2222222222222222,0.19230769230769232,0.2222222222222222,0.75,0.7142857142857143,0.75
4
+ 1, What did Thom Kennon lead at Wunderman? , Thom Kennon led a global insights generation practice grounded in applied listening at Wunderman. ," Thom Kennon led a global insights generation practice grounded in applied listening at Wunderman.
5
+ ",0.38095238095238093,0.21052631578947367,0.38095238095238093,0.2857142857142857,0.15384615384615385,0.2857142857142857,0.5714285714285714,0.3333333333333333,0.5714285714285714
6
+ 2, What courses is Dr. Drori teaching at the Stern College for Women? , Organic Chemistry and Biochemistry. ," Organic Chemistry and Biochemistry.
7
+ ",0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
8
+ 3, What is Samuel Strum's area of expertise? ," Samuel Strum's area of expertise includes data management, data analytics, and data modeling. "," Samuel Strum's area of expertise includes data management, data analytics, and data modeling.
9
+ ",0.5454545454545454,0.5,0.5454545454545454,0.42857142857142855,0.38461538461538464,0.42857142857142855,0.75,0.7142857142857143,0.75
10
+ 4, What is the focus of Dr. David Whipple's academic research and publications? ," Dr. David Whipple's academic research and publications center on the relationships between risk management, information security and organizational culture. "," Dr. David Whipple's academic research and publications center on the relationships between risk management, information security and organizational culture.
11
+ ",0.4848484848484849,0.4516129032258065,0.5454545454545455,0.4,0.3684210526315789,0.45,0.6153846153846154,0.5833333333333334,0.6923076923076923
12
+ 5, What is Robin Brizzi's area of expertise? ," Robin Brizzi's area of expertise includes PA Education, PA Admissions, Oral and Written Communications, and Health Care Insurance Industry. "," Robin Brizzi's area of expertise includes PA Education, PA Admissions, Oral and Written Communications, and Health Care Insurance Industry.
13
+ ",0.4285714285714285,0.3846153846153846,0.4285714285714285,0.3,0.2631578947368421,0.3,0.75,0.7142857142857143,0.75
14
+ 6, What kind of mobile application does Esteban Wesson handle? , Esteban Wesson handles a mobile application that delivers life-saving notifications and information. ," Esteban Wesson handles a mobile application that delivers life-saving notifications and information.
15
+ ",0.27272727272727276,0.3,0.4545454545454546,0.23076923076923078,0.25,0.38461538461538464,0.3333333333333333,0.375,0.5555555555555556
16
+ 7, What is Dr. Jochen Raimann's role in Easy Water for Everyone? , Dr. Jochen Raimann is a member of the board of directors of Easy Water for Everyone and supports the organization's research efforts. ," Dr. Jochen Raimann is a member of the board of directors of Easy Water for Everyone and supports the organization's research efforts.
17
+ ",0.4,0.30303030303030304,0.5142857142857143,0.30434782608695654,0.22727272727272727,0.391304347826087,0.5833333333333334,0.45454545454545453,0.75
18
+ 8, What are Wenxiong Chen's research interests? ," Wenxiong Chen's research interests include nonlinear elliptic partial differential equations, fractional Laplacians and other non-local operators, nonlinear functional analysis, and geometric analysis. "," Wenxiong Chen's research interests include nonlinear elliptic partial differential equations, fractional Laplacians and other non-local operators, nonlinear functional analysis, and geometric analysis.
19
+ ",0.3225806451612903,0.27586206896551724,0.3225806451612903,0.20833333333333334,0.17391304347826086,0.20833333333333334,0.7142857142857143,0.6666666666666666,0.7142857142857143
20
+ 9, What is Herman Weiss's current position? , Herman Weiss is currently the vice president of medical affairs and clinical development at Juniper Pharmaceuticals in Boston. ," Herman Weiss is currently the vice president of medical affairs and clinical development at Juniper Pharmaceuticals in Boston.
21
+ ",0.24,0.08695652173913045,0.32,0.16666666666666666,0.058823529411764705,0.2222222222222222,0.42857142857142855,0.16666666666666666,0.5714285714285714
22
+ 10, What is Burdin Hickok's experience in Iraq? , Burdin Hickok served as the economic development section head and senior banking and finance advisor to the government of Iraq for the Kirkuk Provincial Reconstruction Team (PRT) for three years. ," Burdin Hickok served as the economic development section head and senior banking and finance advisor to the government of Iraq for the Kirkuk Provincial Reconstruction Team (PRT) for three years.
23
+ ",0.15789473684210528,0.05555555555555555,0.15789473684210528,0.1,0.034482758620689655,0.1,0.375,0.14285714285714285,0.375
24
+ 11, Where does Brian Malkin work? ," Brian Malkin is a partner at the law firm of McDermott Will & Emery in Washington, D.C. "," Brian Malkin is a partner at the law firm of McDermott Will & Emery in Washington, D.C.
25
+ ",0.1818181818181818,0.1,0.1818181818181818,0.11764705882352941,0.0625,0.11764705882352941,0.4,0.25,0.4
26
+ 12, How many books has Joanne Tombrakos authored? , Joanne Tombrakos has authored three books. , Joanne Tombrakos has authored three books. What is the focus of her third book? ,0.2857142857142857,0.10526315789473684,0.4761904761904762,0.21428571428571427,0.07692307692307693,0.35714285714285715,0.42857142857142855,0.16666666666666666,0.7142857142857143
27
+ 13, How many grants has Wenxiong Chen received from the Simons Foundation? , Wenxiong Chen has received two grants from the Simons Foundation. , Wenxiong Chen has received two grants from the Simons Foundation. What is Dr. Lea F. Santos' area of expertise? ,0.4666666666666667,0.2857142857142857,0.6,0.3684210526315789,0.2222222222222222,0.47368421052631576,0.6363636363636364,0.4,0.8181818181818182
28
+ 14, What areas does the program impact society in? ," The program impacts society in the areas of food, fuel, and health. "," The program impacts society in the areas of food, fuel, and health.
29
+ ",0.5,0.4444444444444444,0.6,0.4166666666666667,0.36363636363636365,0.5,0.625,0.5714285714285714,0.75
30
+ 15, What age group does Dr. Laura Sylvia work with? , Dr. Laura Sylvia works with children with hearing loss. ," Dr. Laura Sylvia works with children with hearing loss.
31
+ ",0.5555555555555556,0.5,0.5555555555555556,0.5555555555555556,0.5,0.5555555555555556,0.5555555555555556,0.5,0.5555555555555556
32
+ 16, What is auditory-verbal therapy? , Auditory-verbal therapy is a type of therapy that helps children with hearing loss learn to listen and speak using their residual hearing and technology such as hearing aids and cochlear implants. ," Auditory-verbal therapy is a type of therapy that helps children with hearing loss learn to listen and speak using their residual hearing and technology such as hearing aids and cochlear implants.
33
+ ",0.16216216216216214,0.1142857142857143,0.21621621621621623,0.09375,0.06451612903225806,0.125,0.6,0.5,0.8
34
+ 17, What is the significance of Partial Differential Equations in mathematics? ," Partial Differential Equations are important in modeling physical phenomena in fields such as physics, engineering, and finance. "," Partial Differential Equations are important in modeling physical phenomena in fields such as physics, engineering, and finance.
35
+ ",0.29629629629629634,0.16,0.29629629629629634,0.23529411764705882,0.125,0.23529411764705882,0.4,0.2222222222222222,0.4
36
+ 18, What is Alexandra Laghezza's educational background? ," Alexandra Laghezza has a PhD in Health and Rehabilitation Science, an Advanced Certificate in Disability Studies, and an M.S. in Occupational Therapy. "," Alexandra Laghezza has a PhD in Health and Rehabilitation Science, an Advanced Certificate in Disability Studies, and an M.S. in Occupational Therapy.
37
+ ",0.2,0.07142857142857144,0.2,0.13043478260869565,0.045454545454545456,0.13043478260869565,0.42857142857142855,0.16666666666666666,0.42857142857142855
38
+ 19, What is the name of the university where Dr. Sergey Buldyrev obtained his Ph.D.? ," Dr. Sergey Buldyrev obtained his Ph.D. from Saint-Petersburg State University, Russia. "," Dr. Sergey Buldyrev obtained his Ph.D. from Saint-Petersburg State University, Russia.
39
+ ",0.5,0.4615384615384615,0.5714285714285715,0.5384615384615384,0.5,0.6153846153846154,0.4666666666666667,0.42857142857142855,0.5333333333333333
40
+ 20, What is Patricia Berry's focus in marketing? ," Patricia Berry's focus in marketing is on integrated brand and customer experience strategy, communications strategy, and omnichannel marketing. "," Patricia Berry's focus in marketing is on integrated brand and customer experience strategy, communications strategy, and omnichannel marketing.
41
+ ",0.44444444444444436,0.4,0.5185185185185185,0.3157894736842105,0.2777777777777778,0.3684210526315789,0.75,0.7142857142857143,0.875
42
+ 21, What is the name of Joseph Weber-Lopez's practice? , His practice is called ProPeak Sports Medicine. ," His practice is called ProPeak Sports Medicine.
43
+ ",0.11764705882352941,0.0,0.23529411764705882,0.14285714285714285,0.0,0.2857142857142857,0.1,0.0,0.2
44
+ 22, What is manifold learning? , Manifold learning is a technique used in machine learning to analyze high-dimensional data and reduce it to a lower-dimensional space. ," Manifold learning is a technique used in machine learning to analyze high-dimensional data and reduce it to a lower-dimensional space.
45
+ ",0.15384615384615385,0.08333333333333333,0.23076923076923075,0.09090909090909091,0.047619047619047616,0.13636363636363635,0.5,0.3333333333333333,0.75
46
+ 23, What is Dr. Rana Khan's role at Yeshiva University? , Dr. Rana Khan is the founding director of the Biotechnology Management and Entrepreneurship program at Yeshiva University. ," Dr. Rana Khan is the founding director of the Biotechnology Management and Entrepreneurship program at Yeshiva University.
47
+ ",0.4444444444444445,0.32,0.5185185185185185,0.35294117647058826,0.25,0.4117647058823529,0.6,0.4444444444444444,0.7
48
+ 24, What kind of solutions does Dr. Yuri Katz's team provide? , Dr. Yuri Katz's team provides innovative solutions based on methods developed in the fundamental sciences and cutting-edge machine learning technologies. ," Dr. Yuri Katz's team provides innovative solutions based on methods developed in the fundamental sciences and cutting-edge machine learning technologies.
49
+ ",0.3636363636363636,0.3225806451612903,0.4242424242424242,0.2727272727272727,0.23809523809523808,0.3181818181818182,0.5454545454545454,0.5,0.6363636363636364
50
+ 25, Where was Tom Balga an adjunct professor? , Tom Balga was an adjunct professor in Yale University's PA program. , Tom Balga was an adjunct professor in Yale University's PA program. What is the focus of Dr. Michaela Medved's doctorate degree? ,0.3333333333333333,0.21428571428571427,0.4,0.21739130434782608,0.13636363636363635,0.2608695652173913,0.7142857142857143,0.5,0.8571428571428571
51
+ 26, What is Dr. Laura Sylvia's area of expertise? ," Hearing loss in children, language development, central auditory processing and auditory verbal therapy. "," Hearing loss in children, language development, central auditory processing and auditory verbal therapy.
52
+ ",0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
53
+ 27, How does Andrew Christler use his business expertise? , Andrew Christler uses his business expertise to establish new FEES programs across the country and the globe through SEC Medical. ," Andrew Christler uses his business expertise to establish new FEES programs across the country and the globe through SEC Medical.
54
+ ",0.4285714285714285,0.3846153846153846,0.4285714285714285,0.3,0.2631578947368421,0.3,0.75,0.7142857142857143,0.75
55
+ 28, What is Free Radicals? , Free Radicals is a strategic consultancy founded by Thom Kennon. ," Free Radicals is a strategic consultancy founded by Thom Kennon.
56
+ ",0.28571428571428575,0.16666666666666666,0.4285714285714285,0.2,0.1111111111111111,0.3,0.5,0.3333333333333333,0.75
57
+ 29, What is the role of a PA in the surgical arena? ," PAs play a crucial role in the surgical arena by assisting surgeons during procedures, providing post-operative care to patients, and performing other tasks as needed. "," PAs play a crucial role in the surgical arena by assisting surgeons during procedures, providing post-operative care to patients, and performing other tasks as needed.
58
+ ",0.27027027027027023,0.17142857142857143,0.3243243243243243,0.19230769230769232,0.12,0.23076923076923078,0.45454545454545453,0.3,0.5454545454545454
59
+ 30, What is the author's opinion on interdisciplinary research? , The author is excited about interdisciplinary research and collaborating with researchers from related fields. ," The author is excited about interdisciplinary research and collaborating with researchers from related fields.
60
+ ",0.34782608695652173,0.1904761904761905,0.43478260869565216,0.2857142857142857,0.15384615384615385,0.35714285714285715,0.4444444444444444,0.25,0.5555555555555556
61
+ 31, What is the Business Relationship Management Institute? ," The Business Relationship Management Institute is a go-to source of professional learning and a community catalyst for more than 20,000 BRM professionals in 40 countries around the world, created and co-founded by Aleksandr Zhuk. "," The Business Relationship Management Institute is a go-to source of professional learning and a community catalyst for more than 20,000 BRM professionals in 40 countries around the world, created and co-founded by Aleksandr Zhuk.
62
+ ",0.22727272727272727,0.1904761904761905,0.27272727272727276,0.13513513513513514,0.1111111111111111,0.16216216216216217,0.7142857142857143,0.6666666666666666,0.8571428571428571
63
+ 32, What is Robert Friedman's area of expertise? ," Robert Friedman's area of expertise is Business Strategy and Operations for Biotech, Biopharma, and Medical Technologies. "," Robert Friedman's area of expertise is Business Strategy and Operations for Biotech, Biopharma, and Medical Technologies.
64
+ ",0.48,0.43478260869565216,0.56,0.35294117647058826,0.3125,0.4117647058823529,0.75,0.7142857142857143,0.875
65
+ 33, What type of journals have published Dr. Kemeny's discoveries? , Dr. Kemeny's discoveries have been published in scientific journals. ," Dr. Kemeny's discoveries have been published in scientific journals.
66
+ ",0.4000000000000001,0.3333333333333333,0.7,0.4,0.3333333333333333,0.7,0.4,0.3333333333333333,0.7
67
+ 34, What is Robert Friedman's area of expertise? ," Robert Friedman's area of expertise is Business Strategy and Operations for Biotech, Biopharma, and Medical Technologies. "," Robert Friedman's area of expertise is Business Strategy and Operations for Biotech, Biopharma, and Medical Technologies.
68
+ ",0.48,0.43478260869565216,0.56,0.35294117647058826,0.3125,0.4117647058823529,0.75,0.7142857142857143,0.875
69
+ 35, What degrees does William Wedo hold? , William Wedo holds an M.A. in journalism from Temple University and an MBA from Westchester University of Pennsylvania. ," William Wedo holds an M.A. in journalism from Temple University and an MBA from Westchester University of Pennsylvania.
70
+ ",0.23999999999999996,0.1739130434782609,0.23999999999999996,0.15789473684210525,0.1111111111111111,0.15789473684210525,0.5,0.4,0.5
71
+ 36, What are some recent publications by Marian Gidea? ," Some recent publications by Marian Gidea include ""Diffusion along chains of normally hyperbolic cylinders"", ""Melnikov method for non-conservative perturbations of the three-body problem"", and ""Surface gravity of rotating dumbbell shapes"". "," Some recent publications by Marian Gidea include ""Diffusion along chains of normally hyperbolic cylinders"", ""Melnikov method for non-conservative perturbations of the three-body problem"", and ""Surface gravity of rotating dumbbell shapes"".
72
+ ",0.3,0.2631578947368421,0.3,0.1875,0.16129032258064516,0.1875,0.75,0.7142857142857143,0.75
73
+ 37, What is Robert Friedman's role at Katz? , Robert Friedman is the professor of Biotechnology Management at Katz. ," Robert Friedman is the professor of Biotechnology Management at Katz.
74
+ ",0.4444444444444445,0.25,0.5555555555555556,0.4,0.2222222222222222,0.5,0.5,0.2857142857142857,0.625
75
+ 38, What academic societies is Dr. Michael Dalezman a member of? , Dr. Michael Dalezman is a member of Sigma Xi and Pi Mu Epsilon. ," Dr. Michael Dalezman is a member of Sigma Xi and Pi Mu Epsilon.
76
+ ",0.5217391304347826,0.380952380952381,0.608695652173913,0.46153846153846156,0.3333333333333333,0.5384615384615384,0.6,0.4444444444444444,0.7
77
+ 39, What is the focus of Dr. Michaela Medved's doctoral degree? , The focus of Dr. Michaela Medved ," The focus of Dr. Michaela Medved
78
+ ",0.7058823529411764,0.6666666666666666,0.7058823529411764,1.0,1.0,1.0,0.5454545454545454,0.5,0.5454545454545454
79
+ 40, What approach does Sharon Verity take to education? , Sharon Verity focuses on a patient-centered approach to education. ," Sharon Verity focuses on a patient-centered approach to education.
80
+ ",0.4444444444444445,0.25,0.5555555555555556,0.4,0.2222222222222222,0.5,0.5,0.2857142857142857,0.625
81
+ 41, What is Brittany Palmer's professional background? ," Brittany Palmer is a clinical assistant professor, clinical supervisor, and CCC-SLP. "," Brittany Palmer is a clinical assistant professor, clinical supervisor, and CCC-SLP.
82
+ ",0.2105263157894737,0.11764705882352942,0.3157894736842105,0.16666666666666666,0.09090909090909091,0.25,0.2857142857142857,0.16666666666666666,0.42857142857142855
83
+ 42, What is Lawrence Fulton's previous work focused on? ," Lawrence Fulton's previous work focused on the application of artificial intelligence and neural networks in specialized applications, the use of Python and R applications to health data science problems, and the application of hierarchical forecasting models using both imagery and time series data simultaneously. "," Lawrence Fulton's previous work focused on the application of artificial intelligence and neural networks in specialized applications, the use of Python and R applications to health data science problems, and the application of hierarchical forecasting models using both imagery and time series data simultaneously.
84
+ ",0.25925925925925924,0.23076923076923075,0.25925925925925924,0.15555555555555556,0.13636363636363635,0.15555555555555556,0.7777777777777778,0.75,0.7777777777777778
85
+ 43, What type of medicine does Sharon Verity practice at Yale New Haven Hospital? , Sharon Verity practices emergency medicine at Yale New Haven Hospital. ," Sharon Verity practices emergency medicine at Yale New Haven Hospital.
86
+ ",0.6956521739130435,0.5714285714285715,0.7826086956521738,0.8,0.6666666666666666,0.9,0.6153846153846154,0.5,0.6923076923076923
87
+ 44, How many research abstracts has Dr. Barrera published? , Dr. Barrera has published over 80 research abstracts. , Dr. Barrera has published over 80 research abstracts. What is the focus of her publications? ,0.26086956521739135,0.19047619047619047,0.5217391304347827,0.2,0.14285714285714285,0.4,0.375,0.2857142857142857,0.75
88
+ 45, Is Sharon Verity a medical doctor? ," No, Sharon Verity is not a medical doctor. She is a PA-C. "," No, Sharon Verity is not a medical doctor. She is a PA-C.
89
+ ",0.5263157894736842,0.35294117647058826,0.631578947368421,0.38461538461538464,0.25,0.46153846153846156,0.8333333333333334,0.6,1.0
90
+ 46, How many years of experience does Julian Jenkins have in the life science and pharmaceutical industry? , Julian Jenkins has almost 30 years of experience in the life science and pharmaceutical industry. ," Julian Jenkins has almost 30 years of experience in the life science and pharmaceutical industry.
91
+ ",0.6451612903225806,0.6206896551724138,0.7741935483870969,0.6666666666666666,0.6428571428571429,0.8,0.625,0.6,0.75
92
+ 47, What is Tia Hopkins' current position? , Tia Hopkins is the vice president of global sales engineering with eSentire. ," Tia Hopkins is the vice president of global sales engineering with eSentire.
93
+ ",0.2222222222222222,0.12500000000000003,0.3333333333333333,0.16666666666666666,0.09090909090909091,0.25,0.3333333333333333,0.2,0.5
94
+ 48, What is eSentire? , eSentire is a leading provider of managed detection and response (MDR) services. ," eSentire is a leading provider of managed detection and response (MDR) services.
95
+ ",0.13333333333333333,0.0,0.26666666666666666,0.08333333333333333,0.0,0.16666666666666666,0.3333333333333333,0.0,0.6666666666666666
96
+ 49, What are the different phases that proteins can self-assemble into? ," Proteins can self-assemble into a variety of phases, including crystals, dense liquids, aggregates, and gels. "," Proteins can self-assemble into a variety of phases, including crystals, dense liquids, aggregates, and gels.
97
+ ",0.3703703703703703,0.32,0.4444444444444444,0.3125,0.26666666666666666,0.375,0.45454545454545453,0.4,0.5454545454545454
98
+ 50, What is the name of the organization for which Dr. Lea F. Santos has worked for awards? , American Physical Society (APS). ," American Physical Society (APS).
99
+ ",0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
100
+ 51, What is Alexandra Laghezza's educational background? ," Alexandra Laghezza has a PhD in Health and Rehabilitation Science, an Advanced Certificate in Disability Studies, and an M.S. in Occupational Therapy. "," Alexandra Laghezza has a PhD in Health and Rehabilitation Science, an Advanced Certificate in Disability Studies, and an M.S. in Occupational Therapy.
101
+ ",0.2,0.07142857142857144,0.2,0.13043478260869565,0.045454545454545456,0.13043478260869565,0.42857142857142855,0.16666666666666666,0.42857142857142855
102
+ 52, What are Wenxiong Chen's research interests? ," Wenxiong Chen's research interests include nonlinear elliptic and parabolic partial differential equations, fractional Laplacians and other non-local operators, nonlinear functional analysis, and geometric analysis. "," Wenxiong Chen's research interests include nonlinear elliptic and parabolic partial differential equations, fractional Laplacians and other non-local operators, nonlinear functional analysis, and geometric analysis.
103
+ ",0.30303030303030304,0.25806451612903225,0.30303030303030304,0.19230769230769232,0.16,0.19230769230769232,0.7142857142857143,0.6666666666666666,0.7142857142857143
104
+ 53, What is Dr. Bodie's profession? , Dr. Bodie is an occupational therapist. ," Dr. Bodie is an occupational therapist.
105
+ ",0.3333333333333333,0.20000000000000004,0.5,0.3333333333333333,0.2,0.5,0.3333333333333333,0.2,0.5
106
+ 54, Who was Emil Prodan's PhD advisor? , Emil Prodan's PhD advisor was Peter Nordlander. ," Emil Prodan's PhD advisor was Peter Nordlander.
107
+ ",0.6666666666666666,0.6153846153846153,0.7999999999999999,0.625,0.5714285714285714,0.75,0.7142857142857143,0.6666666666666666,0.8571428571428571
108
+ 55, What is the focus of Dr. Marini's research? , The text does not provide information on the focus of Dr. Marini's research. ," The text does not provide information on the focus of Dr. Marini's research.
109
+ ",0.6086956521739131,0.5714285714285714,0.6086956521739131,0.5,0.46153846153846156,0.5,0.7777777777777778,0.75,0.7777777777777778
110
+ 56, What is a CRN? ," A CRN is a course registration number, which is unique to each course. "," A CRN is a course registration number, which is unique to each course.
111
+ ",0.23529411764705882,0.26666666666666666,0.3529411764705882,0.15384615384615385,0.16666666666666666,0.23076923076923078,0.5,0.6666666666666666,0.75
112
+ 57, How many proceedings volumes has Dr. Edward Belbruno edited? , Dr. Edward Belbruno has edited eight proceedings volumes. , Dr. Edward Belbruno has edited eight proceedings volumes. What is the focus of Dr. Edward A. Belbruno's research in mathematics and physics? ,0.3125,0.19999999999999998,0.43750000000000006,0.21739130434782608,0.13636363636363635,0.30434782608695654,0.5555555555555556,0.375,0.7777777777777778
113
+ 58, Where did Dr. Troy Dargin earn his juris doctor degree? , Yeshiva University's Cardozo School of Law. , Yeshiva University's Cardozo School of Law. What is the focus of Dr. Troy Dargin's research in intellectual property law? ,0.19354838709677416,0.13793103448275865,0.19354838709677416,0.14285714285714285,0.1,0.14285714285714285,0.3,0.2222222222222222,0.3
114
+ 59, What are the admissions criteria for the B.A./M.S. Pathways option? , Students must have completed a minimum of 84 credits in any YC/SCW undergraduate major with a minimum GPA of 3.0 and a minimum grade of B in the prerequisite coursework. ," Students must have completed a minimum of 84 credits in any YC/SCW undergraduate major with a minimum GPA of 3.0 and a minimum grade of B in the prerequisite coursework.
115
+ ",0.044444444444444446,0.0,0.13333333333333333,0.03125,0.0,0.09375,0.07692307692307693,0.0,0.23076923076923078
116
+ 60, What is the title of Emil Prodan's monograph? ," Emil Prodan's monograph is titled ""Bulk and Boundary Invariants for Complex Topological Insulators "," Emil Prodan's monograph is titled ""Bulk and Boundary Invariants for Complex Topological Insulators ].
117
+ ",0.34782608695652173,0.2857142857142857,0.5217391304347826,0.2857142857142857,0.23076923076923078,0.42857142857142855,0.4444444444444444,0.375,0.6666666666666666
118
+ 61, What is Lev Feldman's educational background? , Lev Feldman holds an MBA in global management from the University of Phoenix and an M.S. in information systems from the Stevens Institute of Technology. ," Lev Feldman holds an MBA in global management from the University of Phoenix and an M.S. in information systems from the Stevens Institute of Technology.
119
+ ",0.18181818181818182,0.06451612903225806,0.18181818181818182,0.11538461538461539,0.04,0.11538461538461539,0.42857142857142855,0.16666666666666666,0.42857142857142855
120
+ 62, What is Dr. Garfinkel's certification? , Dr. Garfinkel is certified as an assistive technology professional (ATP) through the Rehabilitation and Engineering Society of North America (RESNA). ," Dr. Garfinkel is certified as an assistive technology professional (ATP) through the Rehabilitation and Engineering Society of North America (RESNA).
121
+ ",0.15384615384615383,0.08333333333333333,0.23076923076923075,0.1,0.05263157894736842,0.15,0.3333333333333333,0.2,0.5
122
+ 63, Where is Michael Dalezman's office located? ," Michael Dalezman's office is located in Stanton Hall (245 Lexington) on the Beren campus, in room #304. "," Michael Dalezman's office is located in Stanton Hall (245 Lexington) on the Beren campus, in room #304.
123
+ ",0.4,0.2608695652173913,0.48,0.2777777777777778,0.17647058823529413,0.3333333333333333,0.7142857142857143,0.5,0.8571428571428571
124
+ 64, What kind of companies has Dr. Roldan consulted for? ," Dr. Roldan has consulted on machine learning for companies that work in medical devices, molecular design, and treasury management. "," Dr. Roldan has consulted on machine learning for companies that work in medical devices, molecular design, and treasury management.
125
+ ",0.2857142857142857,0.07692307692307691,0.42857142857142855,0.21052631578947367,0.05555555555555555,0.3157894736842105,0.4444444444444444,0.125,0.6666666666666666
126
+ 65, What is the focus of Brian Malkin's practice? , Brian Malkin's practice is at the intersection of FDA-regulated products and patent law. ," Brian Malkin's practice is at the intersection of FDA-regulated products and patent law.
127
+ ",0.33333333333333337,0.2727272727272727,0.5833333333333334,0.26666666666666666,0.21428571428571427,0.4666666666666667,0.4444444444444444,0.375,0.7777777777777778
128
+ 66, What is the focus of Julia Agosto's doctorate degree? , Julia Agosto's doctorate degree focuses on healthcare and education leadership. ," Julia Agosto's doctorate degree focuses on healthcare and education leadership.
129
+ ",0.47619047619047616,0.4210526315789474,0.47619047619047616,0.45454545454545453,0.4,0.45454545454545453,0.5,0.4444444444444444,0.5
adapter_config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alpha_pattern": {},
3
+ "auto_mapping": null,
4
+ "base_model_name_or_path": "filipealmeida/Mistral-7B-Instruct-v0.1-sharded",
5
+ "bias": "none",
6
+ "fan_in_fan_out": false,
7
+ "inference_mode": true,
8
+ "init_lora_weights": true,
9
+ "layers_pattern": null,
10
+ "layers_to_transform": null,
11
+ "lora_alpha": 32,
12
+ "lora_dropout": 0.05,
13
+ "modules_to_save": null,
14
+ "peft_type": "LORA",
15
+ "r": 32,
16
+ "rank_pattern": {},
17
+ "revision": null,
18
+ "target_modules": [
19
+ "o_proj",
20
+ "down_proj",
21
+ "q_proj",
22
+ "gate_proj",
23
+ "up_proj",
24
+ "v_proj",
25
+ "k_proj"
26
+ ],
27
+ "task_type": "CAUSAL_LM"
28
+ }
adapter_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8b4d5ca4d884fdfad390b51ddac14afececb80050ad2ae2572a7be81b88a8443
3
+ size 335706186
config.json ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "filipealmeida/Mistral-7B-Instruct-v0.1-sharded",
3
+ "architectures": [
4
+ "LlamaForCausalLM"
5
+ ],
6
+ "attention_bias": false,
7
+ "bos_token_id": 1,
8
+ "eos_token_id": 2,
9
+ "hidden_act": "silu",
10
+ "hidden_size": 4096,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 14336,
13
+ "max_position_embeddings": 32768,
14
+ "model_type": "llama",
15
+ "num_attention_heads": 32,
16
+ "num_hidden_layers": 32,
17
+ "num_key_value_heads": 8,
18
+ "pretraining_tp": 1,
19
+ "quantization_config": {
20
+ "bnb_4bit_compute_dtype": "bfloat16",
21
+ "bnb_4bit_quant_type": "nf4",
22
+ "bnb_4bit_use_double_quant": false,
23
+ "llm_int8_enable_fp32_cpu_offload": false,
24
+ "llm_int8_has_fp16_weight": false,
25
+ "llm_int8_skip_modules": null,
26
+ "llm_int8_threshold": 6.0,
27
+ "load_in_4bit": true,
28
+ "load_in_8bit": false,
29
+ "quant_method": "bitsandbytes"
30
+ },
31
+ "rms_norm_eps": 1e-05,
32
+ "rope_scaling": null,
33
+ "rope_theta": 10000.0,
34
+ "sliding_window": 4096,
35
+ "tie_word_embeddings": false,
36
+ "torch_dtype": "float16",
37
+ "transformers_version": "4.35.0.dev0",
38
+ "use_cache": false,
39
+ "vocab_size": 32000
40
+ }
pytorch_model-001.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df2ef28a231870bcc3b252dbfb613944bed9f208d2f59fddca8ffd28161bfb8d
3
+ size 3371838230
special_tokens_map.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "</s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": "</s>",
17
+ "unk_token": {
18
+ "content": "<unk>",
19
+ "lstrip": false,
20
+ "normalized": false,
21
+ "rstrip": false,
22
+ "single_word": false
23
+ }
24
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<unk>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<s>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ }
27
+ },
28
+ "additional_special_tokens": [],
29
+ "bos_token": "<s>",
30
+ "chat_template": "{{ bos_token }}{% for message in messages %}{% if (message['role'] == 'user') != (loop.index0 % 2 == 0) %}{{ raise_exception('Conversation roles must alternate user/assistant/user/assistant/...') }}{% endif %}{% if message['role'] == 'user' %}{{ '[INST] ' + message['content'] + ' [/INST]' }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token + ' ' }}{% else %}{{ raise_exception('Only user and assistant roles are supported!') }}{% endif %}{% endfor %}",
31
+ "clean_up_tokenization_spaces": false,
32
+ "eos_token": "</s>",
33
+ "legacy": true,
34
+ "model_max_length": 1000000000000000019884624838656,
35
+ "pad_token": "</s>",
36
+ "sp_model_kwargs": {},
37
+ "spaces_between_special_tokens": false,
38
+ "tokenizer_class": "LlamaTokenizer",
39
+ "torch_dtype": "auto",
40
+ "unk_token": "<unk>",
41
+ "use_default_system_prompt": true
42
+ }
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2bf19fbd377ee8a71c481e66bf314daf736444ebd20d366bc8c57dbf796cfdba
3
+ size 4536