AjayP13 commited on
Commit
b78f98f
·
verified ·
1 Parent(s): 15915f4

Upload tokenizer

Browse files
Files changed (1) hide show
  1. README.md +191 -14
README.md CHANGED
@@ -1,7 +1,5 @@
1
-
2
  ---
3
  base_model: BAAI/bge-m3
4
-
5
  tags:
6
  - datadreamer
7
  - datadreamer-0.46.0
@@ -12,18 +10,197 @@ tags:
12
  library_name: sentence-transformers
13
  pipeline_tag: sentence-similarity
14
  widget:
15
- - example_title: "Example 1"
16
- source_sentence: "Tammy Tran, an undergraduate student in the Preston Lab, received the Rapaport-King Thesis Scholarship from the College of Liberal Arts. The Rapaport-King Scholarship is awarded to Honors Program students in the College of Liberal Arts who are conducting research and writing a senior thesis. Tammy also received an Undergraduate Research Fellowship awarded to students for [\u2026]\nDr. Ila Fiete recently published her paper entitled \u201cFundamental limits on persistent activity in networks of noisy neurons\u201d in PNAS. The research investigates memory, diffusion and information-diffusion inequality in the brain. Y. Burak and I. R. Fiete. (2012). Fundamental limits on persistent activity in networks of noisy neurons. PNAS Early Edition (Oct. 9).\nDr. Hiroshi Nishiyama received a R01 grant from the NINDS for his project entitled \u201cCNS Mechanisms of developmental synapse elimination\u201d. This project investigates how the precision of synaptic circuitry is created in the developing mammalian brain by observing the process in the intact, live animals.\nJohn Widloski, graduate student in the Fiete Lab, received the Burroughs-Welcome Fund Award to attend the Methods in Computational Neuroscience summer course in Woods Hole, MA.\nAkram Bakkor, graduate student in the Poldrack lab, received a National Defense Science & Engineering Graduate Fellowship. This fellowship supports Akram\u2019s research project investigating the neural mechanisms underlying how learned behaviors are changed. Using behavioral testing, modeling and fMRI analyses on human subjects the project will shed light on why habits can be so difficult [\u2026]\nDr. Boris Zemelman is the recipient of a Human Frontiers in Science Program Grant. This grant entitled \u201cIn vivo functional imaging and high-resolution manipulations of hippocampal memory circuits\u201d is a collaborative project that will investigate how the brain encodes and processes spatial memory. Dr. Zemelman will use genetic tools for activation and silencing neurons and in [\u2026]"
17
- sentences:
18
- - "A document that provides information about academic achievements, research funding, and scholarly publications of students and faculty members in a specific institution or department, such as the Preston Lab, Fiete Lab, or Poldrack lab, would be relevant. The document should contain specific details about the awards, scholarships, or grants received by the individuals, including the name of the award, the recipient, and the purpose or focus of the research project, to allow for a determination of the areas of study and research interests. This could include announcements, press releases, or news articles from academic institutions, research organizations, or scientific journals, and should provide a clear explanation of the research projects, enabling a reader to understand the objectives, methods, and significance of the studies. Additionally, the document would include information about the research topics, such as memory, diffusion, and information-diffusion inequality in the brain, or the neural mechanisms underlying learned behaviors, and would discuss the methodologies and techniques used, such as behavioral testing, modeling, and fMRI analyses. The document should also provide information about the funding sources, such as the Rapaport-King Thesis Scholarship, the National Defense Science & Engineering Graduate Fellowship, or the Human Frontiers in Science Program Grant, and explain the criteria or selection process for these awards. Furthermore, the document would describe the collaborative projects, such as the investigation of how the brain encodes and processes spatial memory, and the use of genetic tools for activation and silencing neurons, and would discuss the potential impact or contributions of the research to the field. Overall, a document that provides a comprehensive and detailed account of the academic achievements, research funding, and scholarly publications of students and faculty members would be able to provide an answer to the question."
19
- - example_title: "Example 2"
20
- source_sentence: "Question: Yang, could you tell about yourself?\nYang: I was born in Nanjing, now I live in the capital of China - Beijing. When I was 8, my father brought me to a chess center in Nanjing. There were three kinds of chess: Chinese chess, chess and I-go. We decided to choose chess: despite the popularity of Chinese chess in our country, they are not popular abroad.\nNow I study in Tsinghua University University, which is one of our best, at economics and management faculty. I am the second-year student.\nQ: Will you choose economics or chess as your main profession?\nYang: I used to be a professional chessplayer, but now I spend some time for studying. I will make the final decision after my graduation. If I can improve my level, I will go on playing chess.\nQ: How do you divide your time between chess and other things?\nYang: I spend half of my time on chess and half on study.\nQ: What are you interested in?\nYang: I like to read, listen to the music and write stories. When I was in my childhood, I wrote some cartoons, flesh-stories. Now I write novels.\nQ: What are your preferences in the literature and music?\nYang: Light and classic music. About literature: usually I prefer Chinese books. Recently I got very interested in environment subjects. I learn some materials and environment issues.\nQ: Do you read some chess literature?\nYang: Very few.\nQ: Do you take any sports activities?\nYang: I do a little yoga. I like swimming, but I cannot swim often.\nQ: You travel a lot - which country do you like most?\nYang: I like all the countries I have visited. Every place has its beauty, its own unique culture and rich history. Human history.\nQ: Do you collect any information about the new country before your visit?\nYang: Yes, sometimes when I check it in the Internet.\nQ: Do you have some goals for the nearest future?\nYang: My main goal is connected with chess: I have some problems in my career. I always blunder in good positions. It lasts the last several years. My goal is to cover it."
21
- sentences:
22
- - "A document that provides a personal and introspective account of an individual's life, interests, and goals, particularly focusing on their background, education, and passions, would be suitable. The document should contain detailed information about the individual's birthplace, current residence, and educational institution, as well as their field of study and faculty, and should discuss their early introduction to chess and their decision to pursue it despite its relatively low popularity abroad. It should also delve into the individual's profession, including their experience as a professional chess player and their current balance between studying and playing chess, as well as their future plans and aspirations. Additionally, the document should explore the individual's hobbies and interests, including reading, listening to music, and writing stories, and should provide insight into their preferences in literature and music, including their fondness for light and classic music and Chinese books. The document should also touch on the individual's sports activities, such as yoga and swimming, and their travel experiences, including their approach to learning about new countries before visiting them. Furthermore, the document should discuss the individual's goals and challenges, particularly in relation to their chess career, including their struggles with blundering in good positions and their desire to improve. The document would offer a comprehensive and personal portrait of the individual, including their thoughts, feelings, and experiences, and would provide a unique perspective on their life and aspirations. Additionally, the document would be written in a conversational style, with a question-and-answer format, making it an engaging and relatable read. Overall, the document should provide a nuanced and detailed understanding of the individual's life, interests, and goals, allowing readers to gain insight into their thoughts, feelings, and experiences."
23
- - example_title: "Example 3"
24
- source_sentence: "A document that provides guidance on the self-moderation of the adventurous activity permit scheme within Scouting in the UK, would be relevant, and should include detailed information on the moderation process, the roles and responsibilities of Managers of the Activity Permit Scheme (MAPS) and County Commissioners, and the importance of ensuring the scheme's effectiveness and robustness. This document should offer a comprehensive overview of the moderation scheme, including its design, purpose, and benefits, and would cover the key aspects of the scheme, such as the minimum standards and good practice areas that Counties must adhere to, as well as the process for identifying and addressing areas for improvement. The document should also provide information on the County Self Moderation form, its structure, and how it is used to record and track progress, including the ability to record action plans for areas not met, and would explain the requirements for implementing action plans, particularly for minimum standards that are not met. Additionally, the document would discuss the role of the UK Activities Team in providing support to Counties that are not meeting one or more standards, and the process for requesting and receiving support, including the development of action plans and the provision of guidance and resources. Furthermore, the document should cover the sampling process, where a selection of self-moderations are reviewed each year, and would explain the purpose of this process, which is to identify trends, document the operation of the permit scheme, and demonstrate The Scout Association's ability to manage the provision of adventurous activities internally. Overall, the document should be a detailed and informative guide for MAPS, County Commissioners, and other stakeholders, providing a clear understanding of the self-moderation process and its importance in ensuring the safe and effective delivery of adventurous activities within Scouting."
25
- sentences:
26
- - "A document that provides information on data parsing and extraction methods, specifically focusing on the efficient handling of <fi>description of the input data</fi> to obtain <fi>specific information or value</fi>, and discusses the use of <fi>programming language or tool</fi> for this purpose, would be suitable. This document should include examples or representations of input data, such as <fi>representation of the input data</fi>, and clearly outline the expected output or result, like `<fi>expected output or result</fi>`, to guide the extraction process. It may come from various domains, including but not limited to, computer science, data analysis, and software development, and could be in the form of a web page, article, book, or essay, as long as it offers detailed insights into efficient data parsing techniques and the application of specific programming languages or tools to achieve the desired outcome. Furthermore, the document should cover potential challenges or considerations in the parsing and extraction process, ensuring that the reader can adapt the methods to different scenarios involving <fi>description of the input data</fi> and <fi>programming language or tool</fi>. The document must also demonstrate how to work with the specified <fi>input data</fi> to produce the intended `<fi>expected output or result</fi>`, serving as a comprehensive resource for individuals seeking to efficiently parse and extract specific information using <fi>programming language or tool</fi>. Additionally, it should be able to discuss the relevance of efficiently parsing <fi>description of the input data</fi> and the benefits of using <fi>programming language or tool</fi> for the extraction of <fi>specific information or value</fi>, providing a well-rounded understanding of the topic. Overall, a suitable document would be one that not only provides technical guidance but also contextual understanding and practical applications of data parsing and extraction techniques."
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ---
28
  # Model Card
29
 
 
 
1
  ---
2
  base_model: BAAI/bge-m3
 
3
  tags:
4
  - datadreamer
5
  - datadreamer-0.46.0
 
10
  library_name: sentence-transformers
11
  pipeline_tag: sentence-similarity
12
  widget:
13
+ - example_title: Example 1
14
+ source_sentence: 'Tammy Tran, an undergraduate student in the Preston Lab, received
15
+ the Rapaport-King Thesis Scholarship from the College of Liberal Arts. The Rapaport-King
16
+ Scholarship is awarded to Honors Program students in the College of Liberal Arts
17
+ who are conducting research and writing a senior thesis. Tammy also received an
18
+ Undergraduate Research Fellowship awarded to students for […]
19
+
20
+ Dr. Ila Fiete recently published her paper entitled “Fundamental limits on persistent
21
+ activity in networks of noisy neurons” in PNAS. The research investigates memory,
22
+ diffusion and information-diffusion inequality in the brain. Y. Burak and I. R.
23
+ Fiete. (2012). Fundamental limits on persistent activity in networks of noisy
24
+ neurons. PNAS Early Edition (Oct. 9).
25
+
26
+ Dr. Hiroshi Nishiyama received a R01 grant from the NINDS for his project entitled
27
+ “CNS Mechanisms of developmental synapse elimination”. This project investigates
28
+ how the precision of synaptic circuitry is created in the developing mammalian
29
+ brain by observing the process in the intact, live animals.
30
+
31
+ John Widloski, graduate student in the Fiete Lab, received the Burroughs-Welcome
32
+ Fund Award to attend the Methods in Computational Neuroscience summer course in
33
+ Woods Hole, MA.
34
+
35
+ Akram Bakkor, graduate student in the Poldrack lab, received a National Defense
36
+ Science & Engineering Graduate Fellowship. This fellowship supports Akram’s research
37
+ project investigating the neural mechanisms underlying how learned behaviors are
38
+ changed. Using behavioral testing, modeling and fMRI analyses on human subjects
39
+ the project will shed light on why habits can be so difficult […]
40
+
41
+ Dr. Boris Zemelman is the recipient of a Human Frontiers in Science Program Grant.
42
+ This grant entitled “In vivo functional imaging and high-resolution manipulations
43
+ of hippocampal memory circuits” is a collaborative project that will investigate
44
+ how the brain encodes and processes spatial memory. Dr. Zemelman will use genetic
45
+ tools for activation and silencing neurons and in […]'
46
+ sentences:
47
+ - A document that provides information about academic achievements, research funding,
48
+ and scholarly publications of students and faculty members in a specific institution
49
+ or department, such as the Preston Lab, Fiete Lab, or Poldrack lab, would be relevant.
50
+ The document should contain specific details about the awards, scholarships, or
51
+ grants received by the individuals, including the name of the award, the recipient,
52
+ and the purpose or focus of the research project, to allow for a determination
53
+ of the areas of study and research interests. This could include announcements,
54
+ press releases, or news articles from academic institutions, research organizations,
55
+ or scientific journals, and should provide a clear explanation of the research
56
+ projects, enabling a reader to understand the objectives, methods, and significance
57
+ of the studies. Additionally, the document would include information about the
58
+ research topics, such as memory, diffusion, and information-diffusion inequality
59
+ in the brain, or the neural mechanisms underlying learned behaviors, and would
60
+ discuss the methodologies and techniques used, such as behavioral testing, modeling,
61
+ and fMRI analyses. The document should also provide information about the funding
62
+ sources, such as the Rapaport-King Thesis Scholarship, the National Defense Science
63
+ & Engineering Graduate Fellowship, or the Human Frontiers in Science Program Grant,
64
+ and explain the criteria or selection process for these awards. Furthermore, the
65
+ document would describe the collaborative projects, such as the investigation
66
+ of how the brain encodes and processes spatial memory, and the use of genetic
67
+ tools for activation and silencing neurons, and would discuss the potential impact
68
+ or contributions of the research to the field. Overall, a document that provides
69
+ a comprehensive and detailed account of the academic achievements, research funding,
70
+ and scholarly publications of students and faculty members would be able to provide
71
+ an answer to the question.
72
+ - example_title: Example 2
73
+ source_sentence: 'Question: Yang, could you tell about yourself?
74
+
75
+ Yang: I was born in Nanjing, now I live in the capital of China - Beijing. When
76
+ I was 8, my father brought me to a chess center in Nanjing. There were three kinds
77
+ of chess: Chinese chess, chess and I-go. We decided to choose chess: despite the
78
+ popularity of Chinese chess in our country, they are not popular abroad.
79
+
80
+ Now I study in Tsinghua University University, which is one of our best, at economics
81
+ and management faculty. I am the second-year student.
82
+
83
+ Q: Will you choose economics or chess as your main profession?
84
+
85
+ Yang: I used to be a professional chessplayer, but now I spend some time for studying.
86
+ I will make the final decision after my graduation. If I can improve my level,
87
+ I will go on playing chess.
88
+
89
+ Q: How do you divide your time between chess and other things?
90
+
91
+ Yang: I spend half of my time on chess and half on study.
92
+
93
+ Q: What are you interested in?
94
+
95
+ Yang: I like to read, listen to the music and write stories. When I was in my
96
+ childhood, I wrote some cartoons, flesh-stories. Now I write novels.
97
+
98
+ Q: What are your preferences in the literature and music?
99
+
100
+ Yang: Light and classic music. About literature: usually I prefer Chinese books.
101
+ Recently I got very interested in environment subjects. I learn some materials
102
+ and environment issues.
103
+
104
+ Q: Do you read some chess literature?
105
+
106
+ Yang: Very few.
107
+
108
+ Q: Do you take any sports activities?
109
+
110
+ Yang: I do a little yoga. I like swimming, but I cannot swim often.
111
+
112
+ Q: You travel a lot - which country do you like most?
113
+
114
+ Yang: I like all the countries I have visited. Every place has its beauty, its
115
+ own unique culture and rich history. Human history.
116
+
117
+ Q: Do you collect any information about the new country before your visit?
118
+
119
+ Yang: Yes, sometimes when I check it in the Internet.
120
+
121
+ Q: Do you have some goals for the nearest future?
122
+
123
+ Yang: My main goal is connected with chess: I have some problems in my career.
124
+ I always blunder in good positions. It lasts the last several years. My goal is
125
+ to cover it.'
126
+ sentences:
127
+ - A document that provides a personal and introspective account of an individual's
128
+ life, interests, and goals, particularly focusing on their background, education,
129
+ and passions, would be suitable. The document should contain detailed information
130
+ about the individual's birthplace, current residence, and educational institution,
131
+ as well as their field of study and faculty, and should discuss their early introduction
132
+ to chess and their decision to pursue it despite its relatively low popularity
133
+ abroad. It should also delve into the individual's profession, including their
134
+ experience as a professional chess player and their current balance between studying
135
+ and playing chess, as well as their future plans and aspirations. Additionally,
136
+ the document should explore the individual's hobbies and interests, including
137
+ reading, listening to music, and writing stories, and should provide insight into
138
+ their preferences in literature and music, including their fondness for light
139
+ and classic music and Chinese books. The document should also touch on the individual's
140
+ sports activities, such as yoga and swimming, and their travel experiences, including
141
+ their approach to learning about new countries before visiting them. Furthermore,
142
+ the document should discuss the individual's goals and challenges, particularly
143
+ in relation to their chess career, including their struggles with blundering in
144
+ good positions and their desire to improve. The document would offer a comprehensive
145
+ and personal portrait of the individual, including their thoughts, feelings, and
146
+ experiences, and would provide a unique perspective on their life and aspirations.
147
+ Additionally, the document would be written in a conversational style, with a
148
+ question-and-answer format, making it an engaging and relatable read. Overall,
149
+ the document should provide a nuanced and detailed understanding of the individual's
150
+ life, interests, and goals, allowing readers to gain insight into their thoughts,
151
+ feelings, and experiences.
152
+ - example_title: Example 3
153
+ source_sentence: A document that provides guidance on the self-moderation of the
154
+ adventurous activity permit scheme within Scouting in the UK, would be relevant,
155
+ and should include detailed information on the moderation process, the roles and
156
+ responsibilities of Managers of the Activity Permit Scheme (MAPS) and County Commissioners,
157
+ and the importance of ensuring the scheme's effectiveness and robustness. This
158
+ document should offer a comprehensive overview of the moderation scheme, including
159
+ its design, purpose, and benefits, and would cover the key aspects of the scheme,
160
+ such as the minimum standards and good practice areas that Counties must adhere
161
+ to, as well as the process for identifying and addressing areas for improvement.
162
+ The document should also provide information on the County Self Moderation form,
163
+ its structure, and how it is used to record and track progress, including the
164
+ ability to record action plans for areas not met, and would explain the requirements
165
+ for implementing action plans, particularly for minimum standards that are not
166
+ met. Additionally, the document would discuss the role of the UK Activities Team
167
+ in providing support to Counties that are not meeting one or more standards, and
168
+ the process for requesting and receiving support, including the development of
169
+ action plans and the provision of guidance and resources. Furthermore, the document
170
+ should cover the sampling process, where a selection of self-moderations are reviewed
171
+ each year, and would explain the purpose of this process, which is to identify
172
+ trends, document the operation of the permit scheme, and demonstrate The Scout
173
+ Association's ability to manage the provision of adventurous activities internally.
174
+ Overall, the document should be a detailed and informative guide for MAPS, County
175
+ Commissioners, and other stakeholders, providing a clear understanding of the
176
+ self-moderation process and its importance in ensuring the safe and effective
177
+ delivery of adventurous activities within Scouting.
178
+ sentences:
179
+ - A document that provides information on data parsing and extraction methods, specifically
180
+ focusing on the efficient handling of <fi>description of the input data</fi> to
181
+ obtain <fi>specific information or value</fi>, and discusses the use of <fi>programming
182
+ language or tool</fi> for this purpose, would be suitable. This document should
183
+ include examples or representations of input data, such as <fi>representation
184
+ of the input data</fi>, and clearly outline the expected output or result, like
185
+ `<fi>expected output or result</fi>`, to guide the extraction process. It may
186
+ come from various domains, including but not limited to, computer science, data
187
+ analysis, and software development, and could be in the form of a web page, article,
188
+ book, or essay, as long as it offers detailed insights into efficient data parsing
189
+ techniques and the application of specific programming languages or tools to achieve
190
+ the desired outcome. Furthermore, the document should cover potential challenges
191
+ or considerations in the parsing and extraction process, ensuring that the reader
192
+ can adapt the methods to different scenarios involving <fi>description of the
193
+ input data</fi> and <fi>programming language or tool</fi>. The document must also
194
+ demonstrate how to work with the specified <fi>input data</fi> to produce the
195
+ intended `<fi>expected output or result</fi>`, serving as a comprehensive resource
196
+ for individuals seeking to efficiently parse and extract specific information
197
+ using <fi>programming language or tool</fi>. Additionally, it should be able to
198
+ discuss the relevance of efficiently parsing <fi>description of the input data</fi>
199
+ and the benefits of using <fi>programming language or tool</fi> for the extraction
200
+ of <fi>specific information or value</fi>, providing a well-rounded understanding
201
+ of the topic. Overall, a suitable document would be one that not only provides
202
+ technical guidance but also contextual understanding and practical applications
203
+ of data parsing and extraction techniques.
204
  ---
205
  # Model Card
206