Upload tokenizer
Browse files
README.md
CHANGED
@@ -1,7 +1,5 @@
|
|
1 |
-
|
2 |
---
|
3 |
base_model: BAAI/bge-m3
|
4 |
-
|
5 |
tags:
|
6 |
- datadreamer
|
7 |
- datadreamer-0.46.0
|
@@ -12,18 +10,197 @@ tags:
|
|
12 |
library_name: sentence-transformers
|
13 |
pipeline_tag: sentence-similarity
|
14 |
widget:
|
15 |
-
|
16 |
-
|
17 |
-
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
|
22 |
-
|
23 |
-
|
24 |
-
|
25 |
-
|
26 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
---
|
28 |
# Model Card
|
29 |
|
|
|
|
|
1 |
---
|
2 |
base_model: BAAI/bge-m3
|
|
|
3 |
tags:
|
4 |
- datadreamer
|
5 |
- datadreamer-0.46.0
|
|
|
10 |
library_name: sentence-transformers
|
11 |
pipeline_tag: sentence-similarity
|
12 |
widget:
|
13 |
+
- example_title: Example 1
|
14 |
+
source_sentence: 'Tammy Tran, an undergraduate student in the Preston Lab, received
|
15 |
+
the Rapaport-King Thesis Scholarship from the College of Liberal Arts. The Rapaport-King
|
16 |
+
Scholarship is awarded to Honors Program students in the College of Liberal Arts
|
17 |
+
who are conducting research and writing a senior thesis. Tammy also received an
|
18 |
+
Undergraduate Research Fellowship awarded to students for […]
|
19 |
+
|
20 |
+
Dr. Ila Fiete recently published her paper entitled “Fundamental limits on persistent
|
21 |
+
activity in networks of noisy neurons” in PNAS. The research investigates memory,
|
22 |
+
diffusion and information-diffusion inequality in the brain. Y. Burak and I. R.
|
23 |
+
Fiete. (2012). Fundamental limits on persistent activity in networks of noisy
|
24 |
+
neurons. PNAS Early Edition (Oct. 9).
|
25 |
+
|
26 |
+
Dr. Hiroshi Nishiyama received a R01 grant from the NINDS for his project entitled
|
27 |
+
“CNS Mechanisms of developmental synapse elimination”. This project investigates
|
28 |
+
how the precision of synaptic circuitry is created in the developing mammalian
|
29 |
+
brain by observing the process in the intact, live animals.
|
30 |
+
|
31 |
+
John Widloski, graduate student in the Fiete Lab, received the Burroughs-Welcome
|
32 |
+
Fund Award to attend the Methods in Computational Neuroscience summer course in
|
33 |
+
Woods Hole, MA.
|
34 |
+
|
35 |
+
Akram Bakkor, graduate student in the Poldrack lab, received a National Defense
|
36 |
+
Science & Engineering Graduate Fellowship. This fellowship supports Akram’s research
|
37 |
+
project investigating the neural mechanisms underlying how learned behaviors are
|
38 |
+
changed. Using behavioral testing, modeling and fMRI analyses on human subjects
|
39 |
+
the project will shed light on why habits can be so difficult […]
|
40 |
+
|
41 |
+
Dr. Boris Zemelman is the recipient of a Human Frontiers in Science Program Grant.
|
42 |
+
This grant entitled “In vivo functional imaging and high-resolution manipulations
|
43 |
+
of hippocampal memory circuits” is a collaborative project that will investigate
|
44 |
+
how the brain encodes and processes spatial memory. Dr. Zemelman will use genetic
|
45 |
+
tools for activation and silencing neurons and in […]'
|
46 |
+
sentences:
|
47 |
+
- A document that provides information about academic achievements, research funding,
|
48 |
+
and scholarly publications of students and faculty members in a specific institution
|
49 |
+
or department, such as the Preston Lab, Fiete Lab, or Poldrack lab, would be relevant.
|
50 |
+
The document should contain specific details about the awards, scholarships, or
|
51 |
+
grants received by the individuals, including the name of the award, the recipient,
|
52 |
+
and the purpose or focus of the research project, to allow for a determination
|
53 |
+
of the areas of study and research interests. This could include announcements,
|
54 |
+
press releases, or news articles from academic institutions, research organizations,
|
55 |
+
or scientific journals, and should provide a clear explanation of the research
|
56 |
+
projects, enabling a reader to understand the objectives, methods, and significance
|
57 |
+
of the studies. Additionally, the document would include information about the
|
58 |
+
research topics, such as memory, diffusion, and information-diffusion inequality
|
59 |
+
in the brain, or the neural mechanisms underlying learned behaviors, and would
|
60 |
+
discuss the methodologies and techniques used, such as behavioral testing, modeling,
|
61 |
+
and fMRI analyses. The document should also provide information about the funding
|
62 |
+
sources, such as the Rapaport-King Thesis Scholarship, the National Defense Science
|
63 |
+
& Engineering Graduate Fellowship, or the Human Frontiers in Science Program Grant,
|
64 |
+
and explain the criteria or selection process for these awards. Furthermore, the
|
65 |
+
document would describe the collaborative projects, such as the investigation
|
66 |
+
of how the brain encodes and processes spatial memory, and the use of genetic
|
67 |
+
tools for activation and silencing neurons, and would discuss the potential impact
|
68 |
+
or contributions of the research to the field. Overall, a document that provides
|
69 |
+
a comprehensive and detailed account of the academic achievements, research funding,
|
70 |
+
and scholarly publications of students and faculty members would be able to provide
|
71 |
+
an answer to the question.
|
72 |
+
- example_title: Example 2
|
73 |
+
source_sentence: 'Question: Yang, could you tell about yourself?
|
74 |
+
|
75 |
+
Yang: I was born in Nanjing, now I live in the capital of China - Beijing. When
|
76 |
+
I was 8, my father brought me to a chess center in Nanjing. There were three kinds
|
77 |
+
of chess: Chinese chess, chess and I-go. We decided to choose chess: despite the
|
78 |
+
popularity of Chinese chess in our country, they are not popular abroad.
|
79 |
+
|
80 |
+
Now I study in Tsinghua University University, which is one of our best, at economics
|
81 |
+
and management faculty. I am the second-year student.
|
82 |
+
|
83 |
+
Q: Will you choose economics or chess as your main profession?
|
84 |
+
|
85 |
+
Yang: I used to be a professional chessplayer, but now I spend some time for studying.
|
86 |
+
I will make the final decision after my graduation. If I can improve my level,
|
87 |
+
I will go on playing chess.
|
88 |
+
|
89 |
+
Q: How do you divide your time between chess and other things?
|
90 |
+
|
91 |
+
Yang: I spend half of my time on chess and half on study.
|
92 |
+
|
93 |
+
Q: What are you interested in?
|
94 |
+
|
95 |
+
Yang: I like to read, listen to the music and write stories. When I was in my
|
96 |
+
childhood, I wrote some cartoons, flesh-stories. Now I write novels.
|
97 |
+
|
98 |
+
Q: What are your preferences in the literature and music?
|
99 |
+
|
100 |
+
Yang: Light and classic music. About literature: usually I prefer Chinese books.
|
101 |
+
Recently I got very interested in environment subjects. I learn some materials
|
102 |
+
and environment issues.
|
103 |
+
|
104 |
+
Q: Do you read some chess literature?
|
105 |
+
|
106 |
+
Yang: Very few.
|
107 |
+
|
108 |
+
Q: Do you take any sports activities?
|
109 |
+
|
110 |
+
Yang: I do a little yoga. I like swimming, but I cannot swim often.
|
111 |
+
|
112 |
+
Q: You travel a lot - which country do you like most?
|
113 |
+
|
114 |
+
Yang: I like all the countries I have visited. Every place has its beauty, its
|
115 |
+
own unique culture and rich history. Human history.
|
116 |
+
|
117 |
+
Q: Do you collect any information about the new country before your visit?
|
118 |
+
|
119 |
+
Yang: Yes, sometimes when I check it in the Internet.
|
120 |
+
|
121 |
+
Q: Do you have some goals for the nearest future?
|
122 |
+
|
123 |
+
Yang: My main goal is connected with chess: I have some problems in my career.
|
124 |
+
I always blunder in good positions. It lasts the last several years. My goal is
|
125 |
+
to cover it.'
|
126 |
+
sentences:
|
127 |
+
- A document that provides a personal and introspective account of an individual's
|
128 |
+
life, interests, and goals, particularly focusing on their background, education,
|
129 |
+
and passions, would be suitable. The document should contain detailed information
|
130 |
+
about the individual's birthplace, current residence, and educational institution,
|
131 |
+
as well as their field of study and faculty, and should discuss their early introduction
|
132 |
+
to chess and their decision to pursue it despite its relatively low popularity
|
133 |
+
abroad. It should also delve into the individual's profession, including their
|
134 |
+
experience as a professional chess player and their current balance between studying
|
135 |
+
and playing chess, as well as their future plans and aspirations. Additionally,
|
136 |
+
the document should explore the individual's hobbies and interests, including
|
137 |
+
reading, listening to music, and writing stories, and should provide insight into
|
138 |
+
their preferences in literature and music, including their fondness for light
|
139 |
+
and classic music and Chinese books. The document should also touch on the individual's
|
140 |
+
sports activities, such as yoga and swimming, and their travel experiences, including
|
141 |
+
their approach to learning about new countries before visiting them. Furthermore,
|
142 |
+
the document should discuss the individual's goals and challenges, particularly
|
143 |
+
in relation to their chess career, including their struggles with blundering in
|
144 |
+
good positions and their desire to improve. The document would offer a comprehensive
|
145 |
+
and personal portrait of the individual, including their thoughts, feelings, and
|
146 |
+
experiences, and would provide a unique perspective on their life and aspirations.
|
147 |
+
Additionally, the document would be written in a conversational style, with a
|
148 |
+
question-and-answer format, making it an engaging and relatable read. Overall,
|
149 |
+
the document should provide a nuanced and detailed understanding of the individual's
|
150 |
+
life, interests, and goals, allowing readers to gain insight into their thoughts,
|
151 |
+
feelings, and experiences.
|
152 |
+
- example_title: Example 3
|
153 |
+
source_sentence: A document that provides guidance on the self-moderation of the
|
154 |
+
adventurous activity permit scheme within Scouting in the UK, would be relevant,
|
155 |
+
and should include detailed information on the moderation process, the roles and
|
156 |
+
responsibilities of Managers of the Activity Permit Scheme (MAPS) and County Commissioners,
|
157 |
+
and the importance of ensuring the scheme's effectiveness and robustness. This
|
158 |
+
document should offer a comprehensive overview of the moderation scheme, including
|
159 |
+
its design, purpose, and benefits, and would cover the key aspects of the scheme,
|
160 |
+
such as the minimum standards and good practice areas that Counties must adhere
|
161 |
+
to, as well as the process for identifying and addressing areas for improvement.
|
162 |
+
The document should also provide information on the County Self Moderation form,
|
163 |
+
its structure, and how it is used to record and track progress, including the
|
164 |
+
ability to record action plans for areas not met, and would explain the requirements
|
165 |
+
for implementing action plans, particularly for minimum standards that are not
|
166 |
+
met. Additionally, the document would discuss the role of the UK Activities Team
|
167 |
+
in providing support to Counties that are not meeting one or more standards, and
|
168 |
+
the process for requesting and receiving support, including the development of
|
169 |
+
action plans and the provision of guidance and resources. Furthermore, the document
|
170 |
+
should cover the sampling process, where a selection of self-moderations are reviewed
|
171 |
+
each year, and would explain the purpose of this process, which is to identify
|
172 |
+
trends, document the operation of the permit scheme, and demonstrate The Scout
|
173 |
+
Association's ability to manage the provision of adventurous activities internally.
|
174 |
+
Overall, the document should be a detailed and informative guide for MAPS, County
|
175 |
+
Commissioners, and other stakeholders, providing a clear understanding of the
|
176 |
+
self-moderation process and its importance in ensuring the safe and effective
|
177 |
+
delivery of adventurous activities within Scouting.
|
178 |
+
sentences:
|
179 |
+
- A document that provides information on data parsing and extraction methods, specifically
|
180 |
+
focusing on the efficient handling of <fi>description of the input data</fi> to
|
181 |
+
obtain <fi>specific information or value</fi>, and discusses the use of <fi>programming
|
182 |
+
language or tool</fi> for this purpose, would be suitable. This document should
|
183 |
+
include examples or representations of input data, such as <fi>representation
|
184 |
+
of the input data</fi>, and clearly outline the expected output or result, like
|
185 |
+
`<fi>expected output or result</fi>`, to guide the extraction process. It may
|
186 |
+
come from various domains, including but not limited to, computer science, data
|
187 |
+
analysis, and software development, and could be in the form of a web page, article,
|
188 |
+
book, or essay, as long as it offers detailed insights into efficient data parsing
|
189 |
+
techniques and the application of specific programming languages or tools to achieve
|
190 |
+
the desired outcome. Furthermore, the document should cover potential challenges
|
191 |
+
or considerations in the parsing and extraction process, ensuring that the reader
|
192 |
+
can adapt the methods to different scenarios involving <fi>description of the
|
193 |
+
input data</fi> and <fi>programming language or tool</fi>. The document must also
|
194 |
+
demonstrate how to work with the specified <fi>input data</fi> to produce the
|
195 |
+
intended `<fi>expected output or result</fi>`, serving as a comprehensive resource
|
196 |
+
for individuals seeking to efficiently parse and extract specific information
|
197 |
+
using <fi>programming language or tool</fi>. Additionally, it should be able to
|
198 |
+
discuss the relevance of efficiently parsing <fi>description of the input data</fi>
|
199 |
+
and the benefits of using <fi>programming language or tool</fi> for the extraction
|
200 |
+
of <fi>specific information or value</fi>, providing a well-rounded understanding
|
201 |
+
of the topic. Overall, a suitable document would be one that not only provides
|
202 |
+
technical guidance but also contextual understanding and practical applications
|
203 |
+
of data parsing and extraction techniques.
|
204 |
---
|
205 |
# Model Card
|
206 |
|