Spaces:
Sleeping
Sleeping
johaunh
commited on
Commit
·
cb1bfc1
1
Parent(s):
a823047
Complete pipeline
Browse files- README.md +37 -1
- data/openstax/bio2e/OS_bio_4_2.txt +66 -0
- output/2023-08-28_triplets_Direct.json +526 -0
- output/2023-08-28_triplets_LogicBased.json +473 -0
- output/2023-08-28_triplets_Traditional.json +532 -0
- pipeline.py +87 -0
- recipes.json +37 -0
- text2kg.py +69 -0
README.md
CHANGED
@@ -1,3 +1,39 @@
|
|
1 |
# AutoKG
|
2 |
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# AutoKG
|
2 |
|
3 |
+
Using large language models (ChatGPT) to automatically construct a knowledge graph from unstructured plain text.
|
4 |
+
|
5 |
+
## Usage
|
6 |
+
|
7 |
+
```
|
8 |
+
python text2kg.py <infile> [-h] [--output OUTPUT] [--cookbook COOKBOOK] [--recipe RECIPE] [--thoughts]
|
9 |
+
```
|
10 |
+
|
11 |
+
**Parameters**
|
12 |
+
|
13 |
+
```
|
14 |
+
infile
|
15 |
+
path to input text file (assumed to be in plain text form)
|
16 |
+
--output
|
17 |
+
directory to save results to (default: ./output)
|
18 |
+
--cookbook
|
19 |
+
path to JSON file containing GPT prompts (default: ./recipes.json)
|
20 |
+
--recipe
|
21 |
+
recipe to execute, must be a valid name from cookbook (see above)
|
22 |
+
--thoughts
|
23 |
+
optional flag to save intermediary GPT prompts/replies
|
24 |
+
```
|
25 |
+
|
26 |
+
## File structure
|
27 |
+
|
28 |
+
### [`data`](./data/)
|
29 |
+
|
30 |
+
Name | Description | Source
|
31 |
+
--- | --- | ---
|
32 |
+
`data/openstax/OS_bio_4_2.txt` | Chapter 4.2 of the OpenStax Biology 2nd edition textbook. |
|
33 |
+
`data/Seq2KG/...` | Processed test datasets from Seq2KG paper. | [GitHub](https://github.com/Michael-Stewart-Webdev/Seq2KG/tree/master)
|
34 |
+
|
35 |
+
## References
|
36 |
+
|
37 |
+
1. A case study in bootstrapping ontology graphs from textbooks (V. K. Chaudhri et al., 2021)
|
38 |
+
2. Seq2KG: an end-to-end neural model for domain agnostic knowledge graph (not text graph) construction from text (M. Stewart & W. Liu, 2020)
|
39 |
+
3. Language models are open knowledge graphs (C. Wang et al., 2020)
|
data/openstax/bio2e/OS_bio_4_2.txt
ADDED
@@ -0,0 +1,66 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
A cell is the smallest unit of a living thing.
|
2 |
+
Whether comprised of one cell (like bacteria) or many cells (like a human), we call it an organism.
|
3 |
+
Thus, cells are the basic building blocks of all organisms.
|
4 |
+
Several cells of one kind that interconnect with each other and perform a shared function form tissues.
|
5 |
+
These tissues combine to form an organ (your stomach, heart, or brain), and several organs comprise an organ system (such as the digestive system, circulatory system, or nervous system).
|
6 |
+
Several systems that function together form an organism (like a human being).
|
7 |
+
Here, we will examine the structure and function of cells.
|
8 |
+
There are many types of cells, which scientists group into one of two broad categories: prokaryotic and eukaryotic.
|
9 |
+
For example, we classify both animal and plant cells as eukaryotic cells; whereas, we classify bacterial cells as prokaryotic.
|
10 |
+
Before discussing the criteria for determining whether a cell is prokaryotic or eukaryotic, we will first examine how biologists study cells.
|
11 |
+
Cells vary in size.
|
12 |
+
With few exceptions, we cannot see individual cells with the naked eye, so scientists use microscopes (micro- = "small"; -scope = "to look at") to study them.
|
13 |
+
A microscope is an instrument that magnifies an object.
|
14 |
+
We photograph most cells with a microscope, so we can call these images micrographs.
|
15 |
+
The optics of a microscope's lenses change the image orientation that the user sees.
|
16 |
+
A specimen that is right-side up and facing right on the microscope slide will appear upside-down and facing left when one views through a microscope, and vice versa.
|
17 |
+
Similarly, if one moves the slide left while looking through the microscope, it will appear to move right, and if one moves it down, it will seem to move up.
|
18 |
+
This occurs because microscopes use two sets of lenses to magnify the image.
|
19 |
+
Because of the manner by which light travels through the lenses, this two lens system produces an inverted image (binocular, or dissecting microscopes, work in a similar manner, but include an additional magnification system that makes the final image appear to be upright).
|
20 |
+
To give you a sense of cell size, a typical human red blood cell is about eight millionths of a meter or eight micrometers (abbreviated as eight μm) in diameter.
|
21 |
+
A pin head is about two thousandths of a meter (two mm) in diameter.
|
22 |
+
That means about 250 red blood cells could fit on a pinhead.
|
23 |
+
Most student microscopes are light microscopes (Figure 4.2a).
|
24 |
+
Visible light passes and bends through the lens system to enable the user to see the specimen.
|
25 |
+
Light microscopes are advantageous for viewing living organisms, but since individual cells are generally transparent, their components are not distinguishable unless they are colored with special stains.
|
26 |
+
Staining, however, usually kills the cells.
|
27 |
+
Light microscopes that undergraduates commonly use in the laboratory magnify up to approximately 400 times.
|
28 |
+
Two parameters that are important in microscopy are magnification and resolving power.
|
29 |
+
Magnification is the process of enlarging an object in appearance.
|
30 |
+
Resolving power is the microscope's ability to distinguish two adjacent structures as separate: the higher the resolution, the better the image's clarity and detail.
|
31 |
+
When one uses oil immersion lenses to study small objects, magnification usually increases to 1,000 times.
|
32 |
+
In order to gain a better understanding of cellular structure and function, scientists typically use electron microscopes.
|
33 |
+
a) Most light microscopes in a college biology lab can magnify cells up to approximately 400 times and have a resolution of about 200 nanometers.
|
34 |
+
b) Electron microscopes provide a much higher magnification, 100,000x, and a have a resolution of 50 picometers.
|
35 |
+
use a beam of electrons instead of a beam of light.
|
36 |
+
Not only does this allow for higher magnification and, thus, more detail (Figure 4.3), it also provides higher resolving power.
|
37 |
+
The method to prepare the specimen for viewing with an electron microscope kills the specimen.
|
38 |
+
Electrons have short wavelengths (shorter than photons) that move best in a vacuum, so we cannot view living cells with an electron microscope.
|
39 |
+
In a scanning electron microscope, a beam of electrons moves back and forth across a cell's surface, creating details of cell surface characteristics.
|
40 |
+
In a transmission electron microscope, the electron beam penetrates the cell and provides details of a cell's internal structures.
|
41 |
+
As you might imagine, electron microscopes are significantly more bulky and expensive than light microscopes.
|
42 |
+
a) These Salmonella bacteria appear as tiny purple dots when viewed with a light microscope.
|
43 |
+
b) This scanning electron microscope micrograph shows Salmonella bacteria (in red) invading human cells (yellow).
|
44 |
+
Even though subfigure (b) shows a different Salmonella specimen than subfigure (a), you can still observe the comparative increase in magnification and detail.
|
45 |
+
For another perspective on cell size, try the HowBig interactive at this site.
|
46 |
+
The microscopes we use today are far more complex than those that Dutch shopkeeper Antony van Leeuwenhoek, used in the 1600s.
|
47 |
+
Skilled in crafting lenses, van Leeuwenhoek observed the movements of single-celled organisms, which he collectively termed "animalcules."
|
48 |
+
In the 1665 publication Micrographia, experimental scientist Robert Hooke coined the term "cell" for the box-like structures he observed when viewing cork tissue through a lens.
|
49 |
+
In the 1670s, van Leeuwenhoek discovered bacteria and protozoa.
|
50 |
+
Later advances in lenses, microscope construction, and staining techniques enabled other scientists to see some components inside cells.
|
51 |
+
By the late 1830s, botanist Matthias Schleiden and zoologist Theodor Schwann were studying tissues and proposed the unified cell theory, which states that one or more cells comprise all living things, the cell is the basic unit of life, and new cells arise from existing cells.
|
52 |
+
Rudolf Virchow later made important contributions to this theory.
|
53 |
+
Have you ever heard of a medical test called a Pap smear (Figure 4.4)?
|
54 |
+
In this test, a doctor takes a small sample of cells from the patient's uterine cervix and sends it to a medical lab where a cytotechnologist stains the cells and examines them for any changes that could indicate cervical cancer or a microbial infection.
|
55 |
+
Cytotechnologists (cyto- = "cell") are professionals who study cells via microscopic examinations and other laboratory tests.
|
56 |
+
They are trained to determine which cellular changes are within normal limits and which are abnormal.
|
57 |
+
Their focus is not limited to cervical cells.
|
58 |
+
They study cellular specimens that come from all organs.
|
59 |
+
When they notice abnormalities, they consult a pathologist, a medical doctor who interprets and diagnoses changes that disease in body tissue and fluids cause.
|
60 |
+
Cytotechnologists play a vital role in saving people's lives.
|
61 |
+
When doctors discover abnormalities early, a patient's treatment can begin sooner, which usually increases the chances of a successful outcome.
|
62 |
+
These uterine cervix cells, viewed through a light microscope, are from a Pap smear.
|
63 |
+
Normal cells are on the left.
|
64 |
+
The cells on the right are infected with human papillomavirus (HPV).
|
65 |
+
Notice that the infected cells are larger.
|
66 |
+
Also, two of these cells each have two nuclei instead of one, the normal number.
|
output/2023-08-28_triplets_Direct.json
ADDED
@@ -0,0 +1,526 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
[
|
2 |
+
{
|
3 |
+
"sentence": "A cell is the smallest unit of a living thing.",
|
4 |
+
"triplets": [
|
5 |
+
"cell::is the smallest unit of::a living thing"
|
6 |
+
]
|
7 |
+
},
|
8 |
+
{
|
9 |
+
"sentence": "Whether comprised of one cell (like bacteria) or many cells (like a human), we call it an organism.",
|
10 |
+
"triplets": [
|
11 |
+
"organism::comprised of::one cell",
|
12 |
+
"organism::comprised of::many cells",
|
13 |
+
"bacteria::comprised of::one cell",
|
14 |
+
"human::comprised of::many cells"
|
15 |
+
]
|
16 |
+
},
|
17 |
+
{
|
18 |
+
"sentence": "Thus, cells are the basic building blocks of all organisms.",
|
19 |
+
"triplets": [
|
20 |
+
"cells::are the basic building blocks of::all organisms"
|
21 |
+
]
|
22 |
+
},
|
23 |
+
{
|
24 |
+
"sentence": "Several cells of one kind that interconnect with each other and perform a shared function form tissues.",
|
25 |
+
"triplets": [
|
26 |
+
"cells::interconnect with::each other",
|
27 |
+
"cells::perform::shared function",
|
28 |
+
"cells::form::tissues"
|
29 |
+
]
|
30 |
+
},
|
31 |
+
{
|
32 |
+
"sentence": "These tissues combine to form an organ (your stomach, heart, or brain), and several organs comprise an organ system (such as the digestive system, circulatory system, or nervous system).",
|
33 |
+
"triplets": [
|
34 |
+
"tissues::combine to form::organ",
|
35 |
+
"organ::comprise::organ system",
|
36 |
+
"organ system::such as::digestive system",
|
37 |
+
"organ system::such as::circulatory system",
|
38 |
+
"organ system::such as::nervous system"
|
39 |
+
]
|
40 |
+
},
|
41 |
+
{
|
42 |
+
"sentence": "Several systems that function together form an organism (like a human being).",
|
43 |
+
"triplets": [
|
44 |
+
"systems::form::organism",
|
45 |
+
"systems::function together::organism",
|
46 |
+
"systems::form::human being"
|
47 |
+
]
|
48 |
+
},
|
49 |
+
{
|
50 |
+
"sentence": "Here, we will examine the structure and function of cells.",
|
51 |
+
"triplets": [
|
52 |
+
"Here::examine::structure and function of cells"
|
53 |
+
]
|
54 |
+
},
|
55 |
+
{
|
56 |
+
"sentence": "There are many types of cells, which scientists group into one of two broad categories: prokaryotic and eukaryotic.",
|
57 |
+
"triplets": [
|
58 |
+
"cells::have types::many",
|
59 |
+
"scientists::group into::categories",
|
60 |
+
"types::group into::categories",
|
61 |
+
"prokaryotic cells::belong to::categories",
|
62 |
+
"eukaryotic cells::belong to::categories"
|
63 |
+
]
|
64 |
+
},
|
65 |
+
{
|
66 |
+
"sentence": "For example, we classify both animal and plant cells as eukaryotic cells; whereas, we classify bacterial cells as prokaryotic.",
|
67 |
+
"triplets": [
|
68 |
+
"animal cells::classified as::eukaryotic cells",
|
69 |
+
"plant cells::classified as::eukaryotic cells",
|
70 |
+
"bacterial cells::classified as::prokaryotic"
|
71 |
+
]
|
72 |
+
},
|
73 |
+
{
|
74 |
+
"sentence": "Before discussing the criteria for determining whether a cell is prokaryotic or eukaryotic, we will first examine how biologists study cells.",
|
75 |
+
"triplets": [
|
76 |
+
"biologists::study::cells",
|
77 |
+
"cell::is::prokaryotic",
|
78 |
+
"cell::is::eukaryotic",
|
79 |
+
"criteria::determining::cell",
|
80 |
+
"cell::is::prokaryotic",
|
81 |
+
"cell::is::eukaryotic"
|
82 |
+
]
|
83 |
+
},
|
84 |
+
{
|
85 |
+
"sentence": "Cells vary in size.",
|
86 |
+
"triplets": [
|
87 |
+
"Cells::vary in::size"
|
88 |
+
]
|
89 |
+
},
|
90 |
+
{
|
91 |
+
"sentence": "With few exceptions, we cannot see individual cells with the naked eye, so scientists use microscopes (micro- = \"small\"; -scope = \"to look at\") to study them.",
|
92 |
+
"triplets": [
|
93 |
+
"scientists::use::microscopes",
|
94 |
+
"microscopes::study::cells",
|
95 |
+
"cells::cannot see with::naked eye",
|
96 |
+
"scientists::use::microscopes",
|
97 |
+
"microscopes::study::them"
|
98 |
+
]
|
99 |
+
},
|
100 |
+
{
|
101 |
+
"sentence": "A microscope is an instrument that magnifies an object.",
|
102 |
+
"triplets": []
|
103 |
+
},
|
104 |
+
{
|
105 |
+
"sentence": "We photograph most cells with a microscope, so we can call these images micrographs.",
|
106 |
+
"triplets": [
|
107 |
+
"We::photograph::most cells",
|
108 |
+
"We::call::these images"
|
109 |
+
]
|
110 |
+
},
|
111 |
+
{
|
112 |
+
"sentence": "The optics of a microscope's lenses change the image orientation that the user sees.",
|
113 |
+
"triplets": [
|
114 |
+
"microscope's lenses::change::image orientation",
|
115 |
+
"optics::change::image orientation",
|
116 |
+
"user::sees::image orientation"
|
117 |
+
]
|
118 |
+
},
|
119 |
+
{
|
120 |
+
"sentence": "A specimen that is right-side up and facing right on the microscope slide will appear upside-down and facing left when one views through a microscope, and vice versa.",
|
121 |
+
"triplets": [
|
122 |
+
"specimen::is right-side up and facing right on::microscope slide",
|
123 |
+
"specimen::will appear upside-down and facing left when one views through::microscope",
|
124 |
+
"microscope slide::is viewed through::microscope",
|
125 |
+
"microscope::views through::microscope slide"
|
126 |
+
]
|
127 |
+
},
|
128 |
+
{
|
129 |
+
"sentence": "Similarly, if one moves the slide left while looking through the microscope, it will appear to move right, and if one moves it down, it will seem to move up.",
|
130 |
+
"triplets": [
|
131 |
+
"one::moves::slide",
|
132 |
+
"slide::left::microscope",
|
133 |
+
"one::moves::slide",
|
134 |
+
"slide::right::appear",
|
135 |
+
"one::moves::slide",
|
136 |
+
"slide::down::microscope",
|
137 |
+
"one::moves::slide",
|
138 |
+
"slide::up::appear"
|
139 |
+
]
|
140 |
+
},
|
141 |
+
{
|
142 |
+
"sentence": "This occurs because microscopes use two sets of lenses to magnify the image.",
|
143 |
+
"triplets": [
|
144 |
+
"This occurs::because::microscopes",
|
145 |
+
"microscopes::use::two sets of lenses",
|
146 |
+
"two sets of lenses::magnify::the image"
|
147 |
+
]
|
148 |
+
},
|
149 |
+
{
|
150 |
+
"sentence": "Because of the manner by which light travels through the lenses, this two lens system produces an inverted image (binocular, or dissecting microscopes, work in a similar manner, but include an additional magnification system that makes the final image appear to be upright).",
|
151 |
+
"triplets": [
|
152 |
+
"lenses::travels through::light",
|
153 |
+
"two lens system::produces::inverted image",
|
154 |
+
"dissecting microscopes::work in::similar manner",
|
155 |
+
"dissecting microscopes::include::additional magnification system",
|
156 |
+
"additional magnification system::makes appear::final image",
|
157 |
+
"final image::appear to be::upright"
|
158 |
+
]
|
159 |
+
},
|
160 |
+
{
|
161 |
+
"sentence": "To give you a sense of cell size, a typical human red blood cell is about eight millionths of a meter or eight micrometers (abbreviated as eight \u03bcm) in diameter.",
|
162 |
+
"triplets": [
|
163 |
+
"human red blood cell::has size::eight millionths of a meter",
|
164 |
+
"human red blood cell::has size::eight micrometers",
|
165 |
+
"eight micrometers::abbreviated as::eight \u03bcm",
|
166 |
+
"eight \u03bcm::is a unit of::diameter"
|
167 |
+
]
|
168 |
+
},
|
169 |
+
{
|
170 |
+
"sentence": "A pin head is about two thousandths of a meter (two mm) in diameter.",
|
171 |
+
"triplets": []
|
172 |
+
},
|
173 |
+
{
|
174 |
+
"sentence": "That means about 250 red blood cells could fit on a pinhead.",
|
175 |
+
"triplets": [
|
176 |
+
"red blood cells::fit on::pinhead",
|
177 |
+
"250::fit on::pinhead",
|
178 |
+
"red blood cells::could::fit on",
|
179 |
+
"250::could::fit on",
|
180 |
+
"That::means::about"
|
181 |
+
]
|
182 |
+
},
|
183 |
+
{
|
184 |
+
"sentence": "Most student microscopes are light microscopes (Figure 4.2a).",
|
185 |
+
"triplets": [
|
186 |
+
"student microscopes::are::light microscopes",
|
187 |
+
"Figure 4.2a::is::light microscopes"
|
188 |
+
]
|
189 |
+
},
|
190 |
+
{
|
191 |
+
"sentence": "Visible light passes and bends through the lens system to enable the user to see the specimen.",
|
192 |
+
"triplets": [
|
193 |
+
"Visible light::passes through::lens system",
|
194 |
+
"Visible light::bends through::lens system",
|
195 |
+
"lens system::enables::user",
|
196 |
+
"user::sees::specimen"
|
197 |
+
]
|
198 |
+
},
|
199 |
+
{
|
200 |
+
"sentence": "Light microscopes are advantageous for viewing living organisms, but since individual cells are generally transparent, their components are not distinguishable unless they are colored with special stains.",
|
201 |
+
"triplets": [
|
202 |
+
"Light microscopes::are advantageous for::viewing living organisms",
|
203 |
+
"individual cells::are generally transparent::not distinguishable",
|
204 |
+
"individual cells::are generally transparent::colored with special stains"
|
205 |
+
]
|
206 |
+
},
|
207 |
+
{
|
208 |
+
"sentence": "Staining, however, usually kills the cells.",
|
209 |
+
"triplets": [
|
210 |
+
"Staining::kills::cells"
|
211 |
+
]
|
212 |
+
},
|
213 |
+
{
|
214 |
+
"sentence": "Light microscopes that undergraduates commonly use in the laboratory magnify up to approximately 400 times.",
|
215 |
+
"triplets": [
|
216 |
+
"Light microscopes::use::undergraduates",
|
217 |
+
"undergraduates::use::laboratory",
|
218 |
+
"Light microscopes::magnify::up to approximately 400 times"
|
219 |
+
]
|
220 |
+
},
|
221 |
+
{
|
222 |
+
"sentence": "Two parameters that are important in microscopy are magnification and resolving power.",
|
223 |
+
"triplets": [
|
224 |
+
"microscopy::has parameter::magnification",
|
225 |
+
"microscopy::has parameter::resolving power"
|
226 |
+
]
|
227 |
+
},
|
228 |
+
{
|
229 |
+
"sentence": "Magnification is the process of enlarging an object in appearance.",
|
230 |
+
"triplets": [
|
231 |
+
"Magnification::is the process of::enlarging an object in appearance"
|
232 |
+
]
|
233 |
+
},
|
234 |
+
{
|
235 |
+
"sentence": "Resolving power is the microscope's ability to distinguish two adjacent structures as separate: the higher the resolution, the better the image's clarity and detail.",
|
236 |
+
"triplets": [
|
237 |
+
"Resolving power::is::microscope's ability",
|
238 |
+
"microscope's ability::to distinguish::two adjacent structures",
|
239 |
+
"two adjacent structures::as separate::the higher the resolution",
|
240 |
+
"the higher the resolution::the better::image's clarity and detail"
|
241 |
+
]
|
242 |
+
},
|
243 |
+
{
|
244 |
+
"sentence": "When one uses oil immersion lenses to study small objects, magnification usually increases to 1,000 times.",
|
245 |
+
"triplets": [
|
246 |
+
"oil immersion lenses::used to study::small objects",
|
247 |
+
"magnification::increases to::1"
|
248 |
+
]
|
249 |
+
},
|
250 |
+
{
|
251 |
+
"sentence": "In order to gain a better understanding of cellular structure and function, scientists typically use electron microscopes.",
|
252 |
+
"triplets": [
|
253 |
+
"scientists::use::electron microscopes",
|
254 |
+
"scientists::gain::better understanding",
|
255 |
+
"cellular structure and function::understanding of::scientists",
|
256 |
+
"electron microscopes::used by::scientists"
|
257 |
+
]
|
258 |
+
},
|
259 |
+
{
|
260 |
+
"sentence": "a) Most light microscopes in a college biology lab can magnify cells up to approximately 400 times and have a resolution of about 200 nanometers.",
|
261 |
+
"triplets": [
|
262 |
+
"light microscopes::in::college biology lab",
|
263 |
+
"light microscopes::magnify::cells",
|
264 |
+
"light microscopes::up to::approximately 400 times",
|
265 |
+
"light microscopes::have::resolution",
|
266 |
+
"light microscopes::of::about 200 nanometers"
|
267 |
+
]
|
268 |
+
},
|
269 |
+
{
|
270 |
+
"sentence": "b) Electron microscopes provide a much higher magnification, 100,000x, and a have a resolution of 50 picometers.",
|
271 |
+
"triplets": [
|
272 |
+
"Electron microscopes::provide::higher magnification",
|
273 |
+
"Electron microscopes::have::resolution",
|
274 |
+
"Electron microscopes::have::50 picometers"
|
275 |
+
]
|
276 |
+
},
|
277 |
+
{
|
278 |
+
"sentence": "use a beam of electrons instead of a beam of light.",
|
279 |
+
"triplets": [
|
280 |
+
"beam::use::electrons",
|
281 |
+
"beam::instead of::light"
|
282 |
+
]
|
283 |
+
},
|
284 |
+
{
|
285 |
+
"sentence": "Not only does this allow for higher magnification and, thus, more detail (Figure 4.3), it also provides higher resolving power.",
|
286 |
+
"triplets": [
|
287 |
+
"higher magnification::allow for::more detail",
|
288 |
+
"Figure 4.3::provides::higher resolving power"
|
289 |
+
]
|
290 |
+
},
|
291 |
+
{
|
292 |
+
"sentence": "The method to prepare the specimen for viewing with an electron microscope kills the specimen.",
|
293 |
+
"triplets": [
|
294 |
+
"The method::to prepare::the specimen",
|
295 |
+
"The specimen::for viewing with::an electron microscope",
|
296 |
+
"The method::kills::the specimen"
|
297 |
+
]
|
298 |
+
},
|
299 |
+
{
|
300 |
+
"sentence": "Electrons have short wavelengths (shorter than photons) that move best in a vacuum, so we cannot view living cells with an electron microscope.",
|
301 |
+
"triplets": [
|
302 |
+
"Electrons::have::short wavelengths",
|
303 |
+
"Electrons::move best in::a vacuum",
|
304 |
+
"we::cannot view::living cells",
|
305 |
+
"an electron microscope::view::living cells"
|
306 |
+
]
|
307 |
+
},
|
308 |
+
{
|
309 |
+
"sentence": "In a scanning electron microscope, a beam of electrons moves back and forth across a cell's surface, creating details of cell surface characteristics.",
|
310 |
+
"triplets": [
|
311 |
+
"scanning electron microscope::has::beam of electrons",
|
312 |
+
"beam of electrons::moves back and forth across::cell's surface",
|
313 |
+
"beam of electrons::creates::details of cell surface characteristics"
|
314 |
+
]
|
315 |
+
},
|
316 |
+
{
|
317 |
+
"sentence": "In a transmission electron microscope, the electron beam penetrates the cell and provides details of a cell's internal structures.",
|
318 |
+
"triplets": [
|
319 |
+
"transmission electron microscope::has::electron beam",
|
320 |
+
"electron beam::penetrates::cell",
|
321 |
+
"electron beam::provides details of::cell's internal structures"
|
322 |
+
]
|
323 |
+
},
|
324 |
+
{
|
325 |
+
"sentence": "As you might imagine, electron microscopes are significantly more bulky and expensive than light microscopes.",
|
326 |
+
"triplets": [
|
327 |
+
"electron microscopes::are significantly more bulky and expensive than::light microscopes"
|
328 |
+
]
|
329 |
+
},
|
330 |
+
{
|
331 |
+
"sentence": "a) These Salmonella bacteria appear as tiny purple dots when viewed with a light microscope.",
|
332 |
+
"triplets": [
|
333 |
+
"Salmonella bacteria::appear as::tiny purple dots",
|
334 |
+
"Salmonella bacteria::viewed with::light microscope"
|
335 |
+
]
|
336 |
+
},
|
337 |
+
{
|
338 |
+
"sentence": "b) This scanning electron microscope micrograph shows Salmonella bacteria (in red) invading human cells (yellow).",
|
339 |
+
"triplets": [
|
340 |
+
"Salmonella bacteria::invading::human cells"
|
341 |
+
]
|
342 |
+
},
|
343 |
+
{
|
344 |
+
"sentence": "Even though subfigure (b) shows a different Salmonella specimen than subfigure (a), you can still observe the comparative increase in magnification and detail.",
|
345 |
+
"triplets": [
|
346 |
+
"you::observe::the comparative increase in magnification and detail"
|
347 |
+
]
|
348 |
+
},
|
349 |
+
{
|
350 |
+
"sentence": "For another perspective on cell size, try the HowBig interactive at this site.",
|
351 |
+
"triplets": [
|
352 |
+
"HowBig interactive::is related to::cell size"
|
353 |
+
]
|
354 |
+
},
|
355 |
+
{
|
356 |
+
"sentence": "The microscopes we use today are far more complex than those that Dutch shopkeeper Antony van Leeuwenhoek, used in the 1600s.",
|
357 |
+
"triplets": [
|
358 |
+
"Antony van Leeuwenhoek::used::microscopes",
|
359 |
+
"microscopes::are::complex",
|
360 |
+
"microscopes::use::today",
|
361 |
+
"Antony van Leeuwenhoek::used::those",
|
362 |
+
"Antony van Leeuwenhoek::used::microscopes",
|
363 |
+
"microscopes::are::complex",
|
364 |
+
"microscopes::use::today"
|
365 |
+
]
|
366 |
+
},
|
367 |
+
{
|
368 |
+
"sentence": "Skilled in crafting lenses, van Leeuwenhoek observed the movements of single-celled organisms, which he collectively termed \"animalcules.\"",
|
369 |
+
"triplets": [
|
370 |
+
"van Leeuwenhoek::observed::movements of single-celled organisms",
|
371 |
+
"van Leeuwenhoek::crafting::lenses"
|
372 |
+
]
|
373 |
+
},
|
374 |
+
{
|
375 |
+
"sentence": "In the 1665 publication Micrographia, experimental scientist Robert Hooke coined the term \"cell\" for the box-like structures he observed when viewing cork tissue through a lens.",
|
376 |
+
"triplets": [
|
377 |
+
"Robert Hooke::coined::term",
|
378 |
+
"Robert Hooke::observed::box-like structures",
|
379 |
+
"Robert Hooke::viewing::cork tissue",
|
380 |
+
"Micrographia::publication::1665",
|
381 |
+
"Robert Hooke::viewing::cork tissue through a lens"
|
382 |
+
]
|
383 |
+
},
|
384 |
+
{
|
385 |
+
"sentence": "In the 1670s, van Leeuwenhoek discovered bacteria and protozoa.",
|
386 |
+
"triplets": [
|
387 |
+
"van Leeuwenhoek::discovered::bacteria",
|
388 |
+
"van Leeuwenhoek::discovered::protozoa"
|
389 |
+
]
|
390 |
+
},
|
391 |
+
{
|
392 |
+
"sentence": "Later advances in lenses, microscope construction, and staining techniques enabled other scientists to see some components inside cells.",
|
393 |
+
"triplets": [
|
394 |
+
"Later advances::in::lenses",
|
395 |
+
"Later advances::in::microscope construction",
|
396 |
+
"Later advances::in::staining techniques",
|
397 |
+
"scientists::enabled::other scientists",
|
398 |
+
"scientists::to see::components inside cells"
|
399 |
+
]
|
400 |
+
},
|
401 |
+
{
|
402 |
+
"sentence": "By the late 1830s, botanist Matthias Schleiden and zoologist Theodor Schwann were studying tissues and proposed the unified cell theory, which states that one or more cells comprise all living things, the cell is the basic unit of life, and new cells arise from existing cells.",
|
403 |
+
"triplets": [
|
404 |
+
"Matthias Schleiden::studying::tissues",
|
405 |
+
"Matthias Schleiden::proposed::unified cell theory",
|
406 |
+
"unified cell theory::states::one or more cells comprise all living things",
|
407 |
+
"unified cell theory::states::the cell is the basic unit of life",
|
408 |
+
"unified cell theory::states::new cells arise from existing cells",
|
409 |
+
"Theodor Schwann::studying::tissues",
|
410 |
+
"Theodor Schwann::proposed::unified cell theory",
|
411 |
+
"unified cell theory::states::one or more cells comprise all living things",
|
412 |
+
"unified cell theory::states::the cell is the basic unit of life",
|
413 |
+
"unified cell theory::states::new cells arise from existing cells"
|
414 |
+
]
|
415 |
+
},
|
416 |
+
{
|
417 |
+
"sentence": "Rudolf Virchow later made important contributions to this theory.",
|
418 |
+
"triplets": [
|
419 |
+
"Rudolf Virchow::made contributions to::theory"
|
420 |
+
]
|
421 |
+
},
|
422 |
+
{
|
423 |
+
"sentence": "Have you ever heard of a medical test called a Pap smear (Figure 4.4)?",
|
424 |
+
"triplets": [
|
425 |
+
"you::heard of::medical test",
|
426 |
+
"Pap smear::called::a medical test"
|
427 |
+
]
|
428 |
+
},
|
429 |
+
{
|
430 |
+
"sentence": "In this test, a doctor takes a small sample of cells from the patient's uterine cervix and sends it to a medical lab where a cytotechnologist stains the cells and examines them for any changes that could indicate cervical cancer or a microbial infection.",
|
431 |
+
"triplets": [
|
432 |
+
"doctor::takes::small sample of cells",
|
433 |
+
"doctor::sends::sample of cells to medical lab",
|
434 |
+
"cytotechnologist::stains::cells",
|
435 |
+
"cytotechnologist::examines::cells",
|
436 |
+
"cells::indicate::cervical cancer",
|
437 |
+
"cells::indicate::microbial infection"
|
438 |
+
]
|
439 |
+
},
|
440 |
+
{
|
441 |
+
"sentence": "Cytotechnologists (cyto- = \"cell\") are professionals who study cells via microscopic examinations and other laboratory tests.",
|
442 |
+
"triplets": [
|
443 |
+
"Cytotechnologists::study::cells",
|
444 |
+
"Cytotechnologists::use::microscopic examinations",
|
445 |
+
"Cytotechnologists::use::other laboratory tests"
|
446 |
+
]
|
447 |
+
},
|
448 |
+
{
|
449 |
+
"sentence": "They are trained to determine which cellular changes are within normal limits and which are abnormal.",
|
450 |
+
"triplets": [
|
451 |
+
"They::are trained to determine::which cellular changes are within normal limits",
|
452 |
+
"They::are trained to determine::which cellular changes are abnormal"
|
453 |
+
]
|
454 |
+
},
|
455 |
+
{
|
456 |
+
"sentence": "Their focus is not limited to cervical cells.",
|
457 |
+
"triplets": [
|
458 |
+
"Their::focus::not limited to cervical cells"
|
459 |
+
]
|
460 |
+
},
|
461 |
+
{
|
462 |
+
"sentence": "They study cellular specimens that come from all organs.",
|
463 |
+
"triplets": [
|
464 |
+
"cellular specimens::come from::all organs"
|
465 |
+
]
|
466 |
+
},
|
467 |
+
{
|
468 |
+
"sentence": "When they notice abnormalities, they consult a pathologist, a medical doctor who interprets and diagnoses changes that disease in body tissue and fluids cause.",
|
469 |
+
"triplets": [
|
470 |
+
"they::notice::abnormalities",
|
471 |
+
"they::consult::pathologist",
|
472 |
+
"pathologist::interpret::changes",
|
473 |
+
"pathologist::diagnose::disease",
|
474 |
+
"disease::cause::changes",
|
475 |
+
"disease::in::body tissue",
|
476 |
+
"disease::in::fluids"
|
477 |
+
]
|
478 |
+
},
|
479 |
+
{
|
480 |
+
"sentence": "Cytotechnologists play a vital role in saving people's lives.",
|
481 |
+
"triplets": [
|
482 |
+
"Cytotechnologists::play a role in::saving people's lives"
|
483 |
+
]
|
484 |
+
},
|
485 |
+
{
|
486 |
+
"sentence": "When doctors discover abnormalities early, a patient's treatment can begin sooner, which usually increases the chances of a successful outcome.",
|
487 |
+
"triplets": [
|
488 |
+
"doctors::discover::abnormalities",
|
489 |
+
"abnormalities::early::treatment",
|
490 |
+
"patient's treatment::begin::sooner",
|
491 |
+
"treatment::increase::chances"
|
492 |
+
]
|
493 |
+
},
|
494 |
+
{
|
495 |
+
"sentence": "These uterine cervix cells, viewed through a light microscope, are from a Pap smear.",
|
496 |
+
"triplets": [
|
497 |
+
"uterine cervix cells::viewed through::light microscope",
|
498 |
+
"uterine cervix cells::from::Pap smear"
|
499 |
+
]
|
500 |
+
},
|
501 |
+
{
|
502 |
+
"sentence": "Normal cells are on the left.",
|
503 |
+
"triplets": [
|
504 |
+
"Normal cells::are on::the left"
|
505 |
+
]
|
506 |
+
},
|
507 |
+
{
|
508 |
+
"sentence": "The cells on the right are infected with human papillomavirus (HPV).",
|
509 |
+
"triplets": [
|
510 |
+
"cells::infected with::human papillomavirus"
|
511 |
+
]
|
512 |
+
},
|
513 |
+
{
|
514 |
+
"sentence": "Notice that the infected cells are larger.",
|
515 |
+
"triplets": []
|
516 |
+
},
|
517 |
+
{
|
518 |
+
"sentence": "Also, two of these cells each have two nuclei instead of one, the normal number.",
|
519 |
+
"triplets": [
|
520 |
+
"cells::have::two nuclei",
|
521 |
+
"two of these cells::have::two nuclei",
|
522 |
+
"two of these cells::each have::two nuclei",
|
523 |
+
"two of these cells::have::the normal number"
|
524 |
+
]
|
525 |
+
}
|
526 |
+
]
|
output/2023-08-28_triplets_LogicBased.json
ADDED
@@ -0,0 +1,473 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
[
|
2 |
+
{
|
3 |
+
"sentence": "A cell is the smallest unit of a living thing.",
|
4 |
+
"triplets": [
|
5 |
+
"cell::is the smallest unit of::living thing"
|
6 |
+
]
|
7 |
+
},
|
8 |
+
{
|
9 |
+
"sentence": "Whether comprised of one cell (like bacteria) or many cells (like a human), we call it an organism.",
|
10 |
+
"triplets": [
|
11 |
+
"organism::comprised of::one cell",
|
12 |
+
"organism::comprised of::many cells"
|
13 |
+
]
|
14 |
+
},
|
15 |
+
{
|
16 |
+
"sentence": "Thus, cells are the basic building blocks of all organisms.",
|
17 |
+
"triplets": [
|
18 |
+
"Cells::are the basic building blocks of::all organisms"
|
19 |
+
]
|
20 |
+
},
|
21 |
+
{
|
22 |
+
"sentence": "Several cells of one kind that interconnect with each other and perform a shared function form tissues.",
|
23 |
+
"triplets": [
|
24 |
+
"cells::interconnect with::each other",
|
25 |
+
"interconnected cells::perform::shared function",
|
26 |
+
"interconnected cells::form::tissues"
|
27 |
+
]
|
28 |
+
},
|
29 |
+
{
|
30 |
+
"sentence": "These tissues combine to form an organ (your stomach, heart, or brain), and several organs comprise an organ system (such as the digestive system, circulatory system, or nervous system).",
|
31 |
+
"triplets": [
|
32 |
+
"tissues::combine to form::organ",
|
33 |
+
"organ::comprise::organ system",
|
34 |
+
"stomach::is an example of::organ",
|
35 |
+
"heart::is an example of::organ",
|
36 |
+
"brain::is an example of::organ",
|
37 |
+
"digestive system::is an example of::organ system",
|
38 |
+
"circulatory system::is an example of::organ system",
|
39 |
+
"nervous system::is an example of::organ system"
|
40 |
+
]
|
41 |
+
},
|
42 |
+
{
|
43 |
+
"sentence": "Several systems that function together form an organism (like a human being).",
|
44 |
+
"triplets": [
|
45 |
+
"systems::form::organism",
|
46 |
+
"systems::function together::NULL",
|
47 |
+
"organism::like::human being"
|
48 |
+
]
|
49 |
+
},
|
50 |
+
{
|
51 |
+
"sentence": "Here, we will examine the structure and function of cells.",
|
52 |
+
"triplets": [
|
53 |
+
"text::examines::structure and function of cells"
|
54 |
+
]
|
55 |
+
},
|
56 |
+
{
|
57 |
+
"sentence": "There are many types of cells, which scientists group into one of two broad categories: prokaryotic and eukaryotic.",
|
58 |
+
"triplets": [
|
59 |
+
"scientists::group::cells",
|
60 |
+
"cells::group into::prokaryotic and eukaryotic"
|
61 |
+
]
|
62 |
+
},
|
63 |
+
{
|
64 |
+
"sentence": "For example, we classify both animal and plant cells as eukaryotic cells; whereas, we classify bacterial cells as prokaryotic.",
|
65 |
+
"triplets": [
|
66 |
+
"Animal cells::are classified as::eukaryotic cells",
|
67 |
+
"Plant cells::are classified as::eukaryotic cells",
|
68 |
+
"Bacterial cells::are classified as::prokaryotic"
|
69 |
+
]
|
70 |
+
},
|
71 |
+
{
|
72 |
+
"sentence": "Before discussing the criteria for determining whether a cell is prokaryotic or eukaryotic, we will first examine how biologists study cells.",
|
73 |
+
"triplets": [
|
74 |
+
"Biologists::study::cells",
|
75 |
+
"criteria::determine::cell is prokaryotic or eukaryotic"
|
76 |
+
]
|
77 |
+
},
|
78 |
+
{
|
79 |
+
"sentence": "Cells vary in size.",
|
80 |
+
"triplets": [
|
81 |
+
"Cells::vary in size::size"
|
82 |
+
]
|
83 |
+
},
|
84 |
+
{
|
85 |
+
"sentence": "With few exceptions, we cannot see individual cells with the naked eye, so scientists use microscopes (micro- = \"small\"; -scope = \"to look at\") to study them.",
|
86 |
+
"triplets": [
|
87 |
+
"individual cells::cannot see with::naked eye",
|
88 |
+
"scientists::use::microscopes",
|
89 |
+
"microscopes::study::cells"
|
90 |
+
]
|
91 |
+
},
|
92 |
+
{
|
93 |
+
"sentence": "A microscope is an instrument that magnifies an object.",
|
94 |
+
"triplets": [
|
95 |
+
"microscope::magnifies::object"
|
96 |
+
]
|
97 |
+
},
|
98 |
+
{
|
99 |
+
"sentence": "We photograph most cells with a microscope, so we can call these images micrographs.",
|
100 |
+
"triplets": [
|
101 |
+
"We::photograph::most cells with a microscope",
|
102 |
+
"We::call::these images"
|
103 |
+
]
|
104 |
+
},
|
105 |
+
{
|
106 |
+
"sentence": "The optics of a microscope's lenses change the image orientation that the user sees.",
|
107 |
+
"triplets": [
|
108 |
+
"optics::change::image orientation",
|
109 |
+
"microscope's lenses::change::image orientation",
|
110 |
+
"user::see::image"
|
111 |
+
]
|
112 |
+
},
|
113 |
+
{
|
114 |
+
"sentence": "A specimen that is right-side up and facing right on the microscope slide will appear upside-down and facing left when one views through a microscope, and vice versa.",
|
115 |
+
"triplets": [
|
116 |
+
"specimen::is positioned::right-side up and facing right on the microscope slide",
|
117 |
+
"specimen::appears::upside-down and facing left when viewed through a microscope",
|
118 |
+
"one::views through::a microscope",
|
119 |
+
"specimen::appears::upside-down and facing left",
|
120 |
+
"one::views through::a microscope"
|
121 |
+
]
|
122 |
+
},
|
123 |
+
{
|
124 |
+
"sentence": "Similarly, if one moves the slide left while looking through the microscope, it will appear to move right, and if one moves it down, it will seem to move up.",
|
125 |
+
"triplets": [
|
126 |
+
"slide::move::left",
|
127 |
+
"slide::appear to move::right",
|
128 |
+
"slide::move::down",
|
129 |
+
"slide::seem to move::up"
|
130 |
+
]
|
131 |
+
},
|
132 |
+
{
|
133 |
+
"sentence": "This occurs because microscopes use two sets of lenses to magnify the image.",
|
134 |
+
"triplets": [
|
135 |
+
"Microscopes::use::two sets of lenses",
|
136 |
+
"two sets of lenses::magnify::the image"
|
137 |
+
]
|
138 |
+
},
|
139 |
+
{
|
140 |
+
"sentence": "Because of the manner by which light travels through the lenses, this two lens system produces an inverted image (binocular, or dissecting microscopes, work in a similar manner, but include an additional magnification system that makes the final image appear to be upright).",
|
141 |
+
"triplets": [
|
142 |
+
"two lens system::produces::inverted image",
|
143 |
+
"binocular or dissecting microscopes::work in::similar manner",
|
144 |
+
"binocular or dissecting microscopes::include::additional magnification system",
|
145 |
+
"additional magnification system::makes::final image appear to be upright"
|
146 |
+
]
|
147 |
+
},
|
148 |
+
{
|
149 |
+
"sentence": "To give you a sense of cell size, a typical human red blood cell is about eight millionths of a meter or eight micrometers (abbreviated as eight \u03bcm) in diameter.",
|
150 |
+
"triplets": [
|
151 |
+
"human red blood cell::has diameter::eight millionths of a meter or eight micrometers"
|
152 |
+
]
|
153 |
+
},
|
154 |
+
{
|
155 |
+
"sentence": "A pin head is about two thousandths of a meter (two mm) in diameter.",
|
156 |
+
"triplets": [
|
157 |
+
"pin head::has diameter::two thousandths of a meter"
|
158 |
+
]
|
159 |
+
},
|
160 |
+
{
|
161 |
+
"sentence": "That means about 250 red blood cells could fit on a pinhead.",
|
162 |
+
"triplets": [
|
163 |
+
"red blood cells::fit on::a pinhead"
|
164 |
+
]
|
165 |
+
},
|
166 |
+
{
|
167 |
+
"sentence": "Most student microscopes are light microscopes (Figure 4.2a).",
|
168 |
+
"triplets": [
|
169 |
+
"Most student microscopes::are::light microscopes"
|
170 |
+
]
|
171 |
+
},
|
172 |
+
{
|
173 |
+
"sentence": "Visible light passes and bends through the lens system to enable the user to see the specimen.",
|
174 |
+
"triplets": [
|
175 |
+
"Visible light::passes through::the lens system",
|
176 |
+
"Visible light::bends through::the lens system",
|
177 |
+
"the lens system::enables::the user to see the specimen"
|
178 |
+
]
|
179 |
+
},
|
180 |
+
{
|
181 |
+
"sentence": "Light microscopes are advantageous for viewing living organisms, but since individual cells are generally transparent, their components are not distinguishable unless they are colored with special stains.",
|
182 |
+
"triplets": [
|
183 |
+
"Light microscopes::are advantageous for viewing::living organisms",
|
184 |
+
"Components of cells::are not distinguishable unless colored with::special stains"
|
185 |
+
]
|
186 |
+
},
|
187 |
+
{
|
188 |
+
"sentence": "Staining, however, usually kills the cells.",
|
189 |
+
"triplets": [
|
190 |
+
"Staining::kills::cells"
|
191 |
+
]
|
192 |
+
},
|
193 |
+
{
|
194 |
+
"sentence": "Light microscopes that undergraduates commonly use in the laboratory magnify up to approximately 400 times.",
|
195 |
+
"triplets": [
|
196 |
+
"Light microscopes::commonly used by::undergraduates in the laboratory",
|
197 |
+
"Light microscopes::magnify::up to approximately 400 times"
|
198 |
+
]
|
199 |
+
},
|
200 |
+
{
|
201 |
+
"sentence": "Two parameters that are important in microscopy are magnification and resolving power.",
|
202 |
+
"triplets": [
|
203 |
+
"microscopy::has_parameter::magnification",
|
204 |
+
"microscopy::has_parameter::resolving power"
|
205 |
+
]
|
206 |
+
},
|
207 |
+
{
|
208 |
+
"sentence": "Magnification is the process of enlarging an object in appearance.",
|
209 |
+
"triplets": [
|
210 |
+
"Magnification::is the process of::enlarging an object in appearance"
|
211 |
+
]
|
212 |
+
},
|
213 |
+
{
|
214 |
+
"sentence": "Resolving power is the microscope's ability to distinguish two adjacent structures as separate: the higher the resolution, the better the image's clarity and detail.",
|
215 |
+
"triplets": [
|
216 |
+
"Resolving power::is the ability to distinguish::two adjacent structures as separate",
|
217 |
+
"resolution::is higher in::better image's clarity and detail"
|
218 |
+
]
|
219 |
+
},
|
220 |
+
{
|
221 |
+
"sentence": "When one uses oil immersion lenses to study small objects, magnification usually increases to 1,000 times.",
|
222 |
+
"triplets": [
|
223 |
+
"oil immersion lenses::used to study::small objects",
|
224 |
+
"oil immersion lenses::increases magnification to::1"
|
225 |
+
]
|
226 |
+
},
|
227 |
+
{
|
228 |
+
"sentence": "In order to gain a better understanding of cellular structure and function, scientists typically use electron microscopes.",
|
229 |
+
"triplets": [
|
230 |
+
"scientists::use::electron microscopes",
|
231 |
+
"scientists::gain understanding of::cellular structure and function"
|
232 |
+
]
|
233 |
+
},
|
234 |
+
{
|
235 |
+
"sentence": "a) Most light microscopes in a college biology lab can magnify cells up to approximately 400 times and have a resolution of about 200 nanometers.",
|
236 |
+
"triplets": [
|
237 |
+
"Most light microscopes in a college biology lab::can magnify::cells up to approximately 400 times",
|
238 |
+
"Most light microscopes in a college biology lab::have::a resolution of about 200 nanometers"
|
239 |
+
]
|
240 |
+
},
|
241 |
+
{
|
242 |
+
"sentence": "b) Electron microscopes provide a much higher magnification, 100,000x, and a have a resolution of 50 picometers.",
|
243 |
+
"triplets": [
|
244 |
+
"Electron microscopes::provide::a much higher magnification",
|
245 |
+
"Electron microscopes::have::a resolution of 50 picometers"
|
246 |
+
]
|
247 |
+
},
|
248 |
+
{
|
249 |
+
"sentence": "use a beam of electrons instead of a beam of light.",
|
250 |
+
"triplets": [
|
251 |
+
"beam of electrons::can be used instead of::beam of light"
|
252 |
+
]
|
253 |
+
},
|
254 |
+
{
|
255 |
+
"sentence": "Not only does this allow for higher magnification and, thus, more detail (Figure 4.3), it also provides higher resolving power.",
|
256 |
+
"triplets": [
|
257 |
+
"this::allow for::higher magnification and more detail",
|
258 |
+
"this::provide::higher resolving power"
|
259 |
+
]
|
260 |
+
},
|
261 |
+
{
|
262 |
+
"sentence": "The method to prepare the specimen for viewing with an electron microscope kills the specimen.",
|
263 |
+
"triplets": [
|
264 |
+
"method::prepares::specimen",
|
265 |
+
"method::kills::specimen"
|
266 |
+
]
|
267 |
+
},
|
268 |
+
{
|
269 |
+
"sentence": "Electrons have short wavelengths (shorter than photons) that move best in a vacuum, so we cannot view living cells with an electron microscope.",
|
270 |
+
"triplets": [
|
271 |
+
"Electrons::have::short wavelengths",
|
272 |
+
"Electrons::move best in::a vacuum",
|
273 |
+
"We::cannot view::living cells with an electron microscope"
|
274 |
+
]
|
275 |
+
},
|
276 |
+
{
|
277 |
+
"sentence": "In a scanning electron microscope, a beam of electrons moves back and forth across a cell's surface, creating details of cell surface characteristics.",
|
278 |
+
"triplets": [
|
279 |
+
"scanning electron microscope::uses::beam of electrons",
|
280 |
+
"beam of electrons::moves across::cell's surface",
|
281 |
+
"movement of beam::creates::details of cell surface characteristics"
|
282 |
+
]
|
283 |
+
},
|
284 |
+
{
|
285 |
+
"sentence": "In a transmission electron microscope, the electron beam penetrates the cell and provides details of a cell's internal structures.",
|
286 |
+
"triplets": [
|
287 |
+
"transmission electron microscope::has electron beam::penetrates cell",
|
288 |
+
"electron beam::provides details of::cell's internal structures"
|
289 |
+
]
|
290 |
+
},
|
291 |
+
{
|
292 |
+
"sentence": "As you might imagine, electron microscopes are significantly more bulky and expensive than light microscopes.",
|
293 |
+
"triplets": [
|
294 |
+
"electron microscopes::are significantly more bulky and expensive than::light microscopes"
|
295 |
+
]
|
296 |
+
},
|
297 |
+
{
|
298 |
+
"sentence": "a) These Salmonella bacteria appear as tiny purple dots when viewed with a light microscope.",
|
299 |
+
"triplets": [
|
300 |
+
"Salmonella bacteria::appear as::tiny purple dots"
|
301 |
+
]
|
302 |
+
},
|
303 |
+
{
|
304 |
+
"sentence": "b) This scanning electron microscope micrograph shows Salmonella bacteria (in red) invading human cells (yellow).",
|
305 |
+
"triplets": [
|
306 |
+
"Salmonella bacteria::invade::human cells",
|
307 |
+
"scanning electron microscope micrograph::show::Salmonella bacteria invading human cells"
|
308 |
+
]
|
309 |
+
},
|
310 |
+
{
|
311 |
+
"sentence": "Even though subfigure (b) shows a different Salmonella specimen than subfigure (a), you can still observe the comparative increase in magnification and detail.",
|
312 |
+
"triplets": [
|
313 |
+
"you::can observe::the comparative increase in magnification and detail"
|
314 |
+
]
|
315 |
+
},
|
316 |
+
{
|
317 |
+
"sentence": "For another perspective on cell size, try the HowBig interactive at this site.",
|
318 |
+
"triplets": [
|
319 |
+
"HowBig interactive::provides perspective on::cell size"
|
320 |
+
]
|
321 |
+
},
|
322 |
+
{
|
323 |
+
"sentence": "The microscopes we use today are far more complex than those that Dutch shopkeeper Antony van Leeuwenhoek, used in the 1600s.",
|
324 |
+
"triplets": [
|
325 |
+
"The microscopes::are more complex than::those that Dutch shopkeeper Antony van Leeuwenhoek used in the 1600s"
|
326 |
+
]
|
327 |
+
},
|
328 |
+
{
|
329 |
+
"sentence": "Skilled in crafting lenses, van Leeuwenhoek observed the movements of single-celled organisms, which he collectively termed \"animalcules.\"",
|
330 |
+
"triplets": [
|
331 |
+
"van Leeuwenhoek::skilled in::crafting lenses",
|
332 |
+
"van Leeuwenhoek::observed::movements of single-celled organisms"
|
333 |
+
]
|
334 |
+
},
|
335 |
+
{
|
336 |
+
"sentence": "In the 1665 publication Micrographia, experimental scientist Robert Hooke coined the term \"cell\" for the box-like structures he observed when viewing cork tissue through a lens.",
|
337 |
+
"triplets": [
|
338 |
+
"Robert Hooke::coined::term",
|
339 |
+
"Robert Hooke::observed::box-like structures"
|
340 |
+
]
|
341 |
+
},
|
342 |
+
{
|
343 |
+
"sentence": "In the 1670s, van Leeuwenhoek discovered bacteria and protozoa.",
|
344 |
+
"triplets": [
|
345 |
+
"van Leeuwenhoek::discovered::bacteria",
|
346 |
+
"van Leeuwenhoek::discovered::protozoa"
|
347 |
+
]
|
348 |
+
},
|
349 |
+
{
|
350 |
+
"sentence": "Later advances in lenses, microscope construction, and staining techniques enabled other scientists to see some components inside cells.",
|
351 |
+
"triplets": [
|
352 |
+
"Later advances in lenses::enabled scientists to see::some components inside cells",
|
353 |
+
"Microscope construction::enabled scientists to see::some components inside cells",
|
354 |
+
"Staining techniques::enabled scientists to see::some components inside cells"
|
355 |
+
]
|
356 |
+
},
|
357 |
+
{
|
358 |
+
"sentence": "By the late 1830s, botanist Matthias Schleiden and zoologist Theodor Schwann were studying tissues and proposed the unified cell theory, which states that one or more cells comprise all living things, the cell is the basic unit of life, and new cells arise from existing cells.",
|
359 |
+
"triplets": [
|
360 |
+
"Matthias Schleiden::studied::tissues",
|
361 |
+
"Theodor Schwann::studied::tissues",
|
362 |
+
"Matthias Schleiden::proposed::unified cell theory",
|
363 |
+
"Theodor Schwann::proposed::unified cell theory",
|
364 |
+
"unified cell theory::states::one or more cells comprise all living things",
|
365 |
+
"unified cell theory::states::the cell is the basic unit of life",
|
366 |
+
"unified cell theory::states::new cells arise from existing cells"
|
367 |
+
]
|
368 |
+
},
|
369 |
+
{
|
370 |
+
"sentence": "Rudolf Virchow later made important contributions to this theory.",
|
371 |
+
"triplets": [
|
372 |
+
"Rudolf Virchow::made contributions to::this theory"
|
373 |
+
]
|
374 |
+
},
|
375 |
+
{
|
376 |
+
"sentence": "Have you ever heard of a medical test called a Pap smear (Figure 4.4)?",
|
377 |
+
"triplets": [
|
378 |
+
"Pap smear::is a medical test::exists"
|
379 |
+
]
|
380 |
+
},
|
381 |
+
{
|
382 |
+
"sentence": "In this test, a doctor takes a small sample of cells from the patient's uterine cervix and sends it to a medical lab where a cytotechnologist stains the cells and examines them for any changes that could indicate cervical cancer or a microbial infection.",
|
383 |
+
"triplets": [
|
384 |
+
"doctor::takes::small sample of cells from patient's uterine cervix",
|
385 |
+
"doctor::sends::sample to medical lab",
|
386 |
+
"cytotechnologist::stains::cells",
|
387 |
+
"cytotechnologist::examines::stained cells for changes",
|
388 |
+
"changes in cells::indicate::cervical cancer or microbial infection"
|
389 |
+
]
|
390 |
+
},
|
391 |
+
{
|
392 |
+
"sentence": "Cytotechnologists (cyto- = \"cell\") are professionals who study cells via microscopic examinations and other laboratory tests.",
|
393 |
+
"triplets": [
|
394 |
+
"Cytotechnologists::study::cells",
|
395 |
+
"Cytotechnologists::are::professionals"
|
396 |
+
]
|
397 |
+
},
|
398 |
+
{
|
399 |
+
"sentence": "They are trained to determine which cellular changes are within normal limits and which are abnormal.",
|
400 |
+
"triplets": [
|
401 |
+
"They::are trained to determine::which cellular changes are within normal limits",
|
402 |
+
"They::are trained to determine::which cellular changes are abnormal"
|
403 |
+
]
|
404 |
+
},
|
405 |
+
{
|
406 |
+
"sentence": "Their focus is not limited to cervical cells.",
|
407 |
+
"triplets": [
|
408 |
+
"Their::focus::not limited to cervical cells"
|
409 |
+
]
|
410 |
+
},
|
411 |
+
{
|
412 |
+
"sentence": "They study cellular specimens that come from all organs.",
|
413 |
+
"triplets": [
|
414 |
+
"They::study::cellular specimens",
|
415 |
+
"cellular specimens::come from::all organs"
|
416 |
+
]
|
417 |
+
},
|
418 |
+
{
|
419 |
+
"sentence": "When they notice abnormalities, they consult a pathologist, a medical doctor who interprets and diagnoses changes that disease in body tissue and fluids cause.",
|
420 |
+
"triplets": [
|
421 |
+
"they::consult::pathologist",
|
422 |
+
"pathologist::is a::medical doctor",
|
423 |
+
"pathologist::interprets and diagnoses::changes that disease in body tissue and fluids cause"
|
424 |
+
]
|
425 |
+
},
|
426 |
+
{
|
427 |
+
"sentence": "Cytotechnologists play a vital role in saving people's lives.",
|
428 |
+
"triplets": [
|
429 |
+
"Cytotechnologists::play a vital role in::saving people's lives",
|
430 |
+
"Cytotechnologists::save::people's lives"
|
431 |
+
]
|
432 |
+
},
|
433 |
+
{
|
434 |
+
"sentence": "When doctors discover abnormalities early, a patient's treatment can begin sooner, which usually increases the chances of a successful outcome.",
|
435 |
+
"triplets": [
|
436 |
+
"doctors::discover::abnormalities early",
|
437 |
+
"early discovery of abnormalities::allow for::earlier treatment",
|
438 |
+
"earlier treatment::increase chances of::successful outcome"
|
439 |
+
]
|
440 |
+
},
|
441 |
+
{
|
442 |
+
"sentence": "These uterine cervix cells, viewed through a light microscope, are from a Pap smear.",
|
443 |
+
"triplets": [
|
444 |
+
"uterine cervix cells::are from::Pap smear",
|
445 |
+
"cells::are::uterine cervix cells",
|
446 |
+
"cells::were viewed through::light microscope"
|
447 |
+
]
|
448 |
+
},
|
449 |
+
{
|
450 |
+
"sentence": "Normal cells are on the left.",
|
451 |
+
"triplets": [
|
452 |
+
"Normal cells::are on::the left"
|
453 |
+
]
|
454 |
+
},
|
455 |
+
{
|
456 |
+
"sentence": "The cells on the right are infected with human papillomavirus (HPV).",
|
457 |
+
"triplets": [
|
458 |
+
"cells on the right::are infected with::human papillomavirus"
|
459 |
+
]
|
460 |
+
},
|
461 |
+
{
|
462 |
+
"sentence": "Notice that the infected cells are larger.",
|
463 |
+
"triplets": []
|
464 |
+
},
|
465 |
+
{
|
466 |
+
"sentence": "Also, two of these cells each have two nuclei instead of one, the normal number.",
|
467 |
+
"triplets": [
|
468 |
+
"cells::have nuclei::two",
|
469 |
+
"cells::have nuclei::one",
|
470 |
+
"cells::have number::normal"
|
471 |
+
]
|
472 |
+
}
|
473 |
+
]
|
output/2023-08-28_triplets_Traditional.json
ADDED
@@ -0,0 +1,532 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
[
|
2 |
+
{
|
3 |
+
"sentence": "A cell is the smallest unit of a living thing.",
|
4 |
+
"triplets": [
|
5 |
+
"cell::is the smallest unit of::living thing"
|
6 |
+
]
|
7 |
+
},
|
8 |
+
{
|
9 |
+
"sentence": "Whether comprised of one cell (like bacteria) or many cells (like a human), we call it an organism.",
|
10 |
+
"triplets": [
|
11 |
+
"one cell::comprised of::bacteria",
|
12 |
+
"many cells::comprised of::human"
|
13 |
+
]
|
14 |
+
},
|
15 |
+
{
|
16 |
+
"sentence": "Thus, cells are the basic building blocks of all organisms.",
|
17 |
+
"triplets": [
|
18 |
+
"cells::are the basic building blocks of::organisms"
|
19 |
+
]
|
20 |
+
},
|
21 |
+
{
|
22 |
+
"sentence": "Several cells of one kind that interconnect with each other and perform a shared function form tissues.",
|
23 |
+
"triplets": [
|
24 |
+
"cells::form::tissues"
|
25 |
+
]
|
26 |
+
},
|
27 |
+
{
|
28 |
+
"sentence": "These tissues combine to form an organ (your stomach, heart, or brain), and several organs comprise an organ system (such as the digestive system, circulatory system, or nervous system).",
|
29 |
+
"triplets": [
|
30 |
+
"tissues::combine to form::organ",
|
31 |
+
"organ::comprise::organ system",
|
32 |
+
"organ system::comprise::N",
|
33 |
+
"organ system::such as::digestive system",
|
34 |
+
"organ system::such as::circulatory system",
|
35 |
+
"organ system::such as::nervous system"
|
36 |
+
]
|
37 |
+
},
|
38 |
+
{
|
39 |
+
"sentence": "Several systems that function together form an organism (like a human being).",
|
40 |
+
"triplets": [
|
41 |
+
"systems::form::organism",
|
42 |
+
"systems::function together::organism",
|
43 |
+
"systems::form::human being",
|
44 |
+
"systems::function together::human being",
|
45 |
+
"organism::like::human being"
|
46 |
+
]
|
47 |
+
},
|
48 |
+
{
|
49 |
+
"sentence": "Here, we will examine the structure and function of cells.",
|
50 |
+
"triplets": [
|
51 |
+
"structure::is related to::cells",
|
52 |
+
"function::is related to::cells"
|
53 |
+
]
|
54 |
+
},
|
55 |
+
{
|
56 |
+
"sentence": "There are many types of cells, which scientists group into one of two broad categories: prokaryotic and eukaryotic.",
|
57 |
+
"triplets": [
|
58 |
+
"types of cells::grouped into::categories",
|
59 |
+
"scientists::group into::categories",
|
60 |
+
"categories::have::prokaryotic",
|
61 |
+
"categories::have::eukaryotic"
|
62 |
+
]
|
63 |
+
},
|
64 |
+
{
|
65 |
+
"sentence": "For example, we classify both animal and plant cells as eukaryotic cells; whereas, we classify bacterial cells as prokaryotic.",
|
66 |
+
"triplets": [
|
67 |
+
"animal cells::classified as::eukaryotic cells",
|
68 |
+
"plant cells::classified as::eukaryotic cells",
|
69 |
+
"bacterial cells::classified as::prokaryotic"
|
70 |
+
]
|
71 |
+
},
|
72 |
+
{
|
73 |
+
"sentence": "Before discussing the criteria for determining whether a cell is prokaryotic or eukaryotic, we will first examine how biologists study cells.",
|
74 |
+
"triplets": [
|
75 |
+
"criteria::determines::cell",
|
76 |
+
"cell::is::prokaryotic",
|
77 |
+
"cell::is::eukaryotic",
|
78 |
+
"biologists::study::cells"
|
79 |
+
]
|
80 |
+
},
|
81 |
+
{
|
82 |
+
"sentence": "Cells vary in size.",
|
83 |
+
"triplets": [
|
84 |
+
"Cells::vary in::size"
|
85 |
+
]
|
86 |
+
},
|
87 |
+
{
|
88 |
+
"sentence": "With few exceptions, we cannot see individual cells with the naked eye, so scientists use microscopes (micro- = \"small\"; -scope = \"to look at\") to study them.",
|
89 |
+
"triplets": []
|
90 |
+
},
|
91 |
+
{
|
92 |
+
"sentence": "A microscope is an instrument that magnifies an object.",
|
93 |
+
"triplets": [
|
94 |
+
"microscope::is an instrument::object",
|
95 |
+
"microscope::magnifies::object"
|
96 |
+
]
|
97 |
+
},
|
98 |
+
{
|
99 |
+
"sentence": "We photograph most cells with a microscope, so we can call these images micrographs.",
|
100 |
+
"triplets": [
|
101 |
+
"cells::photograph::microscope",
|
102 |
+
"cells::call::images",
|
103 |
+
"images::be::micrographs"
|
104 |
+
]
|
105 |
+
},
|
106 |
+
{
|
107 |
+
"sentence": "The optics of a microscope's lenses change the image orientation that the user sees.",
|
108 |
+
"triplets": [
|
109 |
+
"optics::change::image orientation",
|
110 |
+
"microscope's lenses::change::image orientation",
|
111 |
+
"image orientation::seen by::user"
|
112 |
+
]
|
113 |
+
},
|
114 |
+
{
|
115 |
+
"sentence": "A specimen that is right-side up and facing right on the microscope slide will appear upside-down and facing left when one views through a microscope, and vice versa.",
|
116 |
+
"triplets": [
|
117 |
+
"Specimen::is placed on::Microscope slide",
|
118 |
+
"Specimen::appears as::Upside-down",
|
119 |
+
"Specimen::appears as::Facing left",
|
120 |
+
"Microscope slide::is viewed through::Microscope",
|
121 |
+
"Microscope::causes::Specimen to appear upside-down",
|
122 |
+
"Microscope::causes::Specimen to appear facing left"
|
123 |
+
]
|
124 |
+
},
|
125 |
+
{
|
126 |
+
"sentence": "Similarly, if one moves the slide left while looking through the microscope, it will appear to move right, and if one moves it down, it will seem to move up.",
|
127 |
+
"triplets": []
|
128 |
+
},
|
129 |
+
{
|
130 |
+
"sentence": "This occurs because microscopes use two sets of lenses to magnify the image.",
|
131 |
+
"triplets": [
|
132 |
+
"microscopes::use::two sets of lenses",
|
133 |
+
"microscopes::magnify::image",
|
134 |
+
"sets of lenses::magnify::image"
|
135 |
+
]
|
136 |
+
},
|
137 |
+
{
|
138 |
+
"sentence": "Because of the manner by which light travels through the lenses, this two lens system produces an inverted image (binocular, or dissecting microscopes, work in a similar manner, but include an additional magnification system that makes the final image appear to be upright).",
|
139 |
+
"triplets": [
|
140 |
+
"lenses::travels through::light",
|
141 |
+
"two lens system::produces::inverted image",
|
142 |
+
"binocular::work in a similar manner as::dissecting microscopes",
|
143 |
+
"dissecting microscopes::include::additional magnification system",
|
144 |
+
"additional magnification system::makes the final image appear to be::upright"
|
145 |
+
]
|
146 |
+
},
|
147 |
+
{
|
148 |
+
"sentence": "To give you a sense of cell size, a typical human red blood cell is about eight millionths of a meter or eight micrometers (abbreviated as eight \u03bcm) in diameter.",
|
149 |
+
"triplets": [
|
150 |
+
"cell size::is about::eight millionths of a meter",
|
151 |
+
"cell size::is about::eight micrometers",
|
152 |
+
"human red blood cell::is::eight millionths of a meter",
|
153 |
+
"human red blood cell::is::eight micrometers",
|
154 |
+
"eight millionths of a meter::is in diameter::human red blood cell",
|
155 |
+
"eight micrometers::is in diameter::human red blood cell"
|
156 |
+
]
|
157 |
+
},
|
158 |
+
{
|
159 |
+
"sentence": "A pin head is about two thousandths of a meter (two mm) in diameter.",
|
160 |
+
"triplets": [
|
161 |
+
"pin head::has diameter::two thousandths of a meter",
|
162 |
+
"two thousandths of a meter::is measured in::mm"
|
163 |
+
]
|
164 |
+
},
|
165 |
+
{
|
166 |
+
"sentence": "That means about 250 red blood cells could fit on a pinhead.",
|
167 |
+
"triplets": [
|
168 |
+
"250 red blood cells::could fit on::pinhead"
|
169 |
+
]
|
170 |
+
},
|
171 |
+
{
|
172 |
+
"sentence": "Most student microscopes are light microscopes (Figure 4.2a).",
|
173 |
+
"triplets": [
|
174 |
+
"student microscopes::are::light microscopes",
|
175 |
+
"student microscopes::are::N",
|
176 |
+
"student microscopes::are::Figure 4.2a",
|
177 |
+
"light microscopes::are::student microscopes",
|
178 |
+
"light microscopes::are::N",
|
179 |
+
"light microscopes::are::Figure 4.2a",
|
180 |
+
"Figure 4.2a::are::student microscopes",
|
181 |
+
"Figure 4.2a::are::light microscopes",
|
182 |
+
"Figure 4.2a::are::N"
|
183 |
+
]
|
184 |
+
},
|
185 |
+
{
|
186 |
+
"sentence": "Visible light passes and bends through the lens system to enable the user to see the specimen.",
|
187 |
+
"triplets": [
|
188 |
+
"Visible light::passes through::lens system",
|
189 |
+
"Visible light::bends through::lens system",
|
190 |
+
"lens system::enables::user",
|
191 |
+
"user::sees::specimen"
|
192 |
+
]
|
193 |
+
},
|
194 |
+
{
|
195 |
+
"sentence": "Light microscopes are advantageous for viewing living organisms, but since individual cells are generally transparent, their components are not distinguishable unless they are colored with special stains.",
|
196 |
+
"triplets": [
|
197 |
+
"Light microscopes::are advantageous for::viewing living organisms",
|
198 |
+
"individual cells::are not distinguishable unless colored with::special stains",
|
199 |
+
"components::are colored with::special stains"
|
200 |
+
]
|
201 |
+
},
|
202 |
+
{
|
203 |
+
"sentence": "Staining, however, usually kills the cells.",
|
204 |
+
"triplets": [
|
205 |
+
"Staining::kills::cells"
|
206 |
+
]
|
207 |
+
},
|
208 |
+
{
|
209 |
+
"sentence": "Light microscopes that undergraduates commonly use in the laboratory magnify up to approximately 400 times.",
|
210 |
+
"triplets": [
|
211 |
+
"Light microscopes::commonly use in::laboratory",
|
212 |
+
"undergraduates::commonly use in::laboratory",
|
213 |
+
"Light microscopes::magnify up to::approximately 400 times"
|
214 |
+
]
|
215 |
+
},
|
216 |
+
{
|
217 |
+
"sentence": "Two parameters that are important in microscopy are magnification and resolving power.",
|
218 |
+
"triplets": [
|
219 |
+
"parameters::are important in::microscopy",
|
220 |
+
"microscopy::are important in::parameters",
|
221 |
+
"parameters::are important in::magnification",
|
222 |
+
"magnification::are important in::parameters",
|
223 |
+
"parameters::are important in::resolving power",
|
224 |
+
"resolving power::are important in::parameters"
|
225 |
+
]
|
226 |
+
},
|
227 |
+
{
|
228 |
+
"sentence": "Magnification is the process of enlarging an object in appearance.",
|
229 |
+
"triplets": [
|
230 |
+
"Magnification::is the process of::enlarging an object in appearance",
|
231 |
+
"process::enlarging::an object in appearance",
|
232 |
+
"object::in::appearance"
|
233 |
+
]
|
234 |
+
},
|
235 |
+
{
|
236 |
+
"sentence": "Resolving power is the microscope's ability to distinguish two adjacent structures as separate: the higher the resolution, the better the image's clarity and detail.",
|
237 |
+
"triplets": [
|
238 |
+
"Resolving power::is the ability of::microscope's ability",
|
239 |
+
"Resolving power::distinguish::two adjacent structures",
|
240 |
+
"resolution::is higher with::better image's clarity",
|
241 |
+
"resolution::is higher with::better detail",
|
242 |
+
"image's clarity::is better with::higher resolution",
|
243 |
+
"detail::is better with::higher resolution"
|
244 |
+
]
|
245 |
+
},
|
246 |
+
{
|
247 |
+
"sentence": "When one uses oil immersion lenses to study small objects, magnification usually increases to 1,000 times.",
|
248 |
+
"triplets": [
|
249 |
+
"oil immersion lenses::used to study::small objects",
|
250 |
+
"oil immersion lenses::increases magnification to::1",
|
251 |
+
"small objects::studied using::oil immersion lenses",
|
252 |
+
"magnification::increases to::1"
|
253 |
+
]
|
254 |
+
},
|
255 |
+
{
|
256 |
+
"sentence": "In order to gain a better understanding of cellular structure and function, scientists typically use electron microscopes.",
|
257 |
+
"triplets": [
|
258 |
+
"cellular structure::is used for::understanding",
|
259 |
+
"cellular structure::is used for::function",
|
260 |
+
"scientists::use::electron microscopes"
|
261 |
+
]
|
262 |
+
},
|
263 |
+
{
|
264 |
+
"sentence": "a) Most light microscopes in a college biology lab can magnify cells up to approximately 400 times and have a resolution of about 200 nanometers.",
|
265 |
+
"triplets": [
|
266 |
+
"light microscopes::magnify::cells",
|
267 |
+
"light microscopes::have::resolution",
|
268 |
+
"light microscopes::have::nanometers",
|
269 |
+
"college biology lab::contain::light microscopes",
|
270 |
+
"cells::magnified by::light microscopes",
|
271 |
+
"resolution::measured in::nanometers"
|
272 |
+
]
|
273 |
+
},
|
274 |
+
{
|
275 |
+
"sentence": "b) Electron microscopes provide a much higher magnification, 100,000x, and a have a resolution of 50 picometers.",
|
276 |
+
"triplets": [
|
277 |
+
"Electron microscopes::provide::magnification",
|
278 |
+
"Electron microscopes::have::resolution",
|
279 |
+
"resolution::is of::picometers"
|
280 |
+
]
|
281 |
+
},
|
282 |
+
{
|
283 |
+
"sentence": "use a beam of electrons instead of a beam of light.",
|
284 |
+
"triplets": [
|
285 |
+
"beam of electrons::instead of::beam of light"
|
286 |
+
]
|
287 |
+
},
|
288 |
+
{
|
289 |
+
"sentence": "Not only does this allow for higher magnification and, thus, more detail (Figure 4.3), it also provides higher resolving power.",
|
290 |
+
"triplets": [
|
291 |
+
"higher magnification::allows for::more detail",
|
292 |
+
"higher magnification::provides::higher resolving power"
|
293 |
+
]
|
294 |
+
},
|
295 |
+
{
|
296 |
+
"sentence": "The method to prepare the specimen for viewing with an electron microscope kills the specimen.",
|
297 |
+
"triplets": [
|
298 |
+
"method::prepares::specimen",
|
299 |
+
"method::kills::specimen",
|
300 |
+
"specimen::is prepared for::viewing",
|
301 |
+
"specimen::is viewed with::electron microscope"
|
302 |
+
]
|
303 |
+
},
|
304 |
+
{
|
305 |
+
"sentence": "Electrons have short wavelengths (shorter than photons) that move best in a vacuum, so we cannot view living cells with an electron microscope.",
|
306 |
+
"triplets": [
|
307 |
+
"Electrons::have::short wavelengths",
|
308 |
+
"Electrons::move best in::vacuum",
|
309 |
+
"we::cannot view::living cells",
|
310 |
+
"we::cannot view::electron microscope",
|
311 |
+
"short wavelengths::are shorter than::photons",
|
312 |
+
"vacuum::is best for::electrons",
|
313 |
+
"living cells::cannot be viewed with::electron microscope"
|
314 |
+
]
|
315 |
+
},
|
316 |
+
{
|
317 |
+
"sentence": "In a scanning electron microscope, a beam of electrons moves back and forth across a cell's surface, creating details of cell surface characteristics.",
|
318 |
+
"triplets": [
|
319 |
+
"scanning electron microscope::moves across::beam of electrons",
|
320 |
+
"beam of electrons::creates::details",
|
321 |
+
"beam of electrons::moves across::cell's surface",
|
322 |
+
"details::of::cell surface characteristics"
|
323 |
+
]
|
324 |
+
},
|
325 |
+
{
|
326 |
+
"sentence": "In a transmission electron microscope, the electron beam penetrates the cell and provides details of a cell's internal structures.",
|
327 |
+
"triplets": [
|
328 |
+
"transmission electron microscope::penetrates::cell",
|
329 |
+
"electron beam::provides details of::cell's internal structures"
|
330 |
+
]
|
331 |
+
},
|
332 |
+
{
|
333 |
+
"sentence": "As you might imagine, electron microscopes are significantly more bulky and expensive than light microscopes.",
|
334 |
+
"triplets": [
|
335 |
+
"electron microscopes::are significantly more bulky and expensive than::light microscopes"
|
336 |
+
]
|
337 |
+
},
|
338 |
+
{
|
339 |
+
"sentence": "a) These Salmonella bacteria appear as tiny purple dots when viewed with a light microscope.",
|
340 |
+
"triplets": [
|
341 |
+
"Salmonella bacteria::appear as::tiny purple dots",
|
342 |
+
"tiny purple dots::viewed with::light microscope"
|
343 |
+
]
|
344 |
+
},
|
345 |
+
{
|
346 |
+
"sentence": "b) This scanning electron microscope micrograph shows Salmonella bacteria (in red) invading human cells (yellow).",
|
347 |
+
"triplets": [
|
348 |
+
"scanning electron microscope::shows::Salmonella bacteria",
|
349 |
+
"scanning electron microscope::shows::human cells",
|
350 |
+
"micrograph::shows::Salmonella bacteria",
|
351 |
+
"micrograph::shows::human cells",
|
352 |
+
"Salmonella bacteria::invading::human cells"
|
353 |
+
]
|
354 |
+
},
|
355 |
+
{
|
356 |
+
"sentence": "Even though subfigure (b) shows a different Salmonella specimen than subfigure (a), you can still observe the comparative increase in magnification and detail.",
|
357 |
+
"triplets": []
|
358 |
+
},
|
359 |
+
{
|
360 |
+
"sentence": "For another perspective on cell size, try the HowBig interactive at this site.",
|
361 |
+
"triplets": [
|
362 |
+
"perspective::on::cell size",
|
363 |
+
"perspective::at::site",
|
364 |
+
"HowBig interactive::at::site"
|
365 |
+
]
|
366 |
+
},
|
367 |
+
{
|
368 |
+
"sentence": "The microscopes we use today are far more complex than those that Dutch shopkeeper Antony van Leeuwenhoek, used in the 1600s.",
|
369 |
+
"triplets": [
|
370 |
+
"microscopes::are more complex than::Dutch shopkeeper Antony van Leeuwenhoek"
|
371 |
+
]
|
372 |
+
},
|
373 |
+
{
|
374 |
+
"sentence": "Skilled in crafting lenses, van Leeuwenhoek observed the movements of single-celled organisms, which he collectively termed \"animalcules.\"",
|
375 |
+
"triplets": [
|
376 |
+
"van Leeuwenhoek::observed::movements",
|
377 |
+
"van Leeuwenhoek::observed::single-celled organisms",
|
378 |
+
"van Leeuwenhoek::termed::animalcules",
|
379 |
+
"lenses::crafted by::van Leeuwenhoek"
|
380 |
+
]
|
381 |
+
},
|
382 |
+
{
|
383 |
+
"sentence": "In the 1665 publication Micrographia, experimental scientist Robert Hooke coined the term \"cell\" for the box-like structures he observed when viewing cork tissue through a lens.",
|
384 |
+
"triplets": [
|
385 |
+
"Micrographia::publication::1665",
|
386 |
+
"Robert Hooke::coined::term",
|
387 |
+
"Robert Hooke::observed::box-like structures",
|
388 |
+
"Robert Hooke::viewing::cork tissue",
|
389 |
+
"Robert Hooke::viewing::lens",
|
390 |
+
"cork tissue::viewed through::lens"
|
391 |
+
]
|
392 |
+
},
|
393 |
+
{
|
394 |
+
"sentence": "In the 1670s, van Leeuwenhoek discovered bacteria and protozoa.",
|
395 |
+
"triplets": [
|
396 |
+
"van Leeuwenhoek::discovered::bacteria",
|
397 |
+
"van Leeuwenhoek::discovered::protozoa"
|
398 |
+
]
|
399 |
+
},
|
400 |
+
{
|
401 |
+
"sentence": "Later advances in lenses, microscope construction, and staining techniques enabled other scientists to see some components inside cells.",
|
402 |
+
"triplets": [
|
403 |
+
"advances::in::lenses",
|
404 |
+
"advances::in::microscope construction",
|
405 |
+
"advances::in::staining techniques",
|
406 |
+
"lenses::enabled::scientists",
|
407 |
+
"microscope construction::enabled::scientists",
|
408 |
+
"staining techniques::enabled::scientists",
|
409 |
+
"scientists::see::components",
|
410 |
+
"components::inside::cells"
|
411 |
+
]
|
412 |
+
},
|
413 |
+
{
|
414 |
+
"sentence": "By the late 1830s, botanist Matthias Schleiden and zoologist Theodor Schwann were studying tissues and proposed the unified cell theory, which states that one or more cells comprise all living things, the cell is the basic unit of life, and new cells arise from existing cells.",
|
415 |
+
"triplets": [
|
416 |
+
"botanist Matthias Schleiden::studied::tissues",
|
417 |
+
"zoologist Theodor Schwann::studied::tissues",
|
418 |
+
"botanist Matthias Schleiden::proposed::unified cell theory",
|
419 |
+
"zoologist Theodor Schwann::proposed::unified cell theory",
|
420 |
+
"unified cell theory::states that::one or more cells comprise all living things",
|
421 |
+
"unified cell theory::states that::the cell is the basic unit of life",
|
422 |
+
"unified cell theory::states that::new cells arise from existing cells"
|
423 |
+
]
|
424 |
+
},
|
425 |
+
{
|
426 |
+
"sentence": "Rudolf Virchow later made important contributions to this theory.",
|
427 |
+
"triplets": [
|
428 |
+
"Rudolf Virchow::made contributions to::theory"
|
429 |
+
]
|
430 |
+
},
|
431 |
+
{
|
432 |
+
"sentence": "Have you ever heard of a medical test called a Pap smear (Figure 4.4)?",
|
433 |
+
"triplets": [
|
434 |
+
"medical test::called::Pap smear"
|
435 |
+
]
|
436 |
+
},
|
437 |
+
{
|
438 |
+
"sentence": "In this test, a doctor takes a small sample of cells from the patient's uterine cervix and sends it to a medical lab where a cytotechnologist stains the cells and examines them for any changes that could indicate cervical cancer or a microbial infection.",
|
439 |
+
"triplets": [
|
440 |
+
"doctor::takes::small sample",
|
441 |
+
"doctor::sends::cells",
|
442 |
+
"cells::stains::cytotechnologist",
|
443 |
+
"cells::examines::changes",
|
444 |
+
"changes::indicate::cervical cancer",
|
445 |
+
"changes::indicate::microbial infection"
|
446 |
+
]
|
447 |
+
},
|
448 |
+
{
|
449 |
+
"sentence": "Cytotechnologists (cyto- = \"cell\") are professionals who study cells via microscopic examinations and other laboratory tests.",
|
450 |
+
"triplets": [
|
451 |
+
"Cytotechnologists::are professionals::N",
|
452 |
+
"Cytotechnologists::study::cells",
|
453 |
+
"professionals::study::cells",
|
454 |
+
"cells::via::microscopic examinations",
|
455 |
+
"cells::via::other laboratory tests"
|
456 |
+
]
|
457 |
+
},
|
458 |
+
{
|
459 |
+
"sentence": "They are trained to determine which cellular changes are within normal limits and which are abnormal.",
|
460 |
+
"triplets": []
|
461 |
+
},
|
462 |
+
{
|
463 |
+
"sentence": "Their focus is not limited to cervical cells.",
|
464 |
+
"triplets": [
|
465 |
+
"Their::has_focus::focus",
|
466 |
+
"focus::has_focus_on::cervical cells"
|
467 |
+
]
|
468 |
+
},
|
469 |
+
{
|
470 |
+
"sentence": "They study cellular specimens that come from all organs.",
|
471 |
+
"triplets": [
|
472 |
+
"cellular specimens::come from::organs"
|
473 |
+
]
|
474 |
+
},
|
475 |
+
{
|
476 |
+
"sentence": "When they notice abnormalities, they consult a pathologist, a medical doctor who interprets and diagnoses changes that disease in body tissue and fluids cause.",
|
477 |
+
"triplets": [
|
478 |
+
"abnormalities::notice::they",
|
479 |
+
"they::consult::pathologist",
|
480 |
+
"pathologist::interprets and diagnoses::changes",
|
481 |
+
"pathologist::interprets and diagnoses::disease",
|
482 |
+
"changes::cause::disease",
|
483 |
+
"disease::cause::body tissue",
|
484 |
+
"disease::cause::fluids"
|
485 |
+
]
|
486 |
+
},
|
487 |
+
{
|
488 |
+
"sentence": "Cytotechnologists play a vital role in saving people's lives.",
|
489 |
+
"triplets": [
|
490 |
+
"Cytotechnologists::play a vital role in::people's lives"
|
491 |
+
]
|
492 |
+
},
|
493 |
+
{
|
494 |
+
"sentence": "When doctors discover abnormalities early, a patient's treatment can begin sooner, which usually increases the chances of a successful outcome.",
|
495 |
+
"triplets": [
|
496 |
+
"doctors::discover::abnormalities",
|
497 |
+
"doctors::begin::treatment",
|
498 |
+
"treatment::increase::chances",
|
499 |
+
"treatment::lead to::successful outcome"
|
500 |
+
]
|
501 |
+
},
|
502 |
+
{
|
503 |
+
"sentence": "These uterine cervix cells, viewed through a light microscope, are from a Pap smear.",
|
504 |
+
"triplets": [
|
505 |
+
"uterine cervix cells::viewed through::light microscope",
|
506 |
+
"uterine cervix cells::from::Pap smear"
|
507 |
+
]
|
508 |
+
},
|
509 |
+
{
|
510 |
+
"sentence": "Normal cells are on the left.",
|
511 |
+
"triplets": [
|
512 |
+
"Normal cells::are on::left"
|
513 |
+
]
|
514 |
+
},
|
515 |
+
{
|
516 |
+
"sentence": "The cells on the right are infected with human papillomavirus (HPV).",
|
517 |
+
"triplets": [
|
518 |
+
"cells::infected with::human papillomavirus"
|
519 |
+
]
|
520 |
+
},
|
521 |
+
{
|
522 |
+
"sentence": "Notice that the infected cells are larger.",
|
523 |
+
"triplets": []
|
524 |
+
},
|
525 |
+
{
|
526 |
+
"sentence": "Also, two of these cells each have two nuclei instead of one, the normal number.",
|
527 |
+
"triplets": [
|
528 |
+
"cells::have::two nuclei",
|
529 |
+
"cells::have::normal number"
|
530 |
+
]
|
531 |
+
}
|
532 |
+
]
|
pipeline.py
ADDED
@@ -0,0 +1,87 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
import time
|
2 |
+
from my_lib.chatbot import ChatGPT
|
3 |
+
import re
|
4 |
+
|
5 |
+
|
6 |
+
class BasePipeline:
|
7 |
+
|
8 |
+
chat = ChatGPT(model="gpt-3.5-turbo")
|
9 |
+
|
10 |
+
def add_task(self, task, **inputs):
|
11 |
+
args = task["args"]
|
12 |
+
args_supplied = {arg: arg in inputs for arg in args}
|
13 |
+
|
14 |
+
if all(args_supplied.values()):
|
15 |
+
|
16 |
+
formatted_task = f"[TASK]\n{task['description']}"
|
17 |
+
|
18 |
+
if len(args):
|
19 |
+
formatted_task += f"\n\n[INPUT]\n" + '\n'.join([f'{arg}: {inputs[arg]}' for arg in args])
|
20 |
+
|
21 |
+
self.pipe.append(formatted_task)
|
22 |
+
else:
|
23 |
+
raise ValueError(
|
24 |
+
f"Missing required argument(s): {[arg for arg in args if not args_supplied[arg]]}"
|
25 |
+
)
|
26 |
+
|
27 |
+
|
28 |
+
def compile_tasks(self, inputs_by_idx):
|
29 |
+
self.pipe = []
|
30 |
+
|
31 |
+
for idx, task in enumerate(self.prompt):
|
32 |
+
self.add_task(task, **inputs_by_idx.get(idx, {}))
|
33 |
+
|
34 |
+
|
35 |
+
def forward(self):
|
36 |
+
self.history = []
|
37 |
+
|
38 |
+
for task in self.pipe:
|
39 |
+
output = self.chat(task)
|
40 |
+
time.sleep(0.2)
|
41 |
+
|
42 |
+
self.history.append(self.chat.history)
|
43 |
+
self.chat.clear_history()
|
44 |
+
|
45 |
+
return self.postprocess(output)
|
46 |
+
|
47 |
+
|
48 |
+
def postprocess(self):
|
49 |
+
raise NotImplementedError
|
50 |
+
|
51 |
+
|
52 |
+
class Text2KG(BasePipeline):
|
53 |
+
|
54 |
+
|
55 |
+
def __init__(self, recipe: dict):
|
56 |
+
|
57 |
+
self.name = recipe["name"]
|
58 |
+
self.prompt = recipe["prompt"]
|
59 |
+
|
60 |
+
self.chat = ChatGPT(
|
61 |
+
model="gpt-3.5-turbo",
|
62 |
+
init=(
|
63 |
+
"You are a sentence parsing agent helping to construct a knowledge graph."
|
64 |
+
),
|
65 |
+
temperature=0.3
|
66 |
+
)
|
67 |
+
|
68 |
+
|
69 |
+
def __call__(self, text):
|
70 |
+
self.compile_tasks(
|
71 |
+
inputs_by_idx={0: {"text": text}}
|
72 |
+
)
|
73 |
+
|
74 |
+
return self.forward()
|
75 |
+
|
76 |
+
|
77 |
+
def __repr__(self):
|
78 |
+
return f"Text2KG(recipe={self.name})"
|
79 |
+
|
80 |
+
|
81 |
+
def postprocess(self, output: str):
|
82 |
+
|
83 |
+
word_pattern = r"'?\w+(?:[ |.'-]\w+)*'?"
|
84 |
+
triplet_pattern = f'({word_pattern}::{word_pattern}::{word_pattern})'
|
85 |
+
processed = re.findall(triplet_pattern, re.sub(r'[<>]', '', output))
|
86 |
+
|
87 |
+
return processed
|
recipes.json
ADDED
@@ -0,0 +1,37 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
[
|
2 |
+
{
|
3 |
+
"name": "LogicBased",
|
4 |
+
"prompt": [
|
5 |
+
{
|
6 |
+
"description": "Given the text, extract a list of the premises embedded within it. Focus on identifying declarative sentences that convey factual information.",
|
7 |
+
"args": ["text"]
|
8 |
+
},
|
9 |
+
{
|
10 |
+
"description": "Convert each premise into a knowledge graph triplet consisting of a subject, predicate, and object. The subject should be the main entity mentioned, the predicate should represent the relationship or attribute, and the object should be another entity or value associated with the subject. Return a list of these knowledge graph triplets, where each triplet has the form: <subject>::<relation>::<object>",
|
11 |
+
"args": []
|
12 |
+
}
|
13 |
+
]
|
14 |
+
},
|
15 |
+
{
|
16 |
+
"name": "Traditional",
|
17 |
+
"prompt": [
|
18 |
+
{
|
19 |
+
"description": "Extract the entities from the text. Focus on descriptive noun chunks and noun phrases.",
|
20 |
+
"args": ["text"]
|
21 |
+
},
|
22 |
+
{
|
23 |
+
"description": "Label all pairs of entities with the appropriate relation between them (or N/A if none). The result will be a list of triplets of the form: <subject>::<relation>::<object>",
|
24 |
+
"args": []
|
25 |
+
}
|
26 |
+
]
|
27 |
+
},
|
28 |
+
{
|
29 |
+
"name": "Direct",
|
30 |
+
"prompt": [
|
31 |
+
{
|
32 |
+
"description": "Extract a list of knowledge graph triplets from the text. Triplets should have the form: <subject>::<relation>::<object>.",
|
33 |
+
"args": ["text"]
|
34 |
+
}
|
35 |
+
]
|
36 |
+
}
|
37 |
+
]
|
text2kg.py
ADDED
@@ -0,0 +1,69 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
from argparse import ArgumentParser
|
2 |
+
from datetime import date
|
3 |
+
import json
|
4 |
+
import os
|
5 |
+
|
6 |
+
from nltk.tokenize import sent_tokenize
|
7 |
+
from tqdm import tqdm
|
8 |
+
|
9 |
+
from pipeline import Text2KG
|
10 |
+
|
11 |
+
|
12 |
+
def parse_args():
|
13 |
+
parser = ArgumentParser()
|
14 |
+
parser.add_argument("infile", type=str)
|
15 |
+
parser.add_argument("--output", type=str, default="./output")
|
16 |
+
parser.add_argument("--cookbook", type=str, default="./recipes.json",
|
17 |
+
help="path to cookbook")
|
18 |
+
parser.add_argument("--recipe", type=str, default=None,
|
19 |
+
help="name of recipe to use"),
|
20 |
+
parser.add_argument("--thoughts", action="store_true")
|
21 |
+
|
22 |
+
return parser.parse_args()
|
23 |
+
|
24 |
+
|
25 |
+
def save(name, item, args):
|
26 |
+
|
27 |
+
os.makedirs(args.output, exist_ok=True)
|
28 |
+
|
29 |
+
today = date.today()
|
30 |
+
filename = f"{today}_{name}_{args.recipe}.json"
|
31 |
+
filepath = os.path.join(args.output, filename)
|
32 |
+
|
33 |
+
with open(filepath, 'w') as f:
|
34 |
+
json.dump(item, f)
|
35 |
+
|
36 |
+
|
37 |
+
def main(args):
|
38 |
+
with open(args.cookbook) as f:
|
39 |
+
cookbook = json.load(f)
|
40 |
+
|
41 |
+
recipe = None
|
42 |
+
for item in cookbook:
|
43 |
+
if item["name"] == args.recipe:
|
44 |
+
recipe = item
|
45 |
+
if recipe is None:
|
46 |
+
raise ValueError(f"Recipe '{args.recipe}' does not exist in cookbook.")
|
47 |
+
|
48 |
+
pipe = Text2KG(recipe)
|
49 |
+
|
50 |
+
with open(args.infile) as f:
|
51 |
+
text = f.read()
|
52 |
+
|
53 |
+
sentences = sent_tokenize(text.replace('\n', ' '))
|
54 |
+
|
55 |
+
triplets = [pipe(s) for s in tqdm(sentences)]
|
56 |
+
|
57 |
+
output = [{"sentence": s, "triplets": t} for s, t in zip(sentences, triplets)]
|
58 |
+
|
59 |
+
save("triplets", output, args)
|
60 |
+
|
61 |
+
if args.thoughts:
|
62 |
+
save("thoughts", pipe.history, args)
|
63 |
+
|
64 |
+
return output
|
65 |
+
|
66 |
+
|
67 |
+
if __name__ == "__main__":
|
68 |
+
args = parse_args()
|
69 |
+
main(args)
|