pearsonkyle commited on
Commit
131556f
·
1 Parent(s): 04914c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -6
README.md CHANGED
@@ -11,22 +11,47 @@ should probably proofread and complete it, then remove this comment. -->
11
 
12
  # gpt2-arxiv
13
 
14
- This model was trained from scratch on an unknown dataset.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  ## Model description
17
 
18
- More information needed
19
 
20
  ## Intended uses & limitations
21
 
22
- More information needed
 
 
23
 
24
- ## Training and evaluation data
25
 
26
- More information needed
27
 
28
  ## Training procedure
29
 
 
 
30
  ### Training hyperparameters
31
 
32
  The following hyperparameters were used during training:
@@ -36,7 +61,7 @@ The following hyperparameters were used during training:
36
  - seed: 42
37
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
38
  - lr_scheduler_type: linear
39
- - num_epochs: 50
40
 
41
  ### Framework versions
42
 
 
11
 
12
  # gpt2-arxiv
13
 
14
+ A [gpt2](https://huggingface.co/gpt2) powered predictive keyboard trained on ~1.6M manuscript abstracts from the ArXiv.
15
+ This model uses https://www.kaggle.com/datasets/Cornell-University/arxiv
16
+
17
+ ```python
18
+ from transformers import pipeline
19
+ from transformers import GPT2TokenizerFast
20
+
21
+ tokenizer = GPT2TokenizerFast.from_pretrained("gpt2")
22
+ llm = pipeline('text-generation',model='pearsonkyle/gpt2-arxiv', tokenizer=tokenizer)
23
+
24
+ texts = llm("Directly imaged exoplanets probe",
25
+ max_length=50, do_sample=True, num_return_sequences=5,
26
+ penalty_alpha=0.65, top_k=4, repetition_penalty=1.2)
27
+
28
+ for i in range(5):
29
+ print(texts[i]['generated_text']+'\n')
30
+ ```
31
+
32
+ - *The reflectance of Earth's vegetation suggests* `that large, deciduous forest fires are composed of mostly dry, unprocessed material that is distributed in a nearly patchy fashion. The distributions of these fires are correlated with temperature, and also with vegetation...`
33
+ - *Directly imaged exoplanets probe* the atmospheres of giant planets. The detection of such planets requires high-quality imaging with high contrast and angular resolution, as well as
34
+ - *We can remotely sense an atmosphere by observing its reflected, transmitted, or emitted light in varying geometries. This light will contain information on* the planetary conditions including atmospheric temperature and cloud properties, which is essential for understanding how the planet interacts with the atmosphere and how it affects the climate. The primary science objective of this paper is to develop a methodology that can be applied to any kind of observation and measurement data, and to provide a framework that enables the detection and characterisation of the atmospheres of ex
35
+
36
 
37
  ## Model description
38
 
39
+ [GPT-2](https://huggingface.co/transformers/v2.2.0/pretrained_models.html): 12-layer, 768-hidden, 12-heads, 117M parameters
40
 
41
  ## Intended uses & limitations
42
 
43
+ Coming soon...
44
+ - Predictive Keyboard using text generation
45
+ - Realtime reference recommendations using nearest neighbors of embeddings
46
 
47
+ Be careful when generating a lot of text...The things it says are not that truthful, maybe more training is needed
48
 
49
+ - The surface of Mars is composed of a thin layer of water ice, that was discovered by the Cassini spacecraft after its impact on the Earth's surface.
50
 
51
  ## Training procedure
52
 
53
+ ~49 hours on a 3090 training for 1.25M iterations
54
+
55
  ### Training hyperparameters
56
 
57
  The following hyperparameters were used during training:
 
61
  - seed: 42
62
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
  - lr_scheduler_type: linear
64
+ - num_epochs: 10
65
 
66
  ### Framework versions
67