drugilsberg commited on
Commit
9c92b60
1 Parent(s): 1634315

feat: updating model card.

Browse files

Signed-off-by: Matteo Manica <[email protected]>

Files changed (1) hide show
  1. model_cards/article.md +9 -10
model_cards/article.md CHANGED
@@ -10,7 +10,7 @@
10
 
11
  # Model card -- PolymerBlocks
12
 
13
- **Model Details**: *PolymerBlocks* is a sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers). The model relies on a Variational Autoencoder architecture as described in [Born et al. (2021; *iScience*)](https://www.sciencedirect.com/science/article/pii/S2589004221002376)
14
 
15
  **Developers**: Matteo Manica and colleagues from IBM Research.
16
 
@@ -18,21 +18,19 @@
18
 
19
  **Model date**: Not yet published.
20
 
21
- **Model version**: Only initial model version.
22
 
23
  **Model type**: A sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers).
24
 
25
- **Information about training algorithms, parameters, fairness constraints or other applied approaches, and features**:
26
- N.A.
27
 
28
- **Paper or other resource for more information**:
29
- TBD
30
 
31
  **License**: MIT
32
 
33
  **Where to send questions or comments about the model**: Open an issue on [GT4SD repository](https://github.com/GT4SD/gt4sd-core).
34
 
35
- **Intended Use. Use cases that were envisioned during development**: Chemical research, in particular drug discovery.
36
 
37
  **Primary intended uses/users**: Researchers and computational chemists using the model for model comparison or research exploration purposes.
38
 
@@ -40,7 +38,7 @@ TBD
40
 
41
  **Metrics**: N.A.
42
 
43
- **Datasets**: N.A.
44
 
45
  **Ethical Considerations**: Unclear, please consult with original authors in case of questions.
46
 
@@ -49,7 +47,7 @@ TBD
49
  Model card prototype inspired by [Mitchell et al. (2019)](https://dl.acm.org/doi/abs/10.1145/3287560.3287596?casa_token=XD4eHiE2cRUAAAAA:NL11gMa1hGPOUKTAbtXnbVQBDBbjxwcjGECF_i-WC_3g1aBgU1Hbz_f2b4kI_m1in-w__1ztGeHnwHs)
50
 
51
  ## Citation
52
- TBD, temporarily please cite:
53
  ```bib
54
  @article{manica2022gt4sd,
55
  title={GT4SD: Generative Toolkit for Scientific Discovery},
@@ -57,4 +55,5 @@ TBD, temporarily please cite:
57
  journal={arXiv preprint arXiv:2207.03928},
58
  year={2022}
59
  }
60
- ```
 
 
10
 
11
  # Model card -- PolymerBlocks
12
 
13
+ **Model Details**: *PolymerBlocks* is a sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers). The model relies on a Variational Autoencoder architecture as described in [Born et al. (2021; *iScience*)](https://www.sciencedirect.com/science/article/pii/S2589004221002376).
14
 
15
  **Developers**: Matteo Manica and colleagues from IBM Research.
16
 
 
18
 
19
  **Model date**: Not yet published.
20
 
21
+ **Model version**: Only initial model version. The model has been pre-trained on 500K compounds from PubChem and further fine-tuned on the SMILES representing monomers and catalysts collected in the database presented in [Park et al. (2022)](https://doi.org/10.26434/chemrxiv-2022-811rl).
22
 
23
  **Model type**: A sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers).
24
 
25
+ **Information about training algorithms, parameters, fairness constraints or other applied approaches, and features**: the sequence-based model is a standard GRU-based VAE trained to reconstruct SMILES representation of molecules. Given the nature of the pre-training and fine-tuning data, the model is biased to create molecules that resemble catalysts and monomers employed in ring-opening polymerization.
 
26
 
27
+ **Paper or other resource for more information**: Details on the model used and code can be found in [Born et al. (2021; *iScience*)](https://www.sciencedirect.com/science/article/pii/S2589004221002376).
 
28
 
29
  **License**: MIT
30
 
31
  **Where to send questions or comments about the model**: Open an issue on [GT4SD repository](https://github.com/GT4SD/gt4sd-core).
32
 
33
+ **Intended Use. Use cases that were envisioned during development**: Chemical research, in particular discovery and catalysts for polymerization.
34
 
35
  **Primary intended uses/users**: Researchers and computational chemists using the model for model comparison or research exploration purposes.
36
 
 
38
 
39
  **Metrics**: N.A.
40
 
41
+ **Datasets**: See description in the model versions.
42
 
43
  **Ethical Considerations**: Unclear, please consult with original authors in case of questions.
44
 
 
47
  Model card prototype inspired by [Mitchell et al. (2019)](https://dl.acm.org/doi/abs/10.1145/3287560.3287596?casa_token=XD4eHiE2cRUAAAAA:NL11gMa1hGPOUKTAbtXnbVQBDBbjxwcjGECF_i-WC_3g1aBgU1Hbz_f2b4kI_m1in-w__1ztGeHnwHs)
48
 
49
  ## Citation
50
+
51
  ```bib
52
  @article{manica2022gt4sd,
53
  title={GT4SD: Generative Toolkit for Scientific Discovery},
 
55
  journal={arXiv preprint arXiv:2207.03928},
56
  year={2022}
57
  }
58
+ ```
59
+