Spaces:
Sleeping
Sleeping
drugilsberg
commited on
Commit
•
9c92b60
1
Parent(s):
1634315
feat: updating model card.
Browse filesSigned-off-by: Matteo Manica <[email protected]>
- model_cards/article.md +9 -10
model_cards/article.md
CHANGED
@@ -10,7 +10,7 @@
|
|
10 |
|
11 |
# Model card -- PolymerBlocks
|
12 |
|
13 |
-
**Model Details**: *PolymerBlocks* is a sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers). The model relies on a Variational Autoencoder architecture as described in [Born et al. (2021; *iScience*)](https://www.sciencedirect.com/science/article/pii/S2589004221002376)
|
14 |
|
15 |
**Developers**: Matteo Manica and colleagues from IBM Research.
|
16 |
|
@@ -18,21 +18,19 @@
|
|
18 |
|
19 |
**Model date**: Not yet published.
|
20 |
|
21 |
-
**Model version**: Only initial model version.
|
22 |
|
23 |
**Model type**: A sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers).
|
24 |
|
25 |
-
**Information about training algorithms, parameters, fairness constraints or other applied approaches, and features**:
|
26 |
-
N.A.
|
27 |
|
28 |
-
**Paper or other resource for more information**:
|
29 |
-
TBD
|
30 |
|
31 |
**License**: MIT
|
32 |
|
33 |
**Where to send questions or comments about the model**: Open an issue on [GT4SD repository](https://github.com/GT4SD/gt4sd-core).
|
34 |
|
35 |
-
**Intended Use. Use cases that were envisioned during development**: Chemical research, in particular
|
36 |
|
37 |
**Primary intended uses/users**: Researchers and computational chemists using the model for model comparison or research exploration purposes.
|
38 |
|
@@ -40,7 +38,7 @@ TBD
|
|
40 |
|
41 |
**Metrics**: N.A.
|
42 |
|
43 |
-
**Datasets**:
|
44 |
|
45 |
**Ethical Considerations**: Unclear, please consult with original authors in case of questions.
|
46 |
|
@@ -49,7 +47,7 @@ TBD
|
|
49 |
Model card prototype inspired by [Mitchell et al. (2019)](https://dl.acm.org/doi/abs/10.1145/3287560.3287596?casa_token=XD4eHiE2cRUAAAAA:NL11gMa1hGPOUKTAbtXnbVQBDBbjxwcjGECF_i-WC_3g1aBgU1Hbz_f2b4kI_m1in-w__1ztGeHnwHs)
|
50 |
|
51 |
## Citation
|
52 |
-
|
53 |
```bib
|
54 |
@article{manica2022gt4sd,
|
55 |
title={GT4SD: Generative Toolkit for Scientific Discovery},
|
@@ -57,4 +55,5 @@ TBD, temporarily please cite:
|
|
57 |
journal={arXiv preprint arXiv:2207.03928},
|
58 |
year={2022}
|
59 |
}
|
60 |
-
```
|
|
|
|
10 |
|
11 |
# Model card -- PolymerBlocks
|
12 |
|
13 |
+
**Model Details**: *PolymerBlocks* is a sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers). The model relies on a Variational Autoencoder architecture as described in [Born et al. (2021; *iScience*)](https://www.sciencedirect.com/science/article/pii/S2589004221002376).
|
14 |
|
15 |
**Developers**: Matteo Manica and colleagues from IBM Research.
|
16 |
|
|
|
18 |
|
19 |
**Model date**: Not yet published.
|
20 |
|
21 |
+
**Model version**: Only initial model version. The model has been pre-trained on 500K compounds from PubChem and further fine-tuned on the SMILES representing monomers and catalysts collected in the database presented in [Park et al. (2022)](https://doi.org/10.26434/chemrxiv-2022-811rl).
|
22 |
|
23 |
**Model type**: A sequence-based molecular generator tuned to generate blocks of polymers (e.g., catalysts and monomers).
|
24 |
|
25 |
+
**Information about training algorithms, parameters, fairness constraints or other applied approaches, and features**: the sequence-based model is a standard GRU-based VAE trained to reconstruct SMILES representation of molecules. Given the nature of the pre-training and fine-tuning data, the model is biased to create molecules that resemble catalysts and monomers employed in ring-opening polymerization.
|
|
|
26 |
|
27 |
+
**Paper or other resource for more information**: Details on the model used and code can be found in [Born et al. (2021; *iScience*)](https://www.sciencedirect.com/science/article/pii/S2589004221002376).
|
|
|
28 |
|
29 |
**License**: MIT
|
30 |
|
31 |
**Where to send questions or comments about the model**: Open an issue on [GT4SD repository](https://github.com/GT4SD/gt4sd-core).
|
32 |
|
33 |
+
**Intended Use. Use cases that were envisioned during development**: Chemical research, in particular discovery and catalysts for polymerization.
|
34 |
|
35 |
**Primary intended uses/users**: Researchers and computational chemists using the model for model comparison or research exploration purposes.
|
36 |
|
|
|
38 |
|
39 |
**Metrics**: N.A.
|
40 |
|
41 |
+
**Datasets**: See description in the model versions.
|
42 |
|
43 |
**Ethical Considerations**: Unclear, please consult with original authors in case of questions.
|
44 |
|
|
|
47 |
Model card prototype inspired by [Mitchell et al. (2019)](https://dl.acm.org/doi/abs/10.1145/3287560.3287596?casa_token=XD4eHiE2cRUAAAAA:NL11gMa1hGPOUKTAbtXnbVQBDBbjxwcjGECF_i-WC_3g1aBgU1Hbz_f2b4kI_m1in-w__1ztGeHnwHs)
|
48 |
|
49 |
## Citation
|
50 |
+
|
51 |
```bib
|
52 |
@article{manica2022gt4sd,
|
53 |
title={GT4SD: Generative Toolkit for Scientific Discovery},
|
|
|
55 |
journal={arXiv preprint arXiv:2207.03928},
|
56 |
year={2022}
|
57 |
}
|
58 |
+
```
|
59 |
+
|