Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,9 @@ datasets:
|
|
5 |
- medalpaca/medical_meadow_mediqa
|
6 |
- andersonbcdefg/chemistry
|
7 |
---
|
8 |
-
2024-08-
|
|
|
|
|
9 |
|
10 |
2024-08-12: The medalpaca/medical_meadow_mediqa data set was also used, but the model converged on this in less than one epoch, only 1400 steps of training was concluded, in future versions and editions I might elect to exclude this data set, but it is included in this version.
|
11 |
|
|
|
5 |
- medalpaca/medical_meadow_mediqa
|
6 |
- andersonbcdefg/chemistry
|
7 |
---
|
8 |
+
2024-08-15: Splitting model today. This model will be the base ChemWiz Model. The first vintage that I will create today will be the RDKit coder, using my custom data set. Once I have this model, I will create a dev critic out of that model. Will then start a set of tests with Microsoft Autogen testing to test if the addition of a coding critic would improve the results. Still toying with the idea of creating a ChemWiz critic to see if it improves the outcomes and reduces halicinations. But lets see.
|
9 |
+
|
10 |
+
2024-08-13: Taking the model through second round of AI4Chem/ChemData700K, I am amazed how the model seem to converge and the suddenly it does not. I suspect in nextfew days that it will converge. I am quite keen to see this happen. The results of chemichal smiles are very low at this point.
|
11 |
|
12 |
2024-08-12: The medalpaca/medical_meadow_mediqa data set was also used, but the model converged on this in less than one epoch, only 1400 steps of training was concluded, in future versions and editions I might elect to exclude this data set, but it is included in this version.
|
13 |
|