sagawa commited on
Commit
489be70
1 Parent(s): fc64e8a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -11
README.md CHANGED
@@ -52,26 +52,27 @@ output # 'CCN(CC)CCN=C=S.Cc1cnc2c(c1)CCCC2N'
52
  ### Training Procedure
53
 
54
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
55
- We used the Open Reaction Database (ORD) dataset for model training.
56
  The command used for training is the following. For more information, please refer to the paper and GitHub repository.
57
 
58
  ```python
59
- python train_without_duplicates.py \
60
- --model='t5' \
61
- --epochs=80 \
62
- --lr=2e-4 \
 
63
  --batch_size=32 \
64
- --input_max_len=100 \
65
  --target_max_len=150 \
66
  --weight_decay=0.01 \
67
  --evaluation_strategy='epoch' \
68
  --save_strategy='epoch' \
69
  --logging_strategy='epoch' \
70
- --train_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_train.csv' \
71
- --valid_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_valid.csv' \
72
- --test_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_test.csv' \
73
- --USPTO_test_data_path='/home/acf15718oa/ReactionT5_neword/data/USPTO_50k/test.csv' \
74
- --pretrained_model_name_or_path='sagawa/CompoundT5'
75
  ```
76
 
77
  ### Results
 
52
  ### Training Procedure
53
 
54
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
55
+ We used the [USPTO_50k dataset](https://yzhang.hpc.nyu.edu/T5Chem/index.html) for model finetuning.
56
  The command used for training is the following. For more information, please refer to the paper and GitHub repository.
57
 
58
  ```python
59
+ cd task_retrosynthesis
60
+ python finetune.py \
61
+ --output_dir='t5' \
62
+ --epochs=20 \
63
+ --lr=2e-5 \
64
  --batch_size=32 \
65
+ --input_max_len=150 \
66
  --target_max_len=150 \
67
  --weight_decay=0.01 \
68
  --evaluation_strategy='epoch' \
69
  --save_strategy='epoch' \
70
  --logging_strategy='epoch' \
71
+ --save_total_limit=10 \
72
+ --train_data_path='../data/USPTO_50k/train.csv' \
73
+ --valid_data_path='../data/USPTO_50k/val.csv' \
74
+ --disable_tqdm \
75
+ --model_name_or_path='sagawa/ReactionT5v2-retrosynthesis'
76
  ```
77
 
78
  ### Results