sani903 commited on
Commit
7235454
·
1 Parent(s): afbf93f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -12
README.md CHANGED
@@ -1,13 +1,3 @@
1
  # Identifier-Renaming
2
- Generating higher quality identifier names by using context and following conventions with RLHF <br>
3
- 0.0 is used to generate the csv dataset from the .java files after 4.0 <br>
4
- 1.0 is code for the classifier to predict the number of mask tokens to insert in for the variable name <br>
5
- 2.2-all is used for finetuning the GraphCodeBert model on variable names <br>
6
- 4.0 is used to create the dataset. Repositories are cloned and this code file iterates over all the files and preprocesses them for creation of dataset.<br>
7
- procTest.ipynb processes the text generated while evaluating the trained model and generates relevant graphs<br>
8
- class_eval.py is used to evaluate the performance of the classifer <br>
9
- identifier_scoring.py uses two non-fine-tuned models GraphCodeBERT and CodeBERT for the metric<br>
10
- stat_sampling.py evaluates the use of random sampling technique to predict number of mask tokens<br>
11
- model_eval.py is the code to evaluate the trained model<br>
12
- model_test.csv is the subset of data used to evaluate the model<br>
13
- test.csv is the dataset used for the evaluation of the readability metric and the fine-tuned model <br>
 
1
  # Identifier-Renaming
2
+ Generating higher quality identifier names by using context and following conventions <br>
3
+