iterateai
/

Interplay-AppCoder

Text Generation

text-generation-inference

Model card Files Files and versions

iterateai commited on Oct 31, 2023

Commit

29c0b27

·

1 Parent(s): ffdcf97

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -116,7 +116,7 @@ The ICE methodology provides metrics for Usefulness and Functional Correctness a
 * Functional Correctness: An LLM that has complex reasoning capabilities is utilized to conduct unit tests while considering the given question and the reference code.
-We utilized GPT4 to measure the above metrics and provide a score from 0-4. This is the test dataset[https://drive.google.com/file/d/1R6DDyBhcR6TSUYFTgUosJxrvibkR1BHC/view] and Jupyter notebook [https://colab.research.google.com/drive/1USuNLFxLex-C5tLHYET_nQfpM4ALCbc5?usp=sharing#scrollTo=lNCZTBj1nBsJ] we used to perform the benchmark.
 You can read more about the ICE methodology in this paper.

 * Functional Correctness: An LLM that has complex reasoning capabilities is utilized to conduct unit tests while considering the given question and the reference code.
+We utilized GPT4 to measure the above metrics and provide a score from 0-4. This is the [test dataset](https://drive.google.com/file/d/1R6DDyBhcR6TSUYFTgUosJxrvibkR1BHC/view) and [Jupyter notebook] (https://colab.research.google.com/drive/1USuNLFxLex-C5tLHYET_nQfpM4ALCbc5?usp=sharing#scrollTo=lNCZTBj1nBsJ) we used to perform the benchmark.
 You can read more about the ICE methodology in this paper.