ganler commited on
Commit
c80d7e3
β€’
1 Parent(s): 96fa852

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -7,13 +7,24 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- # EvalPlus: Rigorous Evaluation of LLMs for Code Generation
 
 
 
 
 
 
 
 
 
11
 
12
  * πŸ’» **GitHub Repo**: [evalplus/evalplus](https://github.com/evalplus/evalplus)
13
  * πŸ† **Leader Board**: [evalplus.github.io](https://evalplus.github.io/leaderboard.html)
14
  * πŸ“œ **NeurIPS Paper**: [OpenReview](https://openreview.net/pdf?id=1qvx610Cu7)
15
  * 🐍 **Python Package**: [PyPI](https://pypi.org/project/evalplus/)
16
 
 
 
17
  ```bibtex
18
  @inproceedings{evalplus,
19
  title = {Is Your Code Generated by Chat{GPT} Really Correct? Rigorous Evaluation of Large Language Models for Code Generation},
@@ -22,4 +33,12 @@ pinned: false
22
  year = {2023},
23
  url = {https://openreview.net/forum?id=1qvx610Cu7},
24
  }
 
 
 
 
 
 
 
 
25
  ```
 
7
  pinned: false
8
  ---
9
 
10
+ # EvalPlus: Rigorous Evaluation of LLMs for Code Generation
11
+
12
+ ## About
13
+
14
+ EvalPlus evaluates LLM-generated code on:
15
+
16
+ * Code Correctness: HumanEval+ and MBPP+
17
+ * Code Efficiency: EvalPerf
18
+
19
+ ## Resources
20
 
21
  * πŸ’» **GitHub Repo**: [evalplus/evalplus](https://github.com/evalplus/evalplus)
22
  * πŸ† **Leader Board**: [evalplus.github.io](https://evalplus.github.io/leaderboard.html)
23
  * πŸ“œ **NeurIPS Paper**: [OpenReview](https://openreview.net/pdf?id=1qvx610Cu7)
24
  * 🐍 **Python Package**: [PyPI](https://pypi.org/project/evalplus/)
25
 
26
+ ## Citations
27
+
28
  ```bibtex
29
  @inproceedings{evalplus,
30
  title = {Is Your Code Generated by Chat{GPT} Really Correct? Rigorous Evaluation of Large Language Models for Code Generation},
 
33
  year = {2023},
34
  url = {https://openreview.net/forum?id=1qvx610Cu7},
35
  }
36
+
37
+ @inproceedings{evalperf,
38
+ title = {Evaluating Language Models for Efficient Code Generation},
39
+ author = {Liu, Jiawei and Xie, Songrun and Wang, Junhao and Wei, Yuxiang and Ding, Yifeng and Zhang, Lingming},
40
+ booktitle = {First Conference on Language Modeling},
41
+ year = {2024},
42
+ url = {https://openreview.net/forum?id=IBCBMeAhmC},
43
+ }
44
  ```