Spaces:
Runtime error
Runtime error
DarrenChensformer
commited on
Commit
β’
07926c5
1
Parent(s):
ca4743e
Add metric description
Browse files
README.md
CHANGED
@@ -14,16 +14,11 @@ pinned: false
|
|
14 |
|
15 |
# Metric Card for action_generation
|
16 |
|
17 |
-
***Module Card Instructions:*** *Fill out the following subsections. Feel free to take a look at existing metric cards if you'd like examples.*
|
18 |
-
|
19 |
## Metric Description
|
20 |
-
|
|
|
21 |
|
22 |
## How to Use
|
23 |
-
*Give general statement of how to use the metric*
|
24 |
-
|
25 |
-
*Provide simplest possible example for using the metric*
|
26 |
-
|
27 |
```python
|
28 |
import evaluate
|
29 |
valid_labels = [
|
@@ -64,8 +59,6 @@ print(result)
|
|
64 |
|
65 |
*State the range of possible values that the metric's output can take, as well as what in that range is considered good. For example: "This metric can take on any value between 0 and 100, inclusive. Higher scores are better."*
|
66 |
|
67 |
-
#### Values from Popular Papers
|
68 |
-
*Give examples, preferrably with links to leaderboards or publications, to papers that have reported this metric, along with the values they have reported.*
|
69 |
|
70 |
### Examples
|
71 |
*Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.*
|
|
|
14 |
|
15 |
# Metric Card for action_generation
|
16 |
|
|
|
|
|
17 |
## Metric Description
|
18 |
+
Evaluate the result of action generation task.
|
19 |
+
Consider the output format `/class/phrase`. Compute the scores for both `/class` and `phrase` separately, and then perform a weighted sum of these scores.
|
20 |
|
21 |
## How to Use
|
|
|
|
|
|
|
|
|
22 |
```python
|
23 |
import evaluate
|
24 |
valid_labels = [
|
|
|
59 |
|
60 |
*State the range of possible values that the metric's output can take, as well as what in that range is considered good. For example: "This metric can take on any value between 0 and 100, inclusive. Higher scores are better."*
|
61 |
|
|
|
|
|
62 |
|
63 |
### Examples
|
64 |
*Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.*
|