Commit
·
62bc2ef
1
Parent(s):
bd8ed50
Update README.md
Browse files
README.md
CHANGED
@@ -7,45 +7,32 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
The End-to-end Speech Challenge (ESC) is a benchmark for assessing a single ASR system on a collection of eight different speech recognition datasets. The ESC datasets are sourced from different domains and cover a range of audio and text distributions (speaking styles, background noise, transcription requirements). The only constraint on systems is that they must use the same training and evaluation algorithms across datasets and may not use any dataset-specific pre- or post-processing, assuming the data distributions are a-priori unknown. The goal of ESC is to promote generalisable ASR systems that can be applied in a multi-domain setting.
|
11 |
-
|
12 |
-
ESC consists of:
|
13 |
-
|
14 |
-
- A [Hugging Face dataset](https://huggingface.co/datasets/esc-bench/esc-datasets) to download and use pre-prepared ESC audio-text data
|
15 |
-
|
16 |
-
- [Checkpoints and scripts](https://huggingface.co/models?other=esc) to reproduce runs for the five official baseline systems
|
17 |
-
|
18 |
-
- An [ESC leaderboard](https://huggingface.co/spaces/esc-bench/ESC) for ranking systems according to overall performance on the benchmark
|
19 |
-
|
20 |
-
ESC was proposed in _ESC: A Benchmark For Multi-Domain End-to-End Speech Recognition_ by ... For more information, see the official submission on [OpenReview.net](https://openreview.net/forum?id=9OL2fIfDLK).
|
21 |
-
|
22 |
<div class="grid lg:grid-cols-3 gap-x-4 gap-y-7">
|
23 |
<p class="lg:col-span-3">
|
24 |
-
The End-to-end Speech Challenge (ESC) is a benchmark for assessing
|
25 |
-
The ESC datasets are sourced from different domains and cover a range of audio and text distributions (speaking styles, background noise, transcription requirements). The only constraint on systems is that they must use the same training and evaluation algorithms across datasets and may not use any dataset-specific pre- or post-processing, assuming the data distributions are a-priori unknown. The goal of ESC is to promote generalisable ASR systems that can be applied in a multi-domain setting.
|
26 |
</p>
|
27 |
-
<a href="https://
|
28 |
<div
|
29 |
class="w-full h-40 object-cover mb-2 rounded-lg flex items-center justify-center bg-[#ECFAFF]"
|
30 |
>
|
31 |
<img alt="" src="/front/assets/promo/spacy_logo.png" class="w-40" />
|
32 |
</div>
|
33 |
-
<div class="underline">
|
34 |
</a>
|
35 |
<a
|
36 |
-
href="https://
|
37 |
class="block overflow-hidden"
|
38 |
>
|
39 |
<div class="flex items-center h-40 bg-[#ECFAFF] rounded-lg px-4 mb-2">
|
40 |
<pre
|
41 |
class="break-words leading-1 whitespace-pre-line text-xs text-gray-800">
|
42 |
-
|
43 |
</pre>
|
44 |
</div>
|
45 |
-
<div class="underline">
|
46 |
</a>
|
47 |
<a
|
48 |
-
href="https://huggingface.co/
|
49 |
class="block overflow-hidden group"
|
50 |
>
|
51 |
<div
|
@@ -57,33 +44,9 @@ ESC was proposed in _ESC: A Benchmark For Multi-Domain End-to-End Speech Recogni
|
|
57 |
class="w-full h-40 object-cover overflow-hidden"
|
58 |
/>
|
59 |
</div>
|
60 |
-
<div class="underline">
|
61 |
</a>
|
62 |
-
|
63 |
-
|
64 |
-
|
65 |
-
href="https://github.com/explosion/spacy-huggingface-hub"
|
66 |
-
>spacy-huggingface-hub</a
|
67 |
-
>
|
68 |
-
library.
|
69 |
-
</p>
|
70 |
-
<div
|
71 |
-
class="p-4 bg-gradient-to-b from-gray-50-to-white border border-gray-100 rounded-lg relative mb-4"
|
72 |
-
>
|
73 |
-
<pre
|
74 |
-
class="break-words leading-1 whitespace-pre-line text-xs md:text-sm text-gray-800">
|
75 |
-
pip install spacy-huggingface-hub
|
76 |
-
huggingface-cli login
|
77 |
-
python -m spacy package ./en_ner_fashion ./output --build wheel
|
78 |
-
cd ./output/en_ner_fashion-0.0.0/dist
|
79 |
-
python -m spacy huggingface-hub push en_ner_fashion-0.0.0-py3-none-any.whl
|
80 |
-
</pre>
|
81 |
-
</div>
|
82 |
-
<p>
|
83 |
-
More information: <a
|
84 |
-
href="https://spacy.io/usage/v3-1#huggingface-hub"
|
85 |
-
class="underline">spaCy 3.1 release post</a
|
86 |
-
>
|
87 |
-
</p>
|
88 |
-
</div>
|
89 |
</div>
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
<div class="grid lg:grid-cols-3 gap-x-4 gap-y-7">
|
11 |
<p class="lg:col-span-3">
|
12 |
+
The End-to-end Speech Challenge (ESC) is a benchmark for assessing ASR systems on a collection of eight different speech recognition datasets. The ESC datasets are sourced from 11 different domains and cover a range of audio and text distributions (speaking styles, background noise, transcription requirements). There is no restriction on architecture or training data. The only constraint is that systems must use the same training and evaluation algorithms across datasets and may not use any dataset-specific pre- or post-processing. The objective of ESC is to promote generalisable ASR systems that can be applied in a multi-domain setting.
|
|
|
13 |
</p>
|
14 |
+
<a href="https://huggingface.co/datasets/esc-bench/esc-datasets" class="block overflow-hidden group">
|
15 |
<div
|
16 |
class="w-full h-40 object-cover mb-2 rounded-lg flex items-center justify-center bg-[#ECFAFF]"
|
17 |
>
|
18 |
<img alt="" src="/front/assets/promo/spacy_logo.png" class="w-40" />
|
19 |
</div>
|
20 |
+
<div class="underline">ESC Datasets</div>
|
21 |
</a>
|
22 |
<a
|
23 |
+
href="https://huggingface.co/models?other=esc"
|
24 |
class="block overflow-hidden"
|
25 |
>
|
26 |
<div class="flex items-center h-40 bg-[#ECFAFF] rounded-lg px-4 mb-2">
|
27 |
<pre
|
28 |
class="break-words leading-1 whitespace-pre-line text-xs text-gray-800">
|
29 |
+
some stuff
|
30 |
</pre>
|
31 |
</div>
|
32 |
+
<div class="underline">Official Checkpoints</div>
|
33 |
</a>
|
34 |
<a
|
35 |
+
href="https://huggingface.co/spaces/esc-bench/ESC"
|
36 |
class="block overflow-hidden group"
|
37 |
>
|
38 |
<div
|
|
|
44 |
class="w-full h-40 object-cover overflow-hidden"
|
45 |
/>
|
46 |
</div>
|
47 |
+
<div class="underline">ESC Leaderboard</div>
|
48 |
</a>
|
49 |
+
<p class="lg:col-span-3">
|
50 |
+
ESC was proposed in ESC: A Benchmark For Multi-Domain End-to-End Speech Recognition by ... For more information, see the official submission on <a href="https://openreview.net/forum?id=9OL2fIfDLK" class="underline">OpenReview.net</a> or the blog post at <a href="https://openreview.net/forum?id=9OL2fIfDLK" class="underline">ESC Benchmark (TODO)</a>.
|
51 |
+
</p>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
52 |
</div>
|