sanchit-gandhi commited on
Commit
62bc2ef
·
1 Parent(s): bd8ed50

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -48
README.md CHANGED
@@ -7,45 +7,32 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- The End-to-end Speech Challenge (ESC) is a benchmark for assessing a single ASR system on a collection of eight different speech recognition datasets. The ESC datasets are sourced from different domains and cover a range of audio and text distributions (speaking styles, background noise, transcription requirements). The only constraint on systems is that they must use the same training and evaluation algorithms across datasets and may not use any dataset-specific pre- or post-processing, assuming the data distributions are a-priori unknown. The goal of ESC is to promote generalisable ASR systems that can be applied in a multi-domain setting.
11
-
12
- ESC consists of:
13
-
14
- - A [Hugging Face dataset](https://huggingface.co/datasets/esc-bench/esc-datasets) to download and use pre-prepared ESC audio-text data
15
-
16
- - [Checkpoints and scripts](https://huggingface.co/models?other=esc) to reproduce runs for the five official baseline systems
17
-
18
- - An [ESC leaderboard](https://huggingface.co/spaces/esc-bench/ESC) for ranking systems according to overall performance on the benchmark
19
-
20
- ESC was proposed in _ESC: A Benchmark For Multi-Domain End-to-End Speech Recognition_ by ... For more information, see the official submission on [OpenReview.net](https://openreview.net/forum?id=9OL2fIfDLK).
21
-
22
  <div class="grid lg:grid-cols-3 gap-x-4 gap-y-7">
23
  <p class="lg:col-span-3">
24
- The End-to-end Speech Challenge (ESC) is a benchmark for assessing a single ASR system on a collection of eight different speech recognition datasets.<br />
25
- The ESC datasets are sourced from different domains and cover a range of audio and text distributions (speaking styles, background noise, transcription requirements). The only constraint on systems is that they must use the same training and evaluation algorithms across datasets and may not use any dataset-specific pre- or post-processing, assuming the data distributions are a-priori unknown. The goal of ESC is to promote generalisable ASR systems that can be applied in a multi-domain setting.
26
  </p>
27
- <a href="https://spacy.io/" class="block overflow-hidden group">
28
  <div
29
  class="w-full h-40 object-cover mb-2 rounded-lg flex items-center justify-center bg-[#ECFAFF]"
30
  >
31
  <img alt="" src="/front/assets/promo/spacy_logo.png" class="w-40" />
32
  </div>
33
- <div class="underline">spaCy website</div>
34
  </a>
35
  <a
36
- href="https://github.com/explosion/spacy-huggingface-hub"
37
  class="block overflow-hidden"
38
  >
39
  <div class="flex items-center h-40 bg-[#ECFAFF] rounded-lg px-4 mb-2">
40
  <pre
41
  class="break-words leading-1 whitespace-pre-line text-xs text-gray-800">
42
- python -m spacy huggingface-hub push [whl_path] [--org] [--msg] [--local-repo] [--verbose]
43
  </pre>
44
  </div>
45
- <div class="underline">Upload packages on the hub</div>
46
  </a>
47
  <a
48
- href="https://huggingface.co/models?filter=spacy"
49
  class="block overflow-hidden group"
50
  >
51
  <div
@@ -57,33 +44,9 @@ ESC was proposed in _ESC: A Benchmark For Multi-Domain End-to-End Speech Recogni
57
  class="w-full h-40 object-cover overflow-hidden"
58
  />
59
  </div>
60
- <div class="underline">Find all spaCy models in the Hub</div>
61
  </a>
62
- <div class="lg:col-span-3">
63
- <p class="mb-4">
64
- To upload your spaCy pipelines to the Hub, you can use the <a
65
- href="https://github.com/explosion/spacy-huggingface-hub"
66
- >spacy-huggingface-hub</a
67
- >
68
- library.
69
- </p>
70
- <div
71
- class="p-4 bg-gradient-to-b from-gray-50-to-white border border-gray-100 rounded-lg relative mb-4"
72
- >
73
- <pre
74
- class="break-words leading-1 whitespace-pre-line text-xs md:text-sm text-gray-800">
75
- pip install spacy-huggingface-hub
76
- huggingface-cli login
77
- python -m spacy package ./en_ner_fashion ./output --build wheel
78
- cd ./output/en_ner_fashion-0.0.0/dist
79
- python -m spacy huggingface-hub push en_ner_fashion-0.0.0-py3-none-any.whl
80
- </pre>
81
- </div>
82
- <p>
83
- More information: <a
84
- href="https://spacy.io/usage/v3-1#huggingface-hub"
85
- class="underline">spaCy 3.1 release post</a
86
- >
87
- </p>
88
- </div>
89
  </div>
 
7
  pinned: false
8
  ---
9
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  <div class="grid lg:grid-cols-3 gap-x-4 gap-y-7">
11
  <p class="lg:col-span-3">
12
+ The End-to-end Speech Challenge (ESC) is a benchmark for assessing ASR systems on a collection of eight different speech recognition datasets. The ESC datasets are sourced from 11 different domains and cover a range of audio and text distributions (speaking styles, background noise, transcription requirements). There is no restriction on architecture or training data. The only constraint is that systems must use the same training and evaluation algorithms across datasets and may not use any dataset-specific pre- or post-processing. The objective of ESC is to promote generalisable ASR systems that can be applied in a multi-domain setting.
 
13
  </p>
14
+ <a href="https://huggingface.co/datasets/esc-bench/esc-datasets" class="block overflow-hidden group">
15
  <div
16
  class="w-full h-40 object-cover mb-2 rounded-lg flex items-center justify-center bg-[#ECFAFF]"
17
  >
18
  <img alt="" src="/front/assets/promo/spacy_logo.png" class="w-40" />
19
  </div>
20
+ <div class="underline">ESC Datasets</div>
21
  </a>
22
  <a
23
+ href="https://huggingface.co/models?other=esc"
24
  class="block overflow-hidden"
25
  >
26
  <div class="flex items-center h-40 bg-[#ECFAFF] rounded-lg px-4 mb-2">
27
  <pre
28
  class="break-words leading-1 whitespace-pre-line text-xs text-gray-800">
29
+ some stuff
30
  </pre>
31
  </div>
32
+ <div class="underline">Official Checkpoints</div>
33
  </a>
34
  <a
35
+ href="https://huggingface.co/spaces/esc-bench/ESC"
36
  class="block overflow-hidden group"
37
  >
38
  <div
 
44
  class="w-full h-40 object-cover overflow-hidden"
45
  />
46
  </div>
47
+ <div class="underline">ESC Leaderboard</div>
48
  </a>
49
+ <p class="lg:col-span-3">
50
+ ESC was proposed in ESC: A Benchmark For Multi-Domain End-to-End Speech Recognition by ... For more information, see the official submission on <a href="https://openreview.net/forum?id=9OL2fIfDLK" class="underline">OpenReview.net</a> or the blog post at <a href="https://openreview.net/forum?id=9OL2fIfDLK" class="underline">ESC Benchmark (TODO)</a>.
51
+ </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
  </div>