yangwang825 commited on
Commit
0623bf4
·
verified ·
1 Parent(s): 1cdf151

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -1
README.md CHANGED
@@ -7,4 +7,53 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  pinned: false
8
  ---
9
 
10
+ # Welcome to ConFit on Huggingface Hub
11
+
12
+ ## About Us
13
+
14
+ ConFit is a pioneering organisation dedicated to advancing the fields of speech and language processing, audio and sound processing, and natural language processing (NLP). Our team is committed to developing state-of-the-art technologies and tools that empower researchers and developers in the audio and language domains. We provide a rich collection of audio datasets specifically designed for various machine learning applications. These datasets are perfect for training models on tasks such as audio embedding, speech recognition, and more. Our datasets are compatible with popular frameworks and can be seamlessly integrated into your projects.
15
+
16
+ ## Datasets
17
+
18
+ Audio classification:
19
+
20
+ | Dataset | Classes | Task | Samples | Duration |
21
+ | :---: | :---: | :---: | :---: | :---: |
22
+ | WMMS | 32 | Multi-class | | |
23
+ | MSWC (English) | 271 | Multi-class | | |
24
+ | MSWC (Spanish) | 146 | Multi-class | | |
25
+ | MSWC (Indian) | 14 | Multi-class | | |
26
+ | ESC50 | 50 | Multi-class | | |
27
+ | AudioSet | 527 | Multi-label | | |
28
+ | Pianos | 8 | Multi-class | | |
29
+ | FSD-Kaggle-2019 | 80 | Multi-label | | |
30
+ | GTZAN | 10 | Multi-class | | |
31
+ | Nsynth (instrument) | 11 | Multi-class | Multi-class | |
32
+ | Nsynth (pitch) | 112 | Multi-class | | |
33
+ | CREMA-D | 6 | Multi-class | | |
34
+ | IEMOCAP | 4 | Multi-class | | |
35
+ | EmoDB | 7 | Multi-class | | |
36
+ | EMOVO | 7 | Multi-class | | |
37
+ | IRMAS | 11 | Multi-label | | |
38
+ | RAVDESS | 8 | Multi-class | | |
39
+ | TIMIT | 630 | Multi-class | | |
40
+ | LibriSpeech | 2484 | Multi-class | | |
41
+
42
+ Automated audio captioning:
43
+
44
+ | Dataset | Samples | Duration |
45
+ | :---: | :---: | :---: |
46
+ | Music4All | | |
47
+
48
+ Music, speech, and noise:
49
+
50
+ | Dataset | Samples | Duration |
51
+ | :---: | :---: | :---: |
52
+ | MUSAN | | |
53
+ | RIR-Noise | | |
54
+ | ARCA23K | | |
55
+
56
+ ## Contact Us
57
+
58
+ If you have any questions or would like more information about our projects, please feel free to reach out to us.
59
+