wcfr commited on
Commit
a8b229a
1 Parent(s): 94cebf8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -1
README.md CHANGED
@@ -1,3 +1,35 @@
1
  ---
2
- license: cc-by-nc-4.0
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
+ language:
4
+ - yue
5
+ library_name: transformers
6
  ---
7
+
8
+ # Cantonese Wav2Vec2-Conformer-Base with Relative Position Embeddings
9
+
10
+ wav2vec 2.0 Conformer with relative position embeddings, pretrained on
11
+ 2.8K hours of Cantonese spontaneous speech data sampled at 16kHz.
12
+
13
+ Note: This model has not been fine-tuned on labeled text data.
14
+
15
+
16
+ ## Alternative Version
17
+
18
+ An alternative version of the model which was pre-trained on the same dataset but
19
+ sets `layer_norm_first` to `false` is available [here](https://drive.google.com/file/d/1rbP-6pZfR5ieqAwd5_X2KzipLuKpXSsQ/view?usp=sharing)
20
+ as a fairseq checkpoint and may give better downstream results.
21
+
22
+
23
+ ## Citation
24
+
25
+ Please cite the following paper if you use the model.
26
+
27
+ ```
28
+ @inproceedings{rcfhuang23_interspeech,
29
+ author = {Ranzo C. F. Huang and Brian Mak},
30
+ year = {2023},
31
+ title = {{wav2vec 2.0 ASR} for {Cantonese}-Speaking Older Adults in a Clinical Setting},
32
+ booktitle = {Interspeech 2023},
33
+ pubstate = {forthcoming},
34
+ }
35
+ ```