IlayMalinyak commited on
Commit
5bf830b
·
1 Parent(s): e34f3c2
Files changed (1) hide show
  1. README.md +15 -28
README.md CHANGED
@@ -8,46 +8,39 @@ pinned: false
8
  ---
9
 
10
 
11
- # Random Baseline Model for Climate Disinformation Classification
12
 
13
  ## Model Description
14
 
15
- This is a random baseline model for the Frugal AI Challenge 2024, specifically for the text classification task of identifying climate disinformation. The model serves as a performance floor, randomly assigning labels to text inputs without any learning.
16
-
17
  ### Intended Use
18
 
19
- - **Primary intended uses**: Baseline comparison for climate disinformation classification models
20
- - **Primary intended users**: Researchers and developers participating in the Frugal AI Challenge
21
- - **Out-of-scope use cases**: Not intended for production use or real-world classification tasks
22
 
23
  ## Training Data
24
 
25
- The model uses the QuotaClimat/frugalaichallenge-text-train dataset:
26
- - Size: ~6000 examples
27
- - Split: 80% train, 20% test
28
- - 8 categories of climate disinformation claims
29
 
30
  ### Labels
31
- 0. No relevant claim detected
32
- 1. Global warming is not happening
33
- 2. Not caused by humans
34
- 3. Not bad or beneficial
35
- 4. Solutions harmful/unnecessary
36
- 5. Science is unreliable
37
- 6. Proponents are biased
38
- 7. Fossil fuels are needed
39
 
40
  ## Performance
 
41
 
42
  ### Metrics
43
- - **Accuracy**: ~12.5% (random chance with 8 classes)
44
  - **Environmental Impact**:
45
  - Emissions tracked in gCO2eq
46
  - Energy consumption tracked in Wh
47
 
48
  ### Model Architecture
49
- The model implements a random choice between the 8 possible labels, serving as the simplest possible baseline.
50
-
 
51
  ## Environmental Impact
52
 
53
  Environmental impact is tracked using CodeCarbon, measuring:
@@ -57,15 +50,9 @@ Environmental impact is tracked using CodeCarbon, measuring:
57
  This tracking helps establish a baseline for the environmental impact of model deployment and inference.
58
 
59
  ## Limitations
60
- - Makes completely random predictions
61
- - No learning or pattern recognition
62
- - No consideration of input text
63
- - Serves only as a baseline reference
64
- - Not suitable for any real-world applications
65
 
66
  ## Ethical Considerations
67
 
68
- - Dataset contains sensitive topics related to climate disinformation
69
- - Model makes random predictions and should not be used for actual classification
70
  - Environmental impact is tracked to promote awareness of AI's carbon footprint
71
  ```
 
8
  ---
9
 
10
 
11
+ # Conformer model
12
 
13
  ## Model Description
14
 
15
+ This is a CNN followed by Conformer encoder
 
16
  ### Intended Use
17
 
18
+ - baseline for audio predictions
 
 
19
 
20
  ## Training Data
21
 
22
+ The model uses the rfcx audio dataset:
23
+ - Size: ~35000 examples
24
+ - Split: 80% train, 20% validation
25
+ - Binary classification
26
 
27
  ### Labels
28
+ 0. Chain Saw in audio
29
+ 1. no Chain Saw in audio
 
 
 
 
 
 
30
 
31
  ## Performance
32
+ 90% accuracy on validation
33
 
34
  ### Metrics
35
+ - **Accuracy**: 90% on validation
36
  - **Environmental Impact**:
37
  - Emissions tracked in gCO2eq
38
  - Energy consumption tracked in Wh
39
 
40
  ### Model Architecture
41
+ CNN and Conformer. Conformer is a mixture between
42
+ transformer (MHSA with RoPE
43
+ positional encoding), and CNN blocks.
44
  ## Environmental Impact
45
 
46
  Environmental impact is tracked using CodeCarbon, measuring:
 
50
  This tracking helps establish a baseline for the environmental impact of model deployment and inference.
51
 
52
  ## Limitations
53
+ - simple
 
 
 
 
54
 
55
  ## Ethical Considerations
56
 
 
 
57
  - Environmental impact is tracked to promote awareness of AI's carbon footprint
58
  ```