Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
@@ -6,66 +6,41 @@ colorTo: green
|
|
6 |
sdk: docker
|
7 |
pinned: false
|
8 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
|
10 |
-
|
11 |
-
# Random Baseline Model for Climate Disinformation Classification
|
12 |
-
|
13 |
-
## Model Description
|
14 |
-
|
15 |
-
This is a random baseline model for the Frugal AI Challenge 2024, specifically for the text classification task of identifying climate disinformation. The model serves as a performance floor, randomly assigning labels to text inputs without any learning.
|
16 |
-
|
17 |
-
### Intended Use
|
18 |
-
|
19 |
-
- **Primary intended uses**: Baseline comparison for climate disinformation classification models
|
20 |
-
- **Primary intended users**: Researchers and developers participating in the Frugal AI Challenge
|
21 |
-
- **Out-of-scope use cases**: Not intended for production use or real-world classification tasks
|
22 |
-
|
23 |
-
## Training Data
|
24 |
-
|
25 |
-
The model uses the QuotaClimat/frugalaichallenge-text-train dataset:
|
26 |
-
- Size: ~6000 examples
|
27 |
-
- Split: 80% train, 20% test
|
28 |
-
- 8 categories of climate disinformation claims
|
29 |
-
|
30 |
-
### Labels
|
31 |
-
0. No relevant claim detected
|
32 |
-
1. Global warming is not happening
|
33 |
-
2. Not caused by humans
|
34 |
-
3. Not bad or beneficial
|
35 |
-
4. Solutions harmful/unnecessary
|
36 |
-
5. Science is unreliable
|
37 |
-
6. Proponents are biased
|
38 |
-
7. Fossil fuels are needed
|
39 |
-
|
40 |
-
## Performance
|
41 |
-
|
42 |
-
### Metrics
|
43 |
-
- **Accuracy**: ~12.5% (random chance with 8 classes)
|
44 |
-
- **Environmental Impact**:
|
45 |
-
- Emissions tracked in gCO2eq
|
46 |
-
- Energy consumption tracked in Wh
|
47 |
-
|
48 |
-
### Model Architecture
|
49 |
-
The model implements a random choice between the 8 possible labels, serving as the simplest possible baseline.
|
50 |
-
|
51 |
-
## Environmental Impact
|
52 |
-
|
53 |
-
Environmental impact is tracked using CodeCarbon, measuring:
|
54 |
-
- Carbon emissions during inference
|
55 |
-
- Energy consumption during inference
|
56 |
-
|
57 |
-
This tracking helps establish a baseline for the environmental impact of model deployment and inference.
|
58 |
-
|
59 |
-
## Limitations
|
60 |
-
- Makes completely random predictions
|
61 |
-
- No learning or pattern recognition
|
62 |
-
- No consideration of input text
|
63 |
-
- Serves only as a baseline reference
|
64 |
-
- Not suitable for any real-world applications
|
65 |
-
|
66 |
-
## Ethical Considerations
|
67 |
-
|
68 |
-
- Dataset contains sensitive topics related to climate disinformation
|
69 |
-
- Model makes random predictions and should not be used for actual classification
|
70 |
-
- Environmental impact is tracked to promote awareness of AI's carbon footprint
|
71 |
-
```
|
|
|
6 |
sdk: docker
|
7 |
pinned: false
|
8 |
---
|
9 |
+
# Wildfire Detection Task for the Frugal AI 2025 Challenge
|
10 |
+
## Task Overview
|
11 |
+
As part of the Frugal AI 2025 Challenge, I’m working on the wildfire detection task using the PyroNear/pyro-sdis dataset available on Hugging Face (https://huggingface.co/datasets/pyronear/pyro-sdis). This task aims to develop a model capable of detecting wildfires in images efficiently, contributing to early detection and mitigation of wildfire damage while minimizing environmental costs.
|
12 |
+
|
13 |
+
## Dataset Overview
|
14 |
+
This dataset is specifically designed for wildfire detection, containing labeled images with wildfire-related and non-wildfire-related scenes.
|
15 |
+
- Key Features:
|
16 |
+
Labels: Binary classification — wildfire present or not.
|
17 |
+
Images: Captured under real-world conditions, including diverse environments and challenging scenarios like smoke, clouds, and varying lighting.
|
18 |
+
Size: ~33 000 image labeled images, well-suited for training and validation of computer vision models.
|
19 |
+
- 28,103 images with smoke
|
20 |
+
- 31,975 smoke instances
|
21 |
+
This dataset is formatted to be compatible with the Ultralytics YOLO framework, enabling efficient training of object detection models.
|
22 |
+
Usage: Ideal for fine-tuning state-of-the-art models for wildfire detection tasks.
|
23 |
+
|
24 |
+
## Model Development Plan
|
25 |
+
Model Choice: YOLOv11s
|
26 |
+
Why YOLOv11s?
|
27 |
+
Efficiency: YOLO (You Only Look Once) models are known for their high-speed performance and accuracy, ideal for real-time applications.
|
28 |
+
Versatility: YOLOv11s builds upon prior versions, improving object detection, handling small objects, and performing well under challenging visual conditions.
|
29 |
+
Frugality: Optimized for computational efficiency, aligning with the sustainability goals of the Frugal AI Challenge.
|
30 |
+
|
31 |
+
## Data Preprocessing:
|
32 |
+
|
33 |
+
|
34 |
+
Fine-Tuning YOLOv11:
|
35 |
+
Load a pre-trained YOLOv11s model as a starting point (transfer learning).
|
36 |
+
Replace the output layer to align with the binary classification task (wildfire vs. no wildfire).
|
37 |
+
Train the model using the PyroNear/pyro-sdis dataset.
|
38 |
+
|
39 |
+
## Evaluation:
|
40 |
+
|
41 |
+
Metrics:
|
42 |
+
Precision and recall to assess detection accuracy.
|
43 |
+
Inference time to evaluate real-time feasibility.
|
44 |
+
|
45 |
+
Using CodeCarbon, the model's carbon footprint and energy consumption will be tracked, this information will help ensure the model's alignment with the sustainability objectives of the Frugal AI Challenge.
|
46 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|