Spaces:
Runtime error
Runtime error
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -1,12 +1,81 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
|
7 |
-
|
8 |
-
|
9 |
-
|
10 |
-
|
11 |
-
|
12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Peripheral Blood Cell Classification with Vision Language Models
|
2 |
+
|
3 |
+
## Overview
|
4 |
+
This application uses state-of-the-art Vision Language Models to classify and describe peripheral blood cells. The system combines advanced computer vision with natural language processing to provide detailed analysis of blood cell images, making it a valuable tool for hematological research and diagnosis.
|
5 |
+
|
6 |
+
## Models
|
7 |
+
The application integrates three powerful Vision Language Models:
|
8 |
+
|
9 |
+
1. **Blood Cell Classifier with Llama-3.2**
|
10 |
+
- Based on Llama architecture
|
11 |
+
- Fine-tuned specifically for blood cell classification
|
12 |
+
- Model: laurru01/Llama-3.2-11B-Vision-Instruct-ft-PeripherallBloodCells
|
13 |
+
|
14 |
+
2. **Blood Cell Classifier with Qwen2-VL**
|
15 |
+
- Leverages Qwen2's vision-language capabilities
|
16 |
+
- Optimized for medical image analysis
|
17 |
+
- Model: laurru01/Qwen2-VL-2B-Instruct-ft-bloodcells-big
|
18 |
+
|
19 |
+
3. **Blood Cell Classifier with SmolVLM**
|
20 |
+
- Lightweight yet powerful vision-language model
|
21 |
+
- Efficient processing with maintained accuracy
|
22 |
+
- Model: laurru01/SmolVLM-Instruct-ft-PeripherallBloodCells
|
23 |
+
|
24 |
+
## Features
|
25 |
+
- Multi-model analysis for comparative results
|
26 |
+
- Detailed cell type classification
|
27 |
+
- Comprehensive morphological descriptions
|
28 |
+
- Support for common image formats
|
29 |
+
- Real-time processing and analysis
|
30 |
+
- User-friendly interface
|
31 |
+
|
32 |
+
## Cell Types Detected
|
33 |
+
- Neutrophils
|
34 |
+
- Lymphocytes
|
35 |
+
- Monocytes
|
36 |
+
- Eosinophils
|
37 |
+
- Basophils
|
38 |
+
|
39 |
+
## Technical Details
|
40 |
+
- Built with Gradio for the interface
|
41 |
+
- Powered by PyTorch and Transformers
|
42 |
+
- Optimized for GPU processing
|
43 |
+
- Uses 4-bit quantization for efficient model loading
|
44 |
+
- Implements advanced memory management techniques
|
45 |
+
|
46 |
+
## Usage
|
47 |
+
1. Select a model from the dropdown menu
|
48 |
+
2. Upload an image of a blood cell
|
49 |
+
3. Wait for the analysis to complete
|
50 |
+
4. Review the classification and description
|
51 |
+
|
52 |
+
## Performance Notes
|
53 |
+
- Processing time varies by model
|
54 |
+
- GPU recommended for optimal performance
|
55 |
+
- Image quality affects accuracy
|
56 |
+
|
57 |
+
## Limitations
|
58 |
+
- Processes one cell at a time
|
59 |
+
- Requires clear, focused images
|
60 |
+
- May have varying response times based on server load
|
61 |
+
|
62 |
+
## Future Improvements
|
63 |
+
- Support for batch processing
|
64 |
+
- Additional model integrations
|
65 |
+
- Enhanced visualization options
|
66 |
+
- Performance optimizations
|
67 |
+
|
68 |
+
## Citations
|
69 |
+
If you use this application in your research, please cite:
|
70 |
+
@software{blood_cell_classifier,
|
71 |
+
author = {Laurru},
|
72 |
+
title = {Peripheral Blood Cell Classification with Vision Language Models},
|
73 |
+
year = {2024},
|
74 |
+
publisher = {Hugging Face},
|
75 |
+
url = {https://huggingface.co/spaces/laurru/blood-cell-classifier}
|
76 |
+
}
|
77 |
+
## Contact
|
78 |
+
For questions or suggestions, please open an issue in the repository or contact through Hugging Face.
|
79 |
+
|
80 |
+
## License
|
81 |
+
This project is licensed under the MIT License - see the LICENSE file for details.
|