import streamlit as st # Custom CSS for better styling st.markdown(""" """, unsafe_allow_html=True) # Main Title st.markdown('
Swin Transformer is a cutting-edge image classification model introduced in the paper "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" by Liu et al. The model image_classifier_swin_base_patch4_window7_224 is a Swin model, adapted from Hugging Face and curated for scalability and production-readiness using Spark NLP.
Image Classification is a computer vision task where an algorithm is trained to recognize and classify objects within images. This process involves assigning a label or category to an image based on its visual content.
Image classification typically involves the following steps:
Image classification can automate and streamline many tasks, such as:
Applications of image classification span across various industries:
Image classification is crucial because it enables machines to interpret visual data, which is essential for creating intelligent systems capable of understanding and interacting with the world in a more human-like manner.
The Swin Transformer model used in this example is a state-of-the-art approach for image classification, offering advanced performance and scalability. It utilizes a hierarchical transformer architecture to capture intricate patterns and relationships within images, enhancing classification accuracy and efficiency.
Image Name | Result |
---|---|
dog.JPEG | [whippet] |
cat.JPEG | [Siamese] |
bird.JPEG | [peacock] |
Attribute | Description |
---|---|
Model Name | image_classifier_swin_base_patch4_window7_224 |
Compatibility | Spark NLP 4.3.0+ |
License | Open Source |
Edition | Official |
Input Labels | [image_assembler] |
Output Labels | [class] |
Language | en |
Size | 108.0 MB |