Backbone |
ResNet-50 with FPN (Feature Pyramid Network) |
Pretrained Weights |
Trained on ImageNet for feature extraction. |
RPN (Region Proposal Network) |
Generates region proposals based on extracted features from the backbone. |
ROI Align |
Aligns region proposals to a fixed size for consistent feature extraction. |
Box Head |
Fully connected layers for refining bounding boxes and classifying objects. |
Box Predictor |
Replaced with a custom predictor: FastRCNNPredictor for handling custom classes. |
Number of Classes |
Configurable (including background). |
Loss Function |
Combines classification and regression losses for multi-task optimization. |
Optimizer |
Stochastic Gradient Descent (SGD) with momentum for optimization. |
Learning Rate Scheduler |
StepLR to decay learning rate every few epochs for better convergence. |
Batch Normalization |
Applied within the backbone for stable training. |
Data Format |
Input: Tensor of shape (Batch Size, Channels, Height, Width) in PyTorch's NCHW format. |
Output |
- Class probabilities for each region proposal. |
|
- Refined bounding box coordinates for each detected object. |