Spaces:
Runtime error
Training OneFormer with Custom Datasets
OneFormer advocates the usage of panoptic annotations along with its task-conditioned joint training strategy. However, if panoptic annotations are not available, then also OneFormer can be trained using only the instance or semantic annotations on custom datasets. We provide some guidelines for training with custom datasets.
Register your New Dataset
OneFormer uses the information (class names, thing classes, etc.) stored in a dataset's metadata while preparing a dataset dictionary using a
dataset_mapper
.Use Custom Datasets gives a deeper dive into registering a new custom dataset.
Training with Available Panoptic Annotations
To prepare the dataset dictionary for each iteration during training, OneFormer uses a
dataset_mapper
class.Originally, we provide two
dataset_mapper
classes which support task-conditioned joint training using the panoptic annotations:COCOUnifiedNewBaselineDatasetMapper
: Specifically designed for COCO annotation format.OneFormerUnifiedDatasetMapper
: General annotation format.
If you have panoptic annotations for your custom dataset, you may use these dataset_mapper classes directly after registering your dataset. You may also tune the task sampling probabilities in the corresponding config file.
If you want to train using only the instance or semantic annotation, please follow the next section on preparing a custom dataset mapper class.
Write a Custom Dataset Mapper Class
If you want to train using only instance or semantic annotations, write your custom dataset mapper class and add it to the
build_train_loader
method.We provide a few templates for custom dataset mappers:
InstanceCOCOCustomNewBaselineDatasetMapper
: Specifically designed for COCO instance annotation format.InstanceOneFormerCustomDatasetMapper
: General instance annotation format.SemanticOneFormerCustomDatasetMapper
: General semantic annotation format.
Remember to register your custom dataset before training.
Now you are all set to train OneFormer using your custom dataset!