Context Encoding for Semantic Segmentation
Introduction
[ALGORITHM]
@InProceedings{Zhang_2018_CVPR,
author = {Zhang, Hang and Dana, Kristin and Shi, Jianping and Zhang, Zhongyue and Wang, Xiaogang and Tyagi, Ambrish and Agrawal, Amit},
title = {Context Encoding for Semantic Segmentation},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}
Results and models
Cityscapes
Method |
Backbone |
Crop Size |
Lr schd |
Mem (GB) |
Inf time (fps) |
mIoU |
mIoU(ms+flip) |
download |
encnet |
R-50-D8 |
512x1024 |
40000 |
8.6 |
4.58 |
75.67 |
77.08 |
model | log |
encnet |
R-101-D8 |
512x1024 |
40000 |
12.1 |
2.66 |
75.81 |
77.21 |
model | log |
encnet |
R-50-D8 |
769x769 |
40000 |
9.8 |
1.82 |
76.24 |
77.85 |
model | log |
encnet |
R-101-D8 |
769x769 |
40000 |
13.7 |
1.26 |
74.25 |
76.25 |
model | log |
encnet |
R-50-D8 |
512x1024 |
80000 |
- |
- |
77.94 |
79.13 |
model | log |
encnet |
R-101-D8 |
512x1024 |
80000 |
- |
- |
78.55 |
79.47 |
model | log |
encnet |
R-50-D8 |
769x769 |
80000 |
- |
- |
77.44 |
78.72 |
model | log |
encnet |
R-101-D8 |
769x769 |
80000 |
- |
- |
76.10 |
76.97 |
model | log |
ADE20K
Method |
Backbone |
Crop Size |
Lr schd |
Mem (GB) |
Inf time (fps) |
mIoU |
mIoU(ms+flip) |
download |
encnet |
R-50-D8 |
512x512 |
80000 |
10.1 |
22.81 |
39.53 |
41.17 |
model | log |
encnet |
R-101-D8 |
512x512 |
80000 |
13.6 |
14.87 |
42.11 |
43.61 |
model | log |
encnet |
R-50-D8 |
512x512 |
160000 |
- |
- |
40.10 |
41.71 |
model | log |
encnet |
R-101-D8 |
512x512 |
160000 |
- |
- |
42.61 |
44.01 |
model | log |