MtSCCD:面向深度学习的土地利用场景分类与变化检测数据集
MtSCCD: Land-use scene classification and change-detection dataset for deep learning
- 2024年28卷第2期 页码:321-333
纸质出版日期: 2024-02-07
DOI: 10.11834/jrs.20243210
移动端阅览
浏览全部资源
扫码关注微信
纸质出版日期: 2024-02-07 ,
移动端阅览
周维勋,刘京雷,彭代锋,管海燕,邵振峰.2024.MtSCCD:面向深度学习的土地利用场景分类与变化检测数据集.遥感学报,28(2): 321-333
Zhou W X,Liu J L,Peng D F, Guan H Y and Shao Z F. 2024. MtSCCD: Land-use scene classification and change-detection dataset for deep learning. National Remote Sensing Bulletin, 28(2):321-333
利用遥感影像识别土地利用类型及监测其变化情况在城市规划和土地利用优化等领域发挥着重要作用。当前,相关数据集存在样本量少、类别划分不合理、数据不开源等局限,难以满足样本驱动的深度学习遥感信息提取范式的需求。本文构建了一个面向深度学习的大规模场景分类与变化检测数据集MtSCCD(Multi-temporal Scene Classification and Change Detection)。该数据集包括MtSCCD_LUSC(MtSCCD Land Use Scene Classification)和MtSCCD_LUCD(MtSCCD Land Use Change Detection)两个子数据集,分别用于土地利用场景分类与变化检测任务。该数据集具有以下特点:(1)MtSCCD是目前规模最大的公开的土地利用类型识别与检测数据集,包含10种土地利用类型共65548幅图像,并且样本覆盖中国5个城市的中心区域;(2)由于MtSCCD数据集根据城市划分训练集、验证集以及测试集,对于新增的城市土地利用数据,可以根据需求划分为训练集与验证集或测试集,因此可扩展性较高;(3)MtSCCD数据集中测试集与训练集的样本来自不同的城市,因此符合实际业务需求,且能够验证模型的泛化性能。基于MtSCCD_LUSC和MtSCCD_LUCD两个子数据集,本文评估了多个深度学习网络的分类与变化检测效果,为后续的相关研究提供了参考。
Land-Use Scene Classification and change Detection (LUSCD) aim to recognize land-use types and monitor their changes by using Remote-Sensing (RS) images
which play an important role in urban planning and land-use optimization. In the era of RS big data
conventional hand-crafted feature-based methods are infeasible for LUSCD because the extracted features are not sufficiently discriminative for RS images with high complexity. As a novel data-driven paradigm for information extraction from RS images
deep learning provides a new solution for LUSCD. However
the existing publicly available datasets have limited samples and is thus unable to train a successful deep-learning model. Therefore
it has great significance in constructing an open and large-scale LUSCD benchmark.
To advance the progress of LUSCD using deep-learning methods
this paper releases a large-scale scene classification and change-detection dataset termed Multi-temporal Scene Classification and Change Detection (MtSCCD). The RGB images in MtSCCD are cropped from large-size high-resolution RS images captured from the central areas of five China cities
namely
Hangzhou
Shanghai
Wuhan
Nanjing
and Hefei. The size of the cropped images is 300×300 pixels with the spatial resolution of around 1 m. MtSCCD has 10 land use classes
which are residential land
public service and commercial land
educational land
industrial land
transportation land
agricultural land
water body
green space
woodland
and woodland. Based on the cropped land-use images in MtSCCD
this paper constructs two sub-datasets termed MtSCCD_LUSC (MtSCCD Land Use Scene Classification) and MtSCCD_LUCD (MtSCCD Land Use Change Detection) for land-use scene classification (LUSC) and land-use change detection (LUCD)
respectively. MtSCCD dataset has the following characteristics. (1) It is currently the largest publicly available LUSCD dataset
and both of the two sub-datasets (i.e.
MtSCCD_LUSC and MtSCCD_LUCD) have 65548 images in total. (2) The images in MtSCCD are split into training set
validation set
and testing set according to the five cities. For example
images from three of the five cities are randomly split into training and validation set
whereas the rest remain to be the testing set. Therefore
MtSCCD has high extensibility
i.e.
it can be easily extended to be a larger dataset. (3) For a deep-learning model
the training set and testing set are categorized from different cities
so it is beneficial to demonstrate the model’s generalization ability. (4) MtSCCD has high intra-class diversity
making it a challenging dataset.
Based on MtSCCD_LUSC and MtSCCD_LUCD
this paper evaluates several deep-learning feature-based methods for LUSC and LUCD. Specifically
AlexNet
VGG networks (i.e.
VGG16 and VGG19)
GoogLeNet
and ResNet networks (i.e.
ResNet18
ResNet50
and ResNet101) are selected to extract deep-learning features that are then fed into SVM for LUSC. We also evaluate DenseNet
EfficientNet
SENet
ViT
and SwinT for LUSC. Two kinds of LUCD approaches including conventional classification-based methods and current similarity-based methods have been evaluated. Experimental results show that the highest overall accuracy of MtSCCD_LUSC dataset is around 76%
indicating much room for improvement. Regarding LUCD
similarity-based methods particularly similarity learning-based ones outperform classification-based methods by a significant margin
providing a promising research direction for LUCD.
This paper presents the currently largest scene classification and change-detection dataset MtSCCD based on high-resolution RS images of the central area of five China cities. MtSCCD contains two subsets MtSCCD_LUSC and MtSCCD_LUCD. Both had 10 land-use types and 65548 images in total. Based on the two sub-datasets
this paper evaluates the performance of several deep networks for scene classification and change detection
expecting to provide baseline results for related researchers. We hope that the MtSCCD dataset can promote this progress in land-use type recognition and monitoring.
土地利用场景分类变化检测数据集信息提取特征提取深度学习卷积神经网络
land usescene classificationchange detectiondatasetinformation extractionfeature extractiondeep learningconvolutional neural network
Bai K, Mu X D, Chen X B, Zhu Y Q and You X A. 2022. Unsupervised remote sensing image scene classification based on semi-supervised learning. Acta Geodaetica et Cartographica Sinica, 51(5): 691-702
白坤, 慕晓冬, 陈雪冰, 朱永清, 尤轩昂. 2022. 融合半监督学习的无监督遥感影像场景分类. 测绘学报, 51(5): 691-702 [DOI: 10.11947/j.AGCS.2022.20210270http://dx.doi.org/10.11947/j.AGCS.2022.20210270]
Cheng G, Han J W and Lu X Q. 2017. Remote sensing image scene classification: benchmark and state of the art. Proceedings of the IEEE, 105(10): 1865-1883 [DOI: 10.1109/JPROC.2017.2675998http://dx.doi.org/10.1109/JPROC.2017.2675998]
Cheng G, Wang G X and Han J W. 2022. ISNet: towards improving separability for remote sensing image change detection. IEEE Transactions on Geoscience and Remote Sensing, 60: 5623811 [DOI: 10.1109/TGRS.2022.3174276http://dx.doi.org/10.1109/TGRS.2022.3174276]
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X H, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, Uszkoreit J and Houlsby N. 2021. An image is worth 16x16 words: transformers for image recognition at scale. arXiv:2010.11929
Feng Q L, Chen B A, Li G Q, Yao X C, Gao B B and Zhang L C. 2022. A review for sample datasets of remote sensing imagery. National Remote Sensing Bulletin, 26(4): 589-605
冯权泷, 陈泊安, 李国庆, 姚晓闯, 高秉博, 张连翀. 2022. 遥感影像样本数据集研究综述. 遥感学报, 26(4): 589-605 [DOI: 10.11834/jrs.20221162http://dx.doi.org/10.11834/jrs.20221162]
He K M, Zhang X Y, Ren S Q and Sun J. 2016. Deep residual learning for image recognition//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE: 770-778 [DOI: 10.1109/CVPR.2016.90http://dx.doi.org/10.1109/CVPR.2016.90]
Hu J, Shen L and Sun G. 2018. Squeeze-and-excitation networks//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE: 7132-7141 [DOI: 10.1109/CVPR.2018.00745http://dx.doi.org/10.1109/CVPR.2018.00745]
Huang G, Liu Z, Van Der Maaten L and Weinberger K Q. 2017. Densely connected convolutional networks//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE: 2261-2269 [DOI: 10.1109/CVPR.2017.243http://dx.doi.org/10.1109/CVPR.2017.243]
Huang Y H and Zhou W X. 2022. Similarity method for high-resolution remote sensing scene change detection. Bulletin of Surveying and Mapping, (8): 48-53
黄宇鸿, 周维勋. 2022. 高分辨率遥感影像场景变化检测的相似度方法. 测绘通报, (8): 48-53 [DOI: 10.13474/j.cnki.11-2246.2022.0231http://dx.doi.org/10.13474/j.cnki.11-2246.2022.0231]
Krizhevsky A, Sutskever I and Hinton G E. 2012. ImageNet classification with deep convolutional neural networks//Proceedings of the 25th International Conference on Neural Information Processing Systems. Lake Tahoe: Curran Associates Inc.: 1097-1105
LeCun Y, Bengio Y and Hinton G. 2015. Deep learning. Nature, 521(7553): 436-444 [DOI: 10.1038/nature14539http://dx.doi.org/10.1038/nature14539]
Li H F, Dou X, Tao C, Wu Z X, Chen J, Peng J, Deng M and Zhao L. 2020. RSI-CB: a large-scale remote sensing image classification benchmark using crowdsourced data. Sensors, 20(6): 1594 [DOI: 10.3390/s20061594http://dx.doi.org/10.3390/s20061594]
Liu K, Zhou Z, Li S Y, Liu Y F, Wan X, Liu Z W, Tan H and Zhang W F. 2020. Scene classification dataset using the Tiangong-1 hyperspectral remote sensing imagery and its applications. Journal of Remote Sensing (Chinese), 24(9): 1077-1087
刘康, 周壮, 李盛阳, 刘云飞, 万雪, 刘志文, 谭洪, 张万峰. 2020. 天宫一号高光谱遥感场景分类数据集及应用. 遥感学报, 24(9): 1077-1087 [DOI: 10.11834/jrs.20209323http://dx.doi.org/10.11834/jrs.20209323]
Liu Z, Lin Y T, Cao Y, Hu H, Wei Y X, Zhang Z, Lin S and Guo B N. 2021. Swin transformer: hierarchical vision transformer using shifted windows//Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Montreal: IEEE: 9992-10002 [DOI: 10.1109/ICCV48922.2021.00986http://dx.doi.org/10.1109/ICCV48922.2021.00986]
Ma L, Liu Y, Zhang X L, Ye Y X, Yin G F and Johnson B A. 2019. Deep learning in remote sensing applications: a meta-analysis and review. ISPRS Journal of Photogrammetry and Remote Sensing, 152: 166-177 [DOI: 10.1016/j.isprsjprs.2019.04.015http://dx.doi.org/10.1016/j.isprsjprs.2019.04.015]
Qian X L, Li J, Cheng G, Yao X W, Zhao S N, Chen Y B and Jiang L Y. 2018. Evaluation of the effect of feature extraction strategy on the performance of high-resolution remote sensing image scene classification. Journal of Remote Sensing, 22(5): 758-776
钱晓亮, 李佳, 程塨, 姚西文, 赵素娜, 陈宜滨, 姜利英. 2018. 特征提取策略对高分辨率遥感图像场景分类性能影响的评估. 遥感学报, 22(5): 758-776 [DOI: 10.11834/jrs.20188015http://dx.doi.org/10.11834/jrs.20188015]
Simonyan K and Zisserman A. 2015. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 [DOI: 10.48550/arXiv.1409.1556http://dx.doi.org/10.48550/arXiv.1409.1556]
Sui H G, Feng W Q, Li W Z, Sun K M and Xu C. 2018. Review of change detection methods for multi-temporal remote sensing imagery. Geomatics and Information Science of Wuhan University, 43(12): 1885-1898
眭海刚, 冯文卿, 李文卓, 孙开敏, 徐川. 2018. 多时相遥感影像变化检测方法综述. 武汉大学学报(信息科学版), 43(12): 1885-1898 [DOI: 10.13203/j.whugis20180251http://dx.doi.org/10.13203/j.whugis20180251]
Szegedy C, Liu W, Jia Y Q, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V and Rabinovich A. 2015. Going deeper with convolutions//2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston: IEEE: 1-9 [DOI: 10.1109/cvpr.2015.7298594http://dx.doi.org/10.1109/cvpr.2015.7298594]
Tan M X and Le Q V. 2019. EfficientNet: rethinking model scaling for convolutional neural networks//Proceedings of the 36th International Conference on Machine Learning. Long Beach: [s.n.]: 6105-6114
Wu C, Zhang L F and Zhang L P. 2016. A scene change detection framework for multi-temporal very high resolution remote sensing images. Signal Processing, 124: 184-197 [DOI: 10.1016/j.sigpro.2015.09.020http://dx.doi.org/10.1016/j.sigpro.2015.09.020]
Wu C, Zhang L P and Du B. 2017. Kernel slow feature analysis for scene change detection. IEEE Transactions on Geoscience and Remote Sensing, 55(4): 2367-2384 [DOI: 10.1109/tgrs.2016.2642125http://dx.doi.org/10.1109/tgrs.2016.2642125]
Wu F, Zhang H, Wang C, Li L, Li J J, Chen W R and Zhang B. 2022. SARBuD1.0: a SAR building dataset based on GF-3 FSII imageries for built-up area extraction with deep learning method. National Remote Sensing Bulletin, 26(4): 620-631
吴樊, 张红, 王超, 李璐, 李娟娟, 陈卫荣, 张波. 2022. SARBuD 1.0: 面向深度学习的GF-3 精细模式SAR建筑数据集. 遥感学报, 26(4): 620-631 [DOI: 10.11834/jrs.20220296http://dx.doi.org/10.11834/jrs.20220296]
Xia G S, Hu J W, Hu F, Shi B G, Bai X, Zhong Y F, Zhang L P and Lu X Q. 2017. AID: a benchmark data set for performance evaluation of aerial scene classification. IEEE Transactions on Geoscience and Remote Sensing, 55(7): 3965-3981 [DOI: 10.1109/tgrs.2017.2685945http://dx.doi.org/10.1109/tgrs.2017.2685945]
Xia G S, Yang W, Delon J, Gousseau Y, Sun H and Maître H. 2010. Structural high-resolution satellite image indexing//ISPRS TC VII Symposium - 100 Years ISPRS. 298-303
Yang B S, Han X and Dong Z. 2021. Point cloud benchmark dataset WHU-TLS and WHU-MLS for deep learning. National Remote Sensing Bulletin, 25(1): 231-240
杨必胜, 韩旭, 董震. 2021. 点云深度学习基准数据集. 遥感学报, 25(1): 231-240 [DOI: 10.11834/jrs.20210542http://dx.doi.org/10.11834/jrs.20210542]
Yang Y and Newsam S. 2010. Bag-of-visual-words and spatial extensions for land-use classification//Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems. San Jose: ACM: 270-279 [DOI: 10.1145/1869790.1869829http://dx.doi.org/10.1145/1869790.1869829]
Yu W Q, Cheng G, Wang M J, Yao Y Q, Xie X X, Yao X W and Han J W. 2022. MAR20: a benchmark for military aircraft recognition in remote sensing images. National Remote Sensing Bulletin, 1-11
禹文奇, 程塨, 王美君, 姚艳清, 谢星星, 姚西文, 韩军伟. 2022. MAR20: 遥感图像军用飞机目标识别数据集. 遥感学报: 1-11 [DOI: 10.11834/jrs.20222139http://dx.doi.org/10.11834/jrs.20222139]
Yuan J W, Ru L X, Wang S G and Wu C. 2022. WH-MAVS: a novel dataset and deep learning benchmark for multiple land use and land cover applications. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 15: 1575-1590 [DOI: 10.1109/jstars.2022.3142898http://dx.doi.org/10.1109/jstars.2022.3142898]
Yuan J W, Wu C, Du B, Zhang L P and Wang S G. 2020. Analysis of landscape pattern on urban land use based on GF-5 hyperspectral data. Journal of Remote Sensing (Chinese), 24(4): 465-478
袁静文, 武辰, 杜博, 张良培, 王树根. 2020. 高分五号高光谱遥感影像的城市土地利用景观格局分析. 遥感学报, 24(4): 465-478 [DOI: 10.11834/jrs.20209252http://dx.doi.org/10.11834/jrs.20209252]
Zhang L P and Wu C. 2017. Advance and future development of change detection for multi-temporal remote sensing imagery. Acta Geodaetica et Cartographica Sinica, 46(10): 1447-1459
张良培, 武辰. 2017. 多时相遥感影像变化检测的现状与展望. 测绘学报, 46(10): 1447-1459 [DOI: 10.11947/j.AGCS.2017.20170340http://dx.doi.org/10.11947/j.AGCS.2017.20170340]
Zhang L P, Zhang L F and Du B. 2016. Deep learning for remote sensing data: a technical tutorial on the state of the art. IEEE Geoscience and Remote Sensing Magazine, 4(2): 22-40 [DOI: 10.1109/mgrs.2016.2540798http://dx.doi.org/10.1109/mgrs.2016.2540798]
Zhou W X, Newsam S, Li C M and Shao Z F. 2018. PatternNet: a benchmark dataset for performance evaluation of remote sensing image retrieval. ISPRS Journal of Photogrammetry and Remote Sensing, 145: 197-209 [DOI: 10.1016/j.isprsjprs.2018.01.004http://dx.doi.org/10.1016/j.isprsjprs.2018.01.004]
Zhu X X, Tuia D, Mou L C, Xia G S, Zhang L P, Xu F and Fraundorfer F. 2017. Deep learning in remote sensing: a comprehensive review and list of resources. IEEE Geoscience and Remote Sensing Magazine, 5(4): 8-36 [DOI: 10.1109/mgrs.2017.2762307http://dx.doi.org/10.1109/mgrs.2017.2762307]
Zou Q, Ni L H, Zhang T and Wang Q. 2015. Deep learning based feature selection for remote sensing scene classification. IEEE Geoscience and Remote Sensing Letters, 12(11): 2321-2325 [DOI: 10.1109/lgrs.2015.2475299http://dx.doi.org/10.1109/lgrs.2015.2475299]
相关作者
相关机构