Skip Navigation
Skip to contents

GEO DATA : GEO DATA

OPEN ACCESS
SEARCH
Search

Articles

Page Path
HOME > GEO DATA > Volume 6(4); 2024 > Article
Data Article
GeoAI Dataset for Urbanized Area Segmentation from Landsat 8/9 Satellite Imagery and GEMS
Sung-Hyun Gong1,2orcid, Hyung-Sup Jung3,4,*orcid, Geun-Han Kim5orcid, Geun-Hyouk Han6orcid, Il-Hoon Choi7orcid, Jin-Sung Hong8
GEO DATA 2024;6(4):478-486.
DOI: https://doi.org/10.22761/GD.2024.0053
Published online: December 31, 2024

1Integrated Master and PhD Student, Department of Geoinformatics, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, 02504 Seoul, South Korea

2Integrated Master and PhD Student, Department of Smart Cities, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, 02504 Seoul, South Korea

3Professor, Department of Geoinformatics, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, 02504 Seoul, South Korea

4Professor, Department of Smart Cities, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, 02504 Seoul, South Korea

5Research Specialist, Division for Environmental Planning, Water and Land Research Group, Korea Environment Institute, 370 Sicheong-daero, 30147 Sejong, South Korea

6Director, Neighbor System, 135 Jungdae-ro, Songpa-gu, 05717 Seoul, South Korea

7Managing Director, Neighbor System, 135 Jungdae-ro, Songpa-gu, 05717 Seoul, South Korea

8Senior Manager, E-terra, 51-17 Yangcheon-ro, Gangseo-gu, 07532 Seoul, South Korea

Corresponding Author Hyung-Sup Jung Tel: +82-2-6490-2892 E-mail: hsjung@uos.ac.kr
• Received: November 20, 2024   • Revised: December 8, 2024   • Accepted: December 17, 2024

Copyright © 2024 GeoAI Data Society

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

prev next
  • 163 Views
  • 14 Download
  • In South Korea, air pollution has emerged as a pressing social issue, necessitating data-driven approaches to monitor sources of air pollutants. This study constructed a GEO AI dataset for detecting air pollution sources in urbanized areas, utilizing Landsat 8/9 satellite imagery, Geostationary Environment Monitoring Spectrometer geostationary satellite data, and air quality monitoring network data. The dataset is optimized for semantic segmentation tasks, including labeled data for urban area segmentation, and is designed to enable precise detection of pollution sources within urban regions by integrating satellite imagery and air quality information. Using this dataset, we applied a modified U-Net model to classify pollutant sources in urbanized areas, achieving high performance with an mIoU of 0.8592 and pixel accuracy of 0.9433. These results demonstrate the effectiveness of the GEO AI dataset as a tool for identifying and managing major pollution sources, providing foundational data for air quality monitoring and policy development across South Korea and East Asia. With further integration of additional air pollution data, this dataset is expected to contribute to long-term air quality management and the mitigation of health impacts associated with pollution.
Air pollution has emerged as a pressing global issue, with pollutants accelerating global warming and posing serious threats to human health (Brauer et al., 2016). In South Korea, elevated levels of air pollutants, such as particulate matter and ozone, arise both from domestic sources and from pollutants transported via westerly winds from China, leading to significant environmental and health impacts. Monitoring air quality in neighboring regions, particularly in China, has thus become crucial for anticipating changes in South Korea’s air quality and for formulating effective response strategies (Choi et al., 2019; Jang and Yeo, 2015). Industrialized urban areas in China, which emit substantial quantities of pollutants due to industrial activities and vehicle traffic, demand consistent and efficient monitoring. However, China’s vast area of 9,596,960 km2-approximately 95 times larger than South Korea’s 100,210 km2-poses notable challenges in detecting and monitoring sources of pollution.
Satellite imagery-based remote sensing techniques offer valuable tools for acquiring information about inaccessible regions and large-scale phenomena. Due to their broad observational range, satellite data are particularly suitable for detecting extensive pollution sources, such as those in urbanized regions. In recent years, artificial intelligence (AI) techniques have increasingly been applied to satellite imagery analysis, with methods such as object detection, change detection, classification, and semantic segmentation advancing through deep learning approaches that frequently surpass traditional algorithms in performance (LeCun et al., 2015).
This study developed an AI dataset for urban area segmentation to detect sources of air pollution, utilizing 30-m resolution Landsat 8/9 satellite imagery from 2023 and 2024, as well as data from the Geostationary Environment Monitoring Spectrometer (GEMS) and air quality monitoring network. In addition to optical imagery, GEMS and air quality monitoring data, which are used for detecting areas with high concentrations of air pollutants, were incorporated into the input data to construct a more effective dataset for air pollution source detection. A U-Net model, recognized for its high efficacy in semantic segmentation, was then applied to analyze the dataset, with an assessment conducted on its potential utility as a foundational tool for detecting and monitoring urban air pollution sources.
2.1 Study data
To construct an AI dataset for large-scale urban area segmentation, this study employed imagery from Landsat 8/9, the GEMS, and air quality monitoring network data. The Landsat program, operated by the United States Geological Survey, provides essential information on terrestrial characteristics, including land cover, vegetation, and water resources. Landsat imagery has been widely applied in diverse fields such as land use and land cover change detection, natural resource management, forestry, and disaster response. Launched in February 2013 and September 2021, Landsat 8 and Landsat 9 continue to supply high-resolution multispectral imagery across 11 spectral bands with spatial resolutions of 15 m, 30 m, and 100 m. In this study, Level 2 Landsat data were utilized, which include radiometric, geometric, and atmospheric corrections, with the blue, green, red, and near infrared bands (bands 2, 3, 4, and 5) serving as input data (Table 1).
The GEMS, launched in 2020, is the first geostationary satellite dedicated to the continuous observation of climate-forcing and air pollutant substances across East Asia, including the Korean Peninsula (Choi et al., 2018). GEMS provides critical data on atmospheric constituents essential for environmental monitoring, including aerosols, ozone (O3), surface reflectance, cloud properties, and nitrogen dioxide (NO2). This study utilized the Level 4 GEMS dataset, specifically the monthly average estimates of ground-level NO2 concentrations.
In addition to Landsat and GEMS satellite imagery, air quality monitoring network data were incorporated to refine urban area segmentation. Notably, the concentrations of sulfur dioxide (SO2), carbon monoxide (CO), and NO2 provided by the air quality monitoring network are closely linked to urban air pollution sources, making them valuable for identifying key pollutant emission hotspots (Wei et al., 2023). For South Korea, air quality monitoring network data were sourced from AirKorea, which enables data selection based on time period, region, monitoring network, and station locations. Monthly mean concentrations of SO2, CO, and NO2 were employed for this study. For Chinese data, station locations were verified using the UNEP Air Quality Monitoring Platform, and corresponding pollutant measurements were obtained from the Air Pollution in World database.
Fig. 1 illustrates the Coverage area of Landsat 8/9 imagery utilized in this study. The imagery collection area for constructing the AI dataset for urban area segmentation includes regions in South Korea and key urban and industrial zones in China, such as Beijing, Tianjin, Hebei, Shandong, Shanghai, Zhejiang, and Jiangsu. These regions in China were selected due to their severe air pollution problems, which stem from high population density, extensive industrial complexes, and increased traffic volumes (Jeon and Kim, 2015). A total of 55 Landsat 8/9 images were acquired between March 2023 and June 2024, comprising 10 images from South Korea and 45 from China. Details of the acquisition regions, dates, and corresponding satellite imagery are provided in Table 2. All imagery was manually inspected to obtain raw data without applying additional cloudcover filtering options during the search period.
2.2 AI dataset construction
The AI dataset for semantic segmentation of satellite imagery consists of paired input and label data. In this study, the input data include satellite imagery from Landsat 8/9, GEMS, and air quality monitoring network data, while the label data comprise ground truth classifications specifically developed for urban area segmentation. Each satellite image in the input data is paired with a corresponding label that provides the correct segmentation class for the urban region.
Label data construction for urban area segmentation was conducted using the open-source QGIS software, following a rigorous data annotation process. A clear and precise set of criteria was established to delineate urban areas, documented in a comprehensive data construction guideline. Urban areas were defined as regions where buildings serve residential, commercial, or industrial functions, encompassing structures such as residential buildings, commercial and industrial facilities, and transportation infrastructure. Non-urban regions, including forest areas, water bodies, and agricultural land, were excluded from the urban class; trees and grasslands within urbanized zones were also excluded.
To enhance objectivity and consistency in label generation, reference data were incorporated to guide urban area delineation, acknowledging limitations inherent in using only Landsat 8/9 imagery. For South Korea, mid-level land cover maps served as reference data, while for China, open-source geographic data from OpenStreetMap and ESA WorldCover 2021 were utilized. South Korea’s Ministry of Environment provides a scientifically classified land cover map that categorizes surface features into 22 classes, supporting a systematic approach to delineating urban areas.
Following these guidelines, urban segmentation labels were initially constructed in vector format. These vector labels were then converted into raster format (TIFF) through a rasterization process to ensure compatibility with the input data. The Landsat 8/9 satellite data and rasterized labels were partitioned into 512×512 pixel patches with a 25% overlap, enhancing spatial detail for model training. Additionally, GEMS and air quality monitoring network data were segmented into 64×64 pixel patches, enabling consistent alignment across different data sources. Fig. 2 illustrates the AI dataset construction process of this study.
Through the dataset construction methods described above, a total of 5,000 samples were generated for training. Table 3 summarizes the patch size, data format, and quantity for each component of the dataset. The input data include imagery from Landsat 8/9, GEMS, and air quality monitoring network data, while the label data serve as ground truth for semantic segmentation. Fig. 3 presents representative examples from the constructed AI dataset: Fig. 3A-C show sample inputs from Landsat 8/9 imagery, GEMS imagery, and air quality monitoring network data, respectively, while Fig. 3D illustrates the corresponding label data used as ground truth for these inputs.
To validate the efficacy of the constructed AI dataset for urban area segmentation, a U-Net-based model was employed due to its proven effectiveness in semantic segmentation tasks (Du et al., 2020; Long et al., 2015). The conventional U-Net architecture, which employs a single encoder-decoder structure, was modified to adopt a multi-encoder design, enabling the integration of the three distinct input data types-Landsat 8/9, GEMS, and air quality monitoring network-within a unified framework.
The dataset was divided into training, validation, and test subsets with an 80%, 10%, and 10% split, respectively. The training dataset was used to optimize the model, while the validation dataset facilitated hyperparameter tuning and overfitting assessment. The test dataset, held independent from the training and validation stages, was utilized to objectively evaluate the model’s final performance. Additionally, data augmentation techniques were applied to the training dataset to enhance the model’s generalization ability (Baek et al., 2022).
The model achieved a mean Intersection over Union (mIoU) of 0.8592 and a pixel accuracy of 0.9433. For urban area classification specifically, the model demonstrated a precision of 0.9107, recall of 0.9383, and F1 score of 0.9243, reflecting strong segmentation performance. Fig. 4 illustrates example outputs from the trained modified U-Net model, showcasing predictions for different test samples. Each row in Fig. 4 corresponds to a distinct test image, with columns representing the Landsat 8/9 input imagery, label data, and the model’s predicted segmentation result.
A qualitative assessment of the results revealed that while occasional false negatives and false positives were present, the model generally achieved accurate segmentation for urban areas. For cases with well-defined boundaries, such as those depicted in Fig. 4A, B, the model’s predictions closely aligned with the label data, demonstrating high efficacy in distinguishing between urban and non-urban areas. Conversely, Fig. 4C represents a more complex scenario with ambiguous boundaries between urban and non-urban regions, resulting in minor discrepancies between the predicted segmentation and the label data.
This study developed a GEO AI dataset tailored for urban area segmentation to facilitate the detection of air pollution sources, integrating Landsat 8/9 satellite imagery, GEMS satellite imagery, and air quality monitoring network data. The dataset construction process was meticulously documented, ensuring that the resulting dataset is well-suited for semantic segmentation tasks in urban area segmentation. Notably, the constructed AI dataset incorporates not only optical imagery but also diverse data types, underscoring its significance in urban area segmentation for identifying air pollution sources. Furthermore, the dataset demonstrates substantial potential as a foundational resource for detecting and monitoring major sources of air pollutants.
In experiments utilizing a modified U-Net model, the dataset yielded robust results, achieving a mIoU of 0.8592 and pixel accuracy of 0.9433, thus verifying its practical applicability. The dataset can be further refined by incorporating data from additional time periods and by including diverse types of air pollutants, thereby enabling more detailed analysis of pollution sources and affected regions. The spatial information of air pollution sources provided by the dataset plays a pivotal role in understanding the movement and distribution of air pollutants. As a result, this GEO AI dataset is expected to play a crucial role in air quality monitoring and pollution source detection systems across South Korea and East Asia. In the long term, it holds promise as a key resource for policy development aimed at air pollution mitigation and environmental management, contributing to sustained improvements in air quality.

Conflict of Interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Funding Information

This work was supported by the 2023 sabbatical year research grant of the University of Seoul.

Data Availability Statement

The dataset supporting the findings of this study is currently under embargo. It is scheduled for public release on AI Hub in April 2025, at which time it will be assigned a DOI and made fully accessible. This approach ensures compliance with data-sharing requirements and facilitates reproducibility and further research.

Fig. 1.
Study area. Coverage area of Landsat 8/9 imagery in South Korea and China.
GD-2024-0053f1.jpg
Fig. 2.
Workflow of artificial intelligence (AI) dataset construction.
GD-2024-0053f2.jpg
Fig. 3.
Examples of constructed AI dataset for urbanized area segmentation. (A) Landsat 8/9. (B) GEMS. (C) Air quality monitoring network data. (D) Label. AI, artificial intelligence; GEMS, Geostationary Environment Monitoring Spectrometer.
GD-2024-0053f3.jpg
Fig. 4.
Examples of constructed artificial intelligence dataset for urbanized area segmentation. (A, B) The model demonstrates accurate segmentation in urban areas with well-defined boundaries, closely matching the label data. (C) In areas with ambiguous boundaries, minor discrepancies are observed between predictions and label data.
GD-2024-0053f4.jpg
Table 1.
Spectral bands of Landsat 8/9
Band Wavelength (μm) Resolution (m)
Band 1 - Coastal aerosol 0.43-0.45 30
Band 2 - Blue 0.45-0.51 30
Band 3 - Green 0.53-0.59 30
Band 4 - Red 0.64-0.67 30
Band 5 - NIR 0.895-0.88 30
Band 6 - SWIR 1 1.57-1.65 30
Band 7 - SWIR 2 2.11-2.29 30
Band 8 - Panchromatic 0.50-0.68 15
Band 9 - Cirrus 1.36-1.38 30
Band 10 - TIRS 1 10.6-11.19 100
Band 10 - TIRS 2 11.50-12.51 100

NIR, near infrared; SWIR, shortwave infrared; TIRS, thermal infrared.

Table 2.
Details of Landsat 8/9 image acquisition for study areas
No. Region Satellite Acquisition date
1 South Korea Landsat 8 2024.04.07.
2 South Korea Landsat 8 2024.05.09.
3 South Korea Landsat 9 2023.05.08.
4 South Korea Landsat 9 2023.05.15.
5 South Korea Landsat 9 2023.06.16.
6 South Korea Landsat 9 2023.06.16.
7 South Korea Landsat 9 2023.08.19.
8 South Korea Landsat 9 2023.10.22.
9 South Korea Landsat 9 2023.11.07.
10 South Korea Landsat 9 2024.03.14.
11 China Landsat 8 2023.03.12.
12 China Landsat 8 2023.03.30.
13 China Landsat 8 2023.04.10.
14 China Landsat 8 2023.04.10.
15 China Landsat 8 2023.04.10.
16 China Landsat 8 2023.04.17.
17 China Landsat 8 2023.04.24.
18 China Landsat 8 2023.05.08.
19 China Landsat 8 2023.05.12.
20 China Landsat 8 2023.05.28.
21 China Landsat 8 2023.05.28.
22 China Landsat 8 2023.06.09.
23 China Landsat 8 2023.08.23.
24 China Landsat 8 2023.09.01.
25 China Landsat 8 2023.09.01.
26 China Landsat 8 2023.09.06.
27 China Landsat 8 2023.10.01.
28 China Landsat 8 2023.10.01.
29 China Landsat 8 2023.10.01.
30 China Landsat 8 2023.10.15.
31 China Landsat 8 2023.10.22.
32 China Landsat 8 2023.10.24.
33 China Landsat 8 2023.10.26.
34 China Landsat 8 2023.11.18.
35 China Landsat 8 2023.11.20.
36 China Landsat 8 2023.11.22.
37 China Landsat 8 2023.11.27.
38 China Landsat 8 2023.11.29.
39 China Landsat 8 2023.11.29.
40 China Landsat 8 2024.05.14.
41 China Landsat 9 2023.04.02.
42 China Landsat 9 2023.04.09.
43 China Landsat 9 2023.04.27.
44 China Landsat 9 2023.04.30.
45 China Landsat 9 2023.05.29.
46 China Landsat 9 2023.06.01.
47 China Landsat 9 2023.06.12.
48 China Landsat 9 2023.07.19.
49 China Landsat 9 2023.11.19.
50 China Landsat 9 2023.11.21.
51 China Landsat 9 2024.04.11.
52 China Landsat 9 2024.05.02.
53 China Landsat 9 2024.05.13.
54 China Landsat 9 2024.05.15.
55 China Landsat 9 2024.06.03.
Table 3.
Constructed AI dataset for urbanized area segmentation
Data type Patch size Format Quantity (number of samples)
Input data
 Landsat 8/9 256×256 TIFF 5,000
 GEMS 64×64 TIFF 5,000
 Air quality monitoring network data 64×64 TIFF 5,000
Label data 256×256 TIFF 5,000

AI, artificial intelligence; GEMS, Geostationary Environment Monitoring Spectrometer.

  • Baek WK, Lee MJ, Jung HS (2022) The performance improvement of U-net model for landcover semantic segmentation through data augmentation. Korean J Remote Sens 38(6):1663–1676
  • Brauer M, Freedman G, Frostad J, et al (2016) Ambient air pollution exposure estimation for the global burden of disease 2013. Environ Sci Technol 50(1):79–88PubMed
  • Choi J, Park RJ, Lee HM, et al (2019) Impacts of local vs. transboundary emissions from different sectors on PM2.5 exposure in South Korea during the KORUS-AQ campaign. Atmos Environ 203:196–205
  • Choi WJ, Moon KJ, Yoon J, et al (2018) Introducing the geostationary environment monitoring spectrometer. J Appl Remote Sens 12(4):044005Article
  • Du G, Cao X, Liang J, Chen X, Zhan Y (2020) Medical image segmentation based on U-Net: a review. J Imaging Sci Technol 64(2):020508Article
  • Jang KS, Yeo JH (2015) The effects of Korean and Chinese economic growth on particulate matter in Korea: time series cointegration analysis. J Environ Policy Adm 23(1):97–117Article
  • Jeon SH, Kim YP (2015) A study on the smog reduction strategies in China. Part Aerosol Res 11(3):63–75
  • LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444ArticlePubMedPDF
  • Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 Jun 2015, pp 3431-3440
  • Wei J, Li Z, Wang J, Li C, Gupta P, Cribb M (2023) Ground-level gaseous pollutants (, SO2, and CO) in China: Daily seamless mapping and spatiotemporal variations. Atmos Chem Phys 23(2):1511–1532Article
Meta Data for Dataset
Essential
Field Sub-Category
Title of Dataset GeoAI Dataset for Urbanized Area Segmentation from Landsat 8/9 satellite imagery and GEMS
DOI The dataset supporting the findings of this study is currently under embargo. It is scheduled for public release on AI Hub in April 2025, at which time it will be assigned a DOI and made fully accessible
Category Environment
Temporal Coverage 2023.03.01.-2024.06.30.
Spatial Coverage Address South Korea, China
WGS84 Coordinates WGS84
[Latitude] N 26°-43°
[Longitude] E112°-130°
Personnel Name Geun-Hyouk Han
Affiliation Neighbor System
E-mail hyouk@neighbor21.co.kr
CC License CC BY-NC
Optional
Field Sub-Category
Summary of Dataset GeoAI Dataset for Urbanized area Segmentation
Project #33. Air pollution source space distribution data
Instrument QGIS 3.16.8

Figure & Data

References

    Citations

    Citations to this article as recorded by  

      Figure
      • 0
      • 1
      • 2
      • 3
      Related articles
      GeoAI Dataset for Urbanized Area Segmentation from Landsat 8/9 Satellite Imagery and GEMS
      Image Image Image Image
      Fig. 1. Study area. Coverage area of Landsat 8/9 imagery in South Korea and China.
      Fig. 2. Workflow of artificial intelligence (AI) dataset construction.
      Fig. 3. Examples of constructed AI dataset for urbanized area segmentation. (A) Landsat 8/9. (B) GEMS. (C) Air quality monitoring network data. (D) Label. AI, artificial intelligence; GEMS, Geostationary Environment Monitoring Spectrometer.
      Fig. 4. Examples of constructed artificial intelligence dataset for urbanized area segmentation. (A, B) The model demonstrates accurate segmentation in urban areas with well-defined boundaries, closely matching the label data. (C) In areas with ambiguous boundaries, minor discrepancies are observed between predictions and label data.
      GeoAI Dataset for Urbanized Area Segmentation from Landsat 8/9 Satellite Imagery and GEMS
      Band Wavelength (μm) Resolution (m)
      Band 1 - Coastal aerosol 0.43-0.45 30
      Band 2 - Blue 0.45-0.51 30
      Band 3 - Green 0.53-0.59 30
      Band 4 - Red 0.64-0.67 30
      Band 5 - NIR 0.895-0.88 30
      Band 6 - SWIR 1 1.57-1.65 30
      Band 7 - SWIR 2 2.11-2.29 30
      Band 8 - Panchromatic 0.50-0.68 15
      Band 9 - Cirrus 1.36-1.38 30
      Band 10 - TIRS 1 10.6-11.19 100
      Band 10 - TIRS 2 11.50-12.51 100
      No. Region Satellite Acquisition date
      1 South Korea Landsat 8 2024.04.07.
      2 South Korea Landsat 8 2024.05.09.
      3 South Korea Landsat 9 2023.05.08.
      4 South Korea Landsat 9 2023.05.15.
      5 South Korea Landsat 9 2023.06.16.
      6 South Korea Landsat 9 2023.06.16.
      7 South Korea Landsat 9 2023.08.19.
      8 South Korea Landsat 9 2023.10.22.
      9 South Korea Landsat 9 2023.11.07.
      10 South Korea Landsat 9 2024.03.14.
      11 China Landsat 8 2023.03.12.
      12 China Landsat 8 2023.03.30.
      13 China Landsat 8 2023.04.10.
      14 China Landsat 8 2023.04.10.
      15 China Landsat 8 2023.04.10.
      16 China Landsat 8 2023.04.17.
      17 China Landsat 8 2023.04.24.
      18 China Landsat 8 2023.05.08.
      19 China Landsat 8 2023.05.12.
      20 China Landsat 8 2023.05.28.
      21 China Landsat 8 2023.05.28.
      22 China Landsat 8 2023.06.09.
      23 China Landsat 8 2023.08.23.
      24 China Landsat 8 2023.09.01.
      25 China Landsat 8 2023.09.01.
      26 China Landsat 8 2023.09.06.
      27 China Landsat 8 2023.10.01.
      28 China Landsat 8 2023.10.01.
      29 China Landsat 8 2023.10.01.
      30 China Landsat 8 2023.10.15.
      31 China Landsat 8 2023.10.22.
      32 China Landsat 8 2023.10.24.
      33 China Landsat 8 2023.10.26.
      34 China Landsat 8 2023.11.18.
      35 China Landsat 8 2023.11.20.
      36 China Landsat 8 2023.11.22.
      37 China Landsat 8 2023.11.27.
      38 China Landsat 8 2023.11.29.
      39 China Landsat 8 2023.11.29.
      40 China Landsat 8 2024.05.14.
      41 China Landsat 9 2023.04.02.
      42 China Landsat 9 2023.04.09.
      43 China Landsat 9 2023.04.27.
      44 China Landsat 9 2023.04.30.
      45 China Landsat 9 2023.05.29.
      46 China Landsat 9 2023.06.01.
      47 China Landsat 9 2023.06.12.
      48 China Landsat 9 2023.07.19.
      49 China Landsat 9 2023.11.19.
      50 China Landsat 9 2023.11.21.
      51 China Landsat 9 2024.04.11.
      52 China Landsat 9 2024.05.02.
      53 China Landsat 9 2024.05.13.
      54 China Landsat 9 2024.05.15.
      55 China Landsat 9 2024.06.03.
      Data type Patch size Format Quantity (number of samples)
      Input data
       Landsat 8/9 256×256 TIFF 5,000
       GEMS 64×64 TIFF 5,000
       Air quality monitoring network data 64×64 TIFF 5,000
      Label data 256×256 TIFF 5,000
      Essential
      Field Sub-Category
      Title of Dataset GeoAI Dataset for Urbanized Area Segmentation from Landsat 8/9 satellite imagery and GEMS
      DOI The dataset supporting the findings of this study is currently under embargo. It is scheduled for public release on AI Hub in April 2025, at which time it will be assigned a DOI and made fully accessible
      Category Environment
      Temporal Coverage 2023.03.01.-2024.06.30.
      Spatial Coverage Address South Korea, China
      WGS84 Coordinates WGS84
      [Latitude] N 26°-43°
      [Longitude] E112°-130°
      Personnel Name Geun-Hyouk Han
      Affiliation Neighbor System
      E-mail hyouk@neighbor21.co.kr
      CC License CC BY-NC
      Optional
      Field Sub-Category
      Summary of Dataset GeoAI Dataset for Urbanized area Segmentation
      Project #33. Air pollution source space distribution data
      Instrument QGIS 3.16.8
      Table 1. Spectral bands of Landsat 8/9

      NIR, near infrared; SWIR, shortwave infrared; TIRS, thermal infrared.

      Table 2. Details of Landsat 8/9 image acquisition for study areas

      Table 3. Constructed AI dataset for urbanized area segmentation

      AI, artificial intelligence; GEMS, Geostationary Environment Monitoring Spectrometer.


      GEO DATA : GEO DATA
      TOP