Block 3
Block 3
TECHNIQUES IN REMOTE
Indira Gandhi National Open University
School of Sciences SENSING AND DIGITAL
IMAGE PROCESSING
Volume
2
THEMATIC INFORMATION EXTRACTION AND
INTRODUCTION TO R PROGRAMMING
BLOCK 3
IMAGE CLASSIFICATION AND CHANGE DETECTION TECHNIQUES 7
BLOCK 4
INTRODUCTION TO R PROGRAMMING 113
MGY-005: TECHNIQUES IN REMOTE SENSING AND DIGITAL
IMAGE PROCESSING
Programme Design Committee
Prof. Sujatha Verma Dr. I. M. Bahuguna Mr. Manish Parmar
Former Director Deputy Director (Rtd.) Scientist
School of Sciences Space Applications Centre Space Applications Centre
IGNOU, New Delhi Indian Space Research Organisation (ISRO) Ahmedabad, Gujarat
Dr. Shailesh Nayak (ISRO), Ahmedabad, Gujarat Dr. Akella V.S. Aswani
Director Prof. Shamita Kumar ESRI India Technologies Pvt.
National Institute of Advanced Studies Institute of Environment Education and Ltd.
Bangaluru, Karnataka Research Hyderabad, Telangana
Dr. P.S. Acharya Bharati Vidyapeeth University Dr. O.M. Murali
Head, NRDMS, NSDI Division Pune, Maharashtra GIS Consultant
Department of Science and Technology Ms. Asima Misra Chennai, Tamil Nadu
Ministry of Science & Technology Associate Director
New Delhi Prof. Manish Trivedi
ES & e-Governance Group School of Sciences
Dr. Debapriya Dutta Centre for Development of Advanced
Scientist ‘G’ & Associate Head IGNOU, New Delhi
Computing (C-DAC)
National Geospatial Programme Ministry of Electronics and Information Dr. Rajesh Kaliraman
Department of Science and Technology Technology (MeitY) School of Sciences
Ministry of Science & Technology Pune, Maharashtra IGNOU, New Delhi
New Delhi Dr. V. Venkat Ramanan
Dr. Sameer Saran
Dr. L.K. Sinha Head School of Inter-Disciplinary and
Former Director Geoinformatics Department Trans-Disciplinary Studies
Defence Terrain Research Lab. (DTRL), Indian Institute of Remote Sensing IGNOU, New Delhi
Delhi & Defence Geoinformatics Dehradun, U.K.
Research Establishment (DGRE) Faculty of Geology Discipline
Defence R&D Organisation (DRDO) Prof. Daljeet Singh School of Sciences, IGNOU
Chandigarh Department of Geography Prof. Meenal Mishra
Prof. P.K. Garg Swami Shraddhanand College Prof. Benidhar Deshmukh
Civil Engineering Department University of Delhi, New Delhi Prof. R. Baskar
IIT Roorkee, Roorkee, U.K. Dr. D. R. Rajak Dr. M. Prashanth
Prof. P.K. Verma Scientist Dr. Kakoli Gogoi
School of Studies in Earth Science Space Applications Centre (ISRO)
Ahmedabad, Gujarat Dr. Omkar Verma
Vikram University
Ujjain, M.P.
Course Design Committee
Prof. Shamita Kumar Dr. Dharmendra G. Shah Dr. Neha Garg
Institute of Environment Education and Department of Botany School of Sciences
Research MS University of Baroda IGNOU, New Delhi
Bharati Vidyapeeth University Vadodara, Gujarat Dr. Rajesh Kaliraman
Pune, Maharashtra Dr. Sadhana Jain School of Sciences
Prof. R. Jaishanker Regional Remote Sensing Centre IGNOU, New Delhi
CV Raman Laboratory of Ecological (RRSC), ISRO Dr. V. Venkat Ramanan
Informatics Nagpur, Maharashtra School of Inter-Disciplinary and
Digital University Kerala Dr. Neeti Trans-Disciplinary Studies
(formerly IIITM-K) Centre for Climate Change and IGNOU, New Delhi
Thiruvananthapuram, Kerala Sustainability, Azim Premji University
Ms. Asima Misra Faculty of Geology Discipline
Bengaluru, Karnataka
ES & e-Governance Group School of Sciences, IGNOU
Prof. P. V. K. Sasidhar
Centre for Development of Advanced School of Extension and Development Prof. Meenal Mishra
Computing (C-DAC), MeitY Studies, IGNOU, New Delhi Prof. Benidhar Deshmukh
Pune, Maharashtra
Prof. Nehal Farooqi Prof. R. Baskar
Dr. Amit Kumar
School of Extension and Development Dr. M. Prashanth
Environmental Technology Division
Studies, IGNOU, New Delhi Dr. Kakoli Gogoi
CSIR-Institute of Himalayan
Bioresource Technology Prof. Deepika Dr. Omkar Verma
Palampur, H.P. School of Sciences
IGNOU, New Delhi
Programme Coordinators: Prof. Benidhar Deshmukh and Prof. Meenal Mishra
2
MGY-005: TECHNIQUES IN REMOTE SENSING AND
DIGITAL IMAGE PROCESSING
Preparation Team of VOLUME 2: Thematic Information Extraction and Introduction
to R Programming
Course Contributors
Dr. Sourish Chatterjee Dr. Sapana B. Chavan Ms. Leena K.
(Units 13 and 14) (Unit 15) (Unit 15)
Centre for Climate Change and Senior Software Engineer Tower-B, Noida One
Sustainability, Tech Mahindra Sector-62, Noida
Azim Premji University Hyderabad Uttar Pradesh
Bengaluru, Karnataka Telangana
Dr. Anupam Anand Dr. Neeti Prof. Benidhar Deshmukh
(Units 14) (Units 17, 18, 19, 20 and 21) (Units 15 and 16)
Centre for Climate Change and School of Sciences
Sustainability IGNOU, New Delhi
Azim Premji University
Bengaluru, Karnataka
Content Editor
Acknowledgement: Ms. Savita Sharma for assistance in the preparation of CRC and some of the figures.
Cover Page Design: Prof. Benidhar Deshmukh
April, 2024
© Indira Gandhi National Open University, 2024
Disclaimer: Any materials adapted from web-based resources in this module are being used for educational purposes
only and not for commercial purposes.
All rights reserved. No part of this work may be reproduced in any form, by mimeograph or any other means, without
permission in writing from the Indira Gandhi National Open University.
Further information on the Indira Gandhi National Open University courses may be obtained from the University’s office
at Maidan Garhi, New Delhi-110 068 or visit University official website http://www.ignou.ac.in.
Printed and published on behalf of Indira Gandhi National Open University, New Delhi by the Registrar,
MPDD, IGNOU.
3
Volume 2: Thematic Information Extraction and Introduction
to R Programming
The course MGY-005: Techniques in Remote Sensing and Digital Image Processing consists
of four blocks, which have been packaged in two volumes. The Volume 1 deals with
techniques in remote sensing and consists of two blocks namely, Remote Sensing Techniques
and Image Pre-classification Techniques. In the first block you have been introduced to various
types of remote sensing techniques such as aerial photography, multispectral, thermal,
hyperspectral, microwave, LiDAR and UAV based remote sensing, and in the second block
with image statistics, and various pre-classification techniques such as image corrections,
image enhancement and transformation including image fusion and principal component
analysis.
The Volume 2 covers some other aspects of digital image processing viz. image classification
and post-classification techniques and introduction to R programming. It comprises two blocks
namely, Image Classification and Change Detection Techniques, and Introduction to R
Programming.
The first block of this volume, Block 3: Image Classification and Change Detection
Techniques introduces you with techniques of unsupervised and supervised image
classification, change detection and also accuracy assessment, which is an important step
after extraction of thematic information from remote sensing images.
The second block of this volume, Block 2: Introduction to R Programming, introduces you
to fundamentals of computer programming and with using R package for data exploration,
image processing and data plotting.
Expected Learning Outcomes
After studying this volume, you should be able to:
discuss various image classification techniques and commonly used classification
algorithms;
describe the techniques employed for change detection analysis in remote sensing;
explain accuracy assessment approaches suitable to various types of classification
outputs;
write about R programming, its potential in geospatial data exploration, processing and
creation of plots.
After studying this volume, you will be equipped with the basic knowledge of image
classification and post-classification techniques and R programming for geospatial data
exploration and analysis.
We wish you all success in this endeavour!
4
MGY-005: Techniques in Remote Sensing and Digital
Image Processing
Block 1 : Remote Sensing Techniques
Unit 1 : Aerial Photography and Photogrammetry
Unit 2 : Multispectral and Thermal Remote Sensing
Unit 3 : Hyperspectral Remote Sensing
Unit 4 : Microwave Remote Sensing I
Unit 5 : Microwave Remote Sensing II
Unit 6 : LiDAR Remote Sensing and UAV based Remote Sensing
Block 2 : Image Pre-classification Techniques
Unit 7 : Image Statistics
Unit 8 : Radiometric Image Corrections
Unit 9 : Geometric Image Corrections
Unit 10 : Image Enhancement
Unit 11 : Image Filtering and Band Ratioing
Unit 12 : Image Fusion and Principal Component Analysis
Block 3 : Image Classification and Change Detection Techniques
Unit 13 : Unsupervised Classification
Unit 14 : Supervised Classification
Unit 15 : Change Detection Techniques
Unit 16 : Accuracy Assessment of Thematic Maps
Block 4 : Introduction to R Programming
Unit 17 : Fundamentals of Computer Programming
Unit 18 : Basics of R programming
Unit 19 : Using R for Basic Data Exploration
Unit 20 : Using R for Spatial Data Exploration
Unit 21 : Using R for Basic Image Processing
5
6
MGY-005
TECHNIQUES IN REMOTE
Indira Gandhi National Open University
School of Sciences
SENSING AND DIGITAL
IMAGE PROCESSING
Block
3
IMAGE CLASSIFICATION AND CHANGE DETECTION
TECHNIQUES
UNIT 13
Unsupervised Classification 09
UNIT 14
Supervised Classification 39
UNIT 15
Change Detection Techniques 61
UNIT 16
Accuracy Assessment of Thematic Maps 75
Glossary 107
7
BLOCK 3: IMAGE CLASSIFICATION AND CHANGE
DETECTION TECHNIQUES
You have learnt about various kinds of remote sensing techniques in Block-1. In Block-2, you
have studied that there are several pre-classification techniques that are required for
radiometric and geometric correction of digital images and also enhancement and
transformation techniques employed to enhance certain features of interests.
There are several techniques that are employed to extract thematic information from remote
sensing images. Image classification techniques such as the unsupervised and supervised are
the ones generally used for the purpose. We also apply change detection techniques to
understand changes taken place in an area. Once we have extracted thematic information
through some thematic information extraction techniques, the outputs are subjected to post-
classification techniques including accuracy assessment. It is required to assess accuracy of
the derived maps or information to know how close or far it is from the reality. It is also
important due to the fact such maps/information can be used with confidence for some
planning and decision making purpose with some confidence. This block covers following four
units related to these aspects:
Unit 13 “Unsupervised Classification” introduces you to image classification, its types and
specifically unsupervised mode of classification, its requirements and various methods used for
the purpose.
Unit 14 “Supervised Classification” deals specifically with supervised classification and
discusses various types of approaches used along with giving their comparison.
Unit 15 “Change Detection Techniques” focuses on the aspects related to change detection
such as requirement, pre and post classification approaches and various methods used for this.
Unit 16 “Accuracy Assessment of Thematic Maps” builds upon the accuracy assessment
concept that you have learnt in MGY-102 and introduces you to the sources of errors,
accuracy assessment approaches used for variety of outputs along with challenges and recent
developments in the field.
Expected Learning Outcomes
After studying this block, you should be able to:
discuss about unsupervised classification technique and various algorithms used;
describe supervised classification technique along with a brief on use of artificial intelligence
and machine learning in image classification;
write different types of change detection techniques applicable for different types of data and
stages; and
recognise various approaches of accuracy assessment suitable for different kinds of image
classification outputs and data types.
We wish you all the best and hope you will enjoy reading this course.
8
UNIT 13
UNSUPERVISED CLASSIFICATION
Structure______________________________________________
13.1 Introduction ISODATA Clustering
Expected Learning Outcomes Hierarchical clustering
13.2 Image Classification Self Organising Maps
Approaches Fuzzy C-Means Clustering
Stages 13.6 Some Other Approaches
13.3 Types of Classification Gaussian Mixture Models (GMMs)
Unsupervised and Supervised Density-Based Spatial Clustering of
Classification Applications with Noise (DBSCAN)
Hard/Crisp and Soft/Fuzzy Classification 13.7 Challenges and Recent Developments
Pixel and Object based Classification 13.8 Summary
Parametric and Non-parametric 13.9 Terminal Questions
Classification 13.10 References
13.4 Steps in Unsupervised Classification 13.11 Further/Suggested Readings
13.5 Commonly Used Approaches 13.12 Answers
K-means Clustering
13.1 INTRODUCTION
In Block-2 of this course you have learnt about various pre-classification techniques. This unit and
the next one deal with a very key aspect of digital image processing and thematic information
extraction known as image classification. As you have read about image classification in the course
MGY-102, you know that it involves conversion of raster data into finite set of classes that represent
surface types i.e. a class in the theme of your interest in the imagery. This fundamental task
involves either training a model to categorise images into predefined classes (i.e. supervised
classification) or it is achieved by using algorithms that autonomously identify patterns and
similarities within an image dataset without explicit class labels (i.e. unsupervised classification).
The goal of this process is to enable machines to recognise and interpret visual content, mimicking
11
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
parametric classification, as well as statistical, ensemble, and machine learning
classification. Some broad typology of the classification is given in the Table
13.1.
Table 13.1: Typology of image classification (modified from Lu and Weng,
2007).
Basis of
Classifica- Type Characteristics Suitability Advantage Limitation
tion
Image is partitioned into
spectral classes based
Labelling by
Unsupervised
No training
on statistical information When field analyst
using clustering information is not data required,
required after
available, used automatic
algorithms; analyst classification;
for exploratory grouping of
merges and labels the analysis sensitive to
pixels parameters
spectral classes into
information classes
Use of
training Classifier classifies
samples spectral data into
Information classes When
SuperviSed
Simple, widely
Assigns each pixel or Binary decision Does not handle
Hard /
crisp
used, clear
object to a single, boundaries, crisp mixed pixels or
class
discrete class classification uncertainty well
Output type boundaries
or
assignment Handles
Better handles
of class Provides probabilistic or uncertainty;
mixed pixels, Computationally
Soft / fuzzy
imagery, enhanced
neighbouring pixel requires high-
heterogeneous object
quality data,
information is used in landscapes, recognition,
interpretation
image classification urban and better
challenges,
agricultural boundary
struggles with
monitoring delineation,
fragmented
handles mixed
landscapes
pixels
13
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
Basis of
Classifica- Type Characteristics Suitability Advantage Limitation
tion
Spectral and spatial Improves
information is used in classification
Spectral contextual
(/spectral-spatial)
Classification; High- accuracy;
parametric or non- resolution reduces
Computationally
parametric classifiers imagery misclassificatio
intensive;
are used to generate where spatial n of
requires careful
initial classification features provide spectrally
feature selection
images and then valuable similar classes;
contextual classifiers information preserves
are implemented in the spatial
classified images structure
So, you have learnt about different types of classification. Let us now discuss
some of these types of classifiers in some detail.
Let us first learn about the unsupervised and supervised classification.
15
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
algorithms Distance, K-means clustering, GEOBIA, Region Growing,
Spectral Angle Mapper., etc. Classification and Regression Trees
(CART)
Easier to interpret but less More realistic and easier to interpret
Interpretation
realistic for complex scenes in complex environments
In the following sections, we will discuss the steps and commonly used
approaches of unsupervised classification in some detail.
17
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
(a)
(b) (c)
Fig. 13.3: On the left is the cropped ROI from FCC (identical to Fig 13.1) of
Landsat 8 OLI image. The next two images are results of K-means
performed on two separate tries. Inside the yellow box one can
observe how the K-means provide different classification on different
runs, despite having the same parameters.
(a)
(b) (c)
Fig. 13.4: On the top is the cropped ROI from the FCC (identical to Fig 13.2) of
Landsat 8 OLI image (a). The two images on the lower panel are the
results of ISODATA clustering performed with initial k clusters 3 (b)
and 10 (c). It can be noted that despite specifying 10 clusters the final
result was produced in 5 clusters as seen in (c), this shows the
adaptive nature of ISODATA to cluster variability.
SAQ I
a) What are the types of image classification?
25
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
Limitation
The method can be computationally expensive, especially for large
datasets, as it requires the calculation of all pairwise distances and
repeated merging operations.
It is sensitive to outliers, which can significantly affect cluster formation.
The approach is less practical for very large datasets due to memory
and processing time constraints.
The final clustering result depends mainly on the chosen linkage
method, which may not always capture the most meaningful groupings.
27
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
It is sensitive to the choice of parameters such as learning rate,
neighborhood function, and map size, which can lead to suboptimal
clustering and representation.
The network structure (e.g., grid size and shape) is fixed before training. If
the chosen structure does not match the complexity of the data, the
resulting map may not effectively capture the data’s relationships.
Interpreting the resultant map and understanding the specific meaning of
clusters can be complex, especially for high-dimensional or abstract data.
Visualisation of the map may not always clearly convey the clustering
results.
Outputs of SOMs can vary due to converge to local minima, depending on
initialisation and training conditions.
13.5.5 Fuzzy C-Means Clustering
It is a type of clustering algorithm that is widely used for soft classification. In
hard classification, each data point belongs to exactly one cluster, whereas
FCM allows each data point to belong to multiple clusters with varying degrees
of membership. This approach is particularly useful where land cover types
often have mixed characteristics, making it difficult to classify them into discrete
categories. FCM assigns a membership value to each data point for all clusters.
These membership values range between 0 and 1, with the sum of membership
values for each data point equal to 1. In a typical graphical output of FCM
clustering in remote sensing, each pixel in an image is represented by a colour
or shade corresponding to its membership values across different clusters. In
FCM clustering, the cluster centers are represented as distinct points in the
feature space, typically marked by different colours. Each pixel in the image is
coloured based on its highest membership value, with the intensity indicating
the degree of membership. Further, unlike the hard clustering approaches you
have studied till now, in FCM clustering, boundaries between clusters are not
sharp, rather, there are smooth transitions indicating the fuzziness.
The FCM is particularly suitable for remote sensing applications where land
cover types are not clearly separable. It is used in case of land use land cover
classification, where boundaries between classes such as urban, vegetation,
water, and barren land, may be unclear. Also, it is used in case of vegetation
and soil mapping when different types of vegetation and soil types have
overlapping spectral signatures as there may be gradual transition from one
type to another. Further, it is also useful in change detection studies where
images from different time periods show gradual changes in the classes of
interest.
Following are the major steps in FCM clustering:
1. Initialisation:
The first step is to select the number of clusters, the fuzziness parameter
(typically >1), and the maximum number of iterations or convergence criterion.
Then the membership matrix is randomly initialised, where each element of the
matrix should satisfy certain criteria.
28 Contributor: Dr. Sourish Chatterjee
Unit 13 Unsupervised Classification
…………………………….…………………………………….……………………………………………….…
2. Computation of Cluster Centers:
Then cluster centers are calculated using the current membership matrix. The
cluster center for a cluster is computed satisfying criteria for data point and total
number of data points.
3. Updating of Membership Matrix:
The next step is updating membership values or degrees based on the distance
of each pixel to the centroids.
4. Checking Convergence:
Next step is evaluation of convergence criterion to check if the membership
matrix or cluster centers have converged. The convergence can be determined
by either change in membership (by assessing if the change in membership
degrees between iterations is below a predefined threshold) or change in
cluster centres (by assessing if the change in cluster centers between iterations
is below a predefined threshold). If convergence criteria are met or the
maximum number of iterations is reached then the algorithm is terminated
otherwise the process continues until convergence is achieved.
5. Output Clusters:
Once convergence is achieved, the final cluster centers and membership matrix
represent the clustering results. Each data point will have membership values
for each cluster, indicating the degree to which it belongs to each cluster.
Let us now learn the advantages and limitations of this algorithm.
Advantage
It does soft classification and captures fuzziness in thematic classes,
providing more realistic representation for mixed pixels.
It is effective in areas where class boundaries are not clear or well-defined.
It is flexible and allows for degree of class membership thereby
accommodating complex scenarios.
Limitation
The method can be computationally demanding and time consuming, as
compared to hard classifiers.
The algorithm may converge to local minima, depending on the initial cluster
centers, which can affect the final outcome.
It is sensitive to the initial membership value and also choice of fuzziness
parameter.
Although it is robust to within-class variations, it can still be sensitive to
noise, particularly in the cases, when the noise levels are high relative to the
actual signal.
You have become familiar with several commonly used algorithms for
unsupervised classification. Let us now see their comparison in Table 13.6.
Table 13.6: Comparison of various types of unsupervised classifiers.
29
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
Fuzzy C-
Feature / K-Means ISODATA Hierarchical Self-Organising
Means
Method Clustering Clustering Clustering Maps (SOM)
(FCM)
Algorithm Partition- Neural Network- Partition-
Partition-based Hierarchical
Type based based based
Arbitrary
Arbitrary
Cluster Spherical / Spherical / (varies with Spherical /
(depends on
Shape Convex Convex distance Convex
map grid)
metric)
Dynamic
Dynamic (can Fixed
Number of Fixed (depends on Fixed
merge/split (predefined map
Clusters (predefined) dendrogram (predefined)
clusters) size)
cut)
Initialisati Random or Random or Random or Random or
Not applicable
on heuristic heuristic heuristic heuristic
Can be slow
Can be slow and and
Can be slow
Good for large Good for large memory- memory-
Scalability for large
datasets datasets intensive for intensive for
datasets
large datasets large
datasets
Minimises
Convergence of Minimises
Converge Minimises within-cluster Based on
weights and weighted
nce within-cluster variance, allows dendrogram
membership within-cluster
Criteria variance for cut-off
matrix variance
merging/splitting
Cluster centers Dendrogram, Membership
Cluster
Cluster and assignment, cluster degrees for
assignments
Output centers and plus assignments each data
and visualization
assignment merging/splitting at various point in each
on 2D map
information levels cluster
Flexible (soft
Highly flexible
Less flexible More flexible Flexible (visual clustering
(varies with
Flexibility (fixed number (dynamic cluster representation with
dendrogram
of clusters) number) and clustering) membership
cut)
degrees)
Clustering
Well- Hierarchical
Complex data Visualising and with
separated relationships,
with varying clustering high- overlapping
Suitability clusters, fixed variable
cluster shapes dimensional data or
number of number of
and sizes data ambiguous
clusters clusters
boundaries
31
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
Core Points: DBSCAN identifies core points as those with a minimum
number of neighboring points within a specified distance.
Density-Reachability: Points within the defined distance of a core point are
considered part of the same cluster. This creates a region of connected
points that are dense enough to form a cluster.
Border Points: Points within the specified distance of a core point but do not
meet the minimum density criteria become border points. They are part of
the cluster but not considered as influential as core points.
Noise Points: Points that are not core points and do not have enough
neighboring points within the distance threshold are considered noise. These
points do not belong to any cluster.
Cluster Formation: The algorithm iteratively expands clusters by examining
the density-reachability relationship. It continues this process until all points
are assigned to a cluster or labeled as noise.
Parameter Tuning: DBSCAN requires two main parameters - the minimum
number of points and the distance threshold. The effectiveness of DBSCAN
can be sensitive to the appropriate selection of these parameters, and they
need to be chosen based on the characteristics of the data.
Result: The final result is a set of clusters, each containing core and border
points. Noise points are treated as outliers.
Let us now learn the advantages and limitations of this algorithm.
Advantage
Flexibility in Cluster Shape: DBSCAN is capable of identifying clusters with
irregular shapes, making it suitable for datasets where clusters may not
conform to traditional geometric shapes. Unlike algorithms like k-means,
which assume spherical clusters, DBSCAN excels at discovering clusters of
varying shapes and sizes, providing more flexibility in capturing complex
patterns within the data.
Noise Handling: DBSCAN effectively handles noise and outliers in the data.
It classifies data points that do not belong to any cluster as noise, allowing
for a more robust clustering result. This feature is particularly valuable in
real-world datasets, which often contain irregularities or anomalies that can
distort the clustering outcomes of other algorithms.
Automatic Determination of Cluster Number: DBSCAN does not require
the user to specify the number of clusters in advance, as opposed to
algorithms like k-means that rely on this input. The algorithm dynamically
identifies clusters based on the density of data points, making it well-suited
for situations where the optimal number of clusters is unknown or may vary
across different parts of the dataset. This adaptability simplifies the
clustering process and is especially advantageous in exploratory data
analysis.
Let us spend 5 minutes to check your progress.
SAQ II
a) What is Gaussian mixture model and it usefulness in thematic information
extraction?
b) What are the limitations of density-based spatial clustering?
33
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
High-Dimensional Data Handling: Advanced algorithms are being used
alongside clustering to manage high-dimensional hyperspectral data.
Multi-Scale and Multi-Resolution Approaches: Techniques that combine
information from multiple scales or resolutions to capture more detailed and
accurate features in remote sensing data are being used.
Multi-Source Data Integration and Feature Fusion: Combining data from
different sources (e.g., optical, radar, LiDAR) to improve clustering outcomes
is another trend. Data fusion methods integrate diverse datasets to provide
richer and more accurate information for unsupervised classification. Also,
feature fusion i.e. merging features from different types of data (e.g.,
spectral, spatial, temporal) is being done to enhance the clustering process
and extract more meaningful patterns.
Hybrid Models: Hybrid models such as combining traditional unsupervised
clustering methods with deep learning features are being used. For example,
using SOMs or FCM with features extracted by deep learning models to
enhance clustering performance.
Handling Big Data: Development of algorithms that handle large volumes of
remote sensing data efficiently is another trend. Techniques such as parallel
processing and distributed computing are used to manage and analyse big
data in remote sensing applications. Cloud computing platforms is also being
leveraged for scalable storage and processing of remote sensing data,
thereby allowing for more extensive and complex analyses.
Dimensionality Reduction: Advanced techniques are being used for
reducing the dimensionality of high-dimensional remote sensing data while
preserving its structure.
Adaptive and Dynamic Clustering methods: Algorithms that can adjust
cluster parameters dynamically based on the data characteristics are being
used, which improve the adaptability of clustering algorithms to varying data
distributions and complexities. Further, techniques that evolve clusters over
time or as more data becomes available, useful for monitoring changes in
remote sensing data are also being devleloped.
Uncertainty and Robustness: Handling uncertainty is a major issue in the
field so methods are being developed that can quantify and manage
uncertainty in clustering results. Also, the algorithms are being designed to
be less sensitive to outliers and noise, thereby improving the reliability of
clustering results in challenging environments.
13.8 SUMMARY
Let us now summarise what we studied in this unit:
Image classification is the process of partitioning image in certain groups of
information classes based on their spectral characteristics. There are
broadly two approaches of image classification i.e. unsupervised and
supervised.
Unsupervised image classification is the process of image classification in
which user input is minimum and the process is guided by the spectral
34 Contributor: Dr. Sourish Chatterjee
Unit 13 Unsupervised Classification
…………………………….…………………………………….……………………………………………….…
similarity of the objects present in the image. This unit essentially describes
the fundamentals of unsupervised classification approaches
The commonly used algorithms in unsupervised image classification are k-
means and ISODATA clustering.
There are some other algorithms used in unsupervised image classification.
However, each algorithm has its own advantages and limitations and choice
of the algorithm is guided by the nature of data being used, number and
nature of features present in the image and also computational resources.
13.10 REFERENCES
Banerjee, B., Bovolo, F., Bhattacharya, A., Bruzzone, L., Chaudhuri, S., &
Mohan, B. K. (2014). A new self-training-based unsupervised satellite image
classification technique using cluster ensemble strategy. IEEE Geoscience
and Remote Sensing Letters, 12(4), 741-745.
Bezdek, J. C. (1981). Pattern Recognition with Fuzzy Objective Function
Algorithms. Springer.
Bishop, C.M. (2006). Pattern recognition and machine learning by
Christopher M. Bishop. Springer Science+ Business Media, LLC.
Comaniciu, D., & Meer, P. (2002). Mean shift: A robust approach toward
feature space analysis. *IEEE Transactions on Pattern Analysis and
Machine Intelligence.
Ester, M., Kriegel, H. P., Sander, J., & Xu, X. (1996). A density-based
algorithm for discovering clusters in large spatial databases with noise.
Proceedings of the 2nd International Conference on Knowledge Discovery
and Data Mining.
Foody, G. M. (1996). Fuzzy modelling of vegetation from remotely sensed
imagery. Ecological Modelling.
Foody, G. M. (1999). Applications of the self-organizing feature map neural
network in community data analysis. Ecological Modelling.
Jain, A. K. & Dubes, R. C. (1988). Algorithms for clustering data. Prentice
Hall.
Jolliffe, I. T. (1986). Principal Component Analysis. Springer-Verlag.
Kaufman, L., & Rousseeuw, P. J. (1990). Finding Groups in Data: An
Introduction to Cluster Analysis. Wiley-Interscience.
Kohonen, T. (2001). Self-Organizing maps. Springer Series in Information
Sciences.
35
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
Lee, T. W., & Lewicki, M. S. (2002). Unsupervised image classification,
segmentation, and enhancement using ICA mixture models. IEEE
Transactions on Image Processing, 11(3), 270-279.
Lillesand, T. M., Kiefer, R. W. & Chipman, J. W. (2015). Remote Sensing
and Image Interpretation. John Wiley & Sons.
Liu, J., & Mason, P. J. (2009) Essential Image Processing and GIS for
Remote Sensing. John Wiley & Sons.
Lloyd, S. (1982). Least squares quantization in PCM. IEEE transactions on
information theory, 28(2), 129-137.
Lu, D., & Weng, Q. (2007) A survey of image classification methods and
techniques for improving classification performance. International journal of
Remote sensing, 28(5), 823-870.
Mather, P. M. & Tso, B. (2016). Classification Methods for Remotely Sensed
Data. CRC Press.
Mather, P. M., & Tso, B. (2009) Classification Methods for Remotely Sensed
Data. CRC Press.
Ng, A. Y., Jordan, M. I., & Weiss, Y. (2002). On spectral clustering: Analysis
and an algorithm. Advances in Neural Information Processing Systems.
Pal, N. R., & Bezdek, J. C. (1995). On cluster validity for the fuzzy c-means
model. IEEE Transactions on Fuzzy Systems.
Salah, M. (2017). A survey of modern classification techniques in remote
sensing for improved image classification. Journal of Geomatics, 11(1), 21.
Schmarje, L., Santarossa, M., Schröder, S. M., & Koch, R. (2021). A survey
on semi-, self-and unsupervised learning for image classification. IEEE
Access, 9, 82146-82168.
Shekhar, S., Schrater, P. R., Vatsavai, R.R., Wu, W. and Chawla, S. (2002).
Spatial contextual classification and prediction models for mining geospatial
data. IEEE Transactions on Multimedia, 4(2), 174-188.
Vatsavai, R.R., Symons, C.T., Chandola, V. and Jun, G. (2011). GX-Means:
A model-based divide and merge algorithm for geospatial image clustering.
Procedia Computer Science, 4, 186-195.
Wang, Q., Li, Q., Liu, H., Wang, Y., & Zhu, J. (2014, October). An improved
ISODATA algorithm for hyperspectral image classification. In 2014 7th
International Congress on Image and Signal Processing (pp. 660-664).
IEEE.
Wang, X. and Wang, J. (2010). Using clustering methods in geospatial
information systems. Geomatica, 64(3), 347-361.
13.12 ANSWERS
SAQ I
a) Please refer to section 13.1.
b) Please refer to subsection 13.2.1.
c) Please refer to subsection 13.2.2.
SAQ II
a) Gaussian Mixture Models (GMMs) are probabilistic models used in statistics
and machine learning to represent complex data distributions.
37
Contributor: Dr. Sourish Chatterjee
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
b) Comprising a mixture of Gaussian (normal) distributions, GMMs can model
diverse patterns within a dataset. Each Gaussian component represents a
potential cluster, allowing GMMs to effectively capture intricate structures.
c) Please refer to subsection 13.3.2.
Terminal Questions
1. Refer to section 13.4.
2. You answer should major include the difference in terms of mode,
requirement of field information, stage of analysis, etc.
3. Refer to sections 13.4 and 13.5.
SUPERVISED CLASSIFICATION
Structure______________________________________________
14.1 Introduction Parallelepiped Classifier
14.1 INTRODUCTION
You have learnt that image classification involves mapping of the digital image data into a finite set
of classes that represent surface types in the imagery. It may be used to identify vegetation types,
anthropogenic structures, mineral resources, etc. or transient changes in any of these features.
Additionally, a classified image that can be considered to a raster layer in a geographic information
system. In the previous unit, you have learnt about image classification, typology of image
classification and more about unsupervised image classification and its various approaches. In this
unit, we will learn about the other generic type of image classification i.e. supervised image
classification.
Water absorbs much longer wavelength at visible and NIR radiation than
shorter visible wavelengths. Thus, water typically looks blue or blue-green due
to stronger reflectance at these shorter wavelengths and darker if viewed at red
or NIR wavelengths. If there is suspended sediment present in the upper layers
of a water body, then this will allow better reflectivity and a brighter appearance
of the water. The apparent colour of the water will show a slight shift to longer
wavelengths. Suspended sediment can be easily confused with shallow (but
clear) water, since these two phenomena appear very similar. Chlorophyll in
algae absorbs more of the blue wavelengths and reflects green, making water
appear greener in colour when algae are present. The topography of the water
surface such as rough, smooth or floating materials can also lead to
complications for water related interpretation due to potential problems of
specular reflection and other influences on colour and brightness.
Spectral signatures, however, are not always “pure” which means the sensor
might record some signatures that may be emitted by surrounding objects.
“Pure” spectral signature for individual materials or classes can be determined
best under laboratory conditions, where the sensor is placed very close to the
target (Fig. 14.4). There is no interference in a closed and controlled
environment such as a laboratory. Agencies such as ISRO, US Department of
Agriculture and several universities maintain large repositories of spectral
Fig. 14.4: Spectral curves from a spectral library. (Source: Spectral library of US
Geological Survey)
You have leant that spectral signature is the unique pattern of reflectance or
radiance values of an object or surface across various wavelengths of the
electromagnetic spectrum. Understanding spectral signature is important
because they are used to differentiate between different classes or materials in
remote sensing images. Accurate signatures are crucial for distinguishing
between similar classes and improving classification accuracy.
Let us now study about training site selection.
14.3.2 Training Site Selection
After selection of suitable classification scheme and the data for classification,
the supervised image classification process consists of three important stages:
training, signature evaluation and decision making as you have read earlier
(Fig. 14.5).
Fig. 14.7: Using supervised classification, pixels are classified into different
categories.
Fig. 14.10: Using the parallelepiped approach, pixel 1 is classified as forest and
pixel 2 is classified as urban.
SAQ I
a) What is supervised image classification?
b) What is training?
This is because of the fact that while we use spectral information to create
classes in the unsupervised classes, performance of the supervised classes is
largely dependent on the training samples. The training samples used for the
above classification (Fig. 14.11) is not sufficient to cover the entire spectrum of
a particular class and, therefore, we get a generalised image. The results of the
supervised classification can be further improved by collecting more training
samples which would further help to reduce the differences between classes
which may be due to mixtures within each pixel e.g. grass and forest.
It is not easy to answer the question which classification method is suitable for
a study because different classification results can be obtained from different
methods and each method has its own merits and demerits. However, for a
general guideline it can be said that factors such as spatial resolution of the
remote sensing data, source of data, classification scheme and availability of
Contributors: Dr. Anupam Anand and Dr. Sourish Chatterjee 55
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
classification software must be taken into account while selecting a
classification method for use.
SAQ I
a) What is signature?
14.8 SUMMARY
Let us now summarise what you have studied in this unit:
There are two approaches to image classification, i.e. unsupervised and
supervised. Unsupervised classification is useful for complex terrains and it
can significantly reduce the cost of ground data collection than the
supervised classification.
14.10 REFERENCES
Abburu, S. & Golla, S. B. (2015). Satellite image classification methods and
techniques: A review. International journal of computer applications, 119(8).
Bishop, C.M. (2006). Pattern recognition and machine learning by
Christopher M. Bishop. Springer Science+ Business Media, LLC.
Dhingra, S. & Kumar, D. (2019). A review of remotely sensed satellite image
classification. International Journal of Electrical and Computer Engineering,
9(3), 1720.
ERDAS Field Guide, http://www.gis.usu.edu/unix/imagine/FieldGuide.pdf
(data retrieved in February, 2012).
Jensen, J.R. (1986) Introductory Digital Image Processing: A Remote
Sensing Perspective, Prentice-Hall, New Jersey.
Lillesand, T. M., Kiefer, R. W. & Chipman, J. W. (2015). Remote Sensing
and Image Interpretation. John Wiley & Sons.
Lillesand, T.M. and Kiefer, R.W. (1994) Remote Sensing and Image
Interpretation, John Wiley and Sons, Hoboken.
Liu, J., & Mason, P. J. (2009) Essential Image Processing and GIS for
Remote Sensing. John Wiley & Sons.
14.12 ANSWERS
SAQ I
58 Contributors: Dr. Anupam Anand and Dr. Sourish Chatterjee
Unit 14 Supervised Classification
…………………………….…………………………………….……………………………………………….…
a) Image classification can be defined as a process of assigning land cover
classes/themes to pixels in an image.
b) Training is the process of generating spectral signature of each class.
SAQ II
a) Please refer to subsection 14.3.1.
b) Please refer to section 14.3.
Terminal Questions
1. Please refer to section 14.2.1.
2. Please refer to section 14.3.2.
3. Please refer to section 14.3.1.
4. Please refer to section 14.8.
5. Please refer to section 14.6.
What does Image Processing's Change Change Detection using UAV Technology
Detection Entail?
Machine Learning for Change Detection
The algorithm for Change Detection
15.5 Summary
Factors for Implementing Change Detection
15.6 Terminal Question
Operation of GIS-Based Change Detection
15.7 References
Change Agent
15.8 Further/Suggested Readings
15.3 Technique for Spectral Change
15.9 Answers
Detection
15.1 INTRODUCTION
This unit mainly deals with various aspects of change detection techniques and factors responsible
to understand the changes. In this unit, we will discuss about change detection necessity, their
change agents, requirement of image processing change detection, steps involved for spectral
change detection, and applications of change detection. You will also comprehend the significance
of change detection.
61
Contributor: Dr. Sapana B. Chavan
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
63
Contributor: Dr. Sapana B. Chavan
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
remote sensing data have been employed widely as primary resource for
change detection.
15.2.2 What does image processing's change detection
entail?
Analyzing changes in satellite photos collected at two separate times is a
process called "change detection." Different methods can be used to calculate
the changes in these multi-temporal satellite photos.
15.2.3 The algorithm for change detection
Utilizing several temporal data sets, change detection entails quantifying
temporal impacts. Satellite data is frequently employed when someone is
interested in tracking changes over vast areas and at frequent intervals. The
algorithms utilized have a significant impact on the outcomes of the digital
analysis.
15.2.4 Factors for implementing change detection
The following prerequisites must be met before change detection analysis is put
into practice: (1) accurate radiometric and atmospheric calibration or
normalization between multi-temporal images; (2) precise multi-temporal image
registration; (3) identical phenological states between multi-temporal images;
and (4) detection of change
15.2.5 Operation of GIS-Based Change Detection
To derive insightful information, GIS software mixes spatial data with statistical
data. Equipment like drones, unmanned aerial vehicles, or satellites are used to
remotely perceive and collect the aforementioned spatial data. Numerous
sources can be used to gather statistical data, and satellites, UAVs, and drones
used for remote sensing can acquire geospatial data. Due to open data access,
satellite change detection is becoming more and more popular today and is
frequently the quickest and least expensive choice.
Now, if the data is gathered and examined over time, this provides us with a
change detection dashboard to identify a given feature's aspect through time.
This dashboard can then be used in a variety of ways, such as to comprehend
changes in ice sheets and forest cover, among other things. It can recognize a
certain feature's aspect in two distinct time frames. For instance, change
detection can be used to track retail businesses to find any differences between
the quantity of stores that opened five years ago and currently. Change
detection can also be used to monitor the size, shape, and movements of a
particular feature.
The applications of geospatial change detection are numerous. The technique
is used to monitor changes in crop status, land usage, urban growth, vehicle
mobility, glacier cracking, and other aspects of the environment. The discovery
of manmade climate change in the world's oceans aids in understanding the
extent of the issue and developing a successful response strategy.
GIS examines statistical and spatial data for change detection.
SAQ I
a) What is the goal of change detection?
b) Which analysis is popular in recent years for remote sensing applications
such as mapping from high resolution photos or building and updating GIS
databases?
c) What are the anthropogenic change agents?
65
Contributor: Dr. Sapana B. Chavan
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
4. Change Vector Analysis: A change vector of a pixel is the vector
difference between the multi-band digital vector of the pixel on two different
dates. The amplitude and direction of change from date one to date two are
described by a spectral change vector. The output consists of two images,
one of which shows the change vector's magnitude and the other its
direction. If the magnitude of changes exceeds a certain threshold, the
change is determined, and the type of change is indicated by the direction
of the change vector.
5. Vegetation Index Differencing: In studies of vegetation, a ratio
(sometimes referred to as a vegetation index) is employed to emphasize
spectral contrasts between vegetation's strong near-infrared reflectance and
the chlorophyll-absorption band (red portion) of the spectrum. Ratio
Vegetation Index, Normalized Vegetation Index, and Transformed
Vegetation Index are examples of common vegetation indices.
6. Multi-date Principal Component Analysis: In this method, two images
from the same location taken at different times are superimposed and
analyzed as a single image. The local alterations are revealed by the minor
component images, while the major component images display the
reflectance and radiometric variances (minor changes). This method is
more useful than the post classification method for tracking rapid changes in
land use and urban growth. On two photos from various dates, they
performed a principal component analysis, and on the compressed PCA
image, an interactive supervised categorization of land-use change was
performed. The new method and the traditional post-classification
methodology were compared using photos from two.
7. Post-classification Technique: In the post-classification method, each
image is assigned a unique categorization and label. The area of changes is
then extracted after a direct comparison of the categorization findings. This
method employs both supervised and unsupervised classifications. The
difficulty of adjusting for atmospheric and sensor changes between two
dates is reduced to a minimum by individual classification of two image
dates.
The key drawback of this approach is the results of the categorization are
accuracy dependent. Individual categorization errors result in the
propagation of uncertainties in the change map, which produces erroneous
information about changes in land use. The categorization methods, source
image error, and change determination as the sources of uncertainty in
change detection.
They outlined the following three primary error factors for classification-
based change detection using Maximum Likelihood (ML):
a. Subjective data collection is utilized to gather training data;
b. the ML classifier presumes that the probability distribution of each class
is normal; and
c. the method used to assess changes is not objective (based on number
of uncertainties). The usual way to express the uncertainty in a
categorized remote sensing image is as a confusion matrix (error).
66 Contributor: Dr. Sapana B. Chavan
Unit 15 Change Detection Techniques
…………………………….…………………………………….……………………………………………….…
Confusion matrix can be used to derive a number of error indicators,
including error of commission.
8. On-screen Digitization: This technique is typically applied to scanned aerial
photos and high-resolution data from distant sensors. This technique uses
on-screen picture interpretation of high-resolution images to update
inaccurate government urban infrastructure records.
SAQ II
a) What is the Image ratioing?
b) Explain the Multi-date Principal Component Analysis.
c) What is the Change vector?
d) ............., …………….., and ………………. are examples of common
vegetation indices.
Fig. 15.3: Change Detection for Land Use/ Land Cover Application.
69
Contributor: Dr. Sapana B. Chavan
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
15.6 SUMMARY
In this Unit, you have learned the following:
Identifying, describing, and quantifying variations between images of the
same scene taken at different times or under various circumstances are the
goal of change detection analysis.
Remote sensing and image processing technology advancements in
particular have made it possible to determine vast areas in detail and, in this
regard, produce expanded and trustworthy recent data swiftly
Analyzing changes in satellite photos collected at two separate times is a
process called "change detection." Different methods can be used to
calculate the changes in these multi-temporal satellite photos.
GIS examines statistical and spatial data for change detection.
Urbanization, agriculture, insect attacks, fire, logging, mining, storms, and
petroleum exploration and production are only a few examples of the
change agents that have been the subject of most studies.
Images from two different dates are combined to create a new single-band
or multi-band image that contains the spectral alterations in spectral change
detection.
Image differencing, Image ratioing, Image regression, change vector
analysis, Vegetation index differencing, Multi date principal component
analysis, Post classification techniques and on-screen digitization are the
techniques of spectral change detection
Remote sensing provides a method for quickly gathering and displaying
data on land cover, whether it be on a regional or small scale
71
Contributor: Dr. Sapana B. Chavan
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
Various application of change detection such as Land use/Land cover
analysis, Coastal area management, Forest fire, change detection using
UAV technology, Machine learning for change detection.
15.7 REFERENCES
Asokan, A., & Anitha, J. J. E. S. I. (2019). Change detection techniques for
remote sensing applications: A survey. Earth Science Informatics, 12, 143-
160.
Jianya, G., Haigang, S., Guorui, M., & Qiming, Z. (2008). A review of multi-
temporal remote sensing data change detection algorithms. The
International Archives of the Photogrammetry, Remote Sensing and Spatial
Information Sciences, 37(B7), 757-762.
Bai, T., Wang, L., Yin, D., Sun, K., Chen, Y., Li, W., & Li, D. (2023). Deep
learning for change detection in remote sensing: a review. Geo-spatial
Information Science, 26(3), 262-288.
Lu, D., Mausel, P., Brondizio, E., & Moran, E. (2004). Change detection
techniques. International journal of remote sensing, 25(12), 2365-2401.
Shafique, A., Cao, G., Khan, Z., Asad, M., & Aslam, M. (2022). Deep
learning-based change detection in remote sensing images: A review.
Remote Sensing, 14(4), 871.
Coppin, P. R., & Bauer, M. E. (1996). Digital change detection in forest
ecosystems with remote sensing imagery. Remote sensing reviews, 13(3-
4), 207-234.
Singh, A. (1989). Review article digital change detection techniques using
remotely-sensed data. International journal of remote sensing, 10(6), 989-
1003.
Coppin, P., Jonckheere, I., Nackaerts, K., Muys, B., & Lambin, E. (2004).
Review ArticleDigital change detection methods in ecosystem monitoring: a
review. International journal of remote sensing, 25(9), 1565-1596.
Hussain, M., Chen, D., Cheng, A., Wei, H., & Stanley, D. (2013). Change
detection from remotely sensed images: From pixel-based to object-based
approaches. ISPRS Journal of photogrammetry and remote sensing, 80,
91-106.
Willis, K. S. (2015). Remote sensing change detection for ecological
monitoring in United States protected areas. Biological Conservation, 182,
233-242.
Coppin, P. R., & Bauer, M. E. (1996). Digital change detection in forest
ecosystems with remote sensing imagery. Remote sensing reviews, 13(3-
4), 207-234.
Wen, D., Huang, X., Bovolo, F., Li, J., Ke, X., Zhang, A., & Benediktsson, J.
A. (2021). Change detection from very-high-spatial-resolution optical remote
sensing images: Methods, applications, and future directions. IEEE
Geoscience and Remote Sensing Magazine, 9(4), 68-101.
73
Contributor: Dr. Sapana B. Chavan
Block 3 Image Classification and Change Detection Techniques
…………………………………………………………………….…………………………………………………
Shi, W., Zhang, M., Zhang, R., Chen, S., & Zhan, Z. (2020). Change
detection based on artificial intelligence: State-of-the-art and challenges.
Remote Sensing, 12(10), 1688.
Qin, R., Tian, J., & Reinartz, P. (2016). 3D change detection–approaches
and applications. ISPRS Journal of Photogrammetry and Remote Sensing,
122, 41-56.
Hemati, M., Hasanlou, M., Mahdianpari, M., & Mohammadimanesh, F.
(2021). A systematic review of landsat data for change detection
applications: 50 years of monitoring the earth. Remote sensing, 13(15),
2869.
15.9 ANSWERS
SAQ I
a) Identifying, describing, and quantifying variations between images of the
same scene taken at different times or under various circumstances are the
goal of change detection analysis, which includes a wide range of
techniques.
b) Automatic object-based image analysis has been increasingly popular in
recent years for remote sensing applications such as mapping from high
resolution photos or building and updating GIS databases.
c) The agents responsible for anthropogenic changes refers to human-induced
change, which is typically tied to land use and includes activities like urban
development, farming, logging, and mining.
SAQ II
a) Image ratioing is utilization of the same two co-registered image dates, each
band is rationed pixel by pixel.
b) Multi-date Principal Component Analysis is analysis where two images from
the same location taken at different times are superimposed and analyzed
as a single image. The local alterations are revealed by the minor
component images, while the major component images display the albedo
(reflectance) and radiometric variances (minor changes).
c) A change vector of a pixel is the vector difference between the multi-band
digital vector of the pixel on two different dates.
d) Ratio Vegetation Index, Normalized Vegetation Index, and Transformed
Vegetation Index are examples of common vegetation indices.
Terminal Questions
1. An inventory level of data indicating the location, nature, and extent of
change is provided by the satellite land cover change information
2. The line of land-water contact is referred to as the shoreline.
3. UAVs are a feasible alternative for gathering data from distant sensing for a
variety of real-world uses.
ACCURACY ASSESSMENT
OF THEMATIC MAPS
Structure_______________________________________________
16.1 Introduction For Hard and Soft Classification Outputs
Expected Learning Outcomes 16.6 Approaches to Accuracy Assessment for
other Data/Outputs
16.2 Accuracy Assessment
Change Detection Outputs
Purpose
Digital Elevation Models
Types
Modelled Outputs
Metrics
Vector Data
Steps
16.7 Challenges and Recent Trends
16.3 Historical Development
Challenges
16.4 Considerations for Accuracy
Assessment Approaches to Improving Accuracy
Sources of Errors in Classification Outputs Recent Trends and Developments
Factors Affecting Accuracy Assessment 16.8 Summary
Sampling Design Consideration 16.9 Terminal Questions
Determining Sample Size 16.10 References
16.5 Approaches to Accuracy Assessment 16.11 Further/Suggested Readings
For Per-pixel and Object based 16.12 Answers
Classification Outputs
16.1 INTRODUCTION
You have studied in the course MGY-102 that digital image processing involves three broad
image functions namely, image pre- processing, image processing and image post-processing. In
the Block 2 of this course, i.e. MGY-005, you have studied in detail about the image pre-
processing techniques that include image correction, image enhancement and transformation
techniques.
76
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
80
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
Fig. 16.1: Broad steps in accuracy assessment.
Usually, the first step is the collection of reference data either from the ground or
from a higher resolution data based on the carefully designed sampling strategy
considering number of type of approach employed for thematic information
extraction, classes present, characteristics of the features, accessibility on the
ground, etc. Care should be taken that the chosen or collected ground truth data
accurately represents the classes in the study area for the land use land cover
or any other theme as per the thematic map prepared.
After collecting the reference data, the second step is to create error matrix
(also called confusion matrix) to compare the classification results with the
reference data. The third step is calculation of accuracy metrics such as overall
accuracy, producer’s and user’s accuracy, and the Kappa coefficient. Next step
is to analyse the metrics to understand the strengths and weaknesses of the
classification. Further, at the last step the findings are documented and are used
to refine classification algorithms and data processing techniques.
83
Contributor: Prof. Benidhar Deshmukh
Block 4 Image Classification and Change Detection Techniques
…………………………………………………………………….……………………………………….
efficiency of validation. These systems can dynamically update and refine
reference datasets as new data becomes available.
The use of crowdsourced data and citizen science initiatives, such as through
platforms like Google Earth Engine, has expanded the availability of reference
data. This has led to the development of new methods to assess and validate
classifications using data from non-traditional sources.
The evolution of accuracy assessment in remote sensing reflects the broader
technological advancements in the field. From simple qualitative comparisons in
the early days to sophisticated statistical and AI-driven assessments today, the
field continues to develop new methods that enhance the reliability of remote
sensing data. This historical progression underscores the importance of
accuracy assessment as a dynamic and essential component of remote sensing
analysis, adapting to the increasing complexity and demands of modern
geospatial applications.
Let us spend 5 minutes to check your progress.
SAQ I
a) What are the types of accuracy assessment?
84
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
Source of Error Description Impact Possible Solution
and noise levels can can introduce resolution, and apply noise
affect the data quality inaccuracies in reduction techniques, if
classification needed
Use additional spectral
bands, indices (e.g., NDVI),
Different land cover Results in or apply advanced
Spectral classes may have misclassification algorithms like machine
overlap and overlapping or similar thereby reducing learning models that can
similarity spectral signatures e.g. classification accuracy better handle overlapping
between water and shadow, as similar classes are classes or incorporate
classes leading to not distinguished additional data (e.g.,
misclassification properly texture, elevation) and
incorporate contextual
information in classification
Use sub-pixel classification
Causes
techniques, spectral
misclassification,
Pixels containing more unmixing techniques, fuzzy
especially in
than one land cover classification approach to
heterogeneous or
Mixed pixels types, especially common handle mixed pixels better
transitional zones and
and boundary at class boundaries due or higher spatial resolution
overestimation or
effects to the spatial resolution, data; Use boundary
underestimation of
leading to ambiguous smoothing techniques,
class boundaries,
classification post-classification filtering,
affecting area
or segmentation-based
calculations
classification
Radiometric noise,
Degrades the quality of Apply preprocessing steps
Noise and atmospheric effects,
spectral information, like radiometric correction,
distortions in sensor errors and
leading to incorrect atmospheric correction,
data preprocessing artifacts
classification and image normalisation
may affect data quality
Differences in sunlight Results in inconsistent
due to time of day, reflectance values,
season, or topography, affects classification Apply topographic
Shadow and
shadows from terrain, performance in shaded normalisation, shadow
illumination
buildings, or vegetation areas, especially in detection and removal, or
effects
obscure surface features, urban and forested use of multi-temporal data
altering their spectral regions or other low-
properties reflectance classes
Atmospheric effects like Causes spectral Apply atmospheric
haze, clouds, and aerosol distortion, reducing the correction like Dark Object
scattering affect sensor ability to differentiate Subtraction (DOS) or
measurements and alter between classes, atmospheric radiative
Atmospheric
the signal received by results in gaps or transfer models, use cloud
conditions
sensors, presence of incorrect classifications masking, gap-filling
clouds and their shadows in cloud-covered areas, algorithms, or multi-
obscure the land surface, especially in optical temporal compositing to
leading to missing data data reduce cloud impacts
Alters spectral
Terrain-induced signatures, causes
Topographic variations in reflectance misclassification in Apply topographic
effects due to slope, aspect, and mountainous regions or correction techniques
elevation differences areas with varied
topography
Changes in land cover Leads to outdated or
between image misleading Use recent data,
acquisition dates and the classification outputs, incorporate multi-temporal
Temporal
ground truth data especially in dynamic datasets, or apply change
changes in
collection affect landscapes; Causes detection techniques; use
landscape/
classification consistency; temporal mismatches multi-seasonal data,
seasonal
differences in vegetation and classification phenological adjustment, or
variations
phenology and land cover errors, especially in seasonal compositing
conditions across agriculture and forest techniques
seasons monitoring
Errors due to sensor Results in spatial Apply geometric correction,
Geometric
geometry, earth misalignment and orthorectification, and
distortion/
curvature, or relief incorrect positioning of accurate georeferencing
misregistration
displacement affecting features and affecting using precise ground
85
Contributor: Prof. Benidhar Deshmukh
Block 4 Image Classification and Change Detection Techniques
…………………………………………………………………….……………………………………….
Source of Error Description Impact Possible Solution
pixel locations the classification control points
Differences in data scales Can lead to bias and Normalise or standardise
Scaling issues between training and poor classification data to ensure consistent
classification datasets performance scaling across datasets
Improve training data
Insufficient or through field validation,
Inaccurate, insufficient, or
unrepresentative expert annotation, and
Inadequate non-representative
training samples can sufficient number of high-
training sample training data can
lead to biased or quality, well-distributed
size and quality misguide the
inaccurate classification training samples ensuring
classification model
results balanced class
representation
Standardise procedures,
Errors introduced during use automated approaches
Leads to
Human manual classification or when possible, and
inconsistencies and
interpretation training data collection incorporate multiple
inaccuracies in
errors due to subjective interpreters for removal of
classification outputs
judgment subjectivity in interpretation
and to minimise errors
Use techniques like
Uneven distribution of Dominant classes are oversampling,
Sample design/
classes in training data over-represented, while undersampling, or
class
can bias the model minority classes are Synthetic Minority Over-
imbalance
toward dominant classes under-classified sampling Technique
(SMOTE)
Use of irrelevant or
Increases the risk of
Suboptimal redundant features that Use feature selection
overfitting, leading to
feature do not contribute methods like Principal
decreased performance
selection significantly to Component Analysis (PCA)
on unseen data
classification accuracy
Choosing an unsuitable Results in overly Use appropriate
Inappropriate classification scheme that simplistic or overly classification scheme
classification does not capture the complex classifications, suitable to resolution, scale
scheme variability of land cover leading to high error and level of classes being
classes rates mapped
Results in reduced
Inherent weaknesses in classification accuracy Use ensemble methods,
classification algorithms due to inability to hybrid approaches, or
Algorithm in handling complex or correctly model data advanced machine learning
limitations overlapping classes and complexities especially algorithms such as
non-linear class in complex or Random Forests or Neural
boundaries heterogeneous Networks
landscapes
Mistakes in collecting or Use careful field validation,
Leads to biased training
Ground truth labelling ground truth automated data collection
data and unreliable
data quality data used for training and tools, and quality control
accuracy assessments
validation measures
Errors in classification outputs in remote sensing can stem from various factors
including data quality, algorithm limitations, environmental effects, and human
errors. Each error source can significantly impact the overall classification
accuracy, leading to incorrect land cover maps and unreliable results for
decision-making. Understanding and addressing these sources of errors is
crucial to improving the accuracy of classification outputs in remote sensing,
leading to more reliable and actionable thematic maps. Use of appropriate
preprocessing techniques for correcting geometric and radiometric distortions,
atmospheric corrections, and shadow removal techniques can help reduce
errors related to data quality. Judicious selection of suitable classification
scheme and algorithm, choice of advanced classification techniques that better
handle complex data patterns and non-linear relationships among classes and
use of indices (e.g. NDVI), incorporation of contextual information such as
86
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
texture, etc. and techniques to enhance the separability of classes can
significantly improve classification outputs. Further, ensuring high-quality,
representative, and adequately balanced training data can improve the
robustness of the classifier. Application of post-classification processing
techniques such as smoothing filtering, or object-based post-processing
techniques can help reduce classification noise and improve the spatial
coherence of the output. Lastly, careful validation can greatly improve the
reliability of classification outputs in remote sensing applications.
After learning about the sources of errors in classification outputs of remote
sensing data, let us now understand the factors that affect accuracy
assessment.
16.4.2 Factors Affecting Accuracy Assessment
There are several factors that may affect assessment of accuracy. Some of the
key factors influencing accuracy assessment are given in Table 16.2.
Table 16.2: Factors affecting assessment of accuracy in remote sensing
data analysis.
The categories or classes defined for classification should be consistent with reference
Classification scheme to the data being used for mapping; inconsistent or incompatible schemes can cause
errors in accuracy assessment
Inaccurate, outdated, or imprecise ground truth data used for validation can affect
Quality of ground truth
data
validation accuracy e.g. inaccurate location data can lead to misleading accuracy
metrics and poor validation of classification results
The choice of method used and poor sampling design to collect reference data, (e.g.,
Sampling design random, systematic, stratified) can influence the representativeness of validation data,
bias results and under-represent classes
Inaccessibility to Inaccessibility to the sites being used for validation and over-sampling in easily
sampling location on accessible areas would lead to bias for those classes
ground
Characteristics of the The level of detail in terms of spatial and spectral resolutions of the remote sensing data
reference remote being used as reference data; can impact the ability to distinguish between different
sensing data being features; frequency of data acquisition and its relevance to the study period can affects
used for validation the relevance of data to the study period leading to incorrect reference data
Choice of suitable Maps generated through different classification method may require suitable accuracy
accuracy assessment assessment approach, e.g. hard and soft classification, per-pixel, object based, object
approach detection outputs may require separate approaches for accuracy assessment
Site specific Unique features of the study area that may affect data interpretation can lead to
characteristics misclassification, if not accounted for
Difference in the geographic coordinates in the data from which mapping has been
Positional accuracy done and on the ground can lead to wrong interpretation and affect quality of the data
being collected on ground
Differences in the time of data acquisition between the remote sensing data used for
Temporal mismatch classification and reference data being collected (e.g., seasonal variations) can lead to
inaccurate comparisons
These methods help in visualising how sampling designs are implemented and
the trade-offs between them. Each method has its unique strengths and
weaknesses, which influence the choice of method depending on the specific
goals and constraints of the remote sensing project.
16.4.4 Determining Sample Size
You have learnt earlier that sample size is an important consideration while
assessing the accuracy of remotely sensed data. Proportion of correctly
identified locations will represent the accuracy of the map. Determination of
sample size depends on number of map classes and level of detail/rigor
required. Collection of sample size requires time and resources. Therefore, it
must be kept to a minimum. Yet it is critical to maintain a large enough sample
size so that analysis performed is statistically valid. In remote sensing literature,
many researchers have published equations and guidelines for choosing the
appropriate sample size. Majority of them have used an equation based on the
binomial distribution or the normal approximation to the binomial probability
distribution to compute the required sample size. A generally accepted rule of
thumb is to use a minimum of 50 samples for each class in the error matrix but
also to adjust based on class variability and landscape complexity. This rule
also tends to agree with the results of computing sample size using the
multinomial distribution. Given an error matrix with n land cover classes, for a
given class there is 1 (one) correct answer and n–1 incorrect answers. Sufficient
samples must be acquired to be able to adequately represent this confusion.
Therefore, use of the multinomial probability distribution is recommended.
Determining the appropriate sample size for accuracy assessment in remote
sensing is crucial to ensure reliable and statistically valid results. The sample
size influences the accuracy, precision, and confidence of the assessment.
There are different methods (as given in Table 16.4) used to calculate sample
90
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
size based on statistical principles, data characteristics, and the purpose of the
study.
Table 16.4: Commonly used methods for determining sample size.
Method Description Suitability Advantage Limitation
A simple heuristic
often used to
for preliminary
determine sample size
estimates or when Easy to apply; quick May not be statistically
Rule of thumb based on general
detailed information is estimation rigorous; lacks precision
recommendations or
unavailable
past experience for
quick estimation
for stratified
Sample size is
populations to ensure
Proportional allocated to different Ensures representation Requires accurate
each group is
allocation strata or groups in proportional to class size knowledge of strata sizes
proportionally
proportion to their size
represented
Can over- or under-sample
Equal samples for for situations where
Simple to implement; easy certain classes as does
Equal allocation each class regardless strata are considered
to balance across strata not account for differences
of their size equally important
in stratum sizes
Divides the population for heterogeneous
Increases precision by
Stratified into distinct strata and populations where Requires detailed
accounting for population
random samples randomly each stratum must be information on strata
heterogeneity
within each stratum represented
Calculates sample Assumes normal
for estimating
size for a specified Provides precise sample distribution; complex for
Cochran’s proportions with
level of precision and size for given confidence large populations; requires
formula known or estimated
confidence and precision knowledge of population
proportions
considering variability size
Used for binary
outcomes (e.g.,
Limited to binary
presence/absence) to binary classifications Simple to apply;
Binomial outcomes; does not
estimate the sample with known or statistically robust sample
distribution handle multiclass
size needed for a assumed proportions size
scenarios
specific level of
accuracy
Extends binomial to Requires knowledge of
Handles multiple classes;
Multinomial multiple categories or suitable for problems proportions for all classes;
more flexible than binomial
method classes for multiclass with multiple classes more complex
classification
Estimates sample size
required to detect a for hypothesis testing
Accounts for effect size, Requires prior knowledge
specific effect size and experiments
Power analysis power, and significance of effect size and variance;
with a given level of requiring power
level; flexible complex calculations
statistical power and estimation
significance
Dynamically adjusts
Suitable for Efficient for Requires ongoing
the sample size based
Adaptive heterogeneous or heterogeneous monitoring and
on observed data
sampling spatially variable populations; can reduce adjustments; can be
patterns or conditions
populations costs complex
during the study
Determines sample Suitable when
size based on expert statistical methods Provides practical Highly subjective; may
Expert
opinion and are impractical or recommendations based lack consistency and
judgement
experience rather than when expert insights on experience. scientific rigor
statistical formulas are crucial
SAQ II
a) List the sources of errors in image classification.
94
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
The assessment of classification accuracy for hard and soft classification
outputs involves different approaches and considerations due to the nature of
each classification type. In case of hard classification focuses on discrete class
assignments and uses traditional accuracy metrics whereas in case of soft
classification, it accounts for partial memberships and uses extended metrics to
capture the nuances of class distributions. Important aspects of the accuracy
assessment of both hard and soft classification outputs are given in Table 16.5.
In this section, you have learnt about accuracy assessment approaches
applicable to various kinds of classification outputs and object detection results.
However, there are other types of data that do not fall under these categories. In
the next section, you will be introduced to approaches of accuracy assessment
for other kinds of data or outputs.
Let us spend 5 minutes to check your progress.
SAQ III
a) Differentiate between the accuracy assessment of hard and soft
classification outputs?
96
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
best fit. However, other metrics like MAE, standard deviation, and error
distribution are also important for a holistic assessment. MAE is measured as
the average absolute difference between the predicted values and the actual
values. You must note that accuracy of the reference data (i.e. GCPs, high-
precision DEMs) significantly impacts the reliability of the assessment.
Let us now learn about the accuracy assessment approaches used for
modelled outputs.
16.6.3 Modelled Outputs
Accuracy assessment of modeled outputs in remote sensing involves evaluating
how well the predicted outputs (e.g., land cover maps, change detection results,
environmental models) match the reference or ground truth data. This process
is crucial for validating model performance and ensuring that outputs are reliable
for decision-making. Depending on the nature of the modelled data, the type of
output, and the complexity of the task various types of methods are used for the
purpose. Each method has limitations, such as sensitivity to specific data
characteristics, computational demands, or subjective biases, which must be
considered during assessment. Combining multiple methods often provides a
more comprehensive evaluation, especially when dealing with complex or
heterogeneous data.
Error matrix-based statistics, Root Mean Square Error (RMSE), Coefficient of
Determination [R-squared (R²)], Bias and Variance, Mean Absolute Error (MAE),
Correlation coefficients (e.g., Pearson’s r, Spearman’s Rank, ρ), Cohen’s Kappa
Coefficient, F1 Score and Precision-Recall analysis, NDVI metrics, Cross-
validation, Spatial Autocorrelation metrics (e.g., Moran’s I), and visual
interpretation are some of the methods used for the purpose.
Error matrix-based statistics extends confusion matrix with omission and
commission errors and is useful for thematic maps and categorical data. Root
Mean Square Error (RMSE) measures average squared difference between
predicted and observed values and is useful for continuous outputs such as
elevation models, temperature predictions, and biophysical variables.
Coefficient of Determination [R-squared (R²)] indicates the proportion of
variance in the dependent variable predictable from the independent variables
and is good for assessing the fit of linear regression models. Bias and variance
measures the error due to bias (systematic error) and variance (sensitivity to
fluctuations in the training set) and is good for evaluating trade-offs in model
complexity. Mean Absolute Error (MAE) measures average magnitude of errors
between predicted and observed values, regardless of direction and is suitable
for continuous data to see error between predicted and observed values or
pollution levels. Correlation coefficients (e.g., Pearson’s r, Spearman’s Rank, ρ)
measures the strength and direction of the linear (or rank-based) relationship
between predicted and observed values and is useful for continuous model
outputs, such as biophysical variables. Cohen’s Kappa Coefficient measures the
agreement between predicted and observed classes, adjusting for the
agreement that occurs by chance and is good for classification outputs. F1-
Score and Precision-Recall analysis combines precision (positive predictive
value) and recall (sensitivity) into a single metric for class performance and is
suitable for binary classification outputs and imbalanced classes. NDVI metrics
97
Contributor: Prof. Benidhar Deshmukh
Block 4 Image Classification and Change Detection Techniques
…………………………………………………………………….……………………………………….
uses indices (like NDVI) to compare model outputs to reference data, assessing
performance in vegetation mapping and is useful for assessing models in
agriculture, forestry, and environmental monitoring.
Cross-validation splits the dataset into training and validation subsets multiple
times to assess model performance and variability and is useful for machine
learning models to ensure robust accuracy estimates. Spatial Autocorrelation
metrics (e.g., Moran’s I) measures the degree of spatial clustering of errors in
the model outputs and is ideal for spatial models, particularly when assessing
spatial bias. Visual interpretation method involves expert analysis and visual
comparison of model outputs with reference data and is useful when
quantitative assessment is challenging, such as with complex or novel outputs.
You will learn more about various types of spatial models and uncertainty
analysis in the course MGY-006.
Let us now learn about the approaches employed for accuracy assessment of
vector data.
16.5.5 Vector Data
Some of the data may not be raster data in that case some other approaches
are used. Let us now learn about the approaches of accuracy assessment used
for vector data.
Accuracy assessment of vector data involves evaluating positional, class and
temporal attributes, topological accuracies, etc. Depending on the nature of the
data and its characteristics and also the requirements of accuracy assessment
task various types of methods are used for the purpose. Commonly used
methods are presented in the Table 16.6.
Table 16.6: Comparison of the commonly used methods employed for
accuracy assessment of vector data outputs.
Positional Evaluates the accuracy of the geographic coordinates of Root Mean Square Error
accuracy vector data, e.g. road networks, use of high-precision (RMSE), Mean Absolute
assessment GPS and correction techniques is recommended Error (MAE)
98
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
These methods help in evaluating the accuracy of vector data by providing
various metrics and approaches to identify strengths and weaknesses.
99
Contributor: Prof. Benidhar Deshmukh
Block 4 Image Classification and Change Detection Techniques
…………………………………………………………………….……………………………………….
Incorporating ancillary data: supplementing remote sensing images with
additional data sources, such as digital elevation models (DEMs), soil maps, or
land use data, temporal data, contextual information, etc. can improve
classification as they provide additional context that pure spectral information
may not capture, particularly in complex landscapes.
Data augmentation and synthesis: Data augmentation techniques artificially
increase the diversity of training data by creating variations thereby helping
improve robustness of classification models. It reduces overfitting by exposing
the model to a broader range of conditions and are particularly useful when
training data is limited.
Ensemble methods: combining multiple classifiers or models (ensemble
learning) can boost classification accuracy by leveraging the strengths of each
model. While Bagging (Bootstrap Aggregating) aggregates multiple models
trained on different subsets of the data to reduce variance and improve stability,
Boosting sequentially trains models to correct the errors of previous models
thereby enhancing accuracy. And, stacking combines outputs from different
models using a meta-classifier for final prediction. This approach increases
overall model robustness and reduces the impact of individual model
weaknesses.
Post-processing techniques: these refine the classified outputs to reduce
noise and improve spatial coherence. While smoothing filters reduces
classification noise and improve the visual quality of the map majority filters
corrects isolated misclassified pixels by replacing them with the majority class of
neighboring pixels. These techniques enhance the visual consistency and
accuracy of the final classified map and reduce salt-and-pepper noise often
seen in pixel-based classifications. And as you have learnt in subsection 16.4.2
with Fig. 16.1, contextual editing incorporating spatial rules (e.g., adjacency,
size constraints, location, association, etc.) can be used to refine and improve
classification results.
Active learning and interactive classification: it involves iteratively selecting
the most informative samples for labelling, which improves classification
accuracy by focusing on uncertain or misclassified areas. It involves expert input
to refine classification decisions interactively and focuses on additional sampling
efforts on areas with high classification uncertainty. It improves accuracy with
targeted data collection and also helps to fine-tune models based on expert
feedback.
Cross-validation and hyperparameter tuning: while cross-validation divides
the data into multiple subsets to ensure the model is evaluated on different
parts, and hyperparameter tuning optimises the model’s settings. For example,
K-fold cross-validation provides a more reliable accuracy estimate by using
multiple subsets of the data for training and validation and method such as grid
search/random search systematically explores different combinations of
hyperparameters to find the best model configuration. Such methods reduce the
risk of overfitting and improve model generalisation and ensures the selected
model parameters are optimal for classification.
100
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
Improving classification accuracy in remote sensing involves a combination of
data pre-processing, advanced algorithms, incorporation of ancillary data,
ensemble approaches, and careful tuning of model parameters. Applying these
methods helps to create more reliable, accurate classifications that better
represent real-world conditions.
16.7.3 Recent Trends and Developments
We have discussed about various approaches to accuracy assessment for
variety of raster and vector data in the previous two sections. We have also
learnt about various challenges and ways to improve accuracy in the previous
two subsections. Let us now learn about the recent trends and developments
taking place in this field.
There are several developments taking place in the field. These trends aimed at
enhancing the precision, reliability, and applicability of remote sensing
classifications reflect advancements in terms of data availability and
methodology. Morales-Barquero et al (2019) have analysed how accuracy
assessment practices have evolved over the past two decades and provide
valuable insights into the current state and future directions of accuracy
assessment in remote sensing, and emphasise on the need for more rigorous
and standardised practices.
Some of the recent trends and developments in the field are mentioned here:
Integration of machine learning and deep learning to automate the process,
to identify and analyse classification errors more effectively.
Incorporation of multi-source and multi-resolution data to improve
classification accuracy and assessment, and also for cross-sensor validation.
Enhanced spatial and temporal accuracy assessment
Crowdsourcing and citizen science to gather reference data and validate
remote sensing classification outputs.
Development of new metrics and methodologies such as development of
fuzzy kappa coefficient and other fuzzy metrics that account for the
uncertainty in soft classifications, new methods such as spatial kappa and
object-based accuracy measures for evaluating spatial accuracy and
coherence of classifications, and even combining multiple metrics to provide
a more comprehensive assessment
Integration of contextual and auxiliary information including demographic or
infrastructure information, to validate and improve classifications (e.g. using
demographic data to validate classifications of social or economic features).
Standardisation and benchmarking: to ensure consistency and comparability,
and creating benchmark datasets for evaluating and comparing classification
algorithms and accuracy assessment methods.
Google Earth Engine has emerged as a platform for thematic information
extraction including image classification. Accuracy assessment of the classified
maps generated through this are also required to be evaluated. Recent trends in
accuracy assessment for remote sensing are driven by advancements in
technology, the integration of diverse data sources, and the development of new
methodologies. These trends aim to improve the precision, reliability, and
101
Contributor: Prof. Benidhar Deshmukh
Block 4 Image Classification and Change Detection Techniques
…………………………………………………………………….……………………………………….
applicability of remote sensing classifications, providing more accurate and
actionable insights for a range of applications.
Let us spend 5 minutes to check your progress.
SAQ IV
a) List the approaches used for accuracy assessment of digital elevation
models.
16.8 SUMMARY
Let us now summarise what you have studied in this unit:
Accuracy assessment is an important step in remote sensing after
preparation of thematic maps. It determines the suitability of the maps for
further decision making and planning.
There are several metrics used for assessing accuracy. The most common
are user’s, producer’s, overall and kappa coefficient. Besides, precision,
recall, F-1 score, r, R2, mAP, MAE, etc. are some other metrics.
Assessing accuracy for each class as well as for the whole image is essential
to compare the results of various classification techniques and quality and
reliability of the results obtained.
Error/confusion matrix can be used for accuracy and reliability assessments.
Overall accuracy is a measure of accuracy for the whole image across all
categories. Kappa coefficient is another method for accuracy assessment
having a number of advantages over other methods.
Accuracy assessment has evolved from initial visual comparison to now
incorporation of machine learning approaches.
Sampling size and pattern are important consideration for accuracy
assessment and sufficient number of samples should be taken for the same.
The approaches for accuracy assessment vary greatly depending upon the
types of image classification methods employed, types of outputs, types of
data, etc.
There are several challenges to accuracy assessment and suitable measures
can be employed to improve accuracy.
102
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
3. Discuss the approaches of accuracy assessment for various types of image
classification outputs.
4. Describe the approach of accuracy assessment of change detection outputs.
16.10 REFERENCES
Abdullah, Q. A. (2023) Photogrammetric Engineering & Remote Sensing
Vol. 89, No. 10, October 2023, pp. 581-588.
https://www.asprs.org/wpcontent/uploads/2024/03/October2023_HLA-
Positional_Accuracy_Standards.pdf
ASPRS (2014) New ASPRS positional accuracy standards for digital
geospatial data (2014) by Smith, D. L., Abdullah, Q. A., Maune, D. &
Heidemann, K. H.
ASPRS (2023) ASPRS Positional Accuracy Standards for Digital Geospatial
Data, 2nd Ed., Version 1.0, August 23, 2023.
https://publicdocuments.asprs.org/PositionalAccuracyStd-Ed2-V1
ASPRS (2024) ASPRS Positional Accuracy Standards for Digital Geospatial
Data, 2nd Ed., Version 2. https://www.asprs.org/revisions-to-the-asprs-
positional-accuracy-standards-for-geospatial-data-2023
Chavez Jr, P. S. (1988) An improved dark-object subtraction technique for
atmospheric scattering correction of multispectral data. Remote sensing of
environment, 24(3), 459-479.
Chavez, P. S. (1996) Image-based atmospheric corrections-revisited and
improved. Photogrammetric engineering and remote sensing, 62(9), 1025-
1035.
Cohen, J. (1960) A coefficient of agreement for nominal scales. Educational
and Psychological Measurement, 20, pp. 37-46.
Congalton, R. G. & Green, K. (2019) Assessing the Accuracy of Remotely
Sensed Data: Principles and Practices, 3rd Edition, CRC Press.
Congalton, R. G. (1991) A review of assessing the accuracy of
classifications of remotely sensed data, Remote Sensing of Environment,
Vol 37, pp. 35-46.
Congalton, R. G., & Green, K. (2008) Assessing the Accuracy of Remotely
Sensed Data: Principles and Practices. CRC Press.
Deshmukh, B., Bahuguna, A., Nayak, S., Dhargalkar, V. K. & Jagtap, T. G.
(2005) Eco-geomorphological zonation of the Bangaram reef,
Lakshadweep. Journal of the Indian Society of Remote Sensing, 33, 99-106.
Foody, G. M. (2002). Status of land cover classification accuracy
assessment. Remote sensing of environment, 80(1), 185-201.
Foody, G. M. (2020) Explaining the unsuitability of the kappa coefficient in
the assessment and comparison of the accuracy of thematic maps obtained
by image classification. Remote sensing of environment, 239, 111630.
Foody, G. M. & Cox, D. P. (1994) Sub-pixel land cover composition
estimation using a linear mixture model and fuzzy membership functions.
Remote sensing, 15(3), 619-631.
Gu, J., Congalton, R. G. & Pan, Y. (2015) The impact of positional errors on
soft classification accuracy assessment: A simulation analysis. Remote
Sensing, 7(1), 579-599.
103
Contributor: Prof. Benidhar Deshmukh
Block 4 Image Classification and Change Detection Techniques
…………………………………………………………………….……………………………………….
https://deepchecks.com/how-to-check-the-accuracy-of-your-machine-
learning-model/
https://medium.com/@prathameshamrutkar3/the-complete-guide-to-object-
detection-evaluation-metrics-from-iou-to-map-and-more-1a23c0ea3c9d
https://pressbooks.lib.vt.edu/remotesensing/chapter/chapter-25-accuracy-
assessment/
https://pro.arcgis.com/en/pro-app/latest/tool-reference/image-analyst/how-
compute-accuracy-for-object-detection-works.htm
https://storymaps.arcgis.com/stories/fddf80b8900f4168addbfefec336f4bd
https://www.ridgerun.ai/post/mean-average-precision-map-and-other-object-
detection-metrics
Landis, J. R. & Koch, G. G. (1977) The measurement of observer
agreement for categorical data. Biometrics, 33, 159-174.
Liu, J. & Mason, P. J. (2009) Essential Image Processing and GIS for
Remote Sensing. John Wiley & Sons.
Lowell, K. & Jaton, A. (1999) Spatial Accuracy Assessment: Land
Information Uncertainty in Natural Resources, Ann Arbor Press, Michigan.
Lu, D. & Weng, Q. (2007) A survey of image classification methods and
techniques for improving classification performance. International journal of
Remote sensing, 28(5), 823-870.
Lunetta, R. S. & Lyon, J. G. (2004) Remote sensing and GIS accuracy
assessment. CRC press.
Lunetta, R., Congalton, R., Fenstermaker, L., Jensen, J., Mcgwire, K. &
Tinney, L. R. (1991) Remote sensing and geographic information system
data integration- Error sources and research issues. Photogrammetric
engineering and remote sensing, 57(6), 677-687.
Mather, P. M. & Tso, B. (2009) Classification Methods for Remotely Sensed
Data. CRC Press.
Morales-Barquero, L., Lyons, M. B., Phinn, S. R. & Roelfsema, C. M. (2019)
Trends in remote sensing accuracy assessment approaches in the context
of natural resources. Remote sensing, 11(19), 2305.
Mustak, S. (2013) Correction of atmospheric haze in Resourcesat-1 Liss-4
Mx Data for urban analysis: an improved dark object subtraction approach.
The International Archives of the Photogrammetry, Remote Sensing and
Spatial Information Sciences, 40, 283-287.
Schott, J. R., Salvaggio, C. & Volchok, W. J. (1988) Radiometric scene
normalization using pseudoinvariant features. Remote sensing of
Environment, 26(1), 1-16.
Stehman, S. V. & Czaplewski, R. L. (1998) Design and analysis for thematic
map accuracy assessment: fundamental principles. Remote sensing of
environment, 64(3), 331-344.
Story, M. & Congalton, R. G. (1986) Accuracy assessment: a user’s
perspective. Photogrammetric Engineering and remote sensing, 52(3), pp.
397-399.
16.11 FURTHER/SUGGESTED READINGS
Chapter 25: Accuracy Assessment – Remote Sensing with ArcGIS Pro
(second edition) (vt.edu)
104
Contributor: Prof. Benidhar Deshmukh
Unit 16 Accuracy Assessment of Thematic Maps
…………………………….……………………………………...………………………………………..
https://pressbooks.lib.vt.edu/remotesensing/chapter/chapter-25-accuracy-
assessment/#sdfootnote1anc
Foody, G. M. (2020) Explaining the unsuitability of the kappa coefficient in
the assessment and comparison of the accuracy of thematic maps obtained
by image classification. Remote sensing of environment, 239, 111630.
Hand, D. J. (2012) Assessing the performance of classification methods,
International Statistical Review, 80 (3), 400-414.
https://doc.arcgis.com/en/imagery/workflows/resources/accuracy-
assessment-of-orthomosaics.htm
https://openmrv.org/web/guest/w/modules/mrv/modules_3/sampling-design-
for-estimation-of-area-and-map-accuracy
https://pro.arcgis.com/en/pro-app/latest/tool-reference/spatial-
analyst/generalization-of-classified-raster-imagery.htm
Janssen, L. L. F. and van der Wel, F. J. M. (1994) Accuracy assessment of
satellite derived land-cover data: a review, Photogrammetric Engineering and
Remote Sensing, Vol 60, pp 419-426.
Nicolau, A.P., Dyson, K., Saah, D., & Clinton, N. (2024). Accuracy
Assessment: Quantifying Classification Quality. In: Cardille, J.A., Crowley,
M.A., Saah, D., Clinton, N.E. (Eds) Cloud-Based Remote Sensing with
Google Earth Engine. Springer, Cham. https://doi.org/10.1007/978-3-031-
26588-4_7
Olofsson, P., Foody, G. M., Herold, M., Stehman, S. V., Woodcock, C. E., &
Wulder, M. A. (2014) Good practices for estimating area and assessing
accuracy of land change. Remote sensing of Environment, 148, 42-57.
https://doi.org/10.1016/j.rse.2014.02.015
Stehman, S. V. & Foody, G. M. (2019) Key issues in rigorous accuracy
assessment of land cover products. Remote Sens Environ
231:111199. https://doi.org/10.1016/j.rse.2019.05.018
Stehman, S. V. (2009) Sampling designs for accuracy assessment of land
cover. International Journal of Remote Sensing, 30(20), 5243–5272.
https://doi.org/10.1080/01431160903131000
16.12 ANSWERS
SAQ I
a) There are two types: qualitative and quantitative accuracy. Define the two
and also the site-specific and non-site specific accuracy as mentioned in
subsection 16.2.2.
b) There have been several developments in the standardisation and
methodological refinement era. Include the milestones in your answer as
discussed in section 16.3.
c) Major steps in accuracy assessment are discussed in subsection 16.2.4.
Refer to it to write your answer.
SAQ II
a) There are several sources of errors in image classification as listed in a
table under subsection 16.4.1. Refer to it to prepare your answer.
b) Discuss the point as given under subsection 16.4.1.
105
Contributor: Prof. Benidhar Deshmukh
Block 4 Image Classification and Change Detection Techniques
…………………………………………………………………….………………………………………
c) There are several methods for determining sample size in the context of
accuracy assessment as discussed under subsection 16.4.4. Refer to it
to write your answer.
SAQ III
a) Several differences are discussed in subsection 16.5.2. Refer to it to
write your answer.
b) Several approaches are discussed in subsection 16.5.1. Refer to it to
write your answer for accuracy assessment of object based classification
outputs.
c) There is a basic difference in how object detection outputs are assessed
for their accuracy. Refer to subsection 16.5.1 for preparing your answer.
SAQ IV
a) Some approaches used for accuracy assessment of digital elevation
models are mentioned under subsection 16.6.2. Prepare your answer
from it.
b) Please refer to subsection 16.6.3.
c) Please refer to subsection 16.6.4.
d) Please refer to subsection 16.7.3.
Terminal Questions
1. Please refer to subsection 16.2.3.
2. Please refer to section 16.3.
3. Please refer to section 16.5.
4. Please refer to subsection 16.6.1.
5. Please refer to subsection 16.7.2.
107
It is a type of clustering algorithm widely used for soft
classification. It allows each data point to belong to
multiple clusters with varying degrees of membership.
This approach is particularly useful where land cover
Fuzzy C-Means types often have mixed characteristics, making it difficult
:
(FCM) clustering to classify them into discrete categories. It assigns a
membership value to each data point for all clusters.
These membership values range between 0 and 1, with
the sum of membership values for each data point equal
to 1.
Hard / crisp An image classification process in which each pixel or
: object is assigned to a single, discrete class.
classification
It is a commonly used technique for unsupervised
classification of remote sensing images. It groups pixels
into clusters based on their attributes (i.e. spectral
Hierarchical
: similarity) without prior knowledge of the class labels.
clustering
Hierarchical clustering organises data into a tree-like
structure called a dendrogram, which represents the
nested grouping of pixels or objects and their similarities.
It is a process through which pixels in the image is
grouped into various informational classes/ objects based
Image classification : on their spectral signatures or reflectance properties.
There are broadly two approaches of image classification
i.e. unsupervised and supervised.
ISODATA (Iterative Self-Organising Data Analysis
Technique) algorithm is an unsupervised clustering
method designed to partition a dataset into a predefined
number of clusters based on the data's inherent
ISODATA clustering : characteristics. Developed by Stuart Lloyd in 1982,
ISODATA draws inspiration from the K-means clustering
algorithm but introduces adaptive mechanisms to handle
varying cluster shapes and sizes during the clustering
process.
108
accuracy (random chance). In this off-diagonal elements
of an error matrix are incorporated as a product of the row
and column marginal totals.
Mean Average
: mAP is the average of AP of each class.
Precision (mAP)
Mixel pixel : In a remote sensing image, in transition zones or at the
edges of different land cover classes you see a pixel
covering more than one land cover class because spatial
resolution of the image i.e. the area covered by a pixel is
larger than the individual objects on the ground. Such
pixels are called mixed pixels and its signature is a
combination of the land cover classes covered in it. Mixed
pixels pose challenges in the analysis of remote sensing
data because its spectral signature does not correspond
to any single land cover type hence resulting in
misclassification.
ROC curve : ROC curve is created by plotting the true positive rate
110
(TPR) against the false positive rate (FPR) at various
threshold settings.
(Where, TPR is the proportion of positive data points
correctly considered as positive, with respect to all
positive data points; and FPR is the proportion of negative
data points that are mistakenly considered as positive,
with respect to all negative data points).
112