123
123
2. Explain:
i. Coordinate System
A reference framework for locating geographic features. It defines how coordinates are assigned to points.
OR
A coordinate system in GIS is a reference framework used to define the location of geographic features on the
Earth's surface. It consists of mathematical rules that specify how coordinates are assigned to points.
There are two main types of coordinate systems used in GIS:
1. Geographic Coordinate System (GCS)
2. Projected Coordinate System (PCS)
✔ Globally consistent and widely used. ✔ Essential for GPS and global datasets.
Disadvantages of GCS:
OR
Definition of GIS
A Geographic Information System (GIS) is a computer-based system that provides tools for collecting,
storing, analyzing, managing, and displaying spatial (geographically referenced) data.
Components of GIS
A working GIS consists of five major components:
1. Hardware
Hardware includes all physical devices and infrastructure needed to run GIS software and process data.
a) Computers & Servers
• Workstations/PCs: Used for analysis, visualization, and map creation.
• Servers: Store large spatial databases; allow remote access and collaboration.
b) Input Devices
• Scanners: Convert paper maps into digital formats.
• Digitizers: Transform analog map data into vector formats.
• GPS Devices: Collect real-time location data.
• Drones/LiDAR: Capture high-resolution geospatial data.
c) Output Devices
• Monitors: Display interactive maps and data.
• Printers/Plotters: Produce hardcopy maps for reporting.
2. Software
GIS software is the engine of the system—it provides the tools to analyze, manage, and visualize geospatial
data.
a) Types of GIS Software
• Desktop GIS: E.g., ArcGIS Pro, QGIS.
• Web GIS: E.g., ArcGIS Online, Google Maps.
• Enterprise GIS: Organization-wide GIS platforms with cloud and database integration.
b) Core Functions
• Data Input & Editing: Import, digitize, clean, and update data.
• Spatial Analysis: Buffering, overlay, interpolation, etc.
• Visualization: Map creation, 3D rendering.
• Database Management: Store and query spatial and attribute data.
c) Examples
• Proprietary: ArcGIS, MapInfo, AutoCAD Map 3D.
• Open Source: QGIS, GRASS GIS, GeoServer.
3. Data
Data is the heart of GIS. It includes spatial (location-based) and attribute (descriptive) information.
a) Spatial Data
• Vector: Points (e.g., trees), Lines (e.g., roads), Polygons (e.g., lakes).
• Raster: Grid-based, e.g., satellite images, elevation models.
• TIN/GRID: Used in 3D and terrain modeling.
b) Attribute Data
• Describes features (e.g., road name, population).
• Stored in tables linked to spatial features via IDs.
c) Data Sources
• Remote sensing
• GPS surveys
• Government records (e.g., census, land records)
• Field surveys
4. People
People are essential for operating, analyzing, and interpreting GIS data effectively.
Roles in GIS:
• GIS Analysts: Perform analysis and produce maps.
• Cartographers: Design high-quality maps.
• Developers: Build GIS applications and plugins.
• Surveyors: Collect geospatial data.
• Policy Makers/Managers: Use GIS for informed decision-making.
Importance: Well-trained professionals ensure data accuracy and meaningful analysis.
5. Methods
Methods refer to the workflows, models, and standards used in GIS for consistent and accurate results.
GIS Workflow Includes:
1. Data Collection
2. Data Processing
3. Spatial Analysis
4. Visualization & Reporting
5. Data Sharing & Management
Best Practices:
• Follow OGC standards for interoperability.
• Maintain metadata and quality assurance.
• Use standardized models for repeatable analysis.
Key Differences
License Cost Requires paid license Free to use, modify, and distribute
Customization Limited flexibility; only vendor can modify Highly customizable for specific needs
Training and Comprehensive and official documentation, May require self-learning or third-party
Documentation training available tutorials
• ✖ Expensive licensing
• ✖ Limited customization
Open Source Software
• ✔ No licensing cost
OR
In Geographic Information Systems (GIS), data is the core component that enables spatial analysis and
decision-making. GIS primarily works with two major types of data:
1. Spatial Data
Spatial data (also called geospatial data or geographic data) represents the location, shape, and geometry of
geographic features on the Earth's surface.
Types of Spatial Data Representations:
a) Vector Data
Represents discrete features using geometric shapes:
Advantages:
• High precision
• Efficient for storing attribute data
• Suitable for topological and network analysis
Disadvantages:
• Complex overlay analysis
• Higher processing needs for large datasets
b) Raster Data
Represents continuous features as a grid of cells (pixels), where each cell has a specific value (e.g., elevation or
temperature).
Examples:
• Satellite images (e.g., Landsat, Sentinel)
• Digital Elevation Models (DEM)
• Aerial photographs
Advantages:
• Best for continuous data
• Simple grid structure
• Effective in remote sensing and image processing
Disadvantages:
• Large file sizes
• Less precise for boundaries
• Requires resampling for resolution changes
c) TIN (Triangulated Irregular Network)
• Represents 3D surfaces using interconnected triangles.
• Commonly used in terrain modeling and elevation representation.
d) GRID Data Model
• Uses a regular matrix of square cells like raster data.
• Common for representing land cover, elevation, or climate data.
2. Attribute Data
Attribute data provides descriptive information about spatial features. It answers the question: “What is it?”
Characteristics:
• Stored in tables linked to spatial data via unique identifiers (ID).
• Each row = one feature; each column = one attribute.
Examples:
Summary Table:
Spatial Data Location & Shape Vector, Raster, TIN, GRID Roads, rivers, satellite images
Attribute Data Descriptive details Tables (RDBMS) City names, population, land use
OR
In GIS, spatial data and attribute data are two fundamental data types that work together to represent and
describe geographic features.
1. Spatial Data
• Definition: Represents the location, shape, and geometry of geographic features on Earth’s surface.
• Purpose: Answers the question "Where is it?"
• Representation:
o Vector: Points (e.g., wells), Lines (e.g., roads), Polygons (e.g., lakes)
o Raster: Gridded data (e.g., satellite imagery, elevation)
• Examples:
o A river shown as a line
o A building outlined as a polygon
o A GPS point representing a tree location
2. Attribute Data
• Definition: Provides descriptive information about spatial features.
• Purpose: Answers the question "What is it?"
• Format: Stored in tables (Relational Database), where:
o Each row = one spatial feature
o Each column = one attribute (e.g., name, type, value)
• Examples:
o A river (spatial) has name, length, flow rate (attributes)
o A building (spatial) has owner name, height, usage type (attributes)
Comparison Table
Definition Data about geographic location and shape Data that describes spatial features
Format Vector (Point, Line, Polygon), Raster Tables (Rows = Features, Columns = Attributes)
Examples Roads, rivers, land parcels Road name, width, type; Land use, population
Use in GIS Mapping, visualization, spatial analysis Queries, thematic mapping, reports
OR
In GIS, attribute data refers to the descriptive information linked to spatial features. These attributes help
explain what the geographic features represent (e.g., name of a city, population, land use type).
Attribute
Scale Description Example
Type
1. Nominal Labels or names with no order or Land use type (Residential, Commercial),
Categorical
Data numeric meaning Soil type (Clay, Sand)
2. Ordinal Ordered data, but without equal Road hierarchy (Highway > Main Road >
Ranked
Data spacing between values Street), Risk level (High, Medium, Low)
ID City Name Type (Nominal) Risk Level (Ordinal) Temperature (Interval) Population (Ratio)
Summary Table
Importance:
• Guides correct statistical and GIS operations
• Impacts how data is visualized (e.g., choropleth maps vs proportional symbols)
Module 2
1. Differentiate Between Relational Database and Object-Oriented Database (or Explain Database
Structures)
Data Representation Tables (rows & columns) Objects (with attributes and methods)
Ideal for Structured data, transactions, SQL queries Complex, evolving data with relationships
Relational DBs excel in structured environments. Object-oriented DBs are better suited for systems aligned
with object-oriented programming where data structure evolves.
OR
What is a Database Structure?
A Database Structure is the organized framework that defines how data is stored, accessed, and managed
within a Database Management System (DBMS). It includes:
• Data storage formats (e.g., tables, objects)
• Data types and constraints
• Relationships and rules between data elements
The two widely used database structures in GIS and other applications are:
• Relational Database
• Object-Oriented Database
1. Relational Database (RDBMS)
Definition:
A Relational Database is based on the relational model introduced by E.F. Codd. Data is stored in tables (also
called relations), where each table consists of:
• Rows: Represent records (tuples)
• Columns: Represent attributes (fields)
Key Features:
• Uses SQL (Structured Query Language) for data querying and manipulation.
• Primary keys uniquely identify records.
• Foreign keys establish relationships between tables.
Examples:
• MySQL, PostgreSQL, Oracle, SQL Server
Definition:
An Object-Oriented Database stores data as objects, similar to those used in object-oriented programming.
Each object includes:
• Attributes (data) and
• Methods (functions) that operate on the data
Key Features:
• Aligns well with object-oriented languages (e.g., Java, C++)
• Supports inheritance, encapsulation, and polymorphism
• Uses Object Query Languages (OQL) instead of SQL
Examples:
• ObjectDB, db4o, Versant
Data Structure Tables (rows & columns) Objects (with attributes & methods)
Relationships Handled via keys (PK, FK) Handled via object references
Ideal Use Case Structured data, transactions Complex, dynamic data relationships
Performance Optimized for large, structured data Better for multimedia & scientific data
OR
An entity is a real-world object or concept that can be uniquely identified and has data stored about it in a
database.
Entity: A distinguishable thing, object, or concept in the real world that can be represented in a database.
Entities can be:
• Tangible: Like buildings, roads, land parcels
• Intangible: Like services, ownerships, transactions
Each entity has:
• Attributes: Descriptive information (e.g., Name, ID, Location)
• Unique Identifier (Primary Key): To distinguish each instance
Components of ER Diagram:
Cardinality (Implied)
3. Explain Types of Data Models or Explain Conceptual, Logical, and Physical Models
How the system will be Data architects, Includes detailed attributes, data types,
Logical
structured analysts relationships
OR
In GIS and database design, data models help in structuring how data is conceptualized, logically organized,
and physically stored. These models exist in three main levels:
1. Conceptual Data Model
Definition:
A Conceptual Data Model is the high-level design of the database. It focuses on identifying the entities, their
attributes, and the relationships between them without worrying about how they will be implemented in a
database system.
Key Features:
• Focuses on what data is required.
• Independent of software or hardware.
• Simple and understandable by both technical and non-technical stakeholders.
• Commonly represented using Entity-Relationship (ER) diagrams.
Example in GIS:
Entities like Land Parcel, Building, and Occupant are identified, along with their relationships:
• A Land Parcel contains Buildings.
• A Building has an Occupant.
Diagram Sample (from the notes):
• Entities: Rectangles (e.g., Land Parcel, Building)
• Attributes: Ovals (e.g., Parcel ID, Building Type)
• Relationships: Diamonds/lines (e.g., contains, has)
2. Logical Data Model
Definition:
The Logical Data Model defines how the data will be organized and managed, but still independent of specific
database systems. It describes:
• Tables
• Fields (columns)
• Data types
• Relationships (one-to-many, many-to-many)
Key Features:
• Defines keys (Primary Key, Foreign Key)
• Logical structure suitable for a Relational Database Management System (RDBMS)
• Maintains normalization to reduce redundancy
• More detailed than conceptual, still not implementation-specific
Example in GIS:
• Tables: Cities, Roads, Rivers
• Fields: City_ID (PK), City_Name, Population
• Relationships: A foreign key like City_ID in a "Parks" table linking to the "Cities" table
3. Physical Data Model
Definition:
The Physical Data Model is the lowest level of data abstraction. It shows how the database will be
implemented in a specific system (like PostgreSQL/PostGIS or Oracle Spatial).
Key Features:
• Includes data storage methods, indexing, partitioning
• Specifies actual data types (e.g., VARCHAR(50), INT)
• Includes performance optimization techniques like indexing
• Deals with storage structure, access paths, and security
Example in GIS:
• Spatial indexing using R-Trees
• Use of formats like GeoTIFF or Shapefile
• Implementation of spatial data types (POINT, LINESTRING, POLYGON)
Summary Table
Conceptual High-level data structure Entities, attributes, relationships Land parcels contain buildings
Use in GIS
These models together guide the design of GIS databases:
• Ensure efficient storage and retrieval of spatial and attribute data
• Help in maintaining data integrity
• Support spatial relationships and real-world mapping
OR
Spatial data models describe how geographic features are represented and stored in a GIS. The two primary
types of spatial data models are:
1. Raster Data Model
Definition:
The Raster Data Model represents spatial data as a matrix (grid) of cells or pixels. Each cell holds a single
value that represents information such as elevation, temperature, or land cover.
Structure:
• Data is stored in rows and columns of equal-sized cells.
• Each cell has a value representing a specific attribute (e.g., vegetation type, elevation).
• Suited for continuous data like satellite images and terrain models.
Types of Raster Data:
• Satellite Imagery
• Digital Elevation Models (DEM)
• Digital Orthophotos
• Scanned maps and images (JPEG, TIFF, etc.)
Advantages:
• Simple data structure.
• Suitable for continuous phenomena.
• Faster for mathematical operations and overlays.
Disadvantages:
• Large file sizes.
• Lower spatial accuracy (depends on resolution).
• Poor representation of discrete features and topology.
Applications:
• Land use classification
• Environmental modeling
• Terrain and elevation analysis
2. Vector Data Model
Definition:
The Vector Data Model represents spatial data using points, lines, and polygons, defined by precise
coordinates.
Structure:
• Point: Represents a single location (e.g., tree, well).
• Line: Represents linear features (e.g., road, river).
• Polygon: Represents area features (e.g., lake, land parcel).
Types of Vector Models:
• Spaghetti Model: Simple model where features are stored independently.
• Topological Model: Maintains relationships between features (adjacency, connectivity).
Advantages:
• High spatial accuracy.
• Efficient storage for discrete features.
• Topology support (helps in spatial queries and network analysis).
Disadvantages:
• More complex data structure.
• Slower for raster-style analysis (like overlays).
• Not ideal for continuous surfaces.
Applications:
• Cadastral mapping
• Transportation and utility networks
• Administrative boundaries
Conclusion
• Raster Model is best for continuous data analysis (e.g., elevation, temperature).
• Vector Model is better for representing precise locations and boundaries (e.g., roads, land parcels).
Both models are fundamental in GIS and are often used together in hybrid GIS systems to take advantage of
their respective strengths.
OR
What is a Raster Data Structure?
Raster data structure refers to the way raster data (gridded or pixel-based spatial data) is stored and organized
in a computer system. Since raster data is composed of a matrix of cells, how this matrix is read, written,
compressed, or accessed significantly affects performance, storage efficiency, and processing.
Basic Concept:
• A raster is a grid of rows and columns.
• Each cell (pixel) has a value representing a geographic attribute (e.g., elevation, land cover).
• Data is stored row-wise or column-wise in files.
BIL All band values per line Image display and line-based analysis
Block Code Square blocks + values Thematic maps with repeated patterns
Quadtree Recursive division into quadrants Large, complex datasets with redundancy
Conclusion:
Raster data structures are essential for efficient storage, retrieval, and analysis in GIS applications. The
choice of structure depends on:
• Data type (satellite image, DEM, thematic layer),
• Storage availability,
• Processing speed,
• Intended GIS operations.
6. Explain Raster Data Compression
• Why Needed: Raw raster is large; compression reduces storage.
• Techniques:
o Run-Length Encoding (RLE): Store values with run count.
o Quadtree: Divide into homogenous quadrants.
o Huffman Coding: Frequency-based binary encoding.
o LZ77 / LZW: Based on data repetition (used in TIFF, PNG).
o JPEG Compression: Lossy, commonly for imagery.
OR
What is Raster Data Compression?
Raster data compression refers to techniques used to reduce the file size of raster (grid-based) datasets by
storing pixel values more efficiently. Since raster datasets (e.g., satellite images, elevation maps) can be very
large, compression helps:
• Save storage space
• Speed up data transmission
• Improve processing performance
Compression methods can be lossless (no data lost) or lossy (some data discarded to reduce size).
Principle:
• Stores repeating values in a row as a single value plus the number of times it repeats.
Example:
Instead of storing:
000011100
You store:
(0,4), (1,3), (0,2)
Advantages:
• Very effective when large blocks of the same value exist.
• Easy to implement and decode.
Disadvantages:
• Not efficient if data has high variability or noise.
2. Raster Chain Coding
Principle:
• Used to store boundaries of features by tracing their edges using directional steps (N, S, E, W).
• Only the outline of a region is stored using a chain of movements.
Example:
From a starting point (x, y), the direction might be stored like:
E3, S4, W1, N4
Advantages:
• Efficient for boundary representation.
Disadvantages:
• Hard to modify or edit.
• Redundant for adjacent polygons (shared boundaries stored multiple times).
3. Block Coding (Two-Dimensional Run-Length Coding)
Principle:
• Stores blocks of grid cells (usually square-shaped) with the same value.
• Each block is stored with:
o Starting location
o Size (e.g., 2×2)
o Value
Example:
A 3×3 block of value "1" is stored as:
(Start: X5,Y5, Size: 3x3, Value: 1)
Advantages:
• Reduces data size more than RLE for large homogeneous regions.
• Good for thematic raster layers.
4. Quadtrees
Principle:
• Recursively divides the raster into quadrants.
• Only subdivides further if a quadrant has mixed values.
• Homogeneous blocks are stored as-is.
Advantages:
• Efficient for large and complex raster maps.
• Stores data at variable resolution (detail where needed).
• Compact for maps with large uniform areas.
Disadvantages:
• More complex to implement.
• Not suitable for highly variable data.
Stores square blocks of same Thematic maps with repeating Less effective for irregular
Block Coding
value regions shapes
Conclusion
Raster data compression helps in:
• Reducing storage requirements
• Improving processing efficiency
• Managing large spatial datasets
The best compression technique depends on the type of raster data, spatial patterns, and intended GIS
application.
OR
What is a Vector Data Structure?
Vector data structures represent geographic features using geometric shapes such as points, lines, and
polygons, based on coordinate systems. They are ideal for discrete spatial features like roads, buildings,
rivers, and land parcels.
Each feature is stored as a record in a database, with associated attribute data stored in linked tables.
Definition:
• Each feature is stored independently as a set of coordinate points.
• No spatial relationships (topology) are stored.
Features:
• Simple and easy to create.
• Features like polygons do not share boundaries.
• Each line or polygon is stored as a separate object.
Disadvantages:
• Redundant storage (e.g., adjacent polygons store shared boundaries twice).
• No support for topological analysis (e.g., adjacency, connectivity).
Example:
Two adjacent land parcels each store their boundary lines separately, even if they are identical.
2. Topological Model
Definition:
• Stores spatial relationships between features, such as:
o Adjacency (which polygons share a border),
o Connectivity (how lines connect),
o Containment (what lies inside a polygon).
Features:
• Efficient for network analysis (e.g., roads, rivers).
• Prevents data redundancy (shared edges stored once).
• Enhances data integrity (no gaps or overlaps between polygons).
Components:
• Node: Intersection point of two or more lines.
• Link/Arc: A line segment connecting nodes.
• Polygon: A closed sequence of links.
• Topology Tables:
o Store relationships (e.g., which nodes/links form which polygon).
Example:
A topological model of a city map allows the system to identify:
• Which roads connect at a junction.
• Which building lies within which ward.
Data Structure Simple list of coordinates Stores nodes, arcs, and relationships
Redundancy High (shared boundaries repeated) Low (shared elements stored once)
Disadvantages
• Complex structure compared to raster.
• Slower for some operations (e.g., overlays).
• Difficult to represent continuous surfaces like elevation.
Applications
• Cadastral mapping (land parcels)
• Transportation networks (roads, railways)
• Utilities (electric lines, pipelines)
• Urban planning and zoning maps
Conclusion
Vector data structures are ideal for applications where precision, topology, and feature-based representation
are crucial. While the spaghetti model is simple, the topological model is preferred for advanced GIS analysis
and maintaining spatial relationships between features.
8. Differentiate Between Raster and Vector Models with Advantages and Disadvantages
OR
Spatial data in GIS is represented using two main models: Raster and Vector. Both models have their own
structure, use cases, strengths, and limitations.
Represents spatial data as a grid of equally sized cells Represents spatial features as points, lines,
Definition
(pixels), each with a value representing an attribute. and polygons defined by precise coordinates.
Data Best for continuous data like elevation, temperature, Best for discrete data like roads, boundaries,
Type satellite images. land parcels.
Data
Grid of cells (pixels) Coordinates: Points, Lines, Polygons
Representation
Continuous data (e.g., elevation, temperature, Discrete features (e.g., roads, parcels,
Best Suited For
NDVI) buildings)
File Size Larger (especially with high resolution) Smaller (efficient storage)
Zoom Behavior Becomes pixelated when zoomed in too far Maintains clarity at all scales
Simple data structure Large file sizes (especially for high-resolution data)
Efficient for continuous data analysis (e.g., elevation) Less accurate — resolution dependent
Suitable for remote sensing and satellite imagery Topology not inherently stored
Faster for certain spatial operations (e.g., buffers) Can be inefficient for storing attribute data
Advantages Disadvantages
Supports topological relationships (e.g., adjacency, Requires more effort for editing and maintaining
network) topology
Compact storage (only stores coordinates and attributes) Overlay operations are computationally intensive
Ideal for cartographic display and feature-based queries Complex for raster-style processing like remote sensing
Conclusion
• Raster Model is ideal for continuous data, image analysis, and modeling natural phenomena.
• Vector Model is best for precise, feature-based spatial data, especially where topology matters.
Many modern GIS systems use a hybrid approach, combining both raster and vector data to leverage the
strengths of each.
9. Write a Note on:
a. TIN (Triangulated Irregular Network)
• Vector-based model for representing terrain using triangles.
• Flexible representation of elevation with fewer data points.
• Supports accurate calculation of:
o Slope, aspect, volume, surface area
• Ideal for small-area, high-precision modeling
b. GRID Model
• Raster-based model with structured cells.
• Used to represent:
o Elevation
o Land use
o Climate data
• Types:
o Integer grids → Discrete data
o Floating-point grids → Continuous data
• Used in ArcGIS and other raster-based GIS software.
OR
a. TIN (Triangulated Irregular Network)
Definition:
TIN is a vector-based data model used to represent terrain surfaces by connecting irregularly spaced
elevation points into non-overlapping triangles. Each triangle is formed using three points (nodes), and
together they form a network covering the entire surface.
Key Features:
• Represents surface as connected triangles (mesh).
• Triangle vertices are elevation points collected from surveys, GPS, or LiDAR.
• Elevation is interpolated within each triangle.
Advantages:
• Variable resolution: More triangles in complex areas, fewer in flat regions.
• Efficient storage: Stores data only where needed (no unnecessary detail).
• High accuracy: Suitable for engineering and 3D terrain modeling.
• Captures breaklines and sharp elevation changes (e.g., cliffs, ridges).
Disadvantages:
• Complex data structure and processing.
• Difficult for raster-based analysis (e.g., overlays).
• Requires more computational resources.
Applications:
• 3D terrain visualization
• Watershed and slope analysis
• Civil engineering (e.g., road construction)
• LiDAR data processing
b. GRID Data Model
Definition:
The GRID data model is a raster-based method of representing spatial data using a matrix of uniform square
cells. Each cell holds a value representing a specific attribute (e.g., elevation, land cover).
Key Features:
• Divides space into regular grid cells (pixels).
• Every cell has equal size and represents one value.
• Suitable for continuous surface representation (e.g., temperature, slope).
Advantages:
• Simple to store and process.
• Compatible with satellite imagery and remote sensing.
• Good for map algebra, classification, and modeling.
• Easily integrated with digital image data.
Disadvantages:
• Large file sizes, especially with fine resolution.
• Less accurate for representing complex shapes or boundaries.
• Fixed resolution may result in data loss in detailed areas.
Applications:
• Digital Elevation Models (DEMs)
• Land cover classification
• Hydrological modeling
• Environmental and climatic studies
Storage Size Smaller (stores only needed points) Larger (stores every cell)
Applications LiDAR, slope, contour analysis DEM, NDVI, satellite data analysis
Module 3 PDF
OR
A scanner is a crucial device in Geographic Information Systems (GIS) used to convert analog sources such as
paper maps, photographs, and documents into digital raster format. This conversion is essential for
processing, analyzing, and integrating spatial data within a GIS environment.
Limitations of Scanners
• Initial cost can be high.
• Scanned images often require manual editing (e.g., removing noise, identifying features).
• Difficult to extract text, symbols, or labels accurately.
• Accuracy depends on map quality and scanner resolution.
• Not ideal for feature-specific data capture compared to manual digitizing.
Conclusion
Scanners are indispensable tools in GIS for automated spatial data input, especially for converting analog
maps into raster format. While they offer speed and accuracy, they are best used when the source maps are
clean, clear, and well-prepared. The resulting digital files form the foundation for many GIS analyses and
operations after georeferencing and vectorization.
OR
Raster data input is a fundamental process in GIS (Geographic Information Systems) used to incorporate spatial
data represented in a grid-cell (pixel-based) format into the system for further analysis, mapping, and
interpretation.
Characteristic Description
Cell size (resolution) Determines the spatial detail; smaller cells = higher resolution
Band-based Stores data in separate bands (e.g., RGB for color images)
Attribute linkage Each cell can store a value linked to attributes (especially in discrete rasters)
Mode Description
Manual Entry Keyboard input for attribute data (limited use for raster)
Direct Digital Conversion Converting CAD, remote sensing, or scanned map files
Voice Input (experimental) Tried for controlling digitizers; not widely used
Summary
• Raster data input is crucial for integrating spatial data like images and maps into a GIS.
• Involves converting analog data or using digital imagery.
• Despite its cost and complexity, raster input is essential for analyzing continuous data.
• Advancements in scanning and remote sensing continue to improve raster data handling.
OR
Raster data file formats define how raster (grid-based) spatial data is stored, compressed, managed, and shared
in a GIS. These formats differ based on their data structure, metadata, compression technique, and intended
use.
Raster formats are essential for representing and analyzing real-world phenomena like satellite imagery, aerial
photos, scanned maps, land cover, elevation, and temperature.
1. Grids
• Used to store both discrete (categorical) and continuous (numeric) data.
• Grids are especially common in ESRI GIS software.
a. Discrete Grids
• Cells store integer values.
• Represent categorical data like land use, soil type, or vegetation class.
• Example: Forest = 1, Water = 2, Urban = 3.
• Each cell value has a corresponding attribute stored in a table.
• Attribute tables are typically in INFO format.
• Efficient for thematic mapping and statistical analysis.
b. Continuous Grids
• Cells store floating-point values (decimal numbers).
• Represent continuous phenomena like elevation, temperature, or rainfall.
• Each cell has a unique value.
• These grids do not have attribute tables.
• Useful in surface modeling, interpolation, and environmental analysis.
2. Images
• Store brightness values of light or radiation (e.g., visible, infrared, UV).
• Commonly used for:
o Aerial photos
o Satellite imagery
o Scanned paper maps
• Can be displayed as background layers or used as attributes linked to features (e.g., house photos in real
estate).
• Must be georeferenced to be useful in GIS (aligned to real-world coordinates).
GeoTIFF TIFF with embedded GIS metadata. Supports georeferencing and compression.
ECW Enhanced Compressed Wavelet format by ERDAS. Lightweight and fast loading.
CADRG/CIB Raster formats used by military and NGA (National Geospatial Agency).
Esri Grid Binary or ASCII format used in ArcGIS to store raster data in grid format.
netCDF-CF Scientific format for storing climate and Earth science data.
RPF/DTED Raster formats for elevation data used in military or topographic analysis.
Importance in GIS
• Choosing the right raster format is crucial for:
o Data sharing
o Storage optimization
o Software compatibility
o Analytical accuracy
• Georeferenced raster data allows integration with vector layers and supports spatial analysis, overlay,
and map production.
Summary
Raster data file formats determine how raster information is stored and utilized in GIS. They fall into two main
categories—grids (discrete or continuous) and images—each suited for different types of analysis. The proper
selection of file format ensures efficient data handling, visualization, and spatial analysis in a GIS environment.
4. Write a note on Geo-referencing
Geo-referencing assigns real-world coordinates to digital images/maps using reference points. It enables spatial
alignment for analysis in GIS. Methods include manual point placement or automatic transformation using
control points. Applications include mapping, urban planning, disaster management, and environmental
analysis.
Geo-referencing is the process of assigning real-world coordinates to a map or image so it aligns correctly with
spatial data.
Steps in Geo-referencing:
1. Select image/map.
2. Acquire, rectify, enhance.
3. Associate with known coordinates.
4. Apply transformation.
5. Save as georeferenced data.
Methods:
• Vector Referencing: Aligning with known vector features (like roads).
• Raster Referencing: Aligning using grids or control points.
Applications:
Used in urban planning, disaster management, land use mapping, etc.
OR
Geo-referencing is the process of assigning real-world geographic coordinates (like latitude and longitude) to a
map, image, or dataset so that it can be accurately aligned and overlaid with other spatial data in a Geographic
Information System (GIS).
It ensures that spatial datasets fit within a common coordinate framework, enabling accurate mapping, analysis,
and integration of geographic information from different sources.
Importance of Geo-referencing
• Aligns scanned maps, aerial images, and satellite data to real-world locations.
Methods of Geo-referencing
a. Image-to-Map
• Aligns an image (like a satellite photo) to a known coordinate system or a reference map.
• Commonly used in remote sensing and environmental studies.
b. Image-to-Image
• Uses another already geo-referenced image as the reference.
• GCPs are selected based on common features between the two images.
• Useful for creating consistency among multiple raster layers.
Summary
Geo-referencing is an essential preprocessing step in GIS that transforms unreferenced spatial data into
accurate, location-based information. By aligning maps and images to a coordinate system using ground control
points, it enables reliable integration and spatial analysis of diverse geographic datasets.
5. Write a note on Vector Data Input
Vector data is input through digitization from printed or scanned maps using devices like:
• Digitizing tables (most common),
• Mouse (used in on-screen digitizing),
• Keyboard (for entering coordinates/attributes).
Modern tools support heads-up digitization and automated vector extraction.
OR
Vector data represents geographic features using points, lines, and polygons.
Input Methods:
• Digitising Table: Map is traced with a puck (like a mouse) on a special table.
• Mouse (on-screen digitising): Tracing over scanned images using GIS software.
• Keyboard: Used to manually enter coordinates or attribute data.
Digitising Table Features:
• High accuracy
• Needs control/reference points
• Captures real-world coordinates
6. Explain Digitiser
A digitiser converts analog information/inputs (like images, sound, motion) into digital format form
Examples:
• Digital Camera: Converts light into image files.
• Audio Digitiser: Converts sound to digital signals.
• Tablets/Stylus: Converts hand-drawn input into vector graphics.
• Graphic tablets (capture drawings),
• Accelerometers & Gyroscopes: Convert motion into data.
Used in GIS to trace features from maps, enabling creation of vector datasets.
In GIS, digitisers are mainly used for converting printed maps into digital vector data.
Defines the origin and orientation of coordinate systems (e.g., WGS84, NAD27). It models Earth's shape.
OR
Definition of Topology in GIS
Topology in GIS refers to the spatial relationships among geographic features such as points, lines, and
polygons. It is a set of rules and behaviors that define how these features share geometry and relate to each
other in a consistent and logical way.
Topology ensures that spatial data is logically structured, which is critical for:
• Maintaining data integrity
• Supporting spatial queries
• Performing network analysis, overlay operations, and map editing
In topological models, features are often stored using nodes, edges, and faces to express how elements are
connected or bounded.
Illustration:
[Parcel A] | [Parcel B]
↑
Shared boundary = adjacency
2. Connectivity
• Definition: Connectivity describes how linear features (like roads, rivers, pipelines) are linked at
nodes or junctions.
• Example: A road network where intersections allow movement from one road to another.
• Use Case: Vital in network analysis, such as finding the shortest path, or modeling the flow of utilities
(water, electricity).
Illustration:
Road A ----•---- Road B
↑
Node (connected point)
3. Containment
• Definition: Containment indicates that a point or feature lies within a polygon or area.
• Example: A school located within a city boundary or a tree inside a park.
• Use Case: Helps in zoning regulations, identifying features inside a boundary, or spatial joins.
Illustration:
[Park Polygon]
• Tree (contained feature)
Summary
Topology Rules defining spatial relationships among features Shared edges, connected nodes
Containment One feature lies within the bounds of another A school inside a district boundary
Module 4
OR
Data quality in GIS refers to the accuracy, reliability, and fitness of spatial and attribute data for a specific
purpose or application. Good data quality ensures:
• Correct decision-making
• Accurate spatial analysis
• Reliable mapping outputs
Poor-quality GIS data can lead to misinterpretation, errors, and costly mistakes.
GIS data quality is commonly evaluated using six key aspects:
1. Completeness
2. Logical Consistency
3. Positional Accuracy
4. Temporal Accuracy
5. Thematic Accuracy
6. Lineage
f. Lineage
Records the history and processing steps of a dataset, including sources, transformations, dates, and processing
methods. Helps determine data reliability.
OR
Describes the history and origin of the data, including:
• Source (e.g., GPS, satellite)
• Processing steps (e.g., projection, editing)
• Accuracy reports
Example: A forest cover map derived from 2020 imagery should mention source, classification method, and
edits.
OR
Metadata = “Data about data” in GIS.
It provides essential information about a dataset, such as:
• Source
• Accuracy
• Projection
• Format
• Purpose
Types of Metadata:
1. Descriptive: Title, keywords, description.
2. Spatial: Coordinate system, projection, extent.
3. Attribute: Data fields, units.
4. Lineage: Source and processing history.
5. Quality: Accuracy and resolution.
Benefits of Metadata:
• Helps users evaluate and understand datasets.
• Supports data discovery, sharing, and decision-making.
• Maintains data investment and ensures updates are planned.
• Critical during emergency response for fast access across agencies.
OR
GIS Standards are rules and protocols that ensure consistency, accuracy, and interoperability of spatial data
across systems and organizations.
OR
Interoperability is the ability of different GIS systems, software, and data formats to work together
seamlessly. It allows smooth data exchange, integration, and collaboration.
Types of Interoperability:
1. Syntactic – Understanding file formats (e.g., reading GeoJSON, KML).
2. Semantic – Understanding meaning of data (e.g., classifying land use consistently).
3. Technical – Hardware and software compatibility (e.g., GPS integration).
4. Organizational – Effective data and workflow sharing between agencies.
Importance:
• Enables cross-platform operations.
• Supports integration with GPS, remote sensing, and web services.
• Reduces duplication and improves data sharing.
OR
a. OGC (Open Geospatial Consortium)
The Open Geospatial Consortium (OGC) is an international organization that develops open standards to
promote interoperability in GIS.
OGC Standards:
1. WMS (Web Map Service) – Delivers map images over the web.
2. WFS (Web Feature Service) – Access and edit vector data online.
3. WCS (Web Coverage Service) – Shares raster data (e.g., satellite images).
4. GML (Geography Markup Language) – XML-based universal spatial data format.
5. KML (Keyhole Markup Language) – Used in Google Earth/Maps.
6. SOS (Sensor Observation Service) – Real-time sensor data (e.g., weather).
7. SF (Simple Features) – Basic shapes like points, lines, polygons.
Benefits:
• Supports data sharing and cross-platform integration
• Used in disaster management, urban planning, agriculture, environment
• Promotes open and accessible GIS systems
OR
b. Spatial Data Infrastructure (SDI)
Spatial Data Infrastructure (SDI) is a framework that allows efficient sharing, access, and use of spatial
data across organizations and systems.
Components of SDI:
1. Data – Maps, imagery, demographics, etc.
2. People – Users, creators, organizations.
3. Policies & Standards – Ensure quality and interoperability.
4. Technology – GIS software, cloud platforms, databases.
5. Network Infrastructure – Web services like WMS, geoportals.
Levels of SDI:
• Local SDI: City-level GIS applications.
• National SDI (NSDI): Country-wide data integration.
• Regional SDI: Shared data across regions (e.g., EU's INSPIRE).
• Global SDI: Worldwide efforts like UN-GGIM.
Benefits of SDI:
• Easier data access and retrieval
• Promotes collaboration
• Ensures standardized data management
• Helps in disaster response, planning, and policy-making
Summary Table
Concept Description
Interoperability GIS systems working together across formats, platforms, and organizations.
OGC Organization developing open standards for GIS (e.g., WMS, WFS, GML).
SDI Framework to manage and share spatial data using standards, technology, and networks.
Module 5
OR
Importing Data:
Importing in GIS means bringing in spatial (vector/raster) or non-spatial (tabular) data from external sources for
visualization and analysis.
Common Import Formats:
• Vector: .shp, .geojson, .kml, .dxf, .gml
• Raster: .tif, .jpg, .png, GRID, .nc
• Tabular: .csv, .xls/.xlsx, .dbf, SQL databases
Import Methods:
• File import in software (e.g., QGIS, ArcGIS)
• Database connections (PostGIS, SQL Server)
• Web services (WMS, WFS, APIs)
• GPS devices (GPX, NMEA)
Exporting Data:
Exporting is saving GIS data in a format suitable for other applications, sharing, or reporting.
Export Formats:
• Vector: .shp, .geojson, .kml, .dxf
• Raster: .tif, .jpg, .png
• Tabular: .csv, .xlsx, .json
• Cartographic: .pdf, .svg, .ai
Challenges:
• Format compatibility
• Coordinate system mismatches
• Metadata loss
• File size issues
Best Practices:
• Check CRS
• Use standard formats
• Validate data
• Preserve metadata
• Compress large files
OR
Data Management in GIS ensures efficient storage, retrieval, analysis, and sharing of spatial data.
Key Functions:
1. Data Storage:
o Relational databases (RDBMS), object-oriented DBMS, and file-based storage.
o Spatial indexing for faster access.
2. Data Input & Integration:
o Importing, georeferencing, digitizing, GPS integration.
3. Data Processing & Manipulation:
o Spatial queries, table editing, coordinate transformations, raster-vector conversions.
4. Data Quality Control:
o Error correction, metadata maintenance, validation.
5. Data Analysis & Modeling:
o Overlay, buffering, interpolation, network analysis.
6. Export & Sharing:
o Export formats, web services, cloud GIS.
Importance:
• Improves accessibility, ensures data integrity, reduces redundancy, and supports collaboration.
OR
Raster to Vector (Vectorization) involves converting pixel-based data into point, line, or polygon features.
Methods:
• Manual Digitization
• Thresholding (binary classification)
• Edge Detection (for contours)
• AI-based Vectorization
Applications:
• Scanned maps to vector layers
• Satellite images to roads/buildings
• DEMs to contour lines
Challenges:
• Detail loss
• Topological errors
• Computational load
• Post-processing cleanup
OR
Raster to Vector Conversion Steps:
1. Select Raster Layer
o Choose the input raster image (e.g., a scanned map, satellite image, or classified land-use raster).
o The raster must have clear pixel values representing different features (e.g., roads, water, land
types).
2. Choose Vectorization Method
Depending on the type of feature to extract:
o Polygonization: Converts groups of similar pixel values into polygons.
o Contouring: Converts continuous raster values (e.g., elevation) into contour lines.
o Thresholding / Feature extraction: Identifies specific pixel values or ranges and converts them
into vector features.
3. Set Parameters
o Pixel value ranges to convert (e.g., only extract water bodies from blue pixels).
o Smoothing/tolerance: Controls how precise or simplified the resulting vector lines/polygons
will be.
o Output type: Point, line, or polygon, based on feature geometry.
4. Run Vectorization Tool
o The software scans through raster cells.
o Groups cells with similar values.
o Converts those groups into vector geometries.
o Saves the result as a vector layer (e.g., shapefile, GeoJSON, etc.).
What Happens During Raster to Vector Conversion?
• The raster is scanned pixel-by-pixel.
• Regions of similar pixel values are grouped.
• Each group is turned into a vector feature:
o If area → Polygon
o If boundary → Line
o If point features (isolated pixels) → Point
• The resulting vector layer can now be edited, analyzed, and stored with attributes.
Example Use Case:
Imagine you have a land cover raster where:
• 1 = Water
• 2 = Forest
• 3 = Urban
After raster-to-vector conversion:
• Water pixels become water polygons.
• Forest pixels become forest polygons.
• Urban areas become urban polygons.
4. Explain Conversion of Vector to Raster in GIS
Definition: The transformation of vector data into raster format, where features are encoded in pixel values.
Why it's done:
• For raster-based analysis (e.g., terrain modeling)
• Compatibility with remote sensing data
Steps:
1. Select vector layer
2. Choose rasterization method
3. Set output resolution (cell size)
4. Run conversion
Tools: QGIS (Rasterize), ArcGIS (Feature to Raster, Polygon to Raster)
Considerations:
• Some detail may be lost
• Pixel size affects accuracy
OR
Vector to Raster (Rasterization) transforms points, lines, or polygons into grid-based data.
Methods:
• Cell-based Encoding: Assign values to raster cells.
• Interpolation: Create continuous surfaces (e.g., elevation).
• Density Estimation: Generate heat maps.
Applications:
• Heat maps
• Spatial modeling
• Remote sensing input
Challenges:
• Resolution loss
• Storage needs
• Generalization of sharp boundaries
OR
Vector to Raster Conversion:
This process converts vector features (points, lines, polygons) into a raster format (grid of pixels), where each
pixel holds a value that represents a feature or attribute from the vector data.
1. Select Vector Layer
• What happens: You begin by selecting the vector layer (e.g., shapefile, GeoJSON, feature class) that
contains the spatial features you want to convert.
• Example: A polygon layer showing land use types (forest, water, agriculture) or a point layer showing
school locations.
2. Choose Rasterization Method
• What happens: You decide how the vector features should be represented in raster form. This involves
selecting:
o Geometry type: Point, line, or polygon.
o Attribute field: Which attribute from the vector file should be used as the pixel value (e.g., land
use type, elevation, population).
o Conversion rule: How values are assigned if multiple features fall within a single pixel (e.g.,
most frequent, average, max, etc.).
3. Set Output Resolution (Cell Size)
• What happens: You define the size of each raster cell (pixel), usually in map units (e.g., meters).
o Smaller cells = higher resolution, more detail, but larger file size.
o Larger cells = lower resolution, less detail, smaller file.
• Example:
o A cell size of 10 meters means each pixel in the raster will represent a 10x10 meter area on the
ground.
4. Run Conversion
• What happens:
o GIS software scans each pixel in the output raster.
o It checks if any part of the vector geometry intersects that pixel.
o If yes, it assigns a pixel value based on the selected attribute.
o The result is a raster dataset where each cell contains a value from the vector layer.
• Output: A raster file (e.g., GeoTIFF) representing the vector features.
Example:
Converting a vector polygon layer of land use to raster:
• Attribute: LandType (1 = Forest, 2 = Water, 3 = Urban)
• Cell size: 30 meters
• Result: A raster grid where each pixel value is 1, 2, or 3 depending on land use.
OR
Map Compilation is the process of assembling GIS data into a final map layout.
Key Elements:
• Title
• Legend
• Scale bar
• North arrow
• Grid
• Labels
• Metadata
• Data layers
Steps:
1. Data Collection
2. Layer Selection & Symbolization
3. Layout Design & Labeling
4. Cartographic Enhancements
5. Export & Sharing (PDF, PNG, GeoTIFF, KML)
Types of Maps:
• Topographic, Thematic, Choropleth, Isoline, Network, 3D Maps
Challenges:
• Data inconsistency
• Color confusion
• Label clutter
OR
Charts/Graphs in GIS visualize attribute data for better understanding and analysis.
Common Types:
1. Bar Charts – Categorical comparisons (e.g., land use types)
2. Pie Charts – Proportional data (e.g., population distribution)
3. Histograms – Frequency distribution (e.g., elevation range)
4. Line Graphs – Trends over time (e.g., rainfall data)
5. Scatter Plots – Correlation between variables (e.g., NDVI vs. temperature)
6. Bubble Charts – Data with 3 variables (e.g., city size vs. income vs. pollution)
Usage in GIS:
• Linked to attribute tables
• Dynamic and interactive in modern GIS dashboards
OR
Multimedia elements like images, audio, video, and hyperlinks enhance GIS maps by providing context and
real-world references.
Examples:
• Photos of landmarks
• Videos from surveillance drones
• Sound clips of environmental data
• Useful in tourism, heritage, and education
Examples ArcGIS Enterprise, QGIS Server ArcGIS Pro, QGIS, Global Mapper
OR
Feature Desktop GIS Enterprise GIS
c. Distributed GIS
Distributed GIS uses multiple connected systems to store, process, and deliver spatial data.
Features:
• Access from different locations
• Scalability and collaboration
• Web/mobile applications
Examples:
• Google Maps, ArcGIS Online, Bing Maps
Applications:
• Logistics, agriculture, disaster response, utilities
OR
Distributed GIS is a system where GIS components (databases, processing, interfaces) are spread across
multiple servers or locations.
Features:
• Uses internet and cloud services
• Enables real-time applications (e.g., traffic, weather)
• Supports IoT integration
Advantages:
• Scalability
• Real-time data access
• Global accessibility
Examples: Google Maps API, NASA EarthData, OpenStreetMap
OR
Data Output refers to how GIS data is presented or shared.
Output Formats:
• Maps (paper, digital PDFs)
• Tables (Excel, CSV)
• Reports (PDF, DOC)
• Web Maps (KML, GeoJSON, WMS)
Uses:
• Decision support
• Reporting
• Visualization
• Sharing with non-GIS users
Usage Organization-wide collaboration and data sharing Individual or small team analysis
Examples ArcGIS Enterprise, PostGIS, QGIS Server ArcGIS Pro, QGIS, Global Mapper
Scalability Highly scalable, supports big data and many users Limited scalability
OR
Criteria Desktop GIS Enterprise GIS
Performance Fast for local data Depends on network & server setup