0% found this document useful (0 votes)
20 views63 pages

123

The document provides an overview of Geographic Information Systems (GIS), detailing basic spatial concepts, coordinate systems, and the components of GIS. It explains the history of GIS, differentiates between proprietary and open-source software, and outlines types of data used in GIS, including spatial and attribute data. Additionally, it covers the various levels of measurement for attribute data and their significance in GIS analysis.

Uploaded by

Naveen SB
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views63 pages

123

The document provides an overview of Geographic Information Systems (GIS), detailing basic spatial concepts, coordinate systems, and the components of GIS. It explains the history of GIS, differentiates between proprietary and open-source software, and outlines types of data used in GIS, including spatial and attribute data. Additionally, it covers the various levels of measurement for attribute data and their significance in GIS analysis.

Uploaded by

Naveen SB
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 63

Module 1

1. Explain Basic Spatial Concepts in GIS


Basic spatial concepts in GIS help represent and analyze geographic data. Key concepts include:
• Space and Location
o Absolute Location: Fixed using coordinates (latitude, longitude).
o Relative Location: Described in relation to other features (e.g., “north of the river”).
• Coordinate Systems
o Used to define accurate geographic locations using either Geographic or Projected Coordinate
Systems.
• Spatial Data
o Vector Data: Points, lines, and polygons for discrete features.
o Raster Data: Grid cells representing continuous data like elevation or temperature.
• Spatial Relationships
o Topology: Defines relationships like adjacency and connectivity.
o Proximity: How close features are.
o Overlay: Combines multiple datasets for analysis.
• Scale and Resolution
o Scale: Ratio between map and real-world distances.
o Resolution: Level of detail (especially in raster data).
• Spatial Attributes and Data Models
o Spatial data (location), attribute data (details), and models like raster and vector.
• Spatial Analysis Techniques
o Includes buffering, interpolation, and network analysis.

2. Explain:
i. Coordinate System
A reference framework for locating geographic features. It defines how coordinates are assigned to points.
OR
A coordinate system in GIS is a reference framework used to define the location of geographic features on the
Earth's surface. It consists of mathematical rules that specify how coordinates are assigned to points.
There are two main types of coordinate systems used in GIS:
1. Geographic Coordinate System (GCS)
2. Projected Coordinate System (PCS)

ii. Geographic Coordinate System (GCS)


• Uses a 3D spherical surface (Earth’s surface).
• Based on:
o Latitude and Longitude
o Datum (e.g., WGS84)
o Prime Meridian
Advantages: Global consistency, essential for GPS
Disadvantages: Distortion in area and distance calculations
OR
A Geographic Coordinate System (GCS) uses a three-dimensional spherical surface to define locations on the
Earth. It is based on:
Components of GCS:
• Latitude and Longitude:
o Latitude: Measures north-south position (0° at the Equator, ±90° at the poles).
o Longitude: Measures east-west position (0° at the Prime Meridian, ±180°).
• Datum: A mathematical model representing the Earth's shape (e.g., WGS84, NAD83).
• Prime Meridian: The reference meridian (0° longitude) is used to define east and west directions.
Advantages of GCS:

✔ Globally consistent and widely used. ✔ Essential for GPS and global datasets.
Disadvantages of GCS:

✖ Distortion occurs when measuring distances or areas due to Earth's curvature.

✖ Not suitable for precise spatial analysis or mapping at a local scale.

iii. Map Projections


• Mathematical methods to flatten Earth's surface onto a map.
• Types:
o Cylindrical (e.g., Mercator)
o Conic (e.g., Lambert Conformal Conic)
o Azimuthal (e.g., Stereographic)

iv. Projected Coordinate System (PCS)


• A 2D representation of Earth based on a projection.
• Units: Meters or feet
• Examples: UTM, State Plane
Advantages: Accurate for local distances and areas
Disadvantages: Distortion increases away from the projection center
OR
A Projected Coordinate System (PCS) is a two-dimensional representation of the Earth's surface, created by
projecting geographic coordinates onto a flat surface.
Components of PCS:
• Projection Method: A mathematical formula used to flatten the Earth’s surface.
• Origin Point: The reference point where the coordinates start (e.g., equator and central meridian).
• Units of Measurement: –Typically meters or feet instead of degrees.

3. Define GIS? What are the components of GIS?


Definition:
A Geographic Information System (GIS) is a computer-based system for capturing, storing, checking,
manipulating, analyzing, and displaying geographically referenced data.
Components of GIS:
1. Hardware: Computers, servers, scanners, GPS, drones, printers.
2. Software: Tools for analysis and visualization (e.g., ArcGIS, QGIS).
3. Data: Spatial and attribute data from surveys, remote sensing, etc.
4. People: Analysts, developers, cartographers, policymakers.
5. Methods: Procedures for data collection, analysis, visualization, and sharing.

OR
Definition of GIS
A Geographic Information System (GIS) is a computer-based system that provides tools for collecting,
storing, analyzing, managing, and displaying spatial (geographically referenced) data.
Components of GIS
A working GIS consists of five major components:
1. Hardware
Hardware includes all physical devices and infrastructure needed to run GIS software and process data.
a) Computers & Servers
• Workstations/PCs: Used for analysis, visualization, and map creation.
• Servers: Store large spatial databases; allow remote access and collaboration.
b) Input Devices
• Scanners: Convert paper maps into digital formats.
• Digitizers: Transform analog map data into vector formats.
• GPS Devices: Collect real-time location data.
• Drones/LiDAR: Capture high-resolution geospatial data.
c) Output Devices
• Monitors: Display interactive maps and data.
• Printers/Plotters: Produce hardcopy maps for reporting.
2. Software
GIS software is the engine of the system—it provides the tools to analyze, manage, and visualize geospatial
data.
a) Types of GIS Software
• Desktop GIS: E.g., ArcGIS Pro, QGIS.
• Web GIS: E.g., ArcGIS Online, Google Maps.
• Enterprise GIS: Organization-wide GIS platforms with cloud and database integration.
b) Core Functions
• Data Input & Editing: Import, digitize, clean, and update data.
• Spatial Analysis: Buffering, overlay, interpolation, etc.
• Visualization: Map creation, 3D rendering.
• Database Management: Store and query spatial and attribute data.
c) Examples
• Proprietary: ArcGIS, MapInfo, AutoCAD Map 3D.
• Open Source: QGIS, GRASS GIS, GeoServer.
3. Data
Data is the heart of GIS. It includes spatial (location-based) and attribute (descriptive) information.
a) Spatial Data
• Vector: Points (e.g., trees), Lines (e.g., roads), Polygons (e.g., lakes).
• Raster: Grid-based, e.g., satellite images, elevation models.
• TIN/GRID: Used in 3D and terrain modeling.
b) Attribute Data
• Describes features (e.g., road name, population).
• Stored in tables linked to spatial features via IDs.
c) Data Sources
• Remote sensing
• GPS surveys
• Government records (e.g., census, land records)
• Field surveys
4. People
People are essential for operating, analyzing, and interpreting GIS data effectively.
Roles in GIS:
• GIS Analysts: Perform analysis and produce maps.
• Cartographers: Design high-quality maps.
• Developers: Build GIS applications and plugins.
• Surveyors: Collect geospatial data.
• Policy Makers/Managers: Use GIS for informed decision-making.
Importance: Well-trained professionals ensure data accuracy and meaningful analysis.
5. Methods
Methods refer to the workflows, models, and standards used in GIS for consistent and accurate results.
GIS Workflow Includes:
1. Data Collection
2. Data Processing
3. Spatial Analysis
4. Visualization & Reporting
5. Data Sharing & Management
Best Practices:
• Follow OGC standards for interoperability.
• Maintain metadata and quality assurance.
• Use standardized models for repeatable analysis.

4. Explain History of GIS


• Pre-1960s: Manual cartography, early surveying, remote sensing.
• 1960s–1970s:
o CGIS by Roger Tomlinson (Father of GIS)
o Harvard Lab developed SYMAP (early digital mapping)
• 1980s–1990s:
o Commercial software (e.g., Arc/INFO, MapInfo)
o Integration with GPS and satellite data
• 2000s–Present:
o Web GIS (e.g., Google Maps, ArcGIS Online)
o Open-source tools (e.g., QGIS)
o Integration with AI, big data, drones, LiDAR

5. Differentiate between Proprietary and Open Source Software

Feature Proprietary GIS Open Source GIS

Cost Paid, licensed Free

Support Professional support Community support

Flexibility Limited customization Highly customizable

Examples ArcGIS, MapInfo QGIS, GRASS GIS

Proprietary Pros: Stable, support, advanced tools


Cons: Expensive, less flexible
Open Source Pros: Free, flexible, active community
Cons: Less official support, steeper learning curve
OR
GIS software is essential for analyzing and visualizing spatial data. It is broadly classified into:
• Proprietary GIS Software – Commercial software developed by companies.
• Open Source GIS Software – Free software with publicly available source code.

Key Differences

Feature Proprietary GIS Software Open Source GIS Software

Ownership Owned by private companies Developed by community or institutions

License Cost Requires paid license Free to use, modify, and distribute

Closed source (not available for


Source Code Open source (fully accessible)
modification)

Customization Limited flexibility; only vendor can modify Highly customizable for specific needs

Community-driven support (forums,


Support Professional, vendor-supported
documentation)

Frequent updates by the open-source


Updates Regular updates by the vendor
community

May need additional setup or plugins for


Stability Often more stable and optimized
advanced features

ArcGIS (Esri), MapInfo, AutoCAD Map


Examples QGIS, GRASS GIS, GeoServer, PostGIS
3D, Global Mapper

Training and Comprehensive and official documentation, May require self-learning or third-party
Documentation training available tutorials

Advantages and Disadvantages


Proprietary Software

• ✔ Professional technical support

• ✔ Advanced tools and enterprise integration

• ✖ Expensive licensing

• ✖ Limited customization
Open Source Software

• ✔ No licensing cost

• ✔ Flexible and customizable

• ✖ Limited official support

• ✖ Steeper learning curve for beginners.


6. Explain Types of Data in GIS
GIS uses:
1. Spatial Data (Geographic):
o Vector: Points (wells), lines (roads), polygons (lakes)
o Raster: Grid-based (e.g., satellite images)
o TIN: Triangular networks for elevation
o GRID: Cell-based storage for terrain/climate
2. Attribute Data (Descriptive):
o Describes characteristics of spatial features (e.g., population)

OR
In Geographic Information Systems (GIS), data is the core component that enables spatial analysis and
decision-making. GIS primarily works with two major types of data:

1. Spatial Data
Spatial data (also called geospatial data or geographic data) represents the location, shape, and geometry of
geographic features on the Earth's surface.
Types of Spatial Data Representations:
a) Vector Data
Represents discrete features using geometric shapes:

Geometry Type Represents Examples

Point Single location Trees, wells, GPS coordinates

Line Linear features Roads, rivers, pipelines

Polygon Enclosed areas Lakes, land parcels, buildings

Advantages:
• High precision
• Efficient for storing attribute data
• Suitable for topological and network analysis
Disadvantages:
• Complex overlay analysis
• Higher processing needs for large datasets
b) Raster Data
Represents continuous features as a grid of cells (pixels), where each cell has a specific value (e.g., elevation or
temperature).
Examples:
• Satellite images (e.g., Landsat, Sentinel)
• Digital Elevation Models (DEM)
• Aerial photographs
Advantages:
• Best for continuous data
• Simple grid structure
• Effective in remote sensing and image processing
Disadvantages:
• Large file sizes
• Less precise for boundaries
• Requires resampling for resolution changes
c) TIN (Triangulated Irregular Network)
• Represents 3D surfaces using interconnected triangles.
• Commonly used in terrain modeling and elevation representation.
d) GRID Data Model
• Uses a regular matrix of square cells like raster data.
• Common for representing land cover, elevation, or climate data.

2. Attribute Data
Attribute data provides descriptive information about spatial features. It answers the question: “What is it?”
Characteristics:
• Stored in tables linked to spatial data via unique identifiers (ID).
• Each row = one feature; each column = one attribute.
Examples:

Feature (Spatial) Attribute Data

Road Line Name, type, width

Lake Polygon Name, area, water quality

City Point Population, elevation, region

Relationship Between Spatial and Attribute Data

Spatial Data Tells Where

Road, Lake, City Geometry & location

Attribute Data Tells What

Name, area, type Descriptive details

They are linked through unique IDs to perform operations like:


• Thematic mapping
• Queries (e.g., find cities with population > 1 million)
• Buffer and overlay analysis

Summary Table:

Data Type Purpose Format Examples

Spatial Data Location & Shape Vector, Raster, TIN, GRID Roads, rivers, satellite images

Attribute Data Descriptive details Tables (RDBMS) City names, population, land use

7. Differentiate between Spatial and Attribute Data

Feature Spatial Data Attribute Data

Meaning Represents geographic features Describes features

Format Vector (points, lines, polygons) or Raster (grids) Tables (RDBMS)

Purpose Shows "where" features are Describes "what" features are

Example: A road (spatial) with its name and type (attribute)

OR
In GIS, spatial data and attribute data are two fundamental data types that work together to represent and
describe geographic features.

1. Spatial Data
• Definition: Represents the location, shape, and geometry of geographic features on Earth’s surface.
• Purpose: Answers the question "Where is it?"
• Representation:
o Vector: Points (e.g., wells), Lines (e.g., roads), Polygons (e.g., lakes)
o Raster: Gridded data (e.g., satellite imagery, elevation)
• Examples:
o A river shown as a line
o A building outlined as a polygon
o A GPS point representing a tree location

2. Attribute Data
• Definition: Provides descriptive information about spatial features.
• Purpose: Answers the question "What is it?"
• Format: Stored in tables (Relational Database), where:
o Each row = one spatial feature
o Each column = one attribute (e.g., name, type, value)
• Examples:
o A river (spatial) has name, length, flow rate (attributes)
o A building (spatial) has owner name, height, usage type (attributes)

Comparison Table

Feature Spatial Data Attribute Data

Definition Data about geographic location and shape Data that describes spatial features

Purpose Answers "Where?" Answers "What?"

Format Vector (Point, Line, Polygon), Raster Tables (Rows = Features, Columns = Attributes)

Storage GIS layers, map files, shapefiles Attribute tables, databases

Examples Roads, rivers, land parcels Road name, width, type; Land use, population

Use in GIS Mapping, visualization, spatial analysis Queries, thematic mapping, reports

8. What are the different types of attribute data in GIS?


Based on levels of measurement:
1. Nominal: Categories (e.g., land use type)
2. Ordinal: Ranked (e.g., risk levels)
3. Interval: Numeric without true zero (e.g., temperature)
4. Ratio: Numeric with true zero (e.g., population, distance)
These are stored in tables linked to spatial data via IDs.

OR
In GIS, attribute data refers to the descriptive information linked to spatial features. These attributes help
explain what the geographic features represent (e.g., name of a city, population, land use type).

Types of Attribute Data in GIS


Attribute data is classified into four major types, based on levels of measurement as defined by Stevens’
measurement scale:

Attribute
Scale Description Example
Type

1. Nominal Labels or names with no order or Land use type (Residential, Commercial),
Categorical
Data numeric meaning Soil type (Clay, Sand)

2. Ordinal Ordered data, but without equal Road hierarchy (Highway > Main Road >
Ranked
Data spacing between values Street), Risk level (High, Medium, Low)

3. Interval Numeric (no Numeric values with equal intervals,


Temperature in °C or °F, pH levels
Data true zero) but no true zero point

4. Ratio Numeric (true Numeric values with meaningful zero,


Population, Area, Distance, Elevation
Data zero) allowing full mathematical operations
Explanation of Each Type
1. Nominal Data
• Purely descriptive.
• Categories are mutually exclusive.
• No logical order.
• Operations allowed: Classification, thematic mapping
• Not allowed: Arithmetic operations
2. Ordinal Data
• Ordered categories (ranked), but no consistent difference between them.
• Operations allowed: Sorting, ranking, thematic mapping
• Not allowed: Mean, standard deviation
3. Interval Data
• Numeric values with equal intervals, but no absolute zero.
• Allows meaningful addition and subtraction.
• Examples: Temperature (0°C ≠ absence of temperature)
• Not allowed: Multiplication, ratio comparisons
4. Ratio Data
• Numeric values with a true zero (zero = absence of the variable).
• Allows all arithmetic operations (addition, subtraction, multiplication, division).
• Examples: Distance, population, rainfall, area

Attribute Table Example

ID City Name Type (Nominal) Risk Level (Ordinal) Temperature (Interval) Population (Ratio)

1 City A Urban High 35°C 500,000

2 City B Suburban Medium 30°C 200,000

3 City C Rural Low 25°C 50,000

Summary Table

Type Level Can Rank? Can Do Math? Examples

Nominal Categorical Land use, soil type

Ordinal Ordered Road type, priority level

Interval Numeric (no zero) Add/Subtract only Temperature, pH

Ratio Numeric (true zero) All operations Population, area


9. Write a note on scales/levels of measurement
Scales determine what analysis is appropriate:

Scale Description Examples Operations

Nominal Categorical Land use Classification

Ordinal Ranked Risk levels Sorting

Interval Numeric (no true zero) Temperature Add/Subtract

Ratio Numeric (true zero) Population, area All math ops

Importance:
• Guides correct statistical and GIS operations
• Impacts how data is visualized (e.g., choropleth maps vs proportional symbols)

Module 2

1. Differentiate Between Relational Database and Object-Oriented Database (or Explain Database
Structures)

Aspect Relational Database Object-Oriented Database

Data Representation Tables (rows & columns) Objects (with attributes and methods)

Schema Flexibility Rigid (predefined schema) Flexible (can evolve dynamically)

Ideal for Structured data, transactions, SQL queries Complex, evolving data with relationships

Query Language SQL Object Query Language

Use Cases Banking, ERP, Analytics CAD, GIS, multimedia databases

Examples MySQL, PostgreSQL db4o, ObjectDB

Relational DBs excel in structured environments. Object-oriented DBs are better suited for systems aligned
with object-oriented programming where data structure evolves.

OR
What is a Database Structure?
A Database Structure is the organized framework that defines how data is stored, accessed, and managed
within a Database Management System (DBMS). It includes:
• Data storage formats (e.g., tables, objects)
• Data types and constraints
• Relationships and rules between data elements
The two widely used database structures in GIS and other applications are:
• Relational Database
• Object-Oriented Database
1. Relational Database (RDBMS)

Definition:
A Relational Database is based on the relational model introduced by E.F. Codd. Data is stored in tables (also
called relations), where each table consists of:
• Rows: Represent records (tuples)
• Columns: Represent attributes (fields)

Key Features:
• Uses SQL (Structured Query Language) for data querying and manipulation.
• Primary keys uniquely identify records.
• Foreign keys establish relationships between tables.

Examples:
• MySQL, PostgreSQL, Oracle, SQL Server

Best Suited For:


• Structured data with predefined schema
• Transactional systems (e.g., banking, inventory)
• Reporting and analytics

2. Object-Oriented Database (OODBMS)

Definition:
An Object-Oriented Database stores data as objects, similar to those used in object-oriented programming.
Each object includes:
• Attributes (data) and
• Methods (functions) that operate on the data

Key Features:
• Aligns well with object-oriented languages (e.g., Java, C++)
• Supports inheritance, encapsulation, and polymorphism
• Uses Object Query Languages (OQL) instead of SQL

Examples:
• ObjectDB, db4o, Versant

Best Suited For:


• Complex, evolving data structures
• Applications with hierarchical or networked data (e.g., CAD, GIS, multimedia systems)

Comparison Table: Relational vs Object-Oriented Database


Aspect Relational Database Object-Oriented Database

Data Structure Tables (rows & columns) Objects (with attributes & methods)

Schema Flexibility Rigid; predefined schema Flexible; schema can evolve

Query Language SQL OQL or embedded in code

Relationships Handled via keys (PK, FK) Handled via object references

Ideal Use Case Structured data, transactions Complex, dynamic data relationships

Integration Easier with traditional apps Seamless with OOP applications

Performance Optimized for large, structured data Better for multimedia & scientific data

2. Define Entities. Explain ER Diagram with Example


• Entity: A real-world object or thing about which data is stored. E.g., student, bank account.
• Entity Set: Collection of similar entities (e.g., all students).
• Attributes: Properties (e.g., Name, Age).
• Primary Key: Uniquely identifies each entity.
Types of Entities
• Strong Entity: Exists independently (e.g., Car).
• Weak Entity: Depends on strong entity (e.g., Car color).
• Tangible & Intangible Entities: Physical (employee) vs conceptual (service type).
ER Diagram
• Uses:
o Rectangles → Entities
o Ovals → Attributes
o Diamonds → Relationships
Example:
• Entities: Student, Course
• Relationship: Enrolled
• Attributes: Student(Name, ID), Course(Title)

OR
An entity is a real-world object or concept that can be uniquely identified and has data stored about it in a
database.

Entity: A distinguishable thing, object, or concept in the real world that can be represented in a database.
Entities can be:
• Tangible: Like buildings, roads, land parcels
• Intangible: Like services, ownerships, transactions
Each entity has:
• Attributes: Descriptive information (e.g., Name, ID, Location)
• Unique Identifier (Primary Key): To distinguish each instance

ER Diagram (Entity-Relationship Diagram)


An ER Diagram is a visual representation of the entities in a database and the relationships between them.

Why ER Diagrams are Important in GIS:


• Organize complex spatial and non-spatial data logically
• Help in database design and normalization
• Represent spatial relationships like adjacency, containment, intersection
• Maintain data integrity and consistency

Components of ER Diagram:

Component Symbol Description

Entity Rectangle ▭ Object to be stored (e.g., Building)

Attribute Oval ⃝ Properties of an entity (e.g., Height)

Relationship Diamond ♦ Links between entities (e.g., "Has")

Cardinality Notation (1:1, 1:N, M:N) Defines number of allowed connections

Example: ER Diagram for GIS Land Management


Entities:
• Land Parcel
• Building
• Occupant
Attributes:
• Land Parcel: ID, Owner, Address
• Building: ID, Type, Number of Floors
• Occupant: ID, Name, Person Info
Relationships:

• Contains → Land Parcel ➝ Building

• Has → Building ➝ Occupant


Interpretation:
• A Land Parcel can contain one or more Buildings
• A Building can have one or more Occupants
Diagram Description (Textual Format)
[Land Parcel] — Contains —> [Building] — Has —> [Occupant]

Cardinality (Implied)

• One Land Parcel ➝ Many Buildings (1:N)

• One Building ➝ Many Occupants (1:N)

Purpose of ER Diagram in GIS


• Provides a blueprint for creating a GIS database
• Helps in efficient data retrieval and querying
• Ensures logical relationships between spatial and non-spatial data

3. Explain Types of Data Models or Explain Conceptual, Logical, and Physical Models

Model Purpose Created By Details

High-level overview, entities, attributes,


Conceptual What the system contains Business stakeholders
relationships

How the system will be Data architects, Includes detailed attributes, data types,
Logical
structured analysts relationships

Includes tables, keys, indexes, constraints, data


Physical How to implement in DBMS Developers, DBAs
types

Conceptual → Basic structure


Logical → Expanded version
Physical → Ready-to-implement schema

OR
In GIS and database design, data models help in structuring how data is conceptualized, logically organized,
and physically stored. These models exist in three main levels:
1. Conceptual Data Model
Definition:
A Conceptual Data Model is the high-level design of the database. It focuses on identifying the entities, their
attributes, and the relationships between them without worrying about how they will be implemented in a
database system.
Key Features:
• Focuses on what data is required.
• Independent of software or hardware.
• Simple and understandable by both technical and non-technical stakeholders.
• Commonly represented using Entity-Relationship (ER) diagrams.
Example in GIS:
Entities like Land Parcel, Building, and Occupant are identified, along with their relationships:
• A Land Parcel contains Buildings.
• A Building has an Occupant.
Diagram Sample (from the notes):
• Entities: Rectangles (e.g., Land Parcel, Building)
• Attributes: Ovals (e.g., Parcel ID, Building Type)
• Relationships: Diamonds/lines (e.g., contains, has)
2. Logical Data Model
Definition:
The Logical Data Model defines how the data will be organized and managed, but still independent of specific
database systems. It describes:
• Tables
• Fields (columns)
• Data types
• Relationships (one-to-many, many-to-many)
Key Features:
• Defines keys (Primary Key, Foreign Key)
• Logical structure suitable for a Relational Database Management System (RDBMS)
• Maintains normalization to reduce redundancy
• More detailed than conceptual, still not implementation-specific
Example in GIS:
• Tables: Cities, Roads, Rivers
• Fields: City_ID (PK), City_Name, Population
• Relationships: A foreign key like City_ID in a "Parks" table linking to the "Cities" table
3. Physical Data Model
Definition:
The Physical Data Model is the lowest level of data abstraction. It shows how the database will be
implemented in a specific system (like PostgreSQL/PostGIS or Oracle Spatial).
Key Features:
• Includes data storage methods, indexing, partitioning
• Specifies actual data types (e.g., VARCHAR(50), INT)
• Includes performance optimization techniques like indexing
• Deals with storage structure, access paths, and security
Example in GIS:
• Spatial indexing using R-Trees
• Use of formats like GeoTIFF or Shapefile
• Implementation of spatial data types (POINT, LINESTRING, POLYGON)
Summary Table

Level Purpose Key Focus Example in GIS

Conceptual High-level data structure Entities, attributes, relationships Land parcels contain buildings

Table: Building (ID, Type,


Logical Abstract database structure Tables, keys, data types
Parcel_ID)

Implementation in DB Indexing, partitions, PostGIS tables with geometry


Physical
system performance columns

Use in GIS
These models together guide the design of GIS databases:
• Ensure efficient storage and retrieval of spatial and attribute data
• Help in maintaining data integrity
• Support spatial relationships and real-world mapping

4. Explain Spatial Data Models (Raster and Vector)


Raster Model
• Grid of cells (pixels), each with one value
• Best for continuous data (e.g., elevation, temperature)
• Common in satellite imagery, DEMs
Vector Model
• Discrete features: Points, Lines, Polygons
• Best for boundaries and objects (e.g., roads, parcels)
• Each feature links to attribute data

OR
Spatial data models describe how geographic features are represented and stored in a GIS. The two primary
types of spatial data models are:
1. Raster Data Model
Definition:
The Raster Data Model represents spatial data as a matrix (grid) of cells or pixels. Each cell holds a single
value that represents information such as elevation, temperature, or land cover.
Structure:
• Data is stored in rows and columns of equal-sized cells.
• Each cell has a value representing a specific attribute (e.g., vegetation type, elevation).
• Suited for continuous data like satellite images and terrain models.
Types of Raster Data:
• Satellite Imagery
• Digital Elevation Models (DEM)
• Digital Orthophotos
• Scanned maps and images (JPEG, TIFF, etc.)
Advantages:
• Simple data structure.
• Suitable for continuous phenomena.
• Faster for mathematical operations and overlays.
Disadvantages:
• Large file sizes.
• Lower spatial accuracy (depends on resolution).
• Poor representation of discrete features and topology.
Applications:
• Land use classification
• Environmental modeling
• Terrain and elevation analysis
2. Vector Data Model
Definition:
The Vector Data Model represents spatial data using points, lines, and polygons, defined by precise
coordinates.
Structure:
• Point: Represents a single location (e.g., tree, well).
• Line: Represents linear features (e.g., road, river).
• Polygon: Represents area features (e.g., lake, land parcel).
Types of Vector Models:
• Spaghetti Model: Simple model where features are stored independently.
• Topological Model: Maintains relationships between features (adjacency, connectivity).
Advantages:
• High spatial accuracy.
• Efficient storage for discrete features.
• Topology support (helps in spatial queries and network analysis).
Disadvantages:
• More complex data structure.
• Slower for raster-style analysis (like overlays).
• Not ideal for continuous surfaces.
Applications:
• Cadastral mapping
• Transportation and utility networks
• Administrative boundaries

Comparison: Raster vs. Vector

Feature Raster Model Vector Model

Representation Grid of cells Coordinates (points, lines, polygons)

Best Suited For Continuous data Discrete data

Accuracy Lower (depends on resolution) Higher (precise coordinates)

File Size Larger Smaller

Topology Support Limited Strong

Processing Speed Faster for overlays Efficient for spatial queries

Example Use Satellite images, DEMs Road networks, parcel maps

Conclusion
• Raster Model is best for continuous data analysis (e.g., elevation, temperature).
• Vector Model is better for representing precise locations and boundaries (e.g., roads, land parcels).
Both models are fundamental in GIS and are often used together in hybrid GIS systems to take advantage of
their respective strengths.

5. Explain Raster Data Structures


1. Cell-by-Cell Encoding: Every pixel value stored individually. Suitable for DEMs.
2. Run-Length Encoding (RLE): Stores repeated values efficiently.
3. Quadtree Encoding: Recursive division of grid for efficient compression and storage.

OR
What is a Raster Data Structure?
Raster data structure refers to the way raster data (gridded or pixel-based spatial data) is stored and organized
in a computer system. Since raster data is composed of a matrix of cells, how this matrix is read, written,
compressed, or accessed significantly affects performance, storage efficiency, and processing.
Basic Concept:
• A raster is a grid of rows and columns.
• Each cell (pixel) has a value representing a geographic attribute (e.g., elevation, land cover).
• Data is stored row-wise or column-wise in files.

Types of Raster Data Structures


There are three major raster data storage formats:
1. Banded Interleaved by Pixel (BIP)
Structure:
• Each pixel stores values for all bands (layers) together.
• Stored as a sequence of band values per pixel:
[Pixel1_Band1, Pixel1_Band2, ..., Pixel2_Band1, Pixel2_Band2, ...]
Use Case:
• Useful when all bands are needed simultaneously (e.g., satellite imagery).
Example:
If an image has 4 bands and 3 pixels, the data would be stored as:
[P1B1, P1B2, P1B3, P1B4, P2B1, P2B2, ..., P3B4]
2. Banded Interleaved by Line (BIL)
Structure:
• Each line (row) of data is stored for all bands before moving to the next line.
• Stored as:
[Line1_Band1, Line1_Band2, ..., Line2_Band1, Line2_Band2, ...]
Use Case:
• Efficient for line-by-line processing and visualization.
3. Banded Sequential Format (BSQ)
Structure:
• Each band is stored separately in a distinct file or section.
• Stored as:
[Band1 data] → [Band2 data] → [Band3 data] ...
Use Case:
• Useful when individual bands are processed separately.
• Easier to skip bands that are not needed.

Raster Data Compression Techniques


Raster files are often large. To save space and processing time, the following compression methods are used:
1. Run-Length Encoding (RLE)
• Stores sequences of the same value as a single value and a count.
• Reduces repetition in rows.
• Example: Instead of 0 0 0 1 1 0, it stores as (0,3), (1,2), (0,1)
2. Raster Chain Coding
• Stores boundaries by direction from a start point (e.g., N, E, S, W).
• Efficient for storing shapes, but hard to edit and modify.
3. Block Coding
• Stores square blocks of grid cells with the same value.
• Each block has a starting point, size, and value.
• Saves space by grouping cells.
4. Quadtrees
• Divides raster recursively into quadrants.
• Stores only areas with homogeneous values.
• Efficient for large datasets and variable resolution.

Summary Table: Raster Data Structures

Format Structure Best For

BIP All band values per pixel Multi-band simultaneous access

BIL All band values per line Image display and line-based analysis

BSQ Separate file per band Independent band analysis

RLE Value + count Repeated values (simple patterns)

Chain Code Directions from start point Boundary representation

Block Code Square blocks + values Thematic maps with repeated patterns

Quadtree Recursive division into quadrants Large, complex datasets with redundancy

Conclusion:
Raster data structures are essential for efficient storage, retrieval, and analysis in GIS applications. The
choice of structure depends on:
• Data type (satellite image, DEM, thematic layer),
• Storage availability,
• Processing speed,
• Intended GIS operations.
6. Explain Raster Data Compression
• Why Needed: Raw raster is large; compression reduces storage.
• Techniques:
o Run-Length Encoding (RLE): Store values with run count.
o Quadtree: Divide into homogenous quadrants.
o Huffman Coding: Frequency-based binary encoding.
o LZ77 / LZW: Based on data repetition (used in TIFF, PNG).
o JPEG Compression: Lossy, commonly for imagery.

OR
What is Raster Data Compression?
Raster data compression refers to techniques used to reduce the file size of raster (grid-based) datasets by
storing pixel values more efficiently. Since raster datasets (e.g., satellite images, elevation maps) can be very
large, compression helps:
• Save storage space
• Speed up data transmission
• Improve processing performance
Compression methods can be lossless (no data lost) or lossy (some data discarded to reduce size).

Common Raster Compression Techniques


1. Run-Length Coding (RLE)

Principle:
• Stores repeating values in a row as a single value plus the number of times it repeats.

Example:
Instead of storing:
000011100
You store:
(0,4), (1,3), (0,2)

Advantages:
• Very effective when large blocks of the same value exist.
• Easy to implement and decode.

Disadvantages:
• Not efficient if data has high variability or noise.
2. Raster Chain Coding

Principle:
• Used to store boundaries of features by tracing their edges using directional steps (N, S, E, W).
• Only the outline of a region is stored using a chain of movements.

Example:
From a starting point (x, y), the direction might be stored like:
E3, S4, W1, N4

Advantages:
• Efficient for boundary representation.

Disadvantages:
• Hard to modify or edit.
• Redundant for adjacent polygons (shared boundaries stored multiple times).
3. Block Coding (Two-Dimensional Run-Length Coding)

Principle:
• Stores blocks of grid cells (usually square-shaped) with the same value.
• Each block is stored with:
o Starting location
o Size (e.g., 2×2)
o Value

Example:
A 3×3 block of value "1" is stored as:
(Start: X5,Y5, Size: 3x3, Value: 1)

Advantages:
• Reduces data size more than RLE for large homogeneous regions.
• Good for thematic raster layers.
4. Quadtrees

Principle:
• Recursively divides the raster into quadrants.
• Only subdivides further if a quadrant has mixed values.
• Homogeneous blocks are stored as-is.

Advantages:
• Efficient for large and complex raster maps.
• Stores data at variable resolution (detail where needed).
• Compact for maps with large uniform areas.

Disadvantages:
• More complex to implement.
• Not suitable for highly variable data.

Summary Table: Raster Compression Techniques


Technique How It Works Best For Limitation

Run-Length Stores repeating values as value


Uniform rows of same values Poor for variable data
Coding + count

Stores feature boundaries via


Chain Coding Boundary encoding Redundant for shared borders
direction

Stores square blocks of same Thematic maps with repeating Less effective for irregular
Block Coding
value regions shapes

Variable resolution, large


Quadtree Recursive division of the map Complex implementation
datasets

Conclusion
Raster data compression helps in:
• Reducing storage requirements
• Improving processing efficiency
• Managing large spatial datasets
The best compression technique depends on the type of raster data, spatial patterns, and intended GIS
application.

7. Explain Vector Data Structures


1. Spaghetti Model: Simple, unstructured lines/points. No topology.
2. Feature-Encoded: Recognizes independent objects with IDs and attributes.
3. Topologically Encoded:
o Stores relationships (connectivity, adjacency)
o Supports advanced spatial analysis
o Used in Arc/Info

OR
What is a Vector Data Structure?
Vector data structures represent geographic features using geometric shapes such as points, lines, and
polygons, based on coordinate systems. They are ideal for discrete spatial features like roads, buildings,
rivers, and land parcels.
Each feature is stored as a record in a database, with associated attribute data stored in linked tables.

Basic Geometries in Vector Data

Geometry Description Examples

Point A single (x, y) coordinate Tree, Well, School

Line A series of connected points Road, River, Pipeline

Polygon A closed shape formed by lines Lake, Forest boundary, Parcel


Types of Vector Data Structures
There are two major types of vector data structures:
1. Spaghetti Model

Definition:
• Each feature is stored independently as a set of coordinate points.
• No spatial relationships (topology) are stored.

Features:
• Simple and easy to create.
• Features like polygons do not share boundaries.
• Each line or polygon is stored as a separate object.

Disadvantages:
• Redundant storage (e.g., adjacent polygons store shared boundaries twice).
• No support for topological analysis (e.g., adjacency, connectivity).

Example:
Two adjacent land parcels each store their boundary lines separately, even if they are identical.
2. Topological Model

Definition:
• Stores spatial relationships between features, such as:
o Adjacency (which polygons share a border),
o Connectivity (how lines connect),
o Containment (what lies inside a polygon).

Features:
• Efficient for network analysis (e.g., roads, rivers).
• Prevents data redundancy (shared edges stored once).
• Enhances data integrity (no gaps or overlaps between polygons).

Components:
• Node: Intersection point of two or more lines.
• Link/Arc: A line segment connecting nodes.
• Polygon: A closed sequence of links.
• Topology Tables:
o Store relationships (e.g., which nodes/links form which polygon).

Example:
A topological model of a city map allows the system to identify:
• Which roads connect at a junction.
• Which building lies within which ward.

Comparison: Spaghetti vs. Topological Models

Feature Spaghetti Model Topological Model

Data Structure Simple list of coordinates Stores nodes, arcs, and relationships

Redundancy High (shared boundaries repeated) Low (shared elements stored once)

Topology Support No Yes

Editing Easier More complex

Analysis Limited Advanced spatial analysis supported

Advantages of Vector Data Structures


• High spatial accuracy (exact coordinates).
• Efficient storage (especially for discrete data).
• Supports topological analysis.
• Good for map rendering and feature-based queries.

Disadvantages
• Complex structure compared to raster.
• Slower for some operations (e.g., overlays).
• Difficult to represent continuous surfaces like elevation.

Applications
• Cadastral mapping (land parcels)
• Transportation networks (roads, railways)
• Utilities (electric lines, pipelines)
• Urban planning and zoning maps

Conclusion
Vector data structures are ideal for applications where precision, topology, and feature-based representation
are crucial. While the spaghetti model is simple, the topological model is preferred for advanced GIS analysis
and maintaining spatial relationships between features.

8. Differentiate Between Raster and Vector Models with Advantages and Disadvantages

Feature Raster Vector

Data Type Continuous Discrete

Structure Grid/Matrix Points, Lines, Polygons

Storage Large file size Efficient for attributes


Feature Raster Vector

Analysis Suited for surface analysis Network and spatial analysis

Resolution Limited by cell size High geometric precision

Examples Satellite images, DEM Roads, land parcels

Raster Pros: Great for continuous data, easy overlay


Raster Cons: Large, less precise
Vector Pros: Precise, attribute-rich
Vector Cons: Complex topology, slower for some operations

OR
Spatial data in GIS is represented using two main models: Raster and Vector. Both models have their own
structure, use cases, strengths, and limitations.

Aspect Raster Data Model Vector Data Model

Represents spatial data as a grid of equally sized cells Represents spatial features as points, lines,
Definition
(pixels), each with a value representing an attribute. and polygons defined by precise coordinates.

Data Best for continuous data like elevation, temperature, Best for discrete data like roads, boundaries,
Type satellite images. land parcels.

Key Differences between Raster and Vector Data Models

Feature Raster Model Vector Model

Data
Grid of cells (pixels) Coordinates: Points, Lines, Polygons
Representation

Continuous data (e.g., elevation, temperature, Discrete features (e.g., roads, parcels,
Best Suited For
NDVI) buildings)

Spatial Accuracy Lower (depends on resolution) Higher (precise coordinates)

File Size Larger (especially with high resolution) Smaller (efficient storage)

Weak (difficult to establish


Topology Support Strong (supports topological relationships)
adjacency/connectivity)

Fast for overlays and pixel-based


Data Processing Efficient for queries and network analysis
calculations

Storage Format GeoTIFF, GRID, JPEG, IMG Shapefile, GeoJSON, KML

Zoom Behavior Becomes pixelated when zoomed in too far Maintains clarity at all scales

Editing More complex due to geometry and topology


Easier to edit as entire area is covered
Complexity maintenance

Advantages and Disadvantages

Raster Data Model


Advantages Disadvantages

Simple data structure Large file sizes (especially for high-resolution data)

Easy to overlay multiple layers Poor representation of discrete features

Efficient for continuous data analysis (e.g., elevation) Less accurate — resolution dependent

Suitable for remote sensing and satellite imagery Topology not inherently stored

Faster for certain spatial operations (e.g., buffers) Can be inefficient for storing attribute data

Vector Data Model

Advantages Disadvantages

High spatial accuracy Complex data structure

Slower for continuous data analysis (e.g., terrain


Efficient for representing discrete features
modeling)

Supports topological relationships (e.g., adjacency, Requires more effort for editing and maintaining
network) topology

Compact storage (only stores coordinates and attributes) Overlay operations are computationally intensive

Ideal for cartographic display and feature-based queries Complex for raster-style processing like remote sensing

When to Use Which Model

Scenario Recommended Model

Elevation modeling, terrain analysis Raster

Land use classification from satellite images Raster

Cadastral mapping and land parcel analysis Vector

Utility network management (e.g., pipelines) Vector

Environmental modeling Raster

Administrative boundary mapping Vector

Conclusion
• Raster Model is ideal for continuous data, image analysis, and modeling natural phenomena.
• Vector Model is best for precise, feature-based spatial data, especially where topology matters.
Many modern GIS systems use a hybrid approach, combining both raster and vector data to leverage the
strengths of each.
9. Write a Note on:
a. TIN (Triangulated Irregular Network)
• Vector-based model for representing terrain using triangles.
• Flexible representation of elevation with fewer data points.
• Supports accurate calculation of:
o Slope, aspect, volume, surface area
• Ideal for small-area, high-precision modeling
b. GRID Model
• Raster-based model with structured cells.
• Used to represent:
o Elevation
o Land use
o Climate data
• Types:
o Integer grids → Discrete data
o Floating-point grids → Continuous data
• Used in ArcGIS and other raster-based GIS software.

OR
a. TIN (Triangulated Irregular Network)

Definition:
TIN is a vector-based data model used to represent terrain surfaces by connecting irregularly spaced
elevation points into non-overlapping triangles. Each triangle is formed using three points (nodes), and
together they form a network covering the entire surface.

Key Features:
• Represents surface as connected triangles (mesh).
• Triangle vertices are elevation points collected from surveys, GPS, or LiDAR.
• Elevation is interpolated within each triangle.

Advantages:
• Variable resolution: More triangles in complex areas, fewer in flat regions.
• Efficient storage: Stores data only where needed (no unnecessary detail).
• High accuracy: Suitable for engineering and 3D terrain modeling.
• Captures breaklines and sharp elevation changes (e.g., cliffs, ridges).

Disadvantages:
• Complex data structure and processing.
• Difficult for raster-based analysis (e.g., overlays).
• Requires more computational resources.

Applications:
• 3D terrain visualization
• Watershed and slope analysis
• Civil engineering (e.g., road construction)
• LiDAR data processing
b. GRID Data Model

Definition:
The GRID data model is a raster-based method of representing spatial data using a matrix of uniform square
cells. Each cell holds a value representing a specific attribute (e.g., elevation, land cover).

Key Features:
• Divides space into regular grid cells (pixels).
• Every cell has equal size and represents one value.
• Suitable for continuous surface representation (e.g., temperature, slope).

Advantages:
• Simple to store and process.
• Compatible with satellite imagery and remote sensing.
• Good for map algebra, classification, and modeling.
• Easily integrated with digital image data.

Disadvantages:
• Large file sizes, especially with fine resolution.
• Less accurate for representing complex shapes or boundaries.
• Fixed resolution may result in data loss in detailed areas.

Applications:
• Digital Elevation Models (DEMs)
• Land cover classification
• Hydrological modeling
• Environmental and climatic studies

Summary Table: TIN vs GRID

Feature TIN (Triangulated Irregular Network) GRID (Raster/Grid Model)

Data Type Vector-based Raster-based

Structure Irregular triangles Regular grid of square cells

Resolution Variable Fixed


Feature TIN (Triangulated Irregular Network) GRID (Raster/Grid Model)

Accuracy High (adaptive to terrain complexity) Medium (depends on pixel size)

Storage Size Smaller (stores only needed points) Larger (stores every cell)

Best For 3D modeling, engineering Terrain modeling, classification

Applications LiDAR, slope, contour analysis DEM, NDVI, satellite data analysis

Module 3 PDF

1. Write a note on Scanner


Scanners convert analog maps or photographs into digital raster image data. This data can then be vectorized
through digitization. Scanners use light sources and optical systems to record image data. Types include:
• Flat-bed scanners (small, less accurate),
• Drum scanners (high accuracy, slow and expensive),
• Large-format feed scanners (accurate and cost-effective),
• CCD and video cameras (for digital image capture).
Image processing steps include despeckling, greyscaling, adjusting brightness/contrast, and
thresholding.
OR
A scanner is a device that converts analog maps, photographs, or paper documents into digital raster format.
This digital image consists of pixels (tiny squares) that represent map features and background.
Types of Scanners:
• Flat-bed Scanner: Map is placed flat; uses light to scan. It’s compact but less accurate.
• Drum Scanner: The map is fixed on a rotating drum. It provides high accuracy but is expensive and
slow.
• Large-format Feed Scanner: Highly accurate, cost-effective, and suitable for large maps. Uses Contact
Image Sensors (CIS).
• CCD Camera/Scanner: More accurate than video cameras; widely used for digital image acquisition.
Image Enhancement Techniques:
• Despeckling: Removes stray marks or pixels.
• Greyscaling: Converts color images to grayscale.
• Brightness & Contrast: Adjusts tone for better visibility.
• Thresholding: Converts grayscale image to binary (black & white).

OR
A scanner is a crucial device in Geographic Information Systems (GIS) used to convert analog sources such as
paper maps, photographs, and documents into digital raster format. This conversion is essential for
processing, analyzing, and integrating spatial data within a GIS environment.

Purpose of Scanners in GIS


• Scanners help transform hardcopy maps, CAD drawings, and images into digital format.
• This digital raster data is necessary for:
o Vectorization
o Digital storage
o Reduced physical wear and tear
o Improved accessibility and sharing of maps

Types of Scanners Used in GIS


Scanners are generally categorized based on their mechanism and the quality of output:
1. Mechanical Scanner (Drum Scanner):
o A map is mounted on a rotating drum.
o The sensor moves line by line to digitize the image.
o Advantage: High accuracy.
o Disadvantage: Slow speed and expensive.
2. Video Camera with CRT:
o Captures small map areas using a cathode ray tube-based camera.
o Advantage: Low cost.
o Disadvantage: Not very accurate.
3. CCD Camera (Digital Still Camera):
o Uses a Charge-Coupled Device for image acquisition.
o More stable and accurate than a video camera.
4. CCD Scanner:
o Commonly flatbed or roll-feed type.
o Uses linear CCD sensors to scan large documents.
o Advantage: Accurate digitization in both mono-tone and color.
o Disadvantage: Expensive.

Main Categories of GIS Scanners


1. Flatbed Scanner:
o Small, affordable, and easy to use.
o Less accurate compared to other types.
2. Rotating Drum Scanner:
o Provides high-resolution outputs.
o Slow and expensive; suitable for highly detailed applications.
3. Large Format Feed Scanner:
o Designed for GIS.
o Balanced in cost, speed, and accuracy.
Process of Raster Scanning
• Scanners work by moving across a map line by line, recording reflected light intensity.
• Black and White Scans: Stored using a single byte (256 gray levels from 0–255).
• Color Scans: Use three gray scale layers—Red (R), Green (G), and Blue (B).
• The result is a digital raster image that GIS software can further process and analyze.

Advantages of Using Scanners


• Digitizes legacy paper maps for modern GIS applications.
• Increases data accessibility and storage efficiency.
• Supports automation of map analysis and updates.
• Reduces physical degradation of maps over time.

Limitations of Scanners
• Initial cost can be high.
• Scanned images often require manual editing (e.g., removing noise, identifying features).
• Difficult to extract text, symbols, or labels accurately.
• Accuracy depends on map quality and scanner resolution.
• Not ideal for feature-specific data capture compared to manual digitizing.

Conclusion
Scanners are indispensable tools in GIS for automated spatial data input, especially for converting analog
maps into raster format. While they offer speed and accuracy, they are best used when the source maps are
clean, clear, and well-prepared. The resulting digital files form the foundation for many GIS analyses and
operations after georeferencing and vectorization.

2. Explain Raster Data Input


Raster data input involves converting analog data like maps, photographs, or prints into digital raster format
using scanners or direct acquisition from aerial/satellite imagery. The scanned image is stored as pixels with
values representing features or background. Raster input is the foundation for digital mapping and analysis.
OR
Raster data represents images as a matrix of cells (pixels), each with a specific value. Sources include:
• Aerial photographs
• Satellite imagery
• Scanned analog maps
Scanners convert these into digital raster files which can later be vectorized (converted into lines, points, and
polygons) through digitisation or vectorisation.

OR
Raster data input is a fundamental process in GIS (Geographic Information Systems) used to incorporate spatial
data represented in a grid-cell (pixel-based) format into the system for further analysis, mapping, and
interpretation.

What is Raster Data?


Raster data consists of a matrix of cells or pixels, organized into rows and columns, where each cell contains a
value representing information—such as temperature, elevation, land cover, or rainfall—at a particular
geographic location.

Why Raster Data Input is Important in GIS


• It allows real-world phenomena (e.g., aerial images, satellite photos, scanned maps) to be input into
GIS systems.
• Raster format is ideal for continuous data like elevation, precipitation, vegetation index, and
temperature.
• Essential for spatial analysis, modeling, and visualization in GIS.

Methods of Raster Data Input


There are several ways raster data can be input into a GIS:
1. Scanning of Hardcopy Maps
• Analog maps are scanned using scanners (flatbed, drum, CCD).
• Resulting image is stored as a raster (bitmap) file.
• Needs further processing like georeferencing, cleaning, and vectorization.
2. Satellite Imagery and Aerial Photography
• Automatically captured by remote sensing devices (e.g., Landsat, Sentinel satellites).
• Images are already in raster format and can be directly imported into GIS after georeferencing.
3. Digital Raster Files from Other Sources
• Includes data from CAD files, engineering drawings, or other raster datasets in digital format (e.g.,
GeoTIFF, JPEG, PNG).

Characteristics of Raster Data Input

Characteristic Description

Cell size (resolution) Determines the spatial detail; smaller cells = higher resolution

Georeferencing Assigns geographic coordinates to the raster image

Band-based Stores data in separate bands (e.g., RGB for color images)

Attribute linkage Each cell can store a value linked to attributes (especially in discrete rasters)

Cost & Effort Involved


• Expensive and time-consuming: Data input may account for over 80% of a GIS project's cost.
• Labor-intensive and error-prone, especially when digitizing analog maps.
• Requires quality control to avoid errors in analysis.

Challenges with Raster Data Input


• Manual editing required for scanned maps due to distortions, overlaps, and noise.
• Difficulty in distinguishing features like text, contours, and symbols during automated raster capture.
• Maps must often be redrafted or cleaned before raster input to ensure quality.
Solutions and Tools
• Automated tools in GIS software help improve input efficiency.
• Tools like ArcGIS, QGIS, and ERDAS Imagine support raster input, georeferencing, and conversion.
• Voice input, though tried, is generally unreliable and requires recalibration.

Summary of Modes of Raster Data Input

Mode Description

Manual Entry Keyboard input for attribute data (limited use for raster)

Manual Locating Devices E.g., digitizers for vector inputs

Automated Devices Scanners, digital cameras, satellite sensors

Direct Digital Conversion Converting CAD, remote sensing, or scanned map files

Voice Input (experimental) Tried for controlling digitizers; not widely used

Summary
• Raster data input is crucial for integrating spatial data like images and maps into a GIS.
• Involves converting analog data or using digital imagery.
• Despite its cost and complexity, raster input is essential for analyzing continuous data.
• Advancements in scanning and remote sensing continue to improve raster data handling.

3. Explain Raster Data File Formats


Raster file formats include:
• PNG, JPEG/JPEG2000, TIFF, GeoTIFF, BMP, GIF
• MrSID, ECW (wavelet-compressed)
• netCDF, DEM, DRG, ADRG, CADRG, RPF
• BSQ, BIL, BIP (based on data layout)
• GeoTIFF: Includes spatial metadata
Each has specific use-cases, compression methods, and compatibility with GIS tools.
OR
There are many raster formats used in GIS:
• PNG: Lossless compression, supports transparency.
• JPEG2000: Supports lossy/lossless compression; used for satellite images.
• TIFF/GeoTIFF: Popular, supports georeferencing (coordinates embedded).
• MrSID/ECW: Highly compressed formats for large aerial imagery.
• DEM: Elevation data in raster grid form.
• DRG/ADRG/CADRG: Digitized raster graphics for maps; used in defense mapping.
• BIL, BSQ, BIP: Storage methods for multiband satellite data.
• BMP, GIF: Standard bitmap formats.
• NetCDF: Scientific data format for multidimensional data.
Each has specific uses based on compression, metadata support, and compatibility with GIS software.

OR
Raster data file formats define how raster (grid-based) spatial data is stored, compressed, managed, and shared
in a GIS. These formats differ based on their data structure, metadata, compression technique, and intended
use.
Raster formats are essential for representing and analyzing real-world phenomena like satellite imagery, aerial
photos, scanned maps, land cover, elevation, and temperature.

Two Main Raster Data Types

1. Grids
• Used to store both discrete (categorical) and continuous (numeric) data.
• Grids are especially common in ESRI GIS software.
a. Discrete Grids
• Cells store integer values.
• Represent categorical data like land use, soil type, or vegetation class.
• Example: Forest = 1, Water = 2, Urban = 3.
• Each cell value has a corresponding attribute stored in a table.
• Attribute tables are typically in INFO format.
• Efficient for thematic mapping and statistical analysis.
b. Continuous Grids
• Cells store floating-point values (decimal numbers).
• Represent continuous phenomena like elevation, temperature, or rainfall.
• Each cell has a unique value.
• These grids do not have attribute tables.
• Useful in surface modeling, interpolation, and environmental analysis.

2. Images
• Store brightness values of light or radiation (e.g., visible, infrared, UV).
• Commonly used for:
o Aerial photos
o Satellite imagery
o Scanned paper maps
• Can be displayed as background layers or used as attributes linked to features (e.g., house photos in real
estate).
• Must be georeferenced to be useful in GIS (aligned to real-world coordinates).

Common Raster File Formats


Format Description

GeoTIFF TIFF with embedded GIS metadata. Supports georeferencing and compression.

JPEG2000 Open-source format with lossy or lossless compression. Efficient size.

IMG ERDAS Imagine format; supports large raster datasets.

MrSID Multi-resolution, compressed wavelet format; good for large images.

ECW Enhanced Compressed Wavelet format by ERDAS. Lightweight and fast loading.

DRG Digital Raster Graphic; scanned USGS topographic maps.

CADRG/CIB Raster formats used by military and NGA (National Geospatial Agency).

ADRG ARC Digitized Raster Graphics (older military raster format).

Esri Grid Binary or ASCII format used in ArcGIS to store raster data in grid format.

netCDF-CF Scientific format for storing climate and Earth science data.

RPF/DTED Raster formats for elevation data used in military or topographic analysis.

Key Differences Between Formats

Feature Grids Images

Data Type Integer (discrete) / Float (continuous) Brightness values (DN)

Attribute Table Present (discrete only) Absent

Use Case Land cover, elevation, rainfall Satellite/aerial photos

Metadata Support Yes (GeoTIFF, IMG, etc.) Varies

Compression Often supported Required for large image files

Coordinate Reference Required for spatial alignment Must be georeferenced

Importance in GIS
• Choosing the right raster format is crucial for:
o Data sharing
o Storage optimization
o Software compatibility
o Analytical accuracy
• Georeferenced raster data allows integration with vector layers and supports spatial analysis, overlay,
and map production.

Summary
Raster data file formats determine how raster information is stored and utilized in GIS. They fall into two main
categories—grids (discrete or continuous) and images—each suited for different types of analysis. The proper
selection of file format ensures efficient data handling, visualization, and spatial analysis in a GIS environment.
4. Write a note on Geo-referencing
Geo-referencing assigns real-world coordinates to digital images/maps using reference points. It enables spatial
alignment for analysis in GIS. Methods include manual point placement or automatic transformation using
control points. Applications include mapping, urban planning, disaster management, and environmental
analysis.
Geo-referencing is the process of assigning real-world coordinates to a map or image so it aligns correctly with
spatial data.
Steps in Geo-referencing:
1. Select image/map.
2. Acquire, rectify, enhance.
3. Associate with known coordinates.
4. Apply transformation.
5. Save as georeferenced data.
Methods:
• Vector Referencing: Aligning with known vector features (like roads).
• Raster Referencing: Aligning using grids or control points.
Applications:
Used in urban planning, disaster management, land use mapping, etc.

OR
Geo-referencing is the process of assigning real-world geographic coordinates (like latitude and longitude) to a
map, image, or dataset so that it can be accurately aligned and overlaid with other spatial data in a Geographic
Information System (GIS).
It ensures that spatial datasets fit within a common coordinate framework, enabling accurate mapping, analysis,
and integration of geographic information from different sources.

Importance of Geo-referencing

• Aligns scanned maps, aerial images, and satellite data to real-world locations.

• Allows overlay and comparison with existing GIS layers.

• Essential for urban planning, disaster management, environmental monitoring, etc.

• Helps in data sharing and interoperability across GIS platforms.

Where Geo-referencing is Needed


• Scanned topographic maps
• Aerial photographs
• Satellite images
• Historical or legacy maps
• Any spatial data that lacks geographic referencing

Key Concepts in Geo-referencing


1. Ground Control Points (GCPs)
• These are known, easily identifiable locations on both the image/map and in the real world.
• Also called: tie points, tick points, or conjugate points.
• Examples: road intersections, landmarks, corner boundaries.
• Must be precise, stable, and well-distributed across the image.
2. Coordinate Systems & Projections
• Geo-referencing requires selecting an appropriate:
o Coordinate System (e.g., UTM, WGS 84)
o Projection (e.g., Mercator, Lambert Conformal Conic)
3. Transformation Models
• The software uses a mathematical model to adjust the image based on control points.
• Common models:
o Affine transformation (simple scaling, rotation, translation)
o Polynomial transformation
o Rubber sheeting (complex warping for distorted maps)
4. Residuals
• These are the errors or differences between the known real-world coordinates and the coordinates
estimated by the transformation model.
• Lower residuals = more accurate geo-referencing.

Methods of Geo-referencing

a. Image-to-Map
• Aligns an image (like a satellite photo) to a known coordinate system or a reference map.
• Commonly used in remote sensing and environmental studies.

b. Image-to-Image
• Uses another already geo-referenced image as the reference.
• GCPs are selected based on common features between the two images.
• Useful for creating consistency among multiple raster layers.

Software Tools for Geo-referencing


• Commercial GIS tools: ArcMap, ERDAS Imagine, PCI Geomatica, TNTmips
• Open-source GIS tools: QGIS, SAGA GIS, GRASS GIS
• These tools allow selection of GCPs, applying transformations, and saving georeferenced outputs.

Summary
Geo-referencing is an essential preprocessing step in GIS that transforms unreferenced spatial data into
accurate, location-based information. By aligning maps and images to a coordinate system using ground control
points, it enables reliable integration and spatial analysis of diverse geographic datasets.
5. Write a note on Vector Data Input
Vector data is input through digitization from printed or scanned maps using devices like:
• Digitizing tables (most common),
• Mouse (used in on-screen digitizing),
• Keyboard (for entering coordinates/attributes).
Modern tools support heads-up digitization and automated vector extraction.
OR
Vector data represents geographic features using points, lines, and polygons.
Input Methods:
• Digitising Table: Map is traced with a puck (like a mouse) on a special table.
• Mouse (on-screen digitising): Tracing over scanned images using GIS software.
• Keyboard: Used to manually enter coordinates or attribute data.
Digitising Table Features:
• High accuracy
• Needs control/reference points
• Captures real-world coordinates

6. Explain Digitiser
A digitiser converts analog information/inputs (like images, sound, motion) into digital format form
Examples:
• Digital Camera: Converts light into image files.
• Audio Digitiser: Converts sound to digital signals.
• Tablets/Stylus: Converts hand-drawn input into vector graphics.
• Graphic tablets (capture drawings),
• Accelerometers & Gyroscopes: Convert motion into data.
Used in GIS to trace features from maps, enabling creation of vector datasets.
In GIS, digitisers are mainly used for converting printed maps into digital vector data.

7. Differentiate between Datum, Projection, and Re-projection


Datum: Reference model for Earth's shape and location (e.g., WGS84).

Defines the origin and orientation of coordinate systems (e.g., WGS84, NAD27). It models Earth's shape.

• Horizontal Datum: Used for latitude/longitude (e.g., WGS84).


• Vertical Datum: Used for elevation (e.g., mean sea level).

Projection: Method to display 3D Earth on 2D maps (e.g., conical, cylindrical).


Transforms Earth’s 3D curved surface into a 2D map. Types:
• Cylindrical (e.g., Mercator)
• Conic (e.g., Albers)
• Planar (Azimuthal)
Each has distortion in area, shape, distance, or direction.
Re-projection: Transforming data from one coordinate system to another for alignment and analysis.
Changing data from one projection or coordinate system to another (e.g., from UTM to Latitude/Longitude).

8. Write a note on Coordinate Transformation


Coordinate transformation converts spatial data from one system to another, accounting for different datums
and projections. It includes translation, rotation, and scaling to align datasets from various sources accurately.
Tools in GIS automate these transformations.
OR
This refers to converting data from one coordinate system to another (e.g., from local to global).
Includes:
• Datum transformation
• Projection transformation
• Mathematical operations (translation, rotation, scaling)
Crucial for combining data from multiple sources accurately.

9. Define Topology and explain Adjacency, Connectivity, and Containment


• Topology: Spatial relationships among vector features.
• Adjacency: Features sharing boundaries.
• Connectivity: Line features connected at nodes (used in networks).
• Containment: One feature inside another (e.g., island in a lake).
OR
Topology is the spatial relationship between features in vector data.
• Adjacency: Two polygons share a common edge.
• Connectivity: Lines connected at nodes (e.g., road networks).
• Containment: One feature lies inside another (e.g., island in lake).
Topology is essential for maintaining spatial accuracy and for analyses like routing or area computation.

OR
Definition of Topology in GIS
Topology in GIS refers to the spatial relationships among geographic features such as points, lines, and
polygons. It is a set of rules and behaviors that define how these features share geometry and relate to each
other in a consistent and logical way.
Topology ensures that spatial data is logically structured, which is critical for:
• Maintaining data integrity
• Supporting spatial queries
• Performing network analysis, overlay operations, and map editing
In topological models, features are often stored using nodes, edges, and faces to express how elements are
connected or bounded.

Why Topology is Important


• Prevents gaps, overlaps, or duplicate features in spatial datasets.
• Enables spatial navigation (e.g., tracing roads or rivers).
• Supports automated error checking and data validation.
• Useful in applications such as utility networks, land parcel management, and cadastral mapping.

Key Topological Terms


1. Adjacency
• Definition: Adjacency refers to the relationship where two or more polygons share a common
boundary (edge).
• Example: Two neighboring land parcels or districts that touch each other.
• Use Case: Determining which zones or properties are next to each other for zoning or buffer analysis.

Illustration:
[Parcel A] | [Parcel B]

Shared boundary = adjacency
2. Connectivity
• Definition: Connectivity describes how linear features (like roads, rivers, pipelines) are linked at
nodes or junctions.
• Example: A road network where intersections allow movement from one road to another.
• Use Case: Vital in network analysis, such as finding the shortest path, or modeling the flow of utilities
(water, electricity).

Illustration:
Road A ----•---- Road B

Node (connected point)
3. Containment
• Definition: Containment indicates that a point or feature lies within a polygon or area.
• Example: A school located within a city boundary or a tree inside a park.
• Use Case: Helps in zoning regulations, identifying features inside a boundary, or spatial joins.

Illustration:
[Park Polygon]
• Tree (contained feature)

Real-World Applications of Topology


• Land administration and cadastral systems.
• Utility network flow (electricity, water, sewer).
• Transportation and navigation systems.
• Environmental monitoring (e.g., identifying neighboring forest areas).
• Data validation to detect and correct topological errors like:
o Gaps or overlaps between polygons.
o Dangling or unconnected lines.
o Slivers or duplicate boundaries.

Summary

Concept Description Example

Topology Rules defining spatial relationships among features Shared edges, connected nodes

Adjacency Polygons sharing a boundary Two adjacent land parcels

Connectivity Lines meeting at nodes (junctions) Connected roads or rivers

Containment One feature lies within the bounds of another A school inside a district boundary

10. Explain Topological Consistency


Topological consistency ensures correct spatial relationships (no gaps/overlaps) between features. It is essential
for data quality, analysis (like routing), and spatial queries.
OR
Ensures all spatial relationships are correctly maintained:
• No overlapping polygons
• Lines meet at nodes
• Polygons close properly
Maintaining consistency is critical for accurate GIS operations and analyses.

11. Explain Non-topological File Formats


Non-topological formats (e.g., shapefiles without topology) store vector data without inherent spatial
relationships. They are easier to use but may lack spatial integrity, leading to errors in advanced analyses.
OR
These formats (like shapefiles) store geometry but don’t enforce topological rules.
Pros:
• Simple, widely used
• Easy to edit
Cons:
• May lead to errors in analysis (e.g., gaps between polygons)
• No enforced relationships
12. Explain Attribute Data Linking
Attribute data linking involves associating spatial features with descriptive information (e.g., population, land
use) using identification codes. This allows querying and analysis based on both spatial and non-spatial data.
OR
Attribute data describes features (e.g., name, population, area).
GIS links spatial features to this data using a unique ID or key.
E.g., a polygon (district) is linked to a table with district name, population, literacy rate, etc.

13. Explain Linking External Databases


External databases (like SQL, Access) can be linked to GIS systems to retrieve and join attribute data using
keys. This enhances the flexibility and scalability of GIS databases.
OR
GIS allows linking with external databases (e.g., SQL Server, MS Access) for:
• Dynamic attribute updates
• Large datasets
• Real-time data access
Databases are linked through common keys, enabling better data management and analysis.

14. Explain GPS Data Integration


GPS integration enables importing real-time or recorded coordinates into GIS, allowing accurate mapping of
locations and features. It supports field data collection and spatial updates.
GPS provides real-time location data using satellites.
Integration in GIS allows:
• Accurate field data collection
• Mapping of features like roads, utilities, boundaries
• Real-time tracking
GPS data is usually in WGS84 datum and can be transformed to local coordinate systems.

Module 4

1. Explain Data Quality


Data Quality in GIS refers to how accurate, reliable, complete, and consistent a dataset is for its intended use.
High data quality ensures efficient processes, informed decision-making, customer satisfaction, and trusted
analysis.
Key Dimensions of Data Quality:
• Accuracy: How closely data reflects real-world conditions (e.g., correct coordinates).
• Completeness: Whether all required data is present.
• Consistency: No contradictions within or between datasets.
• Uniqueness: No duplicates; all entries are distinct.
• Timeliness: Data is available when needed.
• Validity: Conforms to defined formats and rules.
• Currency: Data is up-to-date.
• Integrity: Logical relationships among data are preserved throughout processing.

OR
Data quality in GIS refers to the accuracy, reliability, and fitness of spatial and attribute data for a specific
purpose or application. Good data quality ensures:
• Correct decision-making
• Accurate spatial analysis
• Reliable mapping outputs
Poor-quality GIS data can lead to misinterpretation, errors, and costly mistakes.
GIS data quality is commonly evaluated using six key aspects:
1. Completeness
2. Logical Consistency
3. Positional Accuracy
4. Temporal Accuracy
5. Thematic Accuracy
6. Lineage

2. Explain the terms:


a. Completeness
It measures whether all necessary geographic features, attributes, and relationships are included. Missing data
(omission) or excessive/inappropriate data (commission) affects completeness.
OR
• Measures whether all required features and attributes are included in the dataset.
• Includes:
o Omission Errors: Missing features (e.g., roads).
o Commission Errors: Extra or redundant features.
• Example: A land-use map missing some roads is incomplete.
b. Logical Consistency
Describes the structural soundness of spatial data, including proper topology, valid geometry, and attribute
rules. For instance, roads must connect properly, and attributes must match logical expectations (e.g., no road
type without a road name).
OR
• Ensures that data follows geometric, topological, and attribute rules.
• Includes:
o Geometric consistency: No overlaps or gaps between polygons.
o Topological consistency: Roads must be connected properly.
o Attribute consistency: Attribute values are within valid ranges (e.g., no negative population).
• Example: A road network with gaps or overlaps violates logical consistency.
c. Positional Accuracy
Refers to how well the locations of features in a dataset match their true locations. It includes:
• Absolute Accuracy: Proximity to the true Earth position.
• Relative Accuracy: Correct positioning relative to other nearby features.
OR
Reflects how accurately features are placed in relation to their true geographic location.
Types:
• Absolute Accuracy: Deviation from real-world location (e.g., GPS).
• Relative Accuracy: Accuracy between features (e.g., building distance).
Example: A river misplaced compared to satellite imagery indicates low positional accuracy.
d. Temporal Accuracy
Relates to how accurate the data is with respect to time. This includes:
• Currency: Data reflects current conditions.
• Lineage: Correct sequence of updates and event history.
OR
Refers to the timeliness of the dataset.
Issues include:
• Outdated imagery or maps.
• Wrong time stamps.
Example: A road map that doesn’t include new highways built recently.
e. Thematic Accuracy
Reflects correctness of attribute data:
• Quantitative: Numerical measurements.
• Qualitative (Semantic): Correct naming/classification of features.
OR

• Measures the correctness of attribute data or classification labels.


• Includes:
o Misclassification: Incorrect land cover types.
o Wrong attributes: Incorrect demographic values.
• Example: Classifying urban land as agriculture leads to low thematic accuracy.

f. Lineage
Records the history and processing steps of a dataset, including sources, transformations, dates, and processing
methods. Helps determine data reliability.
OR
Describes the history and origin of the data, including:
• Source (e.g., GPS, satellite)
• Processing steps (e.g., projection, editing)
• Accuracy reports
Example: A forest cover map derived from 2020 imagery should mention source, classification method, and
edits.

3. Explain briefly Metadata


Metadata is "data about data" in GIS. It documents the who, what, when, where, why, and how of a spatial
dataset. It helps users discover, assess, and properly use geospatial data.
Components of Metadata:
1. Identification: Title, keywords, description.
2. Contact: Information about the creator, publisher, and distributor.
3. Quality: Accuracy reports, completeness, and lineage.
4. Spatial Reference: Coordinate system and extent.
5. Entity and Attribute: Describes data types, features, and attributes.
6. Lineage: Processing history.
7. Legal: Access and usage restrictions.
8. Temporal: Data time span and update frequency.
9. Metadata Reference: Who created the metadata.
10. Standards: Formats used (e.g., ISO 19115, FGDC).

OR
Metadata = “Data about data” in GIS.
It provides essential information about a dataset, such as:
• Source
• Accuracy
• Projection
• Format
• Purpose

Types of Metadata:
1. Descriptive: Title, keywords, description.
2. Spatial: Coordinate system, projection, extent.
3. Attribute: Data fields, units.
4. Lineage: Source and processing history.
5. Quality: Accuracy and resolution.

Benefits of Metadata:
• Helps users evaluate and understand datasets.
• Supports data discovery, sharing, and decision-making.
• Maintains data investment and ensures updates are planned.
• Critical during emergency response for fast access across agencies.

4. Explain briefly GIS Standards


GIS Standards are protocols and guidelines that ensure compatibility and consistency in handling geospatial
data, tools, and services.
Types of Standards:
• Data Format Standards: (e.g., KML, GPKG, AIXM)
• Metadata Standards: (e.g., ISO 19115, FGDC CSDGM)
• Service Standards: (e.g., WMS, WFS, WPS)
Importance:
• Ensure interoperability between systems.
• Improve data sharing and integration.
• Reduce redundancy and errors.
• Support international data exchange.

OR
GIS Standards are rules and protocols that ensure consistency, accuracy, and interoperability of spatial data
across systems and organizations.

Why GIS Standards are Important:


• Facilitate data sharing
• Ensure accuracy and compatibility
• Support collaboration and decision-making
• Reduce errors and redundancy

Types of GIS Standards:


1. Data Standards (e.g., ISO 19107 for geometry)
2. Metadata Standards (e.g., ISO 19115, FGDC)
3. Data Exchange Standards (e.g., Shapefile, GML, WMS, WFS)
4. Coordinate Reference System Standards (e.g., WGS84, UTM)
5. Data Quality Standards (e.g., NSSDA, ISO 19157)
6. GIS Software Standards (e.g., CityGML, OGC SFS)
7. Legal and Policy Standards (e.g., INSPIRE, ODbL)
8. Cartographic Standards (e.g., ISO 19117 for symbology)
5. Explain briefly Interoperability
Interoperability is the ability of different GIS systems, applications, and datasets to work together smoothly.
Types:
• IT Interoperability: Working across various hardware and platforms.
• GIS Interoperability: Sharing spatial data regardless of format.
• Web Interoperability: Using standard services (like WMS, REST APIs) across web clients.
Benefits:
• Facilitates collaboration.
• Allows integration of data from multiple sources.
• Enhances decision-making.
Challenges:
• Lack of standard formats.
• Vendor-specific platforms.
• Data translation complexity.

OR
Interoperability is the ability of different GIS systems, software, and data formats to work together
seamlessly. It allows smooth data exchange, integration, and collaboration.

Types of Interoperability:
1. Syntactic – Understanding file formats (e.g., reading GeoJSON, KML).
2. Semantic – Understanding meaning of data (e.g., classifying land use consistently).
3. Technical – Hardware and software compatibility (e.g., GPS integration).
4. Organizational – Effective data and workflow sharing between agencies.

Importance:
• Enables cross-platform operations.
• Supports integration with GPS, remote sensing, and web services.
• Reduces duplication and improves data sharing.

6. Write a note on:


a. OGC – Open Geospatial Consortium
OGC is an international non-profit organization that develops open standards for geospatial content, services,
and technologies. Founded in 1994, it promotes interoperability through standardized protocols.
Key Standards:
• WMS (Web Map Service)
• WFS (Web Feature Service)
• GML (Geography Markup Language)
• KML (Keyhole Markup Language)
• GeoPackage, SensorML, GeoSPARQL
Programs:
• Standards Program: Develops specifications.
• Compliance Program: Tests and certifies software.
• Community & Outreach: Promotes adoption.

OR
a. OGC (Open Geospatial Consortium)
The Open Geospatial Consortium (OGC) is an international organization that develops open standards to
promote interoperability in GIS.

OGC Standards:
1. WMS (Web Map Service) – Delivers map images over the web.
2. WFS (Web Feature Service) – Access and edit vector data online.
3. WCS (Web Coverage Service) – Shares raster data (e.g., satellite images).
4. GML (Geography Markup Language) – XML-based universal spatial data format.
5. KML (Keyhole Markup Language) – Used in Google Earth/Maps.
6. SOS (Sensor Observation Service) – Real-time sensor data (e.g., weather).
7. SF (Simple Features) – Basic shapes like points, lines, polygons.

Benefits:
• Supports data sharing and cross-platform integration
• Used in disaster management, urban planning, agriculture, environment
• Promotes open and accessible GIS systems

b. Spatial Data Infrastructure (SDI)


SDI is a framework that enables efficient access, sharing, and use of spatial data.
Components:
• Spatial data: Maps, satellite images, geodatabases.
• Metadata: Describes datasets.
• Users and institutions: Government, academia, public.
• Software: GIS tools for data access and processing.
• Standards: For data format and communication.
Functions:
• Helps in decision-making.
• Reduces data duplication.
• Supports national and global spatial data integration.
• Enables open data access and sharing.
Users:
Governments, private firms, NGOs, researchers, and the public.

OR
b. Spatial Data Infrastructure (SDI)
Spatial Data Infrastructure (SDI) is a framework that allows efficient sharing, access, and use of spatial
data across organizations and systems.

Components of SDI:
1. Data – Maps, imagery, demographics, etc.
2. People – Users, creators, organizations.
3. Policies & Standards – Ensure quality and interoperability.
4. Technology – GIS software, cloud platforms, databases.
5. Network Infrastructure – Web services like WMS, geoportals.

Levels of SDI:
• Local SDI: City-level GIS applications.
• National SDI (NSDI): Country-wide data integration.
• Regional SDI: Shared data across regions (e.g., EU's INSPIRE).
• Global SDI: Worldwide efforts like UN-GGIM.

Benefits of SDI:
• Easier data access and retrieval
• Promotes collaboration
• Ensures standardized data management
• Helps in disaster response, planning, and policy-making

Summary Table

Concept Description

Data Quality Accuracy and reliability of GIS data for decision-making.

Metadata Data about data (e.g., source, format, accuracy).

GIS Standards Rules to ensure interoperability and consistency of spatial data.

Interoperability GIS systems working together across formats, platforms, and organizations.

OGC Organization developing open standards for GIS (e.g., WMS, WFS, GML).

SDI Framework to manage and share spatial data using standards, technology, and networks.
Module 5

1. Explain Import/Export Data in GIS


Importing in GIS means bringing spatial data from external sources (like shapefiles, databases, CSVs, or web
services) into GIS software for use and analysis.
Exporting is the process of saving or transferring GIS data from the software to another system or file format
for sharing, storage, or external use.
Common Formats:
• Vector: Shapefiles (.shp), GeoJSON, KML/KMZ
• Raster: TIFF, JPEG, PNG
• Tabular: CSV, Excel
• Geodatabases: File or personal geodatabases
Software Tools: ArcGIS Pro, QGIS, Locus GIS, Google Earth Pro

OR
Importing Data:
Importing in GIS means bringing in spatial (vector/raster) or non-spatial (tabular) data from external sources for
visualization and analysis.
Common Import Formats:
• Vector: .shp, .geojson, .kml, .dxf, .gml
• Raster: .tif, .jpg, .png, GRID, .nc
• Tabular: .csv, .xls/.xlsx, .dbf, SQL databases
Import Methods:
• File import in software (e.g., QGIS, ArcGIS)
• Database connections (PostGIS, SQL Server)
• Web services (WMS, WFS, APIs)
• GPS devices (GPX, NMEA)

Exporting Data:
Exporting is saving GIS data in a format suitable for other applications, sharing, or reporting.
Export Formats:
• Vector: .shp, .geojson, .kml, .dxf
• Raster: .tif, .jpg, .png
• Tabular: .csv, .xlsx, .json
• Cartographic: .pdf, .svg, .ai
Challenges:
• Format compatibility
• Coordinate system mismatches
• Metadata loss
• File size issues
Best Practices:
• Check CRS
• Use standard formats
• Validate data
• Preserve metadata
• Compress large files

2. Briefly explain Data Management Functions in GIS


Data management in GIS involves the processes of collecting, storing, editing, organizing, retrieving, and
maintaining geospatial and attribute data.
Key Functions:
1. Data Acquisition: Importing from shapefiles, web services, or sensors.
2. Storage & Organization: Using geodatabases and layers.
3. Editing & Manipulation: Geoprocessing, buffering, overlay, spatial joins.
4. Query & Retrieval: Attribute-based or spatial-based data selection.
5. Quality Control: Validating data accuracy, consistency.
6. Security & Versioning: Managing access, data history.
7. Metadata Management: Recording data source, accuracy, and structure.

OR
Data Management in GIS ensures efficient storage, retrieval, analysis, and sharing of spatial data.

Key Functions:
1. Data Storage:
o Relational databases (RDBMS), object-oriented DBMS, and file-based storage.
o Spatial indexing for faster access.
2. Data Input & Integration:
o Importing, georeferencing, digitizing, GPS integration.
3. Data Processing & Manipulation:
o Spatial queries, table editing, coordinate transformations, raster-vector conversions.
4. Data Quality Control:
o Error correction, metadata maintenance, validation.
5. Data Analysis & Modeling:
o Overlay, buffering, interpolation, network analysis.
6. Export & Sharing:
o Export formats, web services, cloud GIS.

Importance:
• Improves accessibility, ensures data integrity, reduces redundancy, and supports collaboration.

3. Explain Conversion of Raster to Vector in GIS


Definition: The process of converting raster data (pixel-based) into vector data (points, lines, polygons). Also
called vectorization.
Why it's done:
• For precise editing
• Improved spatial analysis
• Efficient storage
Methods:
• Polygonization: Converts pixel regions into polygons.
• Contouring: Extracts elevation lines from DEMs.
• Feature extraction: Classifies pixels and derives vector features.
Tools: QGIS (Polygonize), ArcGIS (Raster to Polygon), Google Earth Engine

OR
Raster to Vector (Vectorization) involves converting pixel-based data into point, line, or polygon features.

Methods:
• Manual Digitization
• Thresholding (binary classification)
• Edge Detection (for contours)
• AI-based Vectorization

Applications:
• Scanned maps to vector layers
• Satellite images to roads/buildings
• DEMs to contour lines

Challenges:
• Detail loss
• Topological errors
• Computational load
• Post-processing cleanup

OR
Raster to Vector Conversion Steps:
1. Select Raster Layer
o Choose the input raster image (e.g., a scanned map, satellite image, or classified land-use raster).
o The raster must have clear pixel values representing different features (e.g., roads, water, land
types).
2. Choose Vectorization Method
Depending on the type of feature to extract:
o Polygonization: Converts groups of similar pixel values into polygons.
o Contouring: Converts continuous raster values (e.g., elevation) into contour lines.
o Thresholding / Feature extraction: Identifies specific pixel values or ranges and converts them
into vector features.
3. Set Parameters
o Pixel value ranges to convert (e.g., only extract water bodies from blue pixels).
o Smoothing/tolerance: Controls how precise or simplified the resulting vector lines/polygons
will be.
o Output type: Point, line, or polygon, based on feature geometry.
4. Run Vectorization Tool
o The software scans through raster cells.
o Groups cells with similar values.
o Converts those groups into vector geometries.
o Saves the result as a vector layer (e.g., shapefile, GeoJSON, etc.).
What Happens During Raster to Vector Conversion?
• The raster is scanned pixel-by-pixel.
• Regions of similar pixel values are grouped.
• Each group is turned into a vector feature:
o If area → Polygon
o If boundary → Line
o If point features (isolated pixels) → Point
• The resulting vector layer can now be edited, analyzed, and stored with attributes.
Example Use Case:
Imagine you have a land cover raster where:
• 1 = Water
• 2 = Forest
• 3 = Urban
After raster-to-vector conversion:
• Water pixels become water polygons.
• Forest pixels become forest polygons.
• Urban areas become urban polygons.
4. Explain Conversion of Vector to Raster in GIS
Definition: The transformation of vector data into raster format, where features are encoded in pixel values.
Why it's done:
• For raster-based analysis (e.g., terrain modeling)
• Compatibility with remote sensing data
Steps:
1. Select vector layer
2. Choose rasterization method
3. Set output resolution (cell size)
4. Run conversion
Tools: QGIS (Rasterize), ArcGIS (Feature to Raster, Polygon to Raster)
Considerations:
• Some detail may be lost
• Pixel size affects accuracy

OR
Vector to Raster (Rasterization) transforms points, lines, or polygons into grid-based data.

Methods:
• Cell-based Encoding: Assign values to raster cells.
• Interpolation: Create continuous surfaces (e.g., elevation).
• Density Estimation: Generate heat maps.

Applications:
• Heat maps
• Spatial modeling
• Remote sensing input

Challenges:
• Resolution loss
• Storage needs
• Generalization of sharp boundaries

OR
Vector to Raster Conversion:
This process converts vector features (points, lines, polygons) into a raster format (grid of pixels), where each
pixel holds a value that represents a feature or attribute from the vector data.
1. Select Vector Layer
• What happens: You begin by selecting the vector layer (e.g., shapefile, GeoJSON, feature class) that
contains the spatial features you want to convert.
• Example: A polygon layer showing land use types (forest, water, agriculture) or a point layer showing
school locations.
2. Choose Rasterization Method
• What happens: You decide how the vector features should be represented in raster form. This involves
selecting:
o Geometry type: Point, line, or polygon.
o Attribute field: Which attribute from the vector file should be used as the pixel value (e.g., land
use type, elevation, population).
o Conversion rule: How values are assigned if multiple features fall within a single pixel (e.g.,
most frequent, average, max, etc.).
3. Set Output Resolution (Cell Size)
• What happens: You define the size of each raster cell (pixel), usually in map units (e.g., meters).
o Smaller cells = higher resolution, more detail, but larger file size.
o Larger cells = lower resolution, less detail, smaller file.
• Example:
o A cell size of 10 meters means each pixel in the raster will represent a 10x10 meter area on the
ground.
4. Run Conversion
• What happens:
o GIS software scans each pixel in the output raster.
o It checks if any part of the vector geometry intersects that pixel.
o If yes, it assigns a pixel value based on the selected attribute.
o The result is a raster dataset where each cell contains a value from the vector layer.
• Output: A raster file (e.g., GeoTIFF) representing the vector features.

Example:
Converting a vector polygon layer of land use to raster:
• Attribute: LandType (1 = Forest, 2 = Water, 3 = Urban)
• Cell size: 30 meters
• Result: A raster grid where each pixel value is 1, 2, or 3 depending on land use.

5. Explain Briefly Map Compilation in GIS


Map Compilation is the process of assembling and organizing spatial data from multiple sources to produce a
map.
Steps:
1. Define purpose and scale
2. Gather data (from maps, GPS, imagery)
3. Verify data quality and accuracy
4. Transform and symbolize data
5. Design map layout (legend, scale, orientation)
6. Generate final output
Sources: Satellite images, surveys, GPS, geodatabases
Importance:
• Provides accurate representation for planning and decision-making
• Often semi-automated but still requires human input for quality assurance

OR
Map Compilation is the process of assembling GIS data into a final map layout.

Key Elements:
• Title
• Legend
• Scale bar
• North arrow
• Grid
• Labels
• Metadata
• Data layers

Steps:
1. Data Collection
2. Layer Selection & Symbolization
3. Layout Design & Labeling
4. Cartographic Enhancements
5. Export & Sharing (PDF, PNG, GeoTIFF, KML)

Types of Maps:
• Topographic, Thematic, Choropleth, Isoline, Network, 3D Maps

Challenges:
• Data inconsistency
• Color confusion
• Label clutter

6. Define Chart/Graphs and Explain types of Chart/Graphs in GIS


Charts/Graphs in GIS visualize data patterns, trends, and relationships complementing maps.
Common Types:
1. Line Graphs – Show trends over time.
2. Bar Charts – Compare categories.
3. Pie Charts – Show proportions.
4. Histograms – Display frequency distribution.
5. Scatter Plots – Reveal relationships between variables.
6. Venn Diagrams – Show overlapping categories.
7. Area Charts – Show cumulative values.
8. Spline Charts – Smooth curves over time.
9. Box and Whisker – Display data quartiles.
10. Bubble Charts – Three-variable comparison.
11. Dot Plots – Frequency of values.
12. Radar Charts – Multi-variable comparison.
13. Pictographs – Use symbols to represent values.
14. Pyramid Charts – Represent hierarchies.
Uses:
• Data exploration and comparison
• Enhancing map interpretation
• Communicating trends visually

OR
Charts/Graphs in GIS visualize attribute data for better understanding and analysis.

Common Types:
1. Bar Charts – Categorical comparisons (e.g., land use types)
2. Pie Charts – Proportional data (e.g., population distribution)
3. Histograms – Frequency distribution (e.g., elevation range)
4. Line Graphs – Trends over time (e.g., rainfall data)
5. Scatter Plots – Correlation between variables (e.g., NDVI vs. temperature)
6. Bubble Charts – Data with 3 variables (e.g., city size vs. income vs. pollution)
Usage in GIS:
• Linked to attribute tables
• Dynamic and interactive in modern GIS dashboards

7. Write Notes on:


a. Multimedia in GIS
Multimedia integrates text, audio, images, video, and animation to enhance GIS applications. It supports:
• Education: Interactive tutorials, training
• Entertainment: Game-based simulations
• Engineering: 3D modeling and visualizations
• Journalism: Storytelling with maps and media
• Medicine: Virtual surgeries and anatomy maps

OR
Multimedia elements like images, audio, video, and hyperlinks enhance GIS maps by providing context and
real-world references.
Examples:
• Photos of landmarks
• Videos from surveillance drones
• Sound clips of environmental data
• Useful in tourism, heritage, and education

b. Enterprise vs. Desktop GIS

Feature Enterprise GIS Desktop GIS

Purpose Organization-wide Individual or small teams

Deployment Web/cloud/server-based Installed on local computer

Data Access Centralized with sharing Local storage

Collaboration High (multi-user editing) Limited

Examples ArcGIS Enterprise, QGIS Server ArcGIS Pro, QGIS, Global Mapper

OR
Feature Desktop GIS Enterprise GIS

Users Single Multiple (organization-wide)

Storage Local machine Central database/server

Access Standalone Network-based (intranet/internet)

Examples QGIS, ArcGIS Pro ArcGIS Enterprise, GeoServer

Scalability Limited Highly scalable

Collaboration Minimal Supports teamwork

Data Sharing Manual Automated, real-time

c. Distributed GIS
Distributed GIS uses multiple connected systems to store, process, and deliver spatial data.
Features:
• Access from different locations
• Scalability and collaboration
• Web/mobile applications
Examples:
• Google Maps, ArcGIS Online, Bing Maps
Applications:
• Logistics, agriculture, disaster response, utilities

OR
Distributed GIS is a system where GIS components (databases, processing, interfaces) are spread across
multiple servers or locations.
Features:
• Uses internet and cloud services
• Enables real-time applications (e.g., traffic, weather)
• Supports IoT integration
Advantages:
• Scalability
• Real-time data access
• Global accessibility
Examples: Google Maps API, NASA EarthData, OpenStreetMap

d. Data Output in GIS


Data output refers to how GIS data is displayed, shared, or communicated.
Forms:
• Maps (digital, print)
• Charts & Graphs
• Tables
• Reports
• Web apps
• 3D models or simulations
Factors Affecting Output:
• Audience
• Purpose
• Type of data
• Software used

OR
Data Output refers to how GIS data is presented or shared.
Output Formats:
• Maps (paper, digital PDFs)
• Tables (Excel, CSV)
• Reports (PDF, DOC)
• Web Maps (KML, GeoJSON, WMS)
Uses:
• Decision support
• Reporting
• Visualization
• Sharing with non-GIS users

8. Differentiate Between Enterprise and Desktop GIS

Aspect Enterprise GIS Desktop GIS

Usage Organization-wide collaboration and data sharing Individual or small team analysis

Deployment Web/cloud-based systems Installed locally

Data Storage Central servers or cloud databases Local hard drive

Collaboration Multi-user editing, version control Minimal or manual sharing

Examples ArcGIS Enterprise, PostGIS, QGIS Server ArcGIS Pro, QGIS, Global Mapper

Scalability Highly scalable, supports big data and many users Limited scalability

Security Role-based access, encryption, authentication Basic file-level access control

OR
Criteria Desktop GIS Enterprise GIS

User Access Single user Multi-user

Data Storage Local (PC, laptop) Centralized (database/server)

Collaboration Limited High (team-based, department-wide)

Internet Required Not essential Required for remote access

Scalability Low High (enterprise-grade solutions)

Examples QGIS, ArcGIS Pro ArcGIS Enterprise, Google Earth Engine

Maintenance Easier and cheaper Requires IT infrastructure

Performance Fast for local data Depends on network & server setup

You might also like