Milestone 1

This document discusses automatic image annotation using multi-kernel learning for image patch clustering. Image patches are extracted from images using dense sampling at multiple scales with overlap. Features are extracted from each patch and multi-kernel learning is applied to cluster visually similar patches into groups within each category. Multi-kernel learning is also used to discover cross-category patch groups. The relevance of each group to its category tag is determined. A "cell graph" is constructed with categories and patch groups to represent associations. Knowledge graph construction and contextual relationship discovery are discussed to annotate images. The main ideas were extracted from two referenced papers on multi-kernel learning for tracking and knowledge graph-based image classification.

Uploaded by

MALARVILI A/P NALLAYAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views3 pages

Milestone 1

Uploaded by

MALARVILI A/P NALLAYAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Milestone 1

Title : Automatic Image Annotation

1.1 Images patches generation

Image patches extraction can be carried out in multiple ways, including image segmentation,
dense sampling, and salient region detection, etc. Dense sampling is the most widely used for its
simplicity where patches are uniformly sampled with a step in an image. Dense sampling on a regular
grid results in a good coverage of the entire objects or scene and a constant number of features per
image area. Regions with less contrast contribute equally to the overall image representation.

Scene images often contain many objects of interest under various backgrounds. We first divide images
into patches and generate patches group with the same category. Although a rigid partition of an image
into grid preserves certain spatial information, it often breaks an object into several blocks or puts
different objects into a single block. Thus, visual information about object, with could be beneficial to
image categorization may be destroyed by a rigid partition. We imposed image patches with dense
sampling at multiple scales and each type of grid is densely scanned over the image with overlap.

In this experiment, the scale of the grids is set as 60 x 60, 120 x 120, and 180 x 180. The corresponding
overlap are set as 15, 30, and 45, respectively. Partitions in the horizontal and vertical directions are
added to preserve consistent structure information (Xie et al., 2018). This process produces a highly
redundant image patch collection in each image category. Each group is defined a collection of image
patches that are visually like one another.

After patches extraction, we can obtain a patch set for each category P cat={p1,p2,…..pi} where pi is
the ith patch and P is the number of patches.

Figure 1: Illustration of the overlapping slide window patch extraction.

1.2 Patches grouping within the category

The goal of this step is to obtain several discriminative dense patch groups for each image
category. Each group contain visually similar image patches. Finding clusters in data is a challenging task
when the clusters different widely in shapes, sizes, and densities. The state-of-the-art methods find
dense subgraph on the affinity graph as the dominant clusters. However, the time and space complexity
of those methods are dominated by the construction of the affinity graph which is quadratic with
respect of the number of data points, and thus impractical on large data set.

We extract three kinds of features to describe the visual content of an image patch, the 128 dimensional
SIFT feature, the 256-dimensional Local Binary Pattern (LBP), and the 128-dimensional color histogram.

 Rough idea

- Now multi kernel learning (Fan, H., & Xiang, J. – use the way discussed in this paper which is
first-stage multi kernel learning and second-stage multiple kernel learning for assigning different
weight for the patches – use apropriate clustering technique to cluster the patches) should be
applied to cluster patches to patches groups in the same category. Image patches that visually
similar to each other will compose to patch groups within the same category. Within the same
category, there will be many patches groups which contain visually similar patches. Number of
groups generated in each category can be varies. We first collect category name as tag.
- Now, we can discover sets of visually similar patch groups across categories. We apply multi
kernel learning. Apply the same technique which used to cluster patches in the same category
for this.
- Then, find relevance degree of each group to its course category tag.
o It is showed that the object of interest is often located near the center of image
o Size is relatively big
o It is located near of the image
- construct the “cell graph” with category as the center node and the image patches group as the
side node. The association value between “cell graph” is obtained from relevance degree
- use wordnet to find semantic association
- Every related “cell graph” is combined into the subgraph.
- Contextual relationship discovery in knowledge graph
- knowledge graph construction as in Paper 2
- Finally, annotate the image (refer to paper 1)
- Main paper where the ideas were extracted highlighted in red color under reference.
Dataset = same as paper 2

- Two contribution excepted from this research which is one on the Multi kernel Learning for
image patches clustering and knowledge graph construction using image patches.
References

Fan, H., & Xiang, J. (2015). Patch-based visual tracking with two-stage multiple kernel learning. Lecture
Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture
Notes in Bioinformatics), 9219(August), 20–33. https://doi.org/10.1007/978-3-319-21969-1_3
Xie, L., Lee, F., Liu, L., Yin, Z., Yan, Y., Wang, W., Zhao, J., & Chen, Q. (2018). Improved spatial pyramid
matching for scene recognition. Pattern Recognition, 82, 118–129.
https://doi.org/10.1016/j.patcog.2018.04.025
(Paper 2) Zhang, D., Cui, M., Yang, Y., Yang, P., Xie, C., Liu, D., Yu, B., & Chen, Z. (2019). Knowledge
Graph-Based Image Classification Refinement. IEEE Access, 7(c), 57678–57690.
https://doi.org/10.1109/ACCESS.2019.2912627
(Paper 1) Zhang, S., Tian, Q., Hua, G., Huang, Q., & Gao, W. (2014). ObjectPatchNet: Towards scalable
and semantic image annotation and retrieval. Computer Vision and Image Understanding, 118, 16–
29. https://doi.org/10.1016/j.cviu.2013.03.008

Active Learning Segmentation
No ratings yet
Active Learning Segmentation
10 pages
Image Segmentation Techniques
No ratings yet
Image Segmentation Techniques
25 pages
Tpami 2008 05 0306 1
No ratings yet
Tpami 2008 05 0306 1
14 pages
Project Report On Image Segmentation
No ratings yet
Project Report On Image Segmentation
4 pages
5 Segmentation
No ratings yet
5 Segmentation
62 pages
Eccv 06
No ratings yet
Eccv 06
15 pages
2011-Structural Image Classification With Graph Neural Networks
No ratings yet
2011-Structural Image Classification With Graph Neural Networks
6 pages
Satellite Image Segmentation With Convolutional Neural Networks (CNN)
100% (1)
Satellite Image Segmentation With Convolutional Neural Networks (CNN)
4 pages
Computer Vision for Coders
No ratings yet
Computer Vision for Coders
152 pages
ImSeg 10 11 18
No ratings yet
ImSeg 10 11 18
41 pages
A Pattern Recognition Approach To Image Segmentation
No ratings yet
A Pattern Recognition Approach To Image Segmentation
7 pages
03-3 Feature Descriptors
No ratings yet
03-3 Feature Descriptors
58 pages
A Data-Related Patch Proposal For Semantic Segmentation of Aerial Images
No ratings yet
A Data-Related Patch Proposal For Semantic Segmentation of Aerial Images
5 pages
Multimedia Systems: Multimedia Databases - Image Processing Basics
No ratings yet
Multimedia Systems: Multimedia Databases - Image Processing Basics
58 pages
Image Segmentation New
No ratings yet
Image Segmentation New
71 pages
Image Annotations Using Machine Learning and Features
No ratings yet
Image Annotations Using Machine Learning and Features
5 pages
Feature Extraction Techniques Based On Color Images
No ratings yet
Feature Extraction Techniques Based On Color Images
7 pages
Im Seg 04
No ratings yet
Im Seg 04
42 pages
Lec 27
No ratings yet
Lec 27
25 pages
Bai09 Descriptors
No ratings yet
Bai09 Descriptors
81 pages
Fusing Global and Local Features For Generalized AI-synthesized Image Detection
No ratings yet
Fusing Global and Local Features For Generalized AI-synthesized Image Detection
5 pages
Bag-Of-Words Models: Noah Snavely
No ratings yet
Bag-Of-Words Models: Noah Snavely
47 pages
CV 2025 Spring 12 Short
No ratings yet
CV 2025 Spring 12 Short
120 pages
Understanding Regions and Region Segmentation: by Nayan Khinvasara
No ratings yet
Understanding Regions and Region Segmentation: by Nayan Khinvasara
59 pages
2017 05 12 Image Segmentation
No ratings yet
2017 05 12 Image Segmentation
2 pages
DIP Mod 4 Segment Part A
No ratings yet
DIP Mod 4 Segment Part A
58 pages
Flyveryhigh Og IP
No ratings yet
Flyveryhigh Og IP
7 pages
Discriminative Random Fields: Google Research, 1440 Broadway, New York, NY 10018, USA
No ratings yet
Discriminative Random Fields: Google Research, 1440 Broadway, New York, NY 10018, USA
36 pages
RRL
No ratings yet
RRL
6 pages
Image Mining Method and Frameworks: Shaikh Nikhat Fatma
No ratings yet
Image Mining Method and Frameworks: Shaikh Nikhat Fatma
11 pages
Understanding Deep Learning Techniques For Image Segmentation
No ratings yet
Understanding Deep Learning Techniques For Image Segmentation
58 pages
Kernel Visual Recognition
No ratings yet
Kernel Visual Recognition
9 pages
Clustering Art
No ratings yet
Clustering Art
8 pages
L9 Segmentation
No ratings yet
L9 Segmentation
89 pages
Patch-Based Within-Object Classification
No ratings yet
Patch-Based Within-Object Classification
8 pages
Currency Recognition System Using Image
No ratings yet
Currency Recognition System Using Image
4 pages
IP Bankai
No ratings yet
IP Bankai
10 pages
Week 10 - Image Segmentation (Part 2)
No ratings yet
Week 10 - Image Segmentation (Part 2)
29 pages
Image Segmentation Digital Image Processing
100% (1)
Image Segmentation Digital Image Processing
44 pages
IT5409 Ch5 Segmentation v2
No ratings yet
IT5409 Ch5 Segmentation v2
70 pages
Marina Ivašić-Kos, Mile Pavlić,: Maja Matetić
No ratings yet
Marina Ivašić-Kos, Mile Pavlić,: Maja Matetić
14 pages
Image Segmentation
No ratings yet
Image Segmentation
36 pages
Image Classification AI
No ratings yet
Image Classification AI
150 pages
Image Segmentation Using Clustering (Texture With PCA)
No ratings yet
Image Segmentation Using Clustering (Texture With PCA)
25 pages
Object Recog
No ratings yet
Object Recog
102 pages
14
No ratings yet
14
72 pages
Image Segmentation: Ross Whitaker SCI Institute, School of Computing University of Utah
No ratings yet
Image Segmentation: Ross Whitaker SCI Institute, School of Computing University of Utah
49 pages
Segmentation
No ratings yet
Segmentation
31 pages
Lich Su Dang
No ratings yet
Lich Su Dang
6 pages
Exer8 TresMarias
No ratings yet
Exer8 TresMarias
3 pages
CV Lecture 7
No ratings yet
CV Lecture 7
119 pages
From Text To Mask Localizing Entities Using The
No ratings yet
From Text To Mask Localizing Entities Using The
43 pages
Data Representation and Pattern Recognition in Image Mining-N D Thokare
No ratings yet
Data Representation and Pattern Recognition in Image Mining-N D Thokare
6 pages
Computer Vision
No ratings yet
Computer Vision
6 pages
Module 3
No ratings yet
Module 3
98 pages
Computer Vision Unit 3
No ratings yet
Computer Vision Unit 3
19 pages
Ref 1
No ratings yet
Ref 1
6 pages
Thesis Z Ai
No ratings yet
Thesis Z Ai
46 pages
t1 English (Intensif) Feb 2025
No ratings yet
t1 English (Intensif) Feb 2025
9 pages
Attention Fusion Network For Multimodal Sentiment Analysis: Yuanyi Luo Rui Wu Jiafeng Liu Xianglong Tang
No ratings yet
Attention Fusion Network For Multimodal Sentiment Analysis: Yuanyi Luo Rui Wu Jiafeng Liu Xianglong Tang
11 pages
Survey On Sentiment Analysis: Evolution of Research Methods and Topics
No ratings yet
Survey On Sentiment Analysis: Evolution of Research Methods and Topics
42 pages
Nota Sejarah
No ratings yet
Nota Sejarah
20 pages
Quantum Coulomb Problem Solutions
No ratings yet
Quantum Coulomb Problem Solutions
15 pages
Advanced Structural Analysis Prof. Devdas Menon Department of Civil Engineering Indian Institute of Technology, Madras
No ratings yet
Advanced Structural Analysis Prof. Devdas Menon Department of Civil Engineering Indian Institute of Technology, Madras
38 pages
COMM 201 Biostatistics
No ratings yet
COMM 201 Biostatistics
30 pages
Cambridge IGCSE: MATHEMATICS 0580/04
No ratings yet
Cambridge IGCSE: MATHEMATICS 0580/04
16 pages
Pre Test 2021
No ratings yet
Pre Test 2021
4 pages
A Bayesian Network Approach To Early Reliability Assessment of Complex Systems
No ratings yet
A Bayesian Network Approach To Early Reliability Assessment of Complex Systems
157 pages
Worksheet Force
No ratings yet
Worksheet Force
13 pages
What Are Different Research Approaches? Comprehensive Review of Qualitative, Quantitative, and Mixed Method Research, Their Applications, Types, and Limitations
No ratings yet
What Are Different Research Approaches? Comprehensive Review of Qualitative, Quantitative, and Mixed Method Research, Their Applications, Types, and Limitations
11 pages
Charles Swartz - An Introduction To Functional Analysis-M. Dekker (1992) PDF
100% (1)
Charles Swartz - An Introduction To Functional Analysis-M. Dekker (1992) PDF
615 pages
Assignment: This Study Resource Was
No ratings yet
Assignment: This Study Resource Was
4 pages
Engineering Dynamics 2 0 Fundamentals and Numerical Solutions Lester W. Schmerr PDF Download
100% (3)
Engineering Dynamics 2 0 Fundamentals and Numerical Solutions Lester W. Schmerr PDF Download
61 pages
Vocabulary Graph Theory
No ratings yet
Vocabulary Graph Theory
48 pages
4 1 2 Uwb Antenne Simulation With CST Microwave Studio
100% (1)
4 1 2 Uwb Antenne Simulation With CST Microwave Studio
24 pages
Lacey Creep Theory
No ratings yet
Lacey Creep Theory
14 pages
Chemistry The Impure Science 2nd Edition Bernadette Bensaude-Vincent All Chapters Available
100% (2)
Chemistry The Impure Science 2nd Edition Bernadette Bensaude-Vincent All Chapters Available
143 pages
Linear Algebra in 4 Pages PDF
No ratings yet
Linear Algebra in 4 Pages PDF
4 pages
The Chain Rule
100% (1)
The Chain Rule
40 pages
Calculations ABAP
No ratings yet
Calculations ABAP
8 pages
Pipe Sizing for Engineers
No ratings yet
Pipe Sizing for Engineers
21 pages
Sms 1204 Geometry
No ratings yet
Sms 1204 Geometry
4 pages
Macro Notes Williamson
No ratings yet
Macro Notes Williamson
146 pages
R9907 Algorithm Methane
No ratings yet
R9907 Algorithm Methane
29 pages
Polynomial Problem Solving Guide
No ratings yet
Polynomial Problem Solving Guide
13 pages
Module 3
No ratings yet
Module 3
10 pages
Calculating The Height of A Building Worksheet
No ratings yet
Calculating The Height of A Building Worksheet
3 pages
Software Testing Techniques Guide
No ratings yet
Software Testing Techniques Guide
47 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Unknown Input Observer and Robust Control1
No ratings yet
Unknown Input Observer and Robust Control1
33 pages
JEE Math Practice with Anshul
No ratings yet
JEE Math Practice with Anshul
2 pages

Milestone 1

Uploaded by

Milestone 1

Uploaded by

Milestone 1

Title : Automatic Image Annotation

1.1 Images patches generation

Figure 1: Illustration of the overlapping slide window patch extraction.

You might also like