SlideShare a Scribd company logo
1 of 8
Download to read offline
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 7
A NOVEL APPROACH FOR ANNOTATING IMAGES BY SEMANTIC
SIMILARITY & KEYWORD BASED ASSOCIATION ANALYSIS
Payal Gulati,
Dept. of Computer Engineering, YMCA University of Science & Technology, Faridabad, Haryana, India
gulatipayal@yahoo.co.in
A.K. Sharma
Dept. of Computer Engineering, YMCA University of Science & Technology, Faridabad, Haryana, India
ABSTRACT
Tremendous growth of multimedia data has resulted in enhancing the number of images over the
web.As a consequence of this, search engines face challenges in finding relevant images. Therefore,
images must be properly annotated and effectively indexed for increasing relevancy. Image
annotation has been a topic of ongoing research for more than a decade leading to several interesting
techniques. However, retrieval results are not satisfactory because there is a semantic gap between
low-level visual features and high-level semantic concepts. This motivates our work. In this work, a
novel approach for finding relevant annotation based on semantic similarity and subsequently
enrichingtheannotationbyusing keywordbasedassociationanalysisisproposed.
Index Terms—Image Annotation, Page Manager, Association Rule Mining, Knowledge Base,
Wordnet,QueryProcessor,ImageRepository.
architectureforannotatingimagesbysemantic
similarity and keyword based association
analysis. Algorithm and flow diagram are
discussed in section IV and V respectively.
Implementation and comparison of the
proposed and existing systems is presented in
section VI. Finally section VII comprises of the
conclusion.
II. RELATEDWORK
Image annotation is to assign metadata with
images based on visual features of the images
and as well as the associated text [2]. It is an
active research topic of recent years for
understanding images and simplifying the web
image retrieval process [3]. The metadata added
to images allows effective searches. Manually
adding metadata is an extremely laborious and
time-consuming task [4]. To overcome this,
many researchers have developed an approach
for automatically annotating images [5, 6, 7] by
using text from the image tag contained in image
fileandthetextsurrounding theimage.
Content descriptive and content dependent
information can be associated with the images
I. INTRODUCTION
The size of the web has already surpassed 13.86
billion pages [1] and yet there is no sign of
leveling off; causing a challenge for image
search engines in searching images over these
pages. Image annotation or automatically
annotating images with keywords is a solution to
this problem. Images are annotated by
extracting relevant keyword from page title,
surrounding text, metadata, image caption etc.
Proper annotations of images increase relevancy
andefficiencyofretrieval.
Despite the continuous efforts in inventing new
annotation algorithms, the annotation
performance is usually not satisfactory due to
limited vocabulary. Moreover, images over the
web are not annotated with semantic
descriptors, raising difficulty in searching for
relevant images. In this paper, a novel approach
for annotating images both by semantic
similarity and keyword based association
analysisisbeingproposed.
This paper is organized in the following way.
Section II discusses the related work done in this
domain. Section III presents the proposed
[8]. In Content-descriptive annotation, images
areannotatedby extractingrelevantkeyword[9]
from page title, surrounding text, metadata,
image caption, alt tags etc. However indexing
embedded image by image caption can be quite
confusing as image caption may not always
relate with image semantics. For instance an
apple image can be captioned as image1.gif on
the web page. Also, extraction of information
from the surrounding text is somewhat
heuristics as it is difficult to determine which
area is more relevant. Despite the continuous
efforts in inventing new annotation algorithms,
the annotation performance is usually not
satisfactory due to limited vocabulary leading to
inaccurateimageannotations.
Content-dependent metadata consists of color
histogram features, texture features calculated
by different algorithms [10] etc. However,
retrieval results of CBIR systems are not
satisfactory because there is always a semantic
gap between low-level visual features and high-
level semantic concepts [11, 12]. For instance,
the user queries in natural language and seldom
specifies the visual features of the image while
submitting the query. Targeting this problem, Li
and Sun presented an approach that incorporates
lexical semantics from WordNet into the image
annotation process [13]. Jin et al. proposed a
novel approach that prunes irrelevant keywords
by the usage of WordNet [14]. Paek et al.
combined visual and textual features for image
annotation [15]. Some researchers have
annotated images using ontologies. Hanbury
reviewed three image annotation approaches:
free text annotation, keyword annotation and
annotationbasedonontologies[16].
In this work, the keywords are extracted by text
based annotation. The extracted keywords are
further enriched by semantics using WordNet
and keywords inferred from association
knowledgebase.
III. PROPOSEDARCHITECTURE
This section describes the architecture of the
proposed systemshown inFig.1.
Following subsections describe various
componentsoftheproposed system.
A.PageManagerandValidImageChecker
Page manager collects web pages from the web,
downloads HTML source codes, checks for the
presence of image tags and hands the page to the
next phase: valid image checker. In this phase,
web page containing images are analyzed for
checking image's validity. Image is considered
as valid if its height is more than 50 pixels. Thus,
it will ignore the unwanted images such as
backgrounds,icons,advertisementsetc.
B.Parser
It crawls the downloaded HTML code of web
page, does link extraction, collects images and
extracts associated text from page title,
surrounding text, image caption and alt tag. The
extracted text is then buffered, from which
frequent keywords are determined by the
annotator.
C.Annotator
Apart from determining frequent keywords;
annotator helps in finding relevant annotation
according to the semantic similarity and
enrichingthefinalannotationby using keyword
Fig.1. Annotation on the basis of Semantic Similarity and
Keyword Association Analysis
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 8
based association analysis, leading to accurate
imageannotations.
D.AssociationRuleMining
Association rule mining is the data mining
technique for extracting patterns from the
database or web page's HTML source code. The
major statistics computed for the association
rules, support and confidence are given in
equation (1) and (2). In the proposed work, it is
used for extracting patterns [17] from the
downloaded source code, which are then stored
inassociationknowledgebase.
A.AssociationKnowledgeBase
The knowledge base is a repository of extracted
rules which have been derived using association
rule mining [17]. A sample knowledge base
containing rules is shown in Fig.2.The inference
derived from these association rules help in
annotationprocess.
F.Wordnet
WordNet is a lexical database for the English
language [18] with synsets (set of synonyms) as
its building block. A synset denotes concepts
paired with description (a.k.a. gloss) of the
synset.
Apart from this, they are interconnected with
relational links, such as hypernymy, meronymy,
antonymy and others. In this work the noun-
taxonomy of WordNet, is used by annotator for
nrichingthesemantics.
G. QueryProcessor
Query pre-processor processes user queries. It
returns images from the web image repository
correspondingtothequeriesfiredbytheuser.
I.ALGORITHM
In this section an algorithm has been proposed
whose stepbystepsequenceisshown below.
I.FLOWDIAGRAM
R1 : star + galaxy sky
R2 : star + cloud sky
R3 : star + cloud moon
R4 : ……………………..
R5 : …………………….
Fig.2. Knowledge Base
Algorithm Semantic_KeywordAsso_Annotation (p0)
begin
p0 is a valid web URL hyperlink
Q is a queue of valid hyperlinks
P is a set of web pages
H is a set of hyperlinks
B is a Buffer for temporary storing information from
tags and
surrounding text
Q ← p0 // insert p0 into the queue Q
while |Q| ≠ Ø do
p← Q // get head of queue Q
P← P U p //appending P by inserting page p
D Download HTML Source Code of web page, p
←
Extracted_ImageSearch for image tag in D
If (HeightOf(Extracted_Image) >50 pixels)
// i.e. not an unwanted image
//Extract image caption, page title, metadata, alt tag in
buffer
B [Info(ImageCaption) U Info(Page_Title) U
←
Info(Alt_Text) U Info(Surrounding_Text)]
Frequent_Keywords Frequent Keywords Analyzer(B)
//extracting frequent keywords from buffered
information
Inferred_KeywordsAssociationAnalysis (B)
//extracting keywords from inference rules from
//Knowledge Base
Semantic_KeywordsEnrichedSemantics(B)
//extracting noun taxonomy from WordNet for all
//keywords in buffer
empty B
B (Frequent_Keywords U Inferred_Keywords U
Semantic_Keywords) // update buffer B
//index image details of the form
<ImageCaption, B,URL, date, size>
end if
extract URL hyperlinks h contained in p into H
for all h Є H, h Q do
Q ← h
end for
empty B
end while
end
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 9
The flow diagram of the algorithm discussed in
sectionIVisshown inFig. 3.
Let p0, Q and Pbe seed URL, URLqueue and set
of pages respectively. Seed URL is the starting
URLfromwithcrawlingforwebpagesstarts.
For instance,
http://en.wikipedia.org/wiki/Jawaharlal_Nehru
[19] be the seed URL i.e. p0, then on the basis of
algorithmdiscussed Q andPareasfollows.
Qhttp://en.wikipedia.org/wiki/Jawaharlal
_Nehru (inserting p0 in Q) P P U
http://en.wikipedia.org/wiki/Jawaharlal_Nehru
(updatingPbyinsertingpagep0)
Page Manager downloads HTML source code
for the page p0. Fig.4. and Fig.5. shows the
contents of web page p0 and its HTML code
respectively.
Now,thevalidimagecheckerchecksthe
validity of the image on p0. It analyzes the width
and height of the image. The width and height
forimage
(Bundesarchiv_Bild_183618490001,_Indien,_
Otto_Grotewohl_bei_Ministerpr%c3%A4side
nt_Nehru_cropped.jpg) on page p0 is 220 and
273 pixels respectively, hence image is valid.
Fig.6. shows thevalidimageonpagep0.
The information extracted from various tags
(page title, image caption, Image alt tag) and text
surrounding the image (Fig. 6.) is shown in
Fig.7. The frequent keywords from extracted
informationarebuffered.
Theenrichedannotationusing associationrules
Fig.3. Flow Diagram
Fig.4. Web Page p0 containing image
Fig.5. p0 in HTML format
Fig.6. Valid Image
Fig. 7. Information Extraction and Buffering
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 10
andontologyisshown inFig.8.
At last all the hyperlinks on p0 are inserted in
queueH.TheresultantqueueH is asfollows
I. IMPLEMENTATION AND SIMULATED
RESULTS
The prototype for the proposed system has been
developed in Java. Implementation Results for
URL
http://en.wikipedia.org/wiki/Jawaharlal_Nehru
areshown inFig.9.andFig. 10.below.
Extensiveexperimentshavebeenconducted
over certain web pages containing valid images.
Table 1. shows the annotation process done by
the proposed system, and it is observed that the
proposed system does better annotation as
compared to original annotation, followed by
existingsystems.
Fig.8. .Enriching Annotation
H H U http://ibnlive.in.com/news/gandhi-jinnah-both-
failed-jaswant/99323-37.html U
http://nuclearweaponarchive.org/India/Bhabha.html U
http://www.jnmf.in/history.html U http://www.india-
intro.com/remembering-nehru-and-others.html U
http://www.indohistory.com/jawaharlalnehru.html U
http://www.harappa.com/sounds/nehru.html U
http://www.india-
today.com/itoday/millennium/100people/nehru.htmlU
http://indrus.in/articles/2011/08/02/12_indians_who_are_fa
mous_in_russia_12823.html
Fig.9. Extracting Keywords from Tags
Fig. 10. Enriching keywords by semantics and Association
Knowledge base
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 11
URL
http://en.wikipedia.org/
wiki/Star_cluster
[20]
http://www.culturalindia.net/
national-symbols/
national-bird.html
[21]
image
Table 1 Proposed System Annotations
Parsing Phase Pagetitle:Starcluster,
Imagecaption:
M92_arp_750pix.jpg,
Surrounding Text : Star clusters or
star clouds are groups of stars. Two
types of star clusters can be
distinguished: globular clusters are
tight groups of hundreds of thousands
of very old stars which are
gravitationally bound, while open
clusters, a more loosely clustered
group of stars, generally contain less
than a few hundred members, and are
often very young. Open clusters
become disrupted over time by the
gravitational influence of giant
molecular clouds as they move
through the, but cluster members will
continue to move in broadly the same
direction through space even though
they are no longer gravitationally
bound; they are then known as a
stellar association, sometimes also
referredtoasamovinggroup.
Image alt tag: img alt = ""
(MISSING)
Page title: National Bird Of India -
National Bird India - India National
Bird-
IndianNationalBird
Image caption:
national-peacock.jpg
Surrounding Text : Peacock (Pavo
cristatus), which is a symbol of grace,
joy, beauty and love is the national
bird of India. Peacock occupies a
respectable position in Indian culture
and is protected not only by religious
sentiments but also by parliamentary
statute. The Indian peacock is a
colorful, swan-sized bird with a fan-
shaped crest of feathers on its head, a
white patch under the eye and a long-
slender neck. The male peacock is
morecolorfulthanthefemaleone.
Imagealttag: NationalBirdofIndia
Original Annotation star, cluster, cloud National, bird, Indian National
bird, peacock
Expanded Annotation
using Association
rule mining
star + galaxy sky
star + cloud sky
star + cloud moon
National + bird peacock
Symbol+ peacock joy + grace
+ beauty + loveRain peacock
Enriched Annotation
using Semantics
Proposed System
Annotation
celestial body, galaxy,
constellation, heavenly body,
extragalactic nebula
star, cluster, cloud, galaxy,
constellation, heavenly body,
extragalactic nebula, sky
Peafowl, Pavo cristatus
Peafowl, Pavo cristatus,
national bird of India,
joy grace beauty love
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 12
VII. CONCLUSION
The size of image collections is increasing
rapidly and as a consequence of this; images
must be effectively indexed and properly
annotated for increasing relevancy. This work
proposes a novel approach for finding relevant
annotation, expanding it by using keyword
based association analysis and enriching key
terms using semantics. The proposed approach
proves to be more efficient than existing
approaches because the annotator annotates by
referring semantics from WordNet and by
inferring rules from association knowledge
base; leading to accurate image annotations.
Further the simulated results show the
correctnessoftheproposedwork.
REFERENCES.
[1] World Wide Web Size.[online]
http://www.worldwidewebsize.com
(Accessed21 January2013).
[2] Sumathi, T., Hemalatha, M.(2011) 'A
combined hierarchical model for
automatic image annotation and retrieval',
Advanced Computing (ICoAC), Third
International Conference on , vol., no.,
pp.135-139,14-16 Dec.
[3] Pecenovic,Z.,Do,M.,Ayer, S. and
Vetterli,M.(1998) 'New methods for
image retrieval', In Proc. of ICPS'98
Congress on Exploring New Tracks in
Imaging,pp.242-246,Antwerp,Belgium.
[4] Inoue,M.(2004) 'On the need for
annotation-based image retrieval',
Workshop on Information Retrieval in
Context.
[5] Sumathi T., Lakshmi C. D., and
Hemalatha M., “An Overview of
Automated Image Annotation
Approaches”, International Journal of
Research and Reviews in Information
Sciences,Vol. 1,No. 1,2011.
[6] Li,X.,Chen,L.,Zhang,L.,Lin,L.andMa,
W.-Y. (2006) 'Image annotation by large-
scale content-based image retrieval', In
ACM Multimedia, pages607-610.
[7] Tsai, Chih-Fong., Hung, Chihli.(2008)
'Automatically Annotating Images with
Keywords:AReview of ImageAnnotation
Systems', Recent Patents on Computer
Science,1,55-68.
[8] Shah,K., and Sheth, A (1998). Logical
Information Modeling of Web- accessible
Heterogeneous Digital Assets. Proc. of
the Forum on Research and Technology
Advances in Digital Libraries, (ADL'98),
SantaBarbara,CA.April,pp266-275.
[9] Jung, K., Kim, K. and Jain,A. (2004) 'Text
Information Extraction in Images and
Video: A Survey', Pattern Recognition,
vol.37,no.5,pp. 977-997.
[10] Smeulders, A. W. M., Worring, M.,
Santini, S., Gupta, A., Jain, R.(2000)
'Content-based image retrieval at the end
of the early years', IEEE Transactions on
PatternAnalysis and Machine Intelligence
22(12)1349–1380.
[11] Wang, X.-J., Zhang, L., Li, X. and Ma,
W.-Y. (2008) 'Annotating images by
mining image search results', IEEE
Transactions on Pattern Analysis and
MachineIntelligence,30(11):1919–1932.
[12] Zhang, Dengsheng, Islam, Md. Monirul,
Lu, Guojun.(2012) 'A review on
automatic image annotation techniques',
Pattern Recogntion, Volume 45, Issue 1,
January,Pages346-362.
[13] Li, Wei., Sun, Maosong.(2006)
'Automatic Image Annotation Based on
WordNet and Hierarchical Ensembles',
Computational Linguistics and Intelligent
Text Processing Lecture Notes in
Computer Science Volume 3878,pp 417-
428.
[14] Jin,Y.,Khan,L.,Wang, L.and Awad,M.
(2005) 'Image annotations by combining
multiple evidence & Wordnet', In ACM
Multimedia,pages706-715.
[15] Paek, S., Sable, C.L., Hatzivassiloglou, V.,
Jaimes, A., Schiffman, B.H., Chang, S.-
F., McKeown, K.R.(1999) 'Integration
of visual and text- based approaches for
the content labeling and classification of
photographs', In: Proc. ACM SIGIR
workshop on multimedia indexing and
retrieval.
[16] Hanbury, A. (2008) 'A Survey of Methods
for Image Annotation', Journal of Visual
Languages & Computing, vol. 19, no. 5,
pp. 617-627.
[17] Zhao, Q., and Bhowmick, S. S.(2003)
'Association Rule Mining: A Survey',
Technical Report, CAIS, Nanyang
Technological University, Singapore, No.
2003116.
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 13
[18] Miller, G. A. (1995) 'WORDNET: A
Lexical Database for English',
CommunicationsofACM(11):39-41.
[19] Wikipedia Jawaharlal Nehru.[online]
http://en.wikipedia.org/wiki/Jawaharlal_
Nehru(Accessed14November2012)
[20] Wikipedia Star Cluster. [online]
http://en.wikipedia.org/wiki/Star_cluster
(Accessed14November2012).
[21] CulturalIndia.[online]
http://www.culturalindia.net/national-
symbols/national-bird.html (Accessed 14
November2012)
Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 14

More Related Content

Similar to A Novel Approach For Annotating Images By Semantic Similarity Keyword Based Association Analysis

Image based search engine
Image based search engineImage based search engine
Image based search engineIRJET Journal
 
A Review on User Personalized Tag Based Image Search by Tag Relevance
A Review on User Personalized Tag Based Image Search by Tag RelevanceA Review on User Personalized Tag Based Image Search by Tag Relevance
A Review on User Personalized Tag Based Image Search by Tag RelevanceIRJET Journal
 
IRJET-A Review on User Personalized Tag Based Image Search by Tag Relevance
IRJET-A Review on User Personalized Tag Based Image Search by Tag RelevanceIRJET-A Review on User Personalized Tag Based Image Search by Tag Relevance
IRJET-A Review on User Personalized Tag Based Image Search by Tag RelevanceIRJET Journal
 
M phil-computer-science-pattern-recognition-projects
M phil-computer-science-pattern-recognition-projectsM phil-computer-science-pattern-recognition-projects
M phil-computer-science-pattern-recognition-projectsVijay Karan
 
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...Journal For Research
 
An Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective ViewAn Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective Viewijtsrd
 
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...IOSR Journals
 
Image Search by using various Reranking Methods
Image Search by using various Reranking MethodsImage Search by using various Reranking Methods
Image Search by using various Reranking MethodsIRJET Journal
 
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...cscpconf
 
Computing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engineComputing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search enginecsandit
 
Multivariate feature descriptor based cbir model to query large image databases
Multivariate feature descriptor based cbir model to query large image databasesMultivariate feature descriptor based cbir model to query large image databases
Multivariate feature descriptor based cbir model to query large image databasesIJARIIT
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...researchinventy
 
Research Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and ScienceResearch Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and Scienceresearchinventy
 
IRJET- Automatic Text Summarization using Text Rank
IRJET- Automatic Text Summarization using Text RankIRJET- Automatic Text Summarization using Text Rank
IRJET- Automatic Text Summarization using Text RankIRJET Journal
 
IRJET-Semi-Supervised Collaborative Image Retrieval using Relevance Feedback
IRJET-Semi-Supervised Collaborative Image Retrieval using Relevance FeedbackIRJET-Semi-Supervised Collaborative Image Retrieval using Relevance Feedback
IRJET-Semi-Supervised Collaborative Image Retrieval using Relevance FeedbackIRJET Journal
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Emily Kolvitz
 

Similar to A Novel Approach For Annotating Images By Semantic Similarity Keyword Based Association Analysis (20)

Ts2 c topic
Ts2 c topicTs2 c topic
Ts2 c topic
 
Ts2 c topic (1)
Ts2 c topic (1)Ts2 c topic (1)
Ts2 c topic (1)
 
Image based search engine
Image based search engineImage based search engine
Image based search engine
 
A Review on User Personalized Tag Based Image Search by Tag Relevance
A Review on User Personalized Tag Based Image Search by Tag RelevanceA Review on User Personalized Tag Based Image Search by Tag Relevance
A Review on User Personalized Tag Based Image Search by Tag Relevance
 
IRJET-A Review on User Personalized Tag Based Image Search by Tag Relevance
IRJET-A Review on User Personalized Tag Based Image Search by Tag RelevanceIRJET-A Review on User Personalized Tag Based Image Search by Tag Relevance
IRJET-A Review on User Personalized Tag Based Image Search by Tag Relevance
 
M phil-computer-science-pattern-recognition-projects
M phil-computer-science-pattern-recognition-projectsM phil-computer-science-pattern-recognition-projects
M phil-computer-science-pattern-recognition-projects
 
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
 
Ko3419161921
Ko3419161921Ko3419161921
Ko3419161921
 
An Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective ViewAn Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective View
 
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
 
Image Search by using various Reranking Methods
Image Search by using various Reranking MethodsImage Search by using various Reranking Methods
Image Search by using various Reranking Methods
 
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...
 
Computing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engineComputing semantic similarity measure between words using web search engine
Computing semantic similarity measure between words using web search engine
 
Multivariate feature descriptor based cbir model to query large image databases
Multivariate feature descriptor based cbir model to query large image databasesMultivariate feature descriptor based cbir model to query large image databases
Multivariate feature descriptor based cbir model to query large image databases
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...
 
Research Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and ScienceResearch Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and Science
 
IRJET- Automatic Text Summarization using Text Rank
IRJET- Automatic Text Summarization using Text RankIRJET- Automatic Text Summarization using Text Rank
IRJET- Automatic Text Summarization using Text Rank
 
IRJET-Semi-Supervised Collaborative Image Retrieval using Relevance Feedback
IRJET-Semi-Supervised Collaborative Image Retrieval using Relevance FeedbackIRJET-Semi-Supervised Collaborative Image Retrieval using Relevance Feedback
IRJET-Semi-Supervised Collaborative Image Retrieval using Relevance Feedback
 
Ijcet 06 06_006
Ijcet 06 06_006Ijcet 06 06_006
Ijcet 06 06_006
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
 

More from April Smith

Format Sample Abstract For Paper Presentation - Schemcon Abstract
Format Sample Abstract For Paper Presentation - Schemcon AbstractFormat Sample Abstract For Paper Presentation - Schemcon Abstract
Format Sample Abstract For Paper Presentation - Schemcon AbstractApril Smith
 
Love Letter Pad Stationery
Love Letter Pad StationeryLove Letter Pad Stationery
Love Letter Pad StationeryApril Smith
 
PPT - Write My Term Paper PowerPoint Presentation, F
PPT - Write My Term Paper PowerPoint Presentation, FPPT - Write My Term Paper PowerPoint Presentation, F
PPT - Write My Term Paper PowerPoint Presentation, FApril Smith
 
Embossed Paper - OEM Or Cust
Embossed Paper - OEM Or CustEmbossed Paper - OEM Or Cust
Embossed Paper - OEM Or CustApril Smith
 
Technical Paper Writing Services Technical Paper Sample
Technical Paper Writing Services Technical Paper SampleTechnical Paper Writing Services Technical Paper Sample
Technical Paper Writing Services Technical Paper SampleApril Smith
 
020 Personal Essay Prompts Example Thatsnotus
020 Personal Essay Prompts Example Thatsnotus020 Personal Essay Prompts Example Thatsnotus
020 Personal Essay Prompts Example ThatsnotusApril Smith
 
Free Printable Friendly Letter Writing Paper - Discover Th
Free Printable Friendly Letter Writing Paper - Discover ThFree Printable Friendly Letter Writing Paper - Discover Th
Free Printable Friendly Letter Writing Paper - Discover ThApril Smith
 
Pin On Study.pdfPin On Study
Pin On Study.pdfPin On StudyPin On Study.pdfPin On Study
Pin On Study.pdfPin On StudyApril Smith
 
Buy Cheap Essay - Buy Cheap
Buy Cheap Essay - Buy CheapBuy Cheap Essay - Buy Cheap
Buy Cheap Essay - Buy CheapApril Smith
 
Help With Scholarship Essa
Help With Scholarship EssaHelp With Scholarship Essa
Help With Scholarship EssaApril Smith
 
Article Summary Outline
Article Summary OutlineArticle Summary Outline
Article Summary OutlineApril Smith
 
Free Compare And Contrast Essay Ex
Free Compare And Contrast Essay ExFree Compare And Contrast Essay Ex
Free Compare And Contrast Essay ExApril Smith
 
Chapter 1 Essay Instructions
Chapter 1 Essay InstructionsChapter 1 Essay Instructions
Chapter 1 Essay InstructionsApril Smith
 
Business Persuasive Letter Example Beautiful Sample
Business Persuasive Letter Example Beautiful SampleBusiness Persuasive Letter Example Beautiful Sample
Business Persuasive Letter Example Beautiful SampleApril Smith
 
620 Write My Paper Ideas In
620 Write My Paper Ideas In620 Write My Paper Ideas In
620 Write My Paper Ideas InApril Smith
 
005 Application Essay Mhmfsn8U3K Thatsnotus
005 Application Essay Mhmfsn8U3K  Thatsnotus005 Application Essay Mhmfsn8U3K  Thatsnotus
005 Application Essay Mhmfsn8U3K ThatsnotusApril Smith
 
Essay Help Grammar
Essay Help GrammarEssay Help Grammar
Essay Help GrammarApril Smith
 
How To Write A Winning Position Paper - Peachy E
How To Write A Winning Position Paper - Peachy EHow To Write A Winning Position Paper - Peachy E
How To Write A Winning Position Paper - Peachy EApril Smith
 
Writing Paper Clipart Writing Cartoon Clipart -
Writing Paper Clipart  Writing Cartoon Clipart -Writing Paper Clipart  Writing Cartoon Clipart -
Writing Paper Clipart Writing Cartoon Clipart -April Smith
 
Good University Essay Introductio
Good University Essay IntroductioGood University Essay Introductio
Good University Essay IntroductioApril Smith
 

More from April Smith (20)

Format Sample Abstract For Paper Presentation - Schemcon Abstract
Format Sample Abstract For Paper Presentation - Schemcon AbstractFormat Sample Abstract For Paper Presentation - Schemcon Abstract
Format Sample Abstract For Paper Presentation - Schemcon Abstract
 
Love Letter Pad Stationery
Love Letter Pad StationeryLove Letter Pad Stationery
Love Letter Pad Stationery
 
PPT - Write My Term Paper PowerPoint Presentation, F
PPT - Write My Term Paper PowerPoint Presentation, FPPT - Write My Term Paper PowerPoint Presentation, F
PPT - Write My Term Paper PowerPoint Presentation, F
 
Embossed Paper - OEM Or Cust
Embossed Paper - OEM Or CustEmbossed Paper - OEM Or Cust
Embossed Paper - OEM Or Cust
 
Technical Paper Writing Services Technical Paper Sample
Technical Paper Writing Services Technical Paper SampleTechnical Paper Writing Services Technical Paper Sample
Technical Paper Writing Services Technical Paper Sample
 
020 Personal Essay Prompts Example Thatsnotus
020 Personal Essay Prompts Example Thatsnotus020 Personal Essay Prompts Example Thatsnotus
020 Personal Essay Prompts Example Thatsnotus
 
Free Printable Friendly Letter Writing Paper - Discover Th
Free Printable Friendly Letter Writing Paper - Discover ThFree Printable Friendly Letter Writing Paper - Discover Th
Free Printable Friendly Letter Writing Paper - Discover Th
 
Pin On Study.pdfPin On Study
Pin On Study.pdfPin On StudyPin On Study.pdfPin On Study
Pin On Study.pdfPin On Study
 
Buy Cheap Essay - Buy Cheap
Buy Cheap Essay - Buy CheapBuy Cheap Essay - Buy Cheap
Buy Cheap Essay - Buy Cheap
 
Help With Scholarship Essa
Help With Scholarship EssaHelp With Scholarship Essa
Help With Scholarship Essa
 
Article Summary Outline
Article Summary OutlineArticle Summary Outline
Article Summary Outline
 
Free Compare And Contrast Essay Ex
Free Compare And Contrast Essay ExFree Compare And Contrast Essay Ex
Free Compare And Contrast Essay Ex
 
Chapter 1 Essay Instructions
Chapter 1 Essay InstructionsChapter 1 Essay Instructions
Chapter 1 Essay Instructions
 
Business Persuasive Letter Example Beautiful Sample
Business Persuasive Letter Example Beautiful SampleBusiness Persuasive Letter Example Beautiful Sample
Business Persuasive Letter Example Beautiful Sample
 
620 Write My Paper Ideas In
620 Write My Paper Ideas In620 Write My Paper Ideas In
620 Write My Paper Ideas In
 
005 Application Essay Mhmfsn8U3K Thatsnotus
005 Application Essay Mhmfsn8U3K  Thatsnotus005 Application Essay Mhmfsn8U3K  Thatsnotus
005 Application Essay Mhmfsn8U3K Thatsnotus
 
Essay Help Grammar
Essay Help GrammarEssay Help Grammar
Essay Help Grammar
 
How To Write A Winning Position Paper - Peachy E
How To Write A Winning Position Paper - Peachy EHow To Write A Winning Position Paper - Peachy E
How To Write A Winning Position Paper - Peachy E
 
Writing Paper Clipart Writing Cartoon Clipart -
Writing Paper Clipart  Writing Cartoon Clipart -Writing Paper Clipart  Writing Cartoon Clipart -
Writing Paper Clipart Writing Cartoon Clipart -
 
Good University Essay Introductio
Good University Essay IntroductioGood University Essay Introductio
Good University Essay Introductio
 

Recently uploaded

Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 

Recently uploaded (20)

Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 

A Novel Approach For Annotating Images By Semantic Similarity Keyword Based Association Analysis

  • 1. Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 7 A NOVEL APPROACH FOR ANNOTATING IMAGES BY SEMANTIC SIMILARITY & KEYWORD BASED ASSOCIATION ANALYSIS Payal Gulati, Dept. of Computer Engineering, YMCA University of Science & Technology, Faridabad, Haryana, India gulatipayal@yahoo.co.in A.K. Sharma Dept. of Computer Engineering, YMCA University of Science & Technology, Faridabad, Haryana, India ABSTRACT Tremendous growth of multimedia data has resulted in enhancing the number of images over the web.As a consequence of this, search engines face challenges in finding relevant images. Therefore, images must be properly annotated and effectively indexed for increasing relevancy. Image annotation has been a topic of ongoing research for more than a decade leading to several interesting techniques. However, retrieval results are not satisfactory because there is a semantic gap between low-level visual features and high-level semantic concepts. This motivates our work. In this work, a novel approach for finding relevant annotation based on semantic similarity and subsequently enrichingtheannotationbyusing keywordbasedassociationanalysisisproposed. Index Terms—Image Annotation, Page Manager, Association Rule Mining, Knowledge Base, Wordnet,QueryProcessor,ImageRepository. architectureforannotatingimagesbysemantic similarity and keyword based association analysis. Algorithm and flow diagram are discussed in section IV and V respectively. Implementation and comparison of the proposed and existing systems is presented in section VI. Finally section VII comprises of the conclusion. II. RELATEDWORK Image annotation is to assign metadata with images based on visual features of the images and as well as the associated text [2]. It is an active research topic of recent years for understanding images and simplifying the web image retrieval process [3]. The metadata added to images allows effective searches. Manually adding metadata is an extremely laborious and time-consuming task [4]. To overcome this, many researchers have developed an approach for automatically annotating images [5, 6, 7] by using text from the image tag contained in image fileandthetextsurrounding theimage. Content descriptive and content dependent information can be associated with the images I. INTRODUCTION The size of the web has already surpassed 13.86 billion pages [1] and yet there is no sign of leveling off; causing a challenge for image search engines in searching images over these pages. Image annotation or automatically annotating images with keywords is a solution to this problem. Images are annotated by extracting relevant keyword from page title, surrounding text, metadata, image caption etc. Proper annotations of images increase relevancy andefficiencyofretrieval. Despite the continuous efforts in inventing new annotation algorithms, the annotation performance is usually not satisfactory due to limited vocabulary. Moreover, images over the web are not annotated with semantic descriptors, raising difficulty in searching for relevant images. In this paper, a novel approach for annotating images both by semantic similarity and keyword based association analysisisbeingproposed. This paper is organized in the following way. Section II discusses the related work done in this domain. Section III presents the proposed
  • 2. [8]. In Content-descriptive annotation, images areannotatedby extractingrelevantkeyword[9] from page title, surrounding text, metadata, image caption, alt tags etc. However indexing embedded image by image caption can be quite confusing as image caption may not always relate with image semantics. For instance an apple image can be captioned as image1.gif on the web page. Also, extraction of information from the surrounding text is somewhat heuristics as it is difficult to determine which area is more relevant. Despite the continuous efforts in inventing new annotation algorithms, the annotation performance is usually not satisfactory due to limited vocabulary leading to inaccurateimageannotations. Content-dependent metadata consists of color histogram features, texture features calculated by different algorithms [10] etc. However, retrieval results of CBIR systems are not satisfactory because there is always a semantic gap between low-level visual features and high- level semantic concepts [11, 12]. For instance, the user queries in natural language and seldom specifies the visual features of the image while submitting the query. Targeting this problem, Li and Sun presented an approach that incorporates lexical semantics from WordNet into the image annotation process [13]. Jin et al. proposed a novel approach that prunes irrelevant keywords by the usage of WordNet [14]. Paek et al. combined visual and textual features for image annotation [15]. Some researchers have annotated images using ontologies. Hanbury reviewed three image annotation approaches: free text annotation, keyword annotation and annotationbasedonontologies[16]. In this work, the keywords are extracted by text based annotation. The extracted keywords are further enriched by semantics using WordNet and keywords inferred from association knowledgebase. III. PROPOSEDARCHITECTURE This section describes the architecture of the proposed systemshown inFig.1. Following subsections describe various componentsoftheproposed system. A.PageManagerandValidImageChecker Page manager collects web pages from the web, downloads HTML source codes, checks for the presence of image tags and hands the page to the next phase: valid image checker. In this phase, web page containing images are analyzed for checking image's validity. Image is considered as valid if its height is more than 50 pixels. Thus, it will ignore the unwanted images such as backgrounds,icons,advertisementsetc. B.Parser It crawls the downloaded HTML code of web page, does link extraction, collects images and extracts associated text from page title, surrounding text, image caption and alt tag. The extracted text is then buffered, from which frequent keywords are determined by the annotator. C.Annotator Apart from determining frequent keywords; annotator helps in finding relevant annotation according to the semantic similarity and enrichingthefinalannotationby using keyword Fig.1. Annotation on the basis of Semantic Similarity and Keyword Association Analysis Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 8
  • 3. based association analysis, leading to accurate imageannotations. D.AssociationRuleMining Association rule mining is the data mining technique for extracting patterns from the database or web page's HTML source code. The major statistics computed for the association rules, support and confidence are given in equation (1) and (2). In the proposed work, it is used for extracting patterns [17] from the downloaded source code, which are then stored inassociationknowledgebase. A.AssociationKnowledgeBase The knowledge base is a repository of extracted rules which have been derived using association rule mining [17]. A sample knowledge base containing rules is shown in Fig.2.The inference derived from these association rules help in annotationprocess. F.Wordnet WordNet is a lexical database for the English language [18] with synsets (set of synonyms) as its building block. A synset denotes concepts paired with description (a.k.a. gloss) of the synset. Apart from this, they are interconnected with relational links, such as hypernymy, meronymy, antonymy and others. In this work the noun- taxonomy of WordNet, is used by annotator for nrichingthesemantics. G. QueryProcessor Query pre-processor processes user queries. It returns images from the web image repository correspondingtothequeriesfiredbytheuser. I.ALGORITHM In this section an algorithm has been proposed whose stepbystepsequenceisshown below. I.FLOWDIAGRAM R1 : star + galaxy sky R2 : star + cloud sky R3 : star + cloud moon R4 : …………………….. R5 : ……………………. Fig.2. Knowledge Base Algorithm Semantic_KeywordAsso_Annotation (p0) begin p0 is a valid web URL hyperlink Q is a queue of valid hyperlinks P is a set of web pages H is a set of hyperlinks B is a Buffer for temporary storing information from tags and surrounding text Q ← p0 // insert p0 into the queue Q while |Q| ≠ Ø do p← Q // get head of queue Q P← P U p //appending P by inserting page p D Download HTML Source Code of web page, p ← Extracted_ImageSearch for image tag in D If (HeightOf(Extracted_Image) >50 pixels) // i.e. not an unwanted image //Extract image caption, page title, metadata, alt tag in buffer B [Info(ImageCaption) U Info(Page_Title) U ← Info(Alt_Text) U Info(Surrounding_Text)] Frequent_Keywords Frequent Keywords Analyzer(B) //extracting frequent keywords from buffered information Inferred_KeywordsAssociationAnalysis (B) //extracting keywords from inference rules from //Knowledge Base Semantic_KeywordsEnrichedSemantics(B) //extracting noun taxonomy from WordNet for all //keywords in buffer empty B B (Frequent_Keywords U Inferred_Keywords U Semantic_Keywords) // update buffer B //index image details of the form <ImageCaption, B,URL, date, size> end if extract URL hyperlinks h contained in p into H for all h Є H, h Q do Q ← h end for empty B end while end Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 9
  • 4. The flow diagram of the algorithm discussed in sectionIVisshown inFig. 3. Let p0, Q and Pbe seed URL, URLqueue and set of pages respectively. Seed URL is the starting URLfromwithcrawlingforwebpagesstarts. For instance, http://en.wikipedia.org/wiki/Jawaharlal_Nehru [19] be the seed URL i.e. p0, then on the basis of algorithmdiscussed Q andPareasfollows. Qhttp://en.wikipedia.org/wiki/Jawaharlal _Nehru (inserting p0 in Q) P P U http://en.wikipedia.org/wiki/Jawaharlal_Nehru (updatingPbyinsertingpagep0) Page Manager downloads HTML source code for the page p0. Fig.4. and Fig.5. shows the contents of web page p0 and its HTML code respectively. Now,thevalidimagecheckerchecksthe validity of the image on p0. It analyzes the width and height of the image. The width and height forimage (Bundesarchiv_Bild_183618490001,_Indien,_ Otto_Grotewohl_bei_Ministerpr%c3%A4side nt_Nehru_cropped.jpg) on page p0 is 220 and 273 pixels respectively, hence image is valid. Fig.6. shows thevalidimageonpagep0. The information extracted from various tags (page title, image caption, Image alt tag) and text surrounding the image (Fig. 6.) is shown in Fig.7. The frequent keywords from extracted informationarebuffered. Theenrichedannotationusing associationrules Fig.3. Flow Diagram Fig.4. Web Page p0 containing image Fig.5. p0 in HTML format Fig.6. Valid Image Fig. 7. Information Extraction and Buffering Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 10
  • 5. andontologyisshown inFig.8. At last all the hyperlinks on p0 are inserted in queueH.TheresultantqueueH is asfollows I. IMPLEMENTATION AND SIMULATED RESULTS The prototype for the proposed system has been developed in Java. Implementation Results for URL http://en.wikipedia.org/wiki/Jawaharlal_Nehru areshown inFig.9.andFig. 10.below. Extensiveexperimentshavebeenconducted over certain web pages containing valid images. Table 1. shows the annotation process done by the proposed system, and it is observed that the proposed system does better annotation as compared to original annotation, followed by existingsystems. Fig.8. .Enriching Annotation H H U http://ibnlive.in.com/news/gandhi-jinnah-both- failed-jaswant/99323-37.html U http://nuclearweaponarchive.org/India/Bhabha.html U http://www.jnmf.in/history.html U http://www.india- intro.com/remembering-nehru-and-others.html U http://www.indohistory.com/jawaharlalnehru.html U http://www.harappa.com/sounds/nehru.html U http://www.india- today.com/itoday/millennium/100people/nehru.htmlU http://indrus.in/articles/2011/08/02/12_indians_who_are_fa mous_in_russia_12823.html Fig.9. Extracting Keywords from Tags Fig. 10. Enriching keywords by semantics and Association Knowledge base Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 11 URL http://en.wikipedia.org/ wiki/Star_cluster [20] http://www.culturalindia.net/ national-symbols/ national-bird.html [21] image Table 1 Proposed System Annotations
  • 6. Parsing Phase Pagetitle:Starcluster, Imagecaption: M92_arp_750pix.jpg, Surrounding Text : Star clusters or star clouds are groups of stars. Two types of star clusters can be distinguished: globular clusters are tight groups of hundreds of thousands of very old stars which are gravitationally bound, while open clusters, a more loosely clustered group of stars, generally contain less than a few hundred members, and are often very young. Open clusters become disrupted over time by the gravitational influence of giant molecular clouds as they move through the, but cluster members will continue to move in broadly the same direction through space even though they are no longer gravitationally bound; they are then known as a stellar association, sometimes also referredtoasamovinggroup. Image alt tag: img alt = "" (MISSING) Page title: National Bird Of India - National Bird India - India National Bird- IndianNationalBird Image caption: national-peacock.jpg Surrounding Text : Peacock (Pavo cristatus), which is a symbol of grace, joy, beauty and love is the national bird of India. Peacock occupies a respectable position in Indian culture and is protected not only by religious sentiments but also by parliamentary statute. The Indian peacock is a colorful, swan-sized bird with a fan- shaped crest of feathers on its head, a white patch under the eye and a long- slender neck. The male peacock is morecolorfulthanthefemaleone. Imagealttag: NationalBirdofIndia Original Annotation star, cluster, cloud National, bird, Indian National bird, peacock Expanded Annotation using Association rule mining star + galaxy sky star + cloud sky star + cloud moon National + bird peacock Symbol+ peacock joy + grace + beauty + loveRain peacock Enriched Annotation using Semantics Proposed System Annotation celestial body, galaxy, constellation, heavenly body, extragalactic nebula star, cluster, cloud, galaxy, constellation, heavenly body, extragalactic nebula, sky Peafowl, Pavo cristatus Peafowl, Pavo cristatus, national bird of India, joy grace beauty love Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 12 VII. CONCLUSION The size of image collections is increasing rapidly and as a consequence of this; images must be effectively indexed and properly annotated for increasing relevancy. This work proposes a novel approach for finding relevant annotation, expanding it by using keyword based association analysis and enriching key terms using semantics. The proposed approach proves to be more efficient than existing approaches because the annotator annotates by referring semantics from WordNet and by inferring rules from association knowledge base; leading to accurate image annotations. Further the simulated results show the correctnessoftheproposedwork. REFERENCES. [1] World Wide Web Size.[online] http://www.worldwidewebsize.com (Accessed21 January2013).
  • 7. [2] Sumathi, T., Hemalatha, M.(2011) 'A combined hierarchical model for automatic image annotation and retrieval', Advanced Computing (ICoAC), Third International Conference on , vol., no., pp.135-139,14-16 Dec. [3] Pecenovic,Z.,Do,M.,Ayer, S. and Vetterli,M.(1998) 'New methods for image retrieval', In Proc. of ICPS'98 Congress on Exploring New Tracks in Imaging,pp.242-246,Antwerp,Belgium. [4] Inoue,M.(2004) 'On the need for annotation-based image retrieval', Workshop on Information Retrieval in Context. [5] Sumathi T., Lakshmi C. D., and Hemalatha M., “An Overview of Automated Image Annotation Approaches”, International Journal of Research and Reviews in Information Sciences,Vol. 1,No. 1,2011. [6] Li,X.,Chen,L.,Zhang,L.,Lin,L.andMa, W.-Y. (2006) 'Image annotation by large- scale content-based image retrieval', In ACM Multimedia, pages607-610. [7] Tsai, Chih-Fong., Hung, Chihli.(2008) 'Automatically Annotating Images with Keywords:AReview of ImageAnnotation Systems', Recent Patents on Computer Science,1,55-68. [8] Shah,K., and Sheth, A (1998). Logical Information Modeling of Web- accessible Heterogeneous Digital Assets. Proc. of the Forum on Research and Technology Advances in Digital Libraries, (ADL'98), SantaBarbara,CA.April,pp266-275. [9] Jung, K., Kim, K. and Jain,A. (2004) 'Text Information Extraction in Images and Video: A Survey', Pattern Recognition, vol.37,no.5,pp. 977-997. [10] Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., Jain, R.(2000) 'Content-based image retrieval at the end of the early years', IEEE Transactions on PatternAnalysis and Machine Intelligence 22(12)1349–1380. [11] Wang, X.-J., Zhang, L., Li, X. and Ma, W.-Y. (2008) 'Annotating images by mining image search results', IEEE Transactions on Pattern Analysis and MachineIntelligence,30(11):1919–1932. [12] Zhang, Dengsheng, Islam, Md. Monirul, Lu, Guojun.(2012) 'A review on automatic image annotation techniques', Pattern Recogntion, Volume 45, Issue 1, January,Pages346-362. [13] Li, Wei., Sun, Maosong.(2006) 'Automatic Image Annotation Based on WordNet and Hierarchical Ensembles', Computational Linguistics and Intelligent Text Processing Lecture Notes in Computer Science Volume 3878,pp 417- 428. [14] Jin,Y.,Khan,L.,Wang, L.and Awad,M. (2005) 'Image annotations by combining multiple evidence & Wordnet', In ACM Multimedia,pages706-715. [15] Paek, S., Sable, C.L., Hatzivassiloglou, V., Jaimes, A., Schiffman, B.H., Chang, S.- F., McKeown, K.R.(1999) 'Integration of visual and text- based approaches for the content labeling and classification of photographs', In: Proc. ACM SIGIR workshop on multimedia indexing and retrieval. [16] Hanbury, A. (2008) 'A Survey of Methods for Image Annotation', Journal of Visual Languages & Computing, vol. 19, no. 5, pp. 617-627. [17] Zhao, Q., and Bhowmick, S. S.(2003) 'Association Rule Mining: A Survey', Technical Report, CAIS, Nanyang Technological University, Singapore, No. 2003116. Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 13
  • 8. [18] Miller, G. A. (1995) 'WORDNET: A Lexical Database for English', CommunicationsofACM(11):39-41. [19] Wikipedia Jawaharlal Nehru.[online] http://en.wikipedia.org/wiki/Jawaharlal_ Nehru(Accessed14November2012) [20] Wikipedia Star Cluster. [online] http://en.wikipedia.org/wiki/Star_cluster (Accessed14November2012). [21] CulturalIndia.[online] http://www.culturalindia.net/national- symbols/national-bird.html (Accessed 14 November2012) Journal of Multi Disciplinary Engineering Technologies Volume 7 No.1 Jan. 2013 14