Bridging semantic gap: learning and integrating semantics for content-based retrieval

Name: Bridging semantic gap: learning and integrating semantics for content-based retrieval
Creator: Lim, Joo Hwee
Keywords: Image processing., Image processing, Database design., Information retrieval., Information retrieval, Semantics., Database design, Semantics, 004

Lim, Joo Hwee

Found an issue? Give us feedback

downloadFull-Text

UNSWorksarrow_drop_down

UNSWorks

Doctoral thesis . 2004

License: CC BY NC ND

Full-Text: http://hdl.handle.net/1959.4/56732

Data sources: Bielefeld Academic Search Engine (BASE)

https://dx.doi.org/10.26190/un...

Doctoral thesis . 2004

License: CC BY NC ND

Data sources: Datacite

DBLP

Doctoral thesis

Data sources: DBLP

Bridging semantic gap: learning and integrating semantics for content-based retrieval

descriptionPublicationkeyboard_double_arrow_right Doctoral thesis 01 Jan 2004 Australia Publisher:UNSW Sydney

Authors: Lim, Joo Hwee;

doi: 10.26190/unsworks/5224

handle: 1959.4/56732

Bridging semantic gap: learning and integrating semantics for content-based retrieval

- Summary
- Subjects
- Metrics

Abstract

Digital cameras have entered ordinary homes and produced^incredibly large number of photos. As a typical example of broad image domain, unconstrained consumer photos vary significantly. Unlike professional or domain-specific images, the objects in the photos are ill-posed, occluded, and cluttered with poor lighting, focus, and exposure. Content-based image retrieval research has yet to bridge the semantic gap between computable low-level information and high-level user interpretation. In this thesis, we address the issue of semantic gap with a structured learning framework to allow modular extraction of visual semantics. Semantic image regions (e.g. face, building, sky etc) are learned statistically, detected directly from image without segmentation, reconciled across multiple scales, and aggregated spatially to form compact semantic index. To circumvent the ambiguity and subjectivity in a query, a new query method that allows spatial arrangement of visual semantics is proposed. A query is represented as a disjunctive normal form of visual query terms and processed using fuzzy set operators. A drawback of supervised learning is the manual labeling of regions as training samples. In this thesis, a new learning framework to discover local semantic patterns and to generate their samples for training with minimal human intervention has been developed. The discovered patterns can be visualized and used in semantic indexing. In addition, three new class-based indexing schemes are explored. The winnertake- all scheme supports class-based image retrieval. The class relative scheme and the local classification scheme compute inter-class memberships and local class patterns as indexes for similarity matching respectively. A Bayesian formulation is proposed to unify local and global indexes in image comparison and ranking that resulted in superior image retrieval performance over those of single indexes. Query-by-example experiments on 2400 consumer photos with 16 semantic queries show that the proposed approaches have significantly better (18% to 55%) average precisions than a high-dimension feature fusion approach. The thesis has paved two promising research directions, namely the semantics design approach and the semantics discovery approach. They form elegant dual frameworks that exploits pattern classifiers in learning and integrating local and global image semantics.

Country

Australia

Related Organizations

UNSW Sydney
Australia

Keywords

Image processing., Image processing, Database design., Information retrieval., Information retrieval, Semantics., Database design, Semantics, 004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green