Querying Spatial Patterns by Keyword
|
Keyword based search in text rich spatial datasets leads to many novel applications and services. It also enhances the functionality of existing map search services, image search applications, and GIS systems. Currently, we are studying the problem of searching top-k sets of spatially close spatial objects for a given set of keywords such that each set contains objects for each keyword and no proper subset of the set does so. We are also developing algorithms to mine interesting connected object sets in spatial datasets. |
|
Nearest Neighbor Search in Very High Dimensional Spaces
Similarity Search in Images and Videos
| Duplicate Video Detection |
|
Copyright infringements and data piracy have recently become serious concerns for the ever growing online video repositories. Approaches based on content-based copy detection and watermarking are used to detect such infringements. In this work we develop an efficient and accurate method for duplicate video detection in a large database using video fingerprints. We use vector quantized color layout descriptor to create fingerprints. We propose a new nonmetric distance measure to find the similarity between the query and a database video fingerprint. Efficient search cannot be performed for high-dimensional data using a nonmetric distance measure with existing indexing techniques. Therefore, we develop novel search algorithms based on precomputed distances and new dataset pruning techniques yielding practical retrieval times. |
| Sub-Image Similarity Search |
|
Sub-image search with high accuracy in natural images still remains a challenging problem. We develop a new feature vector called profile for a keypoint in a bag of visual words model of an image. The profile of a keypoint captures the spatial geometry of all the other keypoints in an image with respect to itself, and is very effective in discriminating true matches from false matches. Sub-image search using profiles is a single-phase process requiring no geometric validation, yields high precision on natural images, and works well on small visual codebook. The proposed search technique differs from traditional methods that first generate a set of candidates disregarding spatial information and then verify them geometrically. Conventional methods also use large codebooks. |
|