IIIT-Delhi Institutional Repository

Scalable algorithms for spatial-textual data join

Show simple item record

dc.contributor.author Gupta, Vivek
dc.contributor.author Goyal, Vikram (Advisor)
dc.date.accessioned 2015-05-27T04:59:08Z
dc.date.available 2015-05-27T04:59:08Z
dc.date.issued 2015-05-27T04:59:08Z
dc.identifier.uri https://repository.iiitd.edu.in/jspui/handle/123456789/272
dc.description.abstract Spatio-textual similarity join retrieves a set of pairs of objects wherein objects in each pair are close in spatial as well as textual dimensions. A lot of work has been done in the spatial dimension but no work has been done for spatial-textual joins. However, due to the ubiquity of GPS enabled devices, huge spatial-textual data is being generated which demand new methods to query and perform operations on this new data type. We study join operation for spatial-textual data and incorporate various optimizations/ heuristics such as e efficient grid partitioning for spatial dimension, use of a speci c pre x length of textual vector and ordering of elements in textual vectors on the basis of their TF-IDF scores. We also design and study algorithms using the above heuristics for spatial-textual data join on MapReduce Framework. Experimental results on two real life datasets, Flickr and Foursquare, show the e effectiveness of these optimizations in terms of computation time as well as pruning of non-candidates. en_US
dc.language.iso en_US en_US
dc.title Scalable algorithms for spatial-textual data join en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account