Dergi makalesi Açık Erişim
Hooshmand, Sahar; Abedin, Paniz; Kulekci, M. Oguzhan; Thankachan, Sharma V.
<?xml version='1.0' encoding='utf-8'?> <oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"> <dc:creator>Hooshmand, Sahar</dc:creator> <dc:creator>Abedin, Paniz</dc:creator> <dc:creator>Kulekci, M. Oguzhan</dc:creator> <dc:creator>Thankachan, Sharma V.</dc:creator> <dc:date>2021-01-01</dc:date> <dc:description>The non-overlapping indexing problem is defined as follows: pre-process a given text T[1, n] of length n into a data structure such that whenever a pattern P [1, m] comes as an input, we can efficiently report the largest set of non-overlapping occurrences of P in T. The best-known solution is by Cohen and Porat [ISAAC 2009]. The size of their structure is O (n) words and the query time is optimal O (m + nocc), where nocc is the output size. Later, Ganguly et al. [CPM 2015 and Algorithmica 2020] proposed a compressed space solution. We study this problem in the cache-oblivious model and present a new data structure of size O (n log n) words. It can answer queries in optimal O (m/B + log(B) n + nocc/B) I/O operations, where B is the block size. The space can be improved to O (n log(M/B) n) in the cache-aware model, where M is the size of main memory. Additionally, we study a generalization of this problem with an additional range [s, e] constraint. Here the task is to report the largest set of non-overlapping occurrences of P in T, that are within the range [s, e]. We present an O (n log(2) n) space data structure in the cache-aware model that can answer queries in optimal O (m/B + log(B) n + nocc([s,e]) B ) I/O operations, where nocc([s,e]) is the output size.</dc:description> <dc:identifier>https://aperta.ulakbim.gov.trrecord/237248</dc:identifier> <dc:identifier>oai:aperta.ulakbim.gov.tr:237248</dc:identifier> <dc:rights>info:eu-repo/semantics/openAccess</dc:rights> <dc:rights>http://www.opendefinition.org/licenses/cc-by</dc:rights> <dc:source>THEORETICAL COMPUTER SCIENCE 857 1-7</dc:source> <dc:title>I/O-efficient data structures for non-overlapping indexing</dc:title> <dc:type>info:eu-repo/semantics/article</dc:type> <dc:type>publication-article</dc:type> </oai_dc:dc>
Görüntülenme | 14 |
İndirme | 3 |
Veri hacmi | 507 Bytes |
Tekil görüntülenme | 14 |
Tekil indirme | 3 |