Home
Scholarly Works
Fast and Simple Computations Using Prefix Tables...
Conference

Fast and Simple Computations Using Prefix Tables Under Hamming and Edit Distance

Abstract

In this article, we introduce a new and simple data structure, the prefix table under Hamming distance, and present two algorithms to compute it efficiently: one asymptotically fast; the other very fast on average and in practice. Because the latter approach avoids the computation of global data structures, such as the suffix array and the longest common prefix array, it yields algorithms much faster in practice than existing methods. We show how this data structure can be used to solve two string problems of interest: (a) approximate string matching under Hamming distance; and (b) longest approximate overlap under Hamming distance. Analogously, we introduce the prefix table under edit distance, and present an efficient algorithm for its computation. In the process, we also define the border array under both distance measures, and provide an algorithm for conversion between prefix tables and border arrays.

Authors

Barton C; Iliopoulos CS; Pissis SP; Smyth WF

Series

Lecture Notes in Computer Science

Volume

8986

Pagination

pp. 49-61

Publisher

Springer Nature

Publication Date

January 1, 2015

DOI

10.1007/978-3-319-19315-1_5

Conference proceedings

Lecture Notes in Computer Science

ISSN

0302-9743
View published work (Non-McMaster Users)

Contact the Experts team