Skip to main navigation Skip to search Skip to main content

Optimality in External Memory Hashing

  • Morten Skaarup Jensen
  • , Rasmus Pagh

    Research output: Journal Article or Conference Article in Journal β€Ί Journal article β€Ί Research β€Ί peer-review

    Abstract

    Hash tables on external memory are commonly used for indexing in database management systems. In this paper we present an algorithm that, in an asymptotic sense, achieves the best possible I/O and space complexities. Let B denote the number of records that fit in a block, and let N denote the total number of records. Our hash table uses 1+𝑂(1/√B) I/Os, expected, for looking up a record (no matter if it is present or not). To insert, delete or change a record that has just been looked up requires 1+𝑂(1/√B) I/Os, amortized expected, including I/Os for reorganizing the hash table when the size of the database changes. The expected external space usage is 1+𝑂(1/√B) times the optimum of N/B blocks, and just O(1) blocks of internal memory are needed.
    Original languageEnglish
    JournalAlgorithmica
    Volume52
    Issue number3
    Pages (from-to)403-411
    Number of pages9
    ISSN0178-4617
    DOIs
    Publication statusPublished - 2008

    Keywords

    • External memory
    • Dictionary
    • Hashing
    • Index

    Fingerprint

    Dive into the research topics of 'Optimality in External Memory Hashing'. Together they form a unique fingerprint.

    Cite this