CS186-L6: Indices & B+ Tree Refinements

HHZZ included in CS186

2024-08-14 310 words 2 minutes

Contents

General Notes

issues to consider in any index structure (not just in B+ tree)

但是这节课我们只是关注1-d range search, equality， B+ tree

注意lexicographic!

以下给出了一个定义Composite Keys，多列，前等，尾唯一range 注意对Lexicographic Range的强调

index entry: (key, value)

index entry: (key, recordID), remember recordID is……

index entry: (key, list of recordIDs)

clustered is more efficient for IOs 🤔, range search and supports “compression” 🤔

重新定义 Occupancy Invariant （当不是用整数来index时候）
get more index entries to shorten the tree (avoiding long-time IOs)
- prefix key compression (only in leaf level 🤔, slightly change the order of keys?)
- suffix key compression

这里引入新的假设：

store by ref (see in alt. 2)
clustered index with 2/3 full heap file pages
- clustered -> heapfile is initially sorted
- fanout is larger ~ $O(Ref)$
- assume static index

符号表达如下：

side note:

Scan all records: $3/2$来自与占有率2/3， $\frac{2}{3}B’ = B \Rightarrow B’ = \frac{3}{2}B \Rightarrow B’D = \frac{3}{2}B D$
Equality Search: $1 \Rightarrow 2$ !! 来自于从page中读取slot从而获得具体的index并且读取数值, $log_F(BR/E)$ 是搜索page
Range Search: 应该是 $(log_F(BR/E)+1+3*pages)*D$
Insert&Delete: 应该是 $(log_F(BR/E)+4)*D$, index 1，读取数值 1，改变数值 1，改变index 1

big-O notation: 😸

time-stamp: 01h42m07s