堆：以图文并茂的方式解读其工作原理

2023-11-07 05:32:37

了解堆：一种高效的优先级队列数据结构

在上一篇关于队列的文章中，我们探索了使用数组来实现优先级队列的方法。虽然这种方法可以提供 O(1) 的删除时间复杂度，但它的插入操作需要移动数组中平均一半的数据，导致 O(N) 的时间复杂度。为了克服这一限制，本文将深入探讨堆，一种能够以 O(logN) 时间复杂度执行插入和删除操作的优雅数据结构。

什么是堆？

堆是一种基于树状结构的非线性数据结构，满足以下两个关键性质：

1. 堆序性质： 每个节点的值都大于或等于（最大堆）或小于或等于（最小堆）其子节点的值。

2. 完全二叉树： 除了最后一层外，所有其他层中的每个节点都有两个子节点。

堆的操作

1. 插入操作：

向堆中插入一个新元素涉及以下步骤：

将元素附加到堆的最后一个叶节点。
与它的父节点比较，如果需要，向上移动并交换位置。
持续比较和交换，直到元素到达堆顶或找到合适的位置。

2. 删除操作：

从堆中删除一个元素，通常是堆顶元素，遵循以下步骤：

将堆顶元素与最后一个叶节点交换位置。
将叶节点与它的父节点进行比较，如果需要，向下移动并交换位置。
持续比较和交换，直到节点到达合适的位置或没有子节点。

堆的应用

堆在计算机科学中有着广泛的应用，尤其是在优先级队列的实现中。优先级队列允许以 O(logN) 的时间复杂度插入和删除优先级最高的数据，这在各种算法问题中至关重要，包括：

中位数查找
最短路径查找
最小生成树
图形渲染
堆排序

代码示例

Python 中实现一个最小堆：

class MinHeap:
    def __init__(self):
        self.heap = []

    def insert(self, value):
        self.heap.append(value)
        self._bubble_up(len(self.heap) - 1)

    def _bubble_up(self, index):
        while index > 0:
            parent_index = (index - 1) // 2
            if self.heap[index] < self.heap[parent_index]:
                self.heap[index], self.heap[parent_index] = self.heap[parent_index], self.heap[index]
            index = parent_index

    def extract_min(self):
        if not self.heap:
            return None

        min_value = self.heap[0]
        self.heap[0] = self.heap.pop()
        self._bubble_down(0)
        return min_value

    def _bubble_down(self, index):
        while index < len(self.heap):
            left_index = 2 * index + 1
            right_index = 2 * index + 2
            smaller_index = index
            if left_index < len(self.heap) and self.heap[left_index] < self.heap[smaller_index]:
                smaller_index = left_index
            if right_index < len(self.heap) and self.heap[right_index] < self.heap[smaller_index]:
                smaller_index = right_index
            if smaller_index == index:
                break
            self.heap[index], self.heap[smaller_index] = self.heap[smaller_index], self.heap[index]
            index = smaller_index