【图像分割】使用Otsu 算法及迭代计算最佳全局阈值估计并实现图像分割（代码实现与分析）

本实验要求理解全局阈值分割的概念，并实现文本图像分割。需要大家深入理解Ostu 算法的实现过程及其迭代原理，同时通过学习使用Otsu 算法及其迭代，实践图像分割技术在文本图像处理中的应用。

以下将从实验原理、实验实现、实验结果分析三部分对整个实验进行阐述

实验原理

全局阈值分割原理

全局阈值分割是一种基于灰度图像的简单分割方法。其基本思想是根据一个固定的阈值T，将图像中的每个像素点的灰度值与阈值T进行比较。如果像素点的灰度值大于或等于T，则将其归为前景（通常表示感兴趣的物体或区域）；否则，将其归为背景。

Otsu算法原理

Otsu算法是一种自动选择全局阈值的方法，它通过最大化类间方差（inter-class variance）来确定最优的阈值。类间方差反映了前景和背景两类像素之间的差异程度，差异越大，说明分割效果越好。

以下是Otsu算法的具体步骤：

a. 计算图像的灰度直方图：直方图表示了图像中各个灰度级像素出现的频率。

b. 计算各类别的概率：对于每一个可能的阈值T，可以将图像分为两个类别，一类是灰度值小于T的像素，另一类是灰度值大于或等于T的像素。计算这两个类别的像素数（或者像素的概率）。

c. 计算类间方差：类间方差定义为两类像素的平均灰度值之差的平方乘以两类像素的概率之和。类间方差越大，说明两类像素的差异越大，分割效果越好。

d. 寻找最优阈值：遍历所有可能的阈值，对于每个阈值，计算其对应的类间方差。选择使类间方差最大的那个阈值作为最佳全局阈值。

图像分割实现

利用计算出的最佳全局阈值，对原始图像进行二值化处理，即根据阈值将每个像素点的灰度值转换为0（背景）或1（前景），从而实现图像的分割。

实验实现

输入图像

在本次实验中，小组选取了三幅灰度图片作为实验的输入图像，如下图所示。

实验代码

利用Python实现Otsu算法及其迭代方法。对于输入的图像，首先生成它的一个渐变灰度图像，接着计算图像的直方图，并基于直方图使用Otsu方法和迭代方法分别寻找最佳的阈值。

import cv2
import numpy as np
import osdef get_file_paths(folder_path):# 获取文件夹内所有文件的路径file_paths = [os.path.join(folder_path, file) for file in os.listdir(folder_path) if os.path.isfile(os.path.join(folder_path, file))]return file_pathsdef generate_gradient_image(width, height):# 生成渐变灰度图像数据gradient_image = np.zeros((height, width), dtype=np.uint8)# 计算每一列的亮度值for col in range(width):brightness = int(255 * col / width)gradient_image[:, col] = brightnessreturn gradient_imagedef save_image(image, file_path):# 保存图像cv2.imwrite(file_path, image)def iterative_thresholding(image, epsilon=1e-6, max_iter=100):# 初始阈值threshold = 128.0for _ in range(max_iter):# 根据当前阈值将图像二值化binary_image = image > threshold# 计算前景和背景的平均灰度mean_foreground = np.mean(image[binary_image])mean_background = np.mean(image[~binary_image])# 计算新的阈值new_threshold = 0.5 * (mean_foreground + mean_background)# 如果新旧阈值之间的差异小于 epsilon，停止迭代if abs(new_threshold - threshold) < epsilon:breakthreshold = new_thresholdreturn thresholddef otsu_thresholding(image): # 计算otsu全局最优阈值# 计算直方图hist, bins = np.histogram(image.flatten(), 256, [0, 256])# 归一化直方图hist = hist.astype(float) / sum(hist)# 初始化类内方差和类间方差var_within = np.zeros(256)var_between = np.zeros(256)for t in range(1, 256):# 类内方差w0 = sum(hist[:t])w1 = sum(hist[t:])mu0 = sum(i * hist[i] for i in range(t)) / w0 if w0 > 0 else 0mu1 = sum(i * hist[i] for i in range(t, 256)) / w1 if w1 > 0 else 0var_within[t] = w0 * w1 * (mu0 - mu1) ** 2# 类间方差var_between[t] = w0 * w1 * (mu0 - mu1) ** 2# 找到最佳阈值optimal_threshold = np.argmax(var_between)return optimal_thresholddef threshold(image_path): # 生成阈值化图像# 读取图像img = cv2.imread(image_path, cv2.IMREAD_GRAYSCALE)image_path=image_path[5:]# 应用Otsu方法获取最佳阈值otsu_threshold = otsu_thresholding(img)# print(f"threshold:{threshold}")# 使用阈值进行二值化print(f'otsu_threshold:{otsu_threshold}')_, binary_image = cv2.threshold(img, otsu_threshold, 255, cv2.THRESH_BINARY)# 保存阈值化图像save_image(binary_image, f'Otsu_{image_path}')iterative_threshold = iterative_thresholding(img)# print(f"threshold:{threshold}")# 使用阈值进行二值化print(f'iterative_threshold:{iterative_threshold}')_, binary_image = cv2.threshold(img, iterative_threshold, 255, cv2.THRESH_BINARY)# 保存阈值化图像save_image(binary_image, f'iterative_{image_path}')print(f'{image_path[:-4]}测试已完成')if __name__ == '__main__':# # 设置图像的宽度和高度# width = 640# height = 480# # 生成渐变图像# image = generate_gradient_image(width, height)# # 保存图像# save_image(image, 'exam0.jpg')folder_path = 'exam'# 获取文件夹内所有文件的路径exam_paths = get_file_paths(folder_path)# 依次测试图像for image_path in exam_paths:print(f'image_name:{image_path[5:]}')threshold(image_path)