Python实现Bbox的旋转变换

发布时间：2024-01-01 21:01:33

在Python中实现Bbox（边界框）的旋转变换可以使用OpenCV库。OpenCV提供了一个rotate function（旋转函数），可以旋转图像或图像中的边界框。

下面是一个使用OpenCV进行Bbox旋转变换的示例：

import cv2
import numpy as np

def rotate_bbox(bbox, angle, image_width, image_height):
    # 计算旋转矩阵
    center_x = (bbox[0] + bbox[2]) / 2
    center_y = (bbox[1] + bbox[3]) / 2
    rotation_matrix = cv2.getRotationMatrix2D((center_x, center_y), angle, 1)

    # 旋转边界框的四个顶点
    top_left = np.dot(rotation_matrix, [bbox[0], bbox[1], 1])
    top_right = np.dot(rotation_matrix, [bbox[2], bbox[1], 1])
    bottom_left = np.dot(rotation_matrix, [bbox[0], bbox[3], 1])
    bottom_right = np.dot(rotation_matrix, [bbox[2], bbox[3], 1])

    # 计算旋转后的边界框的新坐标
    x = min(top_left[0], top_right[0], bottom_left[0], bottom_right[0])
    y = min(top_left[1], top_right[1], bottom_left[1], bottom_right[1])
    w = max(top_left[0], top_right[0], bottom_left[0], bottom_right[0]) - x
    h = max(top_left[1], top_right[1], bottom_left[1], bottom_right[1]) - y

    # 将坐标限制在图像范围内
    x = max(0, min(x, image_width - 1))
    y = max(0, min(y, image_height - 1))
    w = max(0, min(w, image_width - x))
    h = max(0, min(h, image_height - y))

    return [x, y, x + w, y + h]

# 读取图像
image = cv2.imread('image.jpg')
image_height, image_width, _ = image.shape

# 定义初始边界框
bbox = [100, 100, 200, 200]

# 定义旋转角度
angle = 45

# 进行旋转变换
rotated_bbox = rotate_bbox(bbox, angle, image_width, image_height)

# 绘制旋转后的边界框
cv2.rectangle(image, (bbox[0], bbox[1]), (bbox[2], bbox[3]), (0, 255, 0), 2)
cv2.rectangle(image, (rotated_bbox[0], rotated_bbox[1]), (rotated_bbox[2], rotated_bbox[3]), (0, 0, 255), 2)

# 显示图像
cv2.imshow("Image", image)
cv2.waitKey(0)
cv2.destroyAllWindows()

在这个例子中，rotate_bbox函数接受一个初始边界框（由左上角和右下角的坐标定义），旋转角度以及图像的宽度和高度作为输入。它首先计算旋转矩阵，然后将边界框的四个顶点旋转。最后，函数计算旋转后的边界框的新坐标，并确保它们保持在图像的范围内。

在主程序中，我们首先读取输入图像并获取其高度和宽度。然后，我们定义初始边界框和旋转角度。接下来，我们调用rotate_bbox函数来获得旋转后的边界框。最后，我们在图像上绘制初始边界框和旋转后的边界框，并显示图像。

这是一个非常基本的例子，你可以根据你的需求进行修改和扩展。希望这对你有所帮助！