基于改进YOLOv5的实时苹果采摘机器人检测方法

需积分: 0 155 浏览量更新于2024-06-18 收藏 7.89MB PDF 举报

"基于改进YOLOv5的实时苹果采摘机器人目标检测方法" 这篇论文介绍了一种基于改进的YOLOv5（You Only Look Once version 5）算法的实时苹果目标检测方法，用于提升采摘机器人的性能。YOLOv5是一种高效且精确的目标检测框架，尤其适用于实时应用。在农业自动化领域，这种技术可以显著提高果实检测和采摘的效率与精度。 YOLO系列算法的核心思想是将图像分割和目标识别合并到一个单一的神经网络中，使得整个过程快速且直接。YOLOv5通过优化网络结构、引入更先进的训练策略以及利用更高效的计算技术，进一步提升了检测速度和准确性。在本文中，作者们对YOLOv5进行了改进，以适应果园环境中的复杂条件，如光照变化、果实遮挡等问题。文章详细阐述了他们如何收集和标注苹果图像数据集，这是训练深度学习模型的关键步骤。数据集的质量和多样性直接影响模型的泛化能力。此外，作者还讨论了训练过程中的超参数选择、数据增强技术和损失函数的设计，这些都是优化模型性能的重要因素。论文中还提到了实施实时检测所面临的挑战，如计算资源限制和保持高帧率。为了解决这些问题，研究团队可能采用了轻量级网络架构或者模型压缩技术，以确保在机器人硬件平台上能够流畅运行。实验结果部分，作者展示了改进的YOLOv5在苹果检测任务上的性能，包括精度（如平均平均精度mAP）、检测速度以及与其他检测算法的比较。这些结果验证了该方法在实时性与准确性的平衡上具有显著优势。最后，论文讨论了该方法的潜在应用和未来的研究方向，可能包括扩展到其他水果类型、优化采摘策略以及集成到更复杂的农业机器人系统中。这项工作为农业自动化提供了新的解决方案，有助于推动智能农业的发展。该论文揭示了如何通过改进的YOLOv5算法实现苹果目标的实时检测，对于农业机器人尤其是水果采摘领域的研究具有重要的理论和实践价值。

Remote Sens. 2021, 13, 1619 5 of 23

training set, data enhancement processing was carried out to the data set to better extract

the features of apples belonging to different labeled categories and avoid the over-ﬁtting of

the model obtained from training.

Table 2. Detailed information of images in test set.

Test Set Sunny Cloudy Total

Number of images 100 100 200

Graspable apple 482 525 1007

Ungraspable apple 766 563 1329

Due to the uncertain factors, such as illumination angle and weather, resulting in the

light environment of image acquisition is extremely complex; in order to improve the gen-

eralization ability of apple targets detection model, several image enhancement methods

were utilized for the 1014 images of training set respectively based on MATLAB (version

2016, the MathWorks Inc., Natick, MA, USA) software and its related image processing

functions. The image enhancement methods include image brightness enhancement and

reduction, horizontal mirroring, vertical mirroring, multi-angle rotation (90

◦

, 180

◦

, 270

◦

)

etc. In addition, considering the noise generated by the image acquisition equipment in the

process of image acquisition and the blur of the captured images caused by the shaking

of the equipment or the branches, Gaussian noise with variance of 0.02 was added to the

images, and the motion blur processing was carried out. Detailed procedures of image

enhancement methods are illustrated in the following.

Image brightness enhancement and reduction: Firstly, the original image is converted

to HSV space by using ‘rgb2hsv’ function; secondly, the V component (brightness com-

ponent) of the image is multiplied by different coefﬁcients; ﬁnally, the synthesized HSV

space image is converted to RGB space by using ‘hsv2rgb’ function, realizing the brightness

enhancement and reduction of the image. In the study, three brightness intensities can be

generated utilizing brightness enhancement, including (H + S + 1.2

V),

(H + S + 1.4 × V)

and (H + S + 1.6

V); two brightness intensities can be generated using brightness reduc-

tion, including (H + S + 0.6 × V) and (H + S + 0.8 × V).

Image mirroring (horizontal and vertical mirror) was implemented using the Matlab

function ‘imwarp’. The horizontal mirroring was implemented by transforming the left and

right sides of the image centering on the vertical line of the image. The vertical mirroring

was implemented by transforming the upper and lower sides of the image centering on the

horizontal centerline of the image.

For image rotation, the Matlab function ‘imrotate’ was used to rotate the raw image,

and 90

◦

, 180

◦

, and 270

◦

of rotation were achieved by changing the function parameter

‘angle’, respectively. The transformed images can improve the detection performance of

the model by correctly identifying the apples of different orientations.

Four kinds of motion blur processing were employed to make the convolutional

network model have strong adaptability with the blurred images. A predetermined two-

dimensional ﬁlter was created using the Matlab function ‘fspecial’. LEN (length, represents

pixels of linear motion of camera) and THETA (

, represents the angular degree in a

counter-clockwise direction) of the motion ﬁlter were set as (6, 30), (6,

−

30), (7, 45) and

(7, −45),

respectively. Then, the Matlab function ‘imﬁlter’ was used to blur the image with

the generated ﬁlter.

Furthermore, the addition of Gaussian noise with variance of 0.02 to the raw images

was implemented using Matlab function ‘imnoise’.

The ﬁnal training sets consist of 16,224 images used as the ﬁnal training set data for

training of apple targets recognition model, including 15,210 enhanced images and 1014

raw images. The detailed distribution of training set data is shown in Figure 3. There was

no overlap between the training set and the test set.

剩余22页未读，继续阅读

ahdvsgshqkyd

粉丝: 0
资源: 1

基于改进YOLOv5的实时苹果采摘机器人检测方法

YOLOv5是一种基于PyTorch的目标检测模型，可以快速准确地识别图像中的不同类别的物体

DMclust, a Density-based modularity method for picking OTU from massive 16S rRNA sequence data

photoshop-plugin-copy-color-after-picking:在AdobePhotoshop:registered:中拾取后复制十六进制颜色

Interactively Picking Real-World Objects with Unconstrained

android-clockseekbar, Standalone Android widget for picking a single time or range from a clock view..zip

SCE 11.0x-WMM-Batch Picking

ME336-Yellow-Team-Project1-2D-Picking:协作机器人学习的项目源代码

SCE 11.0x-WMM-Picking

060-WMM 11.0x_Batch Picking

Raspi_Opencv_Picking_Robot:这是采摘机器人的代码

最新资源