基于YOLO的农业智能：实时目标检测与机器人收获

版权申诉

139 浏览量更新于2024-08-03 收藏 1.46MB PDF 举报

本文主要探讨了如何利用基于YOLO（You Only Look Once）的学习方法来优化农业领域的实时目标检测和机器人操作，以提升普通种植作物的收获效率。在当前的农业产业化进程中，机器视觉技术的应用是关键一步，它使得作物的自动化识别成为可能。然而，这项技术在实际应用中还面临着诸如环境变化、作物多样性和复杂背景等挑战。研究者提出了一种创新的框架，通过整合两个独立的卷积神经网络（CNN）架构，实现了作物检测与机器人收割（即机器人操作）任务的并行处理。在模拟环境中，研究人员对采集到的作物图像进行了一系列预处理，包括随机旋转、裁剪、调整亮度和对比度，以此生成增强数据集，增加了模型对各种条件下的适应性。 "你只看一次"（YOLO）算法在此框架中扮演了关键角色，它与传统的矩形边界框（R-Bbox）相结合，提高了作物定位的精度和速度。YOLO算法以其高效的特点，能够在短时间内对图像中的作物进行准确识别，这对于实时农业场景来说尤为重要。后续步骤中，研究团队利用视觉几何模型对检测到的图像信息进行深入分析，从而确定机器人操作的抓取位置，确保机器人能够精准执行收割任务。这种方法不仅简化了操作流程，还降低了人工干预的需求，有助于提高整体的农业生产效率。这项研究通过结合深度学习的先进技术和农业实践，为农业自动化提供了一个新的解决方案，有望推动农业向更智能化、高效率的方向发展。未来的研究可能会进一步优化算法性能，适应更多复杂的农业场景，以实现农业生产的全面升级。

Real-time object detection and robotic manipulation

for agriculture using a YOLO-based learning

approach

Hongyu Zhao, Zezhi Tang

∗

, Zhenhong Li, Yi Dong, Yuancheng Si, Mingyang Lu, George Panoutsos

Abstract—The optimisation of crop harvesting processes for

commonly cultivated crops is of great importance in the aim

of agricultural industrialisation. Nowadays, the utilisation of

machine vision has enabled the automated identiﬁcation of

crops, leading to the enhancement of harvesting efﬁciency, but

challenges still exist. This study presents a new framework

that combines two separate architectures of convolutional neural

networks (CNNs) in order to simultaneously accomplish the tasks

of crop detection and harvesting (robotic manipulation) inside a

simulated environment. Crop images in the simulated environ-

ment are subjected to random rotations, cropping, brightness,

and contrast adjustments to create augmented images for dataset

generation. The you only look once algorithmic framework

is employed with traditional rectangular bounding boxes (R-

Bbox) for crop localization. The proposed method subsequently

utilises the acquired image data via a visual geometry group

model in order to reveal the grasping positions for the robotic

manipulation.

Index Terms—Deep learning, YOLOV3-dense, robot grasping.

I. INTRODUCTION

The progression of automation can be observed on a global

scale across several industries. The modernization and automa-

tion of agricultural production have also been noticed. The

implementation of mechanized techniques in agriculture has

facilitated the automation of diverse processes, resulting in

enhanced efﬁciency in agricultural production [1]. Neverthe-

less, the issue pertaining to crop harvesting within the realm of

automation remains inadequately resolved, with conventional

robots encountering challenges in accurately perceiving and

successfully executing the act of crop grasping.

H. Zhao is with the Department of Physics, Imperial College, London,

United Kingdom (email: hz2623@ic.ac.uk)

Z. Tang and G. Panoutsos are with the Department of Automatic Control

and Systems Engineering, University of Shefﬁeld, Shefﬁeld, S1 3JD, United

Kingdom (emails: zezhi.tang@shefﬁeld.ac.uk, g.panoutsos@shefﬁeld.ac.uk)

Z. Li is with the Department of Electrical and Electronic Engineer-

ing, University of Manchester, Manchester, United Kingdom (email: zhen-

hong.li@manchester.ac.uk)

Y. Dong is with the Department of Electronics and Computer Sci-

ence, University of Southampton, Southampton, United Kingdom (email:

yi.dong@soton.ac.uk)

Y. Si is with the Department of Economics, Fudan University, Shanghai,

200433, China (email: siyuancheng@fudan.edu.cn)

M. Lu is with the Center for Nondestructive Evaluation (CNDE), Iowa State

University, Ames, IA 50011, United States (email: mingylu@iastate.edu)

*Corresponding author

IEEE must be obtained for all other uses, in any current or future media,

including reprinting/republishing this material for advertising or promotional

purposes, creating new collective works, for resale or redistribution to servers

or lists, or reuse of any copyrighted component of this work in other works.

Traditional machines have faced challenges when it comes

to harvesting crops. Manual labor is time-consuming and

leads to increased production costs. Therefore, robots can be

used to contribute to increased agricultural productivity [2].

In the context of industrial production lines, robots typically

perform speciﬁc roles within a production task, such as the

manipulation and placement of products at a ﬁxed location or

the execution of speciﬁc steps within a specialized process

[3]. Nevertheless, using robots in agricultural productivity

requires enhanced object detection and grasping capabilities.

Hence, it is necessary to conduct research on the topic of

robot recognition and grasping techniques. Speciﬁcally, the

concept of grasping holds signiﬁcant importance in the ﬁeld

of automation, as the majority of automated systems rely

on the precise and effective gripping of a designated object.

Currently, a wide range of algorithms exist that are used for

the purpose of object recognition in conjunction with robotic

grasping.

The mask region-based convolutional neural network

(Mask-RCNN) algorithm is employed for the purpose of

segmenting and performing geometric stereo-matching in or-

der to accurately determine the location of the object of

interest in the camera’s ﬁeld of view; the robot manipulator

subsequently performs efﬁcient grabbing of the target object

[4]. The grasp region-based convolutional neural network(GR-

ConvNet) algorithm has the capability to produce grasping

poses based on RGB photos. This addresses the challenge of

creating and performing grasping actions for a robot that is

unfamiliar with the items in its environment, using n-channel

photographs of the scene [5].

The YOLO method is a computer vision technique utilised

for object recognition, renowned for its remarkable real-time

detection capabilities. Moreover, it is continuously subjected

to optimisation and enhancement efforts. As discussed in [6],

the utilisation of YOLO effectively addresses the signiﬁcant

challenge of domain drift commonly seen in traditional target

detection approaches. This enables the generalised underwater

object detector (GUOD) to achieve commendable performance

in detecting targets in diverse underwater settings. The authors

of [7] propose a solution to address the challenge of picture

detection on datasets with limited samples. The proposed

approach uses the real-time capabilities of YOLO, along with

techniques such as transfer learning and data augmentation,

to enhance detection rates and speed. In [8], the YOLO is

acknowledged for addressing the difﬁculty of accurately de-

arXiv:2401.15785v1 [cs.CV] 28 Jan 2024

下载后可阅读完整内容，剩余6页未读，立即下载

人工智能_SYBH

粉丝: 5w+
资源: 233

基于YOLO的农业智能：实时目标检测与机器人收获

基于深度学习的柑橘实时识别方法.pdf

深度学习 苹果数据集（带标注）YOLO和VOC格式 4000张图片

草莓数据集检测YOLO8

采摘机器人果实识别与定位研究——基于双目视觉和机器学习.pdf

基于改进YOLOv5的实时苹果采摘机器人检测方法

基于YOLO3与PyTorch的害虫识别系统源码发布

YOLO目标检测在农业领域的应用：作物监测与病虫害识别（农业科技新突破）

YOLO目标检测在农业领域的应用：作物监测与病虫害识别实战

：YOLO目标检测算法在农业领域的应用：作物监测与病虫害识别，助力农业现代化

使用 YOLO5 进行多尺度物体检测的技巧

最新资源

深度学习苹果数据集（带标注）YOLO和VOC格式 4000张图片