Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling

Deep learning has already been successfully used in the development of decision support systems in various domains. Therefore, there is an incentive to apply it in other important domains such as agriculture. Fertilizers, electricity, chemicals, human labor, and water are the components of total ene...

Full description

Bibliographic Details
Main Authors: Khadijeh Alibabaei, Pedro D. Gaspar, Tânia M. Lima
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Energies
Subjects:
Online Access:https://www.mdpi.com/1996-1073/14/11/3004
_version_ 1797533103581822976
author Khadijeh Alibabaei
Pedro D. Gaspar
Tânia M. Lima
author_facet Khadijeh Alibabaei
Pedro D. Gaspar
Tânia M. Lima
author_sort Khadijeh Alibabaei
collection DOAJ
description Deep learning has already been successfully used in the development of decision support systems in various domains. Therefore, there is an incentive to apply it in other important domains such as agriculture. Fertilizers, electricity, chemicals, human labor, and water are the components of total energy consumption in agriculture. Yield estimates are critical for food security, crop management, irrigation scheduling, and estimating labor requirements for harvesting and storage. Therefore, estimating product yield can reduce energy consumption. Two deep learning models, Long Short-Term Memory and Gated Recurrent Units, have been developed for the analysis of time-series data such as agricultural datasets. In this paper, the capabilities of these models and their extensions, called Bidirectional Long Short-Term Memory and Bidirectional Gated Recurrent Units, to predict end-of-season yields are investigated. The models use historical data, including climate data, irrigation scheduling, and soil water content, to estimate end-of-season yield. The application of this technique was tested for tomato and potato yields at a site in Portugal. The Bidirectional Long Short-Term memory outperformed the Gated Recurrent Units network, the Long Short-Term Memory, and the Bidirectional Gated Recurrent Units network on the validation dataset. The model was able to capture the nonlinear relationship between irrigation amount, climate data, and soil water content and predict yield with an MSE of 0.017 to 0.039. The performance of the Bidirectional Long Short-Term Memory in the test was compared with the most commonly used deep learning method, the Convolutional Neural Network, and machine learning methods including a Multi-Layer Perceptrons model and Random Forest Regression. The Bidirectional Long Short-Term Memory outperformed the other models with an <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mi>R</mi><mn>2</mn></msup></semantics></math></inline-formula> score between 0.97 and 0.99. The results show that analyzing agricultural data with the Long Short-Term Memory model improves the performance of the model in terms of accuracy. The Convolutional Neural Network model achieved the second-best performance. Therefore, the deep learning model has a remarkable ability to predict the yield at the end of the season.
first_indexed 2024-03-10T11:09:49Z
format Article
id doaj.art-ec42ee323bb546a78d4783a7da002667
institution Directory Open Access Journal
issn 1996-1073
language English
last_indexed 2024-03-10T11:09:49Z
publishDate 2021-05-01
publisher MDPI AG
record_format Article
series Energies
spelling doaj.art-ec42ee323bb546a78d4783a7da0026672023-11-21T20:53:38ZengMDPI AGEnergies1996-10732021-05-011411300410.3390/en14113004Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation SchedulingKhadijeh Alibabaei0Pedro D. Gaspar1Tânia M. Lima2C-MAST Center for Mechanical and Aerospace Science and Technologies, University of Beira Interior, 6201-001 Covilhã, PortugalC-MAST Center for Mechanical and Aerospace Science and Technologies, University of Beira Interior, 6201-001 Covilhã, PortugalC-MAST Center for Mechanical and Aerospace Science and Technologies, University of Beira Interior, 6201-001 Covilhã, PortugalDeep learning has already been successfully used in the development of decision support systems in various domains. Therefore, there is an incentive to apply it in other important domains such as agriculture. Fertilizers, electricity, chemicals, human labor, and water are the components of total energy consumption in agriculture. Yield estimates are critical for food security, crop management, irrigation scheduling, and estimating labor requirements for harvesting and storage. Therefore, estimating product yield can reduce energy consumption. Two deep learning models, Long Short-Term Memory and Gated Recurrent Units, have been developed for the analysis of time-series data such as agricultural datasets. In this paper, the capabilities of these models and their extensions, called Bidirectional Long Short-Term Memory and Bidirectional Gated Recurrent Units, to predict end-of-season yields are investigated. The models use historical data, including climate data, irrigation scheduling, and soil water content, to estimate end-of-season yield. The application of this technique was tested for tomato and potato yields at a site in Portugal. The Bidirectional Long Short-Term memory outperformed the Gated Recurrent Units network, the Long Short-Term Memory, and the Bidirectional Gated Recurrent Units network on the validation dataset. The model was able to capture the nonlinear relationship between irrigation amount, climate data, and soil water content and predict yield with an MSE of 0.017 to 0.039. The performance of the Bidirectional Long Short-Term Memory in the test was compared with the most commonly used deep learning method, the Convolutional Neural Network, and machine learning methods including a Multi-Layer Perceptrons model and Random Forest Regression. The Bidirectional Long Short-Term Memory outperformed the other models with an <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mi>R</mi><mn>2</mn></msup></semantics></math></inline-formula> score between 0.97 and 0.99. The results show that analyzing agricultural data with the Long Short-Term Memory model improves the performance of the model in terms of accuracy. The Convolutional Neural Network model achieved the second-best performance. Therefore, the deep learning model has a remarkable ability to predict the yield at the end of the season.https://www.mdpi.com/1996-1073/14/11/3004agriculturedeep learningLSTMsupport decision-making algorithmsyield estimationirrigation management
spellingShingle Khadijeh Alibabaei
Pedro D. Gaspar
Tânia M. Lima
Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling
Energies
agriculture
deep learning
LSTM
support decision-making algorithms
yield estimation
irrigation management
title Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling
title_full Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling
title_fullStr Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling
title_full_unstemmed Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling
title_short Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling
title_sort crop yield estimation using deep learning based on climate big data and irrigation scheduling
topic agriculture
deep learning
LSTM
support decision-making algorithms
yield estimation
irrigation management
url https://www.mdpi.com/1996-1073/14/11/3004
work_keys_str_mv AT khadijehalibabaei cropyieldestimationusingdeeplearningbasedonclimatebigdataandirrigationscheduling
AT pedrodgaspar cropyieldestimationusingdeeplearningbasedonclimatebigdataandirrigationscheduling
AT taniamlima cropyieldestimationusingdeeplearningbasedonclimatebigdataandirrigationscheduling