DIEC-ViT: discriminative information enhanced contrastive vision transformer for the identification of plant diseases in complex environments
DIEC-ViT: discriminative information enhanced contrastive vision transformer for the identification of plant diseases in complex environments
Recently, vision transformer (ViT)-based methods have made breakthroughs on plant disease recognition tasks and have surpassed convolutional neural network (CNN)-based methods. They are now considered the state-of-the-art for such methods. However, ViT-based methods usually encode and decode images through global modeling, which introduces a large amount of noise information when dealing with plant disease images in complex environments. In addition, plant disease images in complex environments have significant intra- and inter-class differences, further limiting the performance of ViT-based methods. To address the above limitations, we propose the discriminative information enhanced contrastive vision transformer, in short DIEC-ViT, for plant disease recognition in complex environments. DIEC-ViT contains two key modules, namely, the discriminative information enhancement (DIE) module and the contrastive learning (CL) module. Specifically, the DIE module enhances the perception of discriminative regions of the ViT and suppresses complex backgrounds by counting multi-head self-attention for multi-levels of class tokens. To cope with the problem of intra- and inter-class differences in plant disease images, the CL module is introduced into the ViT to optimize the feature space by reducing the distance between positive pairs and increasing the distance between negative pairs. Extensive experiments verify the effectiveness of the two modules. In addition, DEIC-ViT outperforms state-of-the-art methods with three field plant disease datasets. The obtained results indicate the potential of our approach to drive further development of ViT in the field of plant disease monitoring.
Contrastive learning, Discriminative information enhancement modules, Plant disease recognition, Vision transformers
Lin, Jianwu
7db282c5-c8a9-447c-bad4-34241c034b32
Chen, Xiaoyulong
02c0a0f6-0927-47d3-80ab-68b70e81c9fb
Lou, Lunhong
fa075f70-efed-4a0d-8f81-3f235ad55c15
You, Lin
35c5c929-7427-49e7-8b2f-176bfcb6d1fc
Cernava, Tomislav
a13d65aa-2529-479a-ba90-69ebbc4ba07f
Huang, Dahui
05b13b54-035f-475b-8791-6f0cda3aba01
Qin, Yongbin
d1d0dabe-7b43-4287-b0f3-739863cd360f
Zhang, Xin
3056a795-80f7-4bbd-9c75-ecbc93085421
1 July 2025
Lin, Jianwu
7db282c5-c8a9-447c-bad4-34241c034b32
Chen, Xiaoyulong
02c0a0f6-0927-47d3-80ab-68b70e81c9fb
Lou, Lunhong
fa075f70-efed-4a0d-8f81-3f235ad55c15
You, Lin
35c5c929-7427-49e7-8b2f-176bfcb6d1fc
Cernava, Tomislav
a13d65aa-2529-479a-ba90-69ebbc4ba07f
Huang, Dahui
05b13b54-035f-475b-8791-6f0cda3aba01
Qin, Yongbin
d1d0dabe-7b43-4287-b0f3-739863cd360f
Zhang, Xin
3056a795-80f7-4bbd-9c75-ecbc93085421
Lin, Jianwu, Chen, Xiaoyulong, Lou, Lunhong, You, Lin, Cernava, Tomislav, Huang, Dahui, Qin, Yongbin and Zhang, Xin
(2025)
DIEC-ViT: discriminative information enhanced contrastive vision transformer for the identification of plant diseases in complex environments.
Expert Systems with Applications, 281, [127730].
(doi:10.1016/j.eswa.2025.127730).
Abstract
Recently, vision transformer (ViT)-based methods have made breakthroughs on plant disease recognition tasks and have surpassed convolutional neural network (CNN)-based methods. They are now considered the state-of-the-art for such methods. However, ViT-based methods usually encode and decode images through global modeling, which introduces a large amount of noise information when dealing with plant disease images in complex environments. In addition, plant disease images in complex environments have significant intra- and inter-class differences, further limiting the performance of ViT-based methods. To address the above limitations, we propose the discriminative information enhanced contrastive vision transformer, in short DIEC-ViT, for plant disease recognition in complex environments. DIEC-ViT contains two key modules, namely, the discriminative information enhancement (DIE) module and the contrastive learning (CL) module. Specifically, the DIE module enhances the perception of discriminative regions of the ViT and suppresses complex backgrounds by counting multi-head self-attention for multi-levels of class tokens. To cope with the problem of intra- and inter-class differences in plant disease images, the CL module is introduced into the ViT to optimize the feature space by reducing the distance between positive pairs and increasing the distance between negative pairs. Extensive experiments verify the effectiveness of the two modules. In addition, DEIC-ViT outperforms state-of-the-art methods with three field plant disease datasets. The obtained results indicate the potential of our approach to drive further development of ViT in the field of plant disease monitoring.
This record has no associated files available for download.
More information
Accepted/In Press date: 12 April 2025
e-pub ahead of print date: 15 April 2025
Published date: 1 July 2025
Additional Information:
Publisher Copyright:
© 2025 Elsevier Ltd
Keywords:
Contrastive learning, Discriminative information enhancement modules, Plant disease recognition, Vision transformers
Identifiers
Local EPrints ID: 501653
URI: http://eprints.soton.ac.uk/id/eprint/501653
ISSN: 0957-4174
PURE UUID: 7210e28d-e992-453d-b37a-62425ce945c2
Catalogue record
Date deposited: 04 Jun 2025 17:14
Last modified: 22 Aug 2025 02:38
Export record
Altmetrics
Contributors
Author:
Jianwu Lin
Author:
Xiaoyulong Chen
Author:
Lunhong Lou
Author:
Lin You
Author:
Tomislav Cernava
Author:
Dahui Huang
Author:
Yongbin Qin
Author:
Xin Zhang
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics