Survey of machine learning techniques for malware analysis
Survey of machine learning techniques for malware analysis
Coping with malware is getting more and more challenging, given their relentless growth in complexity and volume. One of the most common approaches in literature is using machine learning techniques, to automatically learn models and patterns behind such complexity, and to develop technologies to keep pace with malware evolution. This survey aims at providing an overview on the way machine learning has been used so far in the context of malware analysis in Windows environments, i.e. for the analysis of Portable Executables. We systematize surveyed papers according to their objectives (i.e., the expected output), what information about malware they specifically use (i.e., the features), and what machine learning techniques they employ (i.e., what algorithm is used to process the input and produce the output). We also outline a number of issues and challenges, including those concerning the used datasets, and identify the main current topical trends and how to possibly advance them. In particular, we introduce the novel concept of malware analysis economics, regarding the study of existing trade-offs among key metrics, such as analysis accuracy and economical costs.
123-147
Ucci, Daniele
a25d9fc6-0075-4d85-bd3f-155058fe32ad
Aniello, Leonardo
9846e2e4-1303-4b8b-9092-5d8e9bb514c3
Baldoni, Roberto
6ea5e1cc-92fe-4b9d-9ed3-0b7970553965
March 2019
Ucci, Daniele
a25d9fc6-0075-4d85-bd3f-155058fe32ad
Aniello, Leonardo
9846e2e4-1303-4b8b-9092-5d8e9bb514c3
Baldoni, Roberto
6ea5e1cc-92fe-4b9d-9ed3-0b7970553965
Ucci, Daniele, Aniello, Leonardo and Baldoni, Roberto
(2019)
Survey of machine learning techniques for malware analysis.
Computers and Security, 81, .
(doi:10.1016/j.cose.2018.11.001).
Abstract
Coping with malware is getting more and more challenging, given their relentless growth in complexity and volume. One of the most common approaches in literature is using machine learning techniques, to automatically learn models and patterns behind such complexity, and to develop technologies to keep pace with malware evolution. This survey aims at providing an overview on the way machine learning has been used so far in the context of malware analysis in Windows environments, i.e. for the analysis of Portable Executables. We systematize surveyed papers according to their objectives (i.e., the expected output), what information about malware they specifically use (i.e., the features), and what machine learning techniques they employ (i.e., what algorithm is used to process the input and produce the output). We also outline a number of issues and challenges, including those concerning the used datasets, and identify the main current topical trends and how to possibly advance them. In particular, we introduce the novel concept of malware analysis economics, regarding the study of existing trade-offs among key metrics, such as analysis accuracy and economical costs.
Text
main
- Accepted Manuscript
More information
Accepted/In Press date: 9 November 2018
e-pub ahead of print date: 24 November 2018
Published date: March 2019
Identifiers
Local EPrints ID: 426403
URI: http://eprints.soton.ac.uk/id/eprint/426403
ISSN: 0167-4048
PURE UUID: 941d9a00-e662-4656-aa50-1e24170d4032
Catalogue record
Date deposited: 27 Nov 2018 17:30
Last modified: 16 Mar 2024 07:20
Export record
Altmetrics
Contributors
Author:
Daniele Ucci
Author:
Leonardo Aniello
Author:
Roberto Baldoni
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics