The University of Southampton
University of Southampton Institutional Repository

Computational prediction of short linear motifs from protein sequences

Computational prediction of short linear motifs from protein sequences
Computational prediction of short linear motifs from protein sequences
Short Linear Motifs (SLiMs) are functional protein microdomains that typically mediate interactions between a short linear region in one protein and a globular domain in another. SLiMs usually occur in structurally disordered regions and mediate low affinity interactions. Most SLiMs are 3-15 amino acids in length and have 2-5 defined positions, making them highly likely to occur by chance and extremely difficult to identify. Nevertheless, our knowledge of SLiMs and capacity to predict them from protein sequence data using computational methods has advanced dramatically over the past decade. By considering the biological, structural, and evolutionary context of SLiM occurrences, it is possible to differentiate functional instances from chance matches in many cases and to identify new regions of proteins that have the features consistent with a SLiM-mediated interaction. Their simplicity also makes SLiMs evolutionarily labile and prone to independent origins on different sequence backgrounds through convergent evolution, which can be exploited for predicting novel SLiMs in proteins that share a function or interaction partner. In this review, we explore our current knowledge of SLiMs and how it can be applied to the task of predicting them computationally from protein sequences. Rather than focusing on specific SLiM prediction tools, we provide an overview of the methods available and concentrate on principles that should continue to be paramount even in the light of future developments. We consider the relative merits of using regular expressions or profiles for SLiM discovery and discuss the main considerations for both predicting new instances of known SLiMs, and de novo prediction of novel SLiMs. In particular, we highlight the importance of correctly modelling evolutionary relationships and the probability of false positive predictions.
short linear motifs, SLiM, motif discovery, protein-protein interactions, posttranslational modifications, intrinsically disordered proteins, regular expressions, sequence profiles, sequence motifs
89-141
Edwards, Richard J.
9d25e74f-dc0d-455a-832c-5f363d864c43
Palopoli, Nicolas
227c4548-0b1a-4dc9-b628-74cf48c878d5
Edwards, Richard J.
9d25e74f-dc0d-455a-832c-5f363d864c43
Palopoli, Nicolas
227c4548-0b1a-4dc9-b628-74cf48c878d5

Edwards, Richard J. and Palopoli, Nicolas (2015) Computational prediction of short linear motifs from protein sequences. Methods in Molecular Biology, 1268, 89-141. (doi:10.1007/978-1-4939-2285-7_6). (PMID:25555723)

Record type: Article

Abstract

Short Linear Motifs (SLiMs) are functional protein microdomains that typically mediate interactions between a short linear region in one protein and a globular domain in another. SLiMs usually occur in structurally disordered regions and mediate low affinity interactions. Most SLiMs are 3-15 amino acids in length and have 2-5 defined positions, making them highly likely to occur by chance and extremely difficult to identify. Nevertheless, our knowledge of SLiMs and capacity to predict them from protein sequence data using computational methods has advanced dramatically over the past decade. By considering the biological, structural, and evolutionary context of SLiM occurrences, it is possible to differentiate functional instances from chance matches in many cases and to identify new regions of proteins that have the features consistent with a SLiM-mediated interaction. Their simplicity also makes SLiMs evolutionarily labile and prone to independent origins on different sequence backgrounds through convergent evolution, which can be exploited for predicting novel SLiMs in proteins that share a function or interaction partner. In this review, we explore our current knowledge of SLiMs and how it can be applied to the task of predicting them computationally from protein sequences. Rather than focusing on specific SLiM prediction tools, we provide an overview of the methods available and concentrate on principles that should continue to be paramount even in the light of future developments. We consider the relative merits of using regular expressions or profiles for SLiM discovery and discuss the main considerations for both predicting new instances of known SLiMs, and de novo prediction of novel SLiMs. In particular, we highlight the importance of correctly modelling evolutionary relationships and the probability of false positive predictions.

This record has no associated files available for download.

More information

e-pub ahead of print date: 11 December 2014
Published date: 2015
Keywords: short linear motifs, SLiM, motif discovery, protein-protein interactions, posttranslational modifications, intrinsically disordered proteins, regular expressions, sequence profiles, sequence motifs
Organisations: Molecular and Cellular

Identifiers

Local EPrints ID: 378593
URI: http://eprints.soton.ac.uk/id/eprint/378593
PURE UUID: 5a863d65-ac2e-4d3f-887c-ce711803471e

Catalogue record

Date deposited: 10 Jul 2015 12:33
Last modified: 14 Mar 2024 20:25

Export record

Altmetrics

Contributors

Author: Richard J. Edwards
Author: Nicolas Palopoli

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×