Interactome-wide prediction of short, disordered protein interaction motifs in humans
Interactome-wide prediction of short, disordered protein interaction motifs in humans
Many of the specific functions of intrinsically disordered protein segments are mediated by Short Linear Motifs (SLiMs) interacting with other proteins. Well known examples include SLiMs that interact with 14-3-3, PDZ, SH2, SH3, and WW domains but the true extent and diversity of SLiM-mediated interactions is largely unknown. Here, we attempt to expand our knowledge of human SLiMs by applying in silico SLiM prediction to the human interactome. Combining data from seven different interaction databases, we analysed approximately 6000 protein-centred and 1600 domain-centred human interaction datasets of 3+ unrelated proteins that interact with a common partner. Results were placed in context through comparison to randomised datasets of similar size and composition. The search returned thousands of evolutionarily conserved, intrinsically disordered occurrences of hundreds of significantly enriched recurring motifs, including many that have never been previously identified (). In addition to True Positive results for at least 25 different known SLiMs, a striking number of "off-target" proteins/domains also returned significantly enriched known motifs. Often, this was due to the non-independence of the datasets, with many proteins sharing interaction partners or contributing interactions to multiple domain datasets. The majority of these motif classes, however, were also found to be significantly enriched in one or more randomised datasets. This highlights the need for care when interpreting motif predictions of this nature but also raises the possibility that SLiM occurrences may be successfully identified independently of interaction data. Although not as compositionally biased as previous studies, patterns matching known SLiMs tended to cluster into a few large groups of similar sequence, while novel predictions tended to be more distinctive and less abundant. Whether this is due to ascertainment bias or a true functional composition bias of SLiMs is not clear and warrants further investigation.
282-295
Edwards, Richard J
9d25e74f-dc0d-455a-832c-5f363d864c43
Davey, Norman E.
bdaded6d-ac23-4a43-b347-3113159dfb70
O'Brien, Kevin
c3a639fc-d95b-4867-ad94-7bb6d68d6eae
Shields, Denis C.
57ffee4f-0277-4b3d-9c7a-8c328637d8e6
January 2012
Edwards, Richard J
9d25e74f-dc0d-455a-832c-5f363d864c43
Davey, Norman E.
bdaded6d-ac23-4a43-b347-3113159dfb70
O'Brien, Kevin
c3a639fc-d95b-4867-ad94-7bb6d68d6eae
Shields, Denis C.
57ffee4f-0277-4b3d-9c7a-8c328637d8e6
Edwards, Richard J, Davey, Norman E., O'Brien, Kevin and Shields, Denis C.
(2012)
Interactome-wide prediction of short, disordered protein interaction motifs in humans.
Molecular BioSystems, 8 (1), .
(doi:10.1039/C1MB05212H).
(PMID:21879107)
Abstract
Many of the specific functions of intrinsically disordered protein segments are mediated by Short Linear Motifs (SLiMs) interacting with other proteins. Well known examples include SLiMs that interact with 14-3-3, PDZ, SH2, SH3, and WW domains but the true extent and diversity of SLiM-mediated interactions is largely unknown. Here, we attempt to expand our knowledge of human SLiMs by applying in silico SLiM prediction to the human interactome. Combining data from seven different interaction databases, we analysed approximately 6000 protein-centred and 1600 domain-centred human interaction datasets of 3+ unrelated proteins that interact with a common partner. Results were placed in context through comparison to randomised datasets of similar size and composition. The search returned thousands of evolutionarily conserved, intrinsically disordered occurrences of hundreds of significantly enriched recurring motifs, including many that have never been previously identified (). In addition to True Positive results for at least 25 different known SLiMs, a striking number of "off-target" proteins/domains also returned significantly enriched known motifs. Often, this was due to the non-independence of the datasets, with many proteins sharing interaction partners or contributing interactions to multiple domain datasets. The majority of these motif classes, however, were also found to be significantly enriched in one or more randomised datasets. This highlights the need for care when interpreting motif predictions of this nature but also raises the possibility that SLiM occurrences may be successfully identified independently of interaction data. Although not as compositionally biased as previous studies, patterns matching known SLiMs tended to cluster into a few large groups of similar sequence, while novel predictions tended to be more distinctive and less abundant. Whether this is due to ascertainment bias or a true functional composition bias of SLiMs is not clear and warrants further investigation.
Text
edwards_et_al_2012.pdf
- Version of Record
Text
edwards_et_al_2012-SI.pdf
- Other
More information
e-pub ahead of print date: 30 August 2011
Published date: January 2012
Organisations:
Molecular and Cellular
Identifiers
Local EPrints ID: 210851
URI: http://eprints.soton.ac.uk/id/eprint/210851
ISSN: 1742-2051
PURE UUID: 2ad6f371-faa7-4f5e-835e-2b609507b456
Catalogue record
Date deposited: 13 Feb 2012 15:06
Last modified: 14 Mar 2024 04:50
Export record
Altmetrics
Contributors
Author:
Richard J Edwards
Author:
Norman E. Davey
Author:
Kevin O'Brien
Author:
Denis C. Shields
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics