{"id":4415,"date":"2018-02-04T21:12:29","date_gmt":"2018-02-04T21:12:29","guid":{"rendered":"http:\/\/www.cytognomix.com\/?p=4415"},"modified":"2018-02-04T21:12:29","modified_gmt":"2018-02-04T21:12:29","slug":"may-4-2015-comment-on-pmid-23348723-prediction-of-mutant-mrna-splice-isoforms-by-information-theory-based-exon-definition","status":"publish","type":"post","link":"https:\/\/www.cytognomix.com\/?p=4415","title":{"rendered":"May 4, 2015. Comment on PMID 23348723. Prediction of mutant mRNA splice isoforms by information theory-based exon definition."},"content":{"rendered":"<p style=\"display: inline !important;\">Pe<a id=\"ui-ncbipopper-10\" role=\"button\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/myncbi\/peter.rogan.1\/comments\/\" class=\"broken_link\">ter Rogan<\/a>\u00a0<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23348723#cm23348723_10087\">2015 May 04 6:14 p.m.<\/a><\/p>\n<div data-md=\"The Logic and Formulation of Exon Definition for Splice and Splicing Regulatory Sites with Negative Information Content. PK Rogan, EJ Mucaki\n\nUpdate on:\n&lt;PMID: 23348723&gt; and the  [Automated Splice Site and Exon Definition Analysis server (ASSEDA)](http:\/\/mutationforecaster.com).\n\nIn &lt;PMID: 23348723&gt;, we described a method of predicting the overall strength of an exon by calculating its total information content (R_i,total_) from the sum of the R_i_ values of its donor and acceptor splice sites, adjusted for their gap surprisal (the self-information of the distance between the two sites). Differences between \u0394R_i,total_ values are predictive of the relative abundance of these exons in distinct processed mRNAs. \n\nSplice sites altered by mutations that prevent stable interaction with splicesomes are said to be abolished.  Information theory predicts abolition of binding below their minimum binding affinity, Ri,minimum, which is empirically derived. This value is slightly above zero bits, the theoretical minimum for binding at equilibrium (\u0394G = 0; &lt;PMID: 9446751&gt;). Sites with R_i_ &lt; 0  are not bound,  forming stable interactions would be endergonic (\u0394G &gt; 0). This raises the question, when predicting the change in exon strength (\u0394R_i,total_) due to a mutation that inactivates binding, whether mutant sites with varying degrees of negative information content are energetically distinguishable from one another.\n\nThe computation of R_i,total_ contains the sum of the the R_i_ values of component binding sites, irrespective of their initial or final strengths. Thus, a mutated site with R_i_ &lt;&lt; 0 would result in greater \u0394R_i,total_ compared to a site with R_i_ ~ 0. To assess whether the degree of unfavorable binding should be applied to the exon definition calculation, or if values below 0 bits should be computed similarly to a binding site at equilibrium (R_i_ ~ 0), we reevaluated experimentally validated natural and regulatory splicing mutations in our paper with both approaches.  R_i,total_ was calculated for 10 variants from Supplementary Table 2, both including and excluding the negative information (ie. R_i_ &lt; 0 vs. R_i_ = 0) of inactivated splice sites. Mutation #2 of Supplementary Table 2 [ADA:g.43249658G&gt;A] abolishes a natural donor site, from 8.8 to -9.9 bits. In applying the full decrease in strength (\u0394R_i,total_: -18.7 bits), the natural exon strength decreases from 21.0 to 2.3 bits. When the negative information content is set to zero bits, the change is significantly smaller (21.0 -&gt; 12.2 bits; \u0394R_i,total_ = -8.8 bits). When a weak natural splice site is abolished, the difference as expressed as \u0394R_i,total_ can be quite small (Mutation #9; -14.8 vs -3.1 bits). In the case of Mutation #38, the reduction in \u0394R_i,total_ leads to a partially discordant prediction where the abolished natural exon is weaker than the experimentally confirmed activated cryptic exon. Results for this mutation were concordant with the published version when the negative bit value of the mutated natural site was included in the calculation.\n\nThe impact of mutations in splicing regulatory (SR) factors can also be predicted on ASSEDA, where the Ri of the SR binding site is added to the R_i,total, as well as a secondary gap surprisal value for the particular SR protein. These sites can also be abolished. But when a SR protein binding site is no longer active, should the SR gap surprisal still be applied, or is the SR gap surprisal no longer applicable?\n\nWe tested mutations from &lt;PMID: 23348723&gt; (Supplementary Table 4), which abolish the splicing enhancer SF2\/ASF with and without the SR protein gap surprisal when Ri of the SR site is &lt; 0 bits. The removal of the gap surprisal term for Mutation #2 of Supplementary Table 4 leads to a discordant prediction, where the \u0394R_i_ is less than the SR gap surprisal at that distance and therefore the \u0394R_i,total_ is positive.  As experimental evidence shows an increase in skipping, it is a discordant prediction. Therefore, the gap surprisal is still applied in the computation of both initial and final R_i_,total values when the SR protein of interest is abolished as the site is naturally present and therefore expected for binding. Conversely, when we apply the gap surprisal to the initial R_i,total_ for a splicing factor that is being created, we are essentially applying a penalty for a site that does not normally exist. Therefore, we no longer apply the SR gap surprisal value to the initial R_i,total_ in these cases.\n\nThe revised R_i,total_ values of SR binding site mutations slightly differ from those reported in &lt;PMID: 23348723&gt; (Supplementary Table 4). This is because the gap surprisal distributions were recomputed for the following factors: SF2\/ASF, SC35 and SRp40, with updated versions of these models based on CLIP Seq data (&lt;PMID: 25416797&gt;, &lt;PMID: 21087992&gt;). This resulted in small changes to the distributions for SF2\/ASF and SC35, however changes for SRp40 were significant, and now more closely resembles the other gap surprisal functions. The updated graphs of distance vs. gap surprisal are available at: &lt;http:\/\/splice.uwo.ca\/gapsurprisals.html&gt;. While this should not significantly affect \u0394R_i,total_ values, it may affect the initial and final R_i,total_ values.\"><\/p>\n<p>The Logic and Formulation of Exon Definition for Splice and Splicing Regulatory Sites with Negative Information Content. PK Rogan, EJ Mucaki<\/p>\n<p>Update on:\u00a0<a title=\"Prediction of mutant mRNA splice isoforms by information theory-based exon definition.\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23348723\/\">Mucaki EJ, 2013<\/a>\u00a0and the\u00a0<a href=\"http:\/\/mutationforecaster.com\/\">Automated Splice Site and Exon Definition Analysis server (ASSEDA)<\/a>.<\/p>\n<p>In\u00a0<a title=\"Prediction of mutant mRNA splice isoforms by information theory-based exon definition.\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23348723\/\">Mucaki EJ, 2013<\/a>, we described a method of predicting the overall strength of an exon by calculating its total information content (R<em>i,total<\/em>) from the sum of the R<em>i<\/em>\u00a0values of its donor and acceptor splice sites, adjusted for their gap surprisal (the self-information of the distance between the two sites). Differences between \u0394R<em>i,total<\/em>\u00a0values are predictive of the relative abundance of these exons in distinct processed mRNAs.<\/p>\n<p>Splice sites altered by mutations that prevent stable interaction with splicesomes are said to be abolished. Information theory predicts abolition of binding below their minimum binding affinity, Ri,minimum, which is empirically derived. This value is slightly above zero bits, the theoretical minimum for binding at equilibrium (\u0394G = 0;\u00a0<a title=\"Information content of individual genetic sequences.\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/9446751\/\">Schneider TD, 1997<\/a>). Sites with R<em>i<\/em>\u00a0&lt; 0 are not bound, forming stable interactions would be endergonic (\u0394G &gt; 0). This raises the question, when predicting the change in exon strength (\u0394R<em>i,total<\/em>) due to a mutation that inactivates binding, whether mutant sites with varying degrees of negative information content are energetically distinguishable from one another.<\/p>\n<p>The computation of R<em>i,total<\/em>\u00a0contains the sum of the the R<em>i<\/em>\u00a0values of component binding sites, irrespective of their initial or final strengths. Thus, a mutated site with R<em>i<\/em>\u00a0&lt;&lt; 0 would result in greater \u0394R<em>i,total<\/em>\u00a0compared to a site with R<em>i<\/em>\u00a0~ 0. To assess whether the degree of unfavorable binding should be applied to the exon definition calculation, or if values below 0 bits should be computed similarly to a binding site at equilibrium (R<em>i<\/em>\u00a0~ 0), we reevaluated experimentally validated natural and regulatory splicing mutations in our paper with both approaches. R<em>i,total<\/em>\u00a0was calculated for 10 variants from Supplementary Table 2, both including and excluding the negative information (ie. R<em>i<\/em>\u00a0&lt; 0 vs. R<em>i<\/em>\u00a0= 0) of inactivated splice sites. Mutation #2 of Supplementary Table 2 [ADA:g.43249658G&gt;A] abolishes a natural donor site, from 8.8 to -9.9 bits. In applying the full decrease in strength (\u0394R<em>i,total<\/em>: -18.7 bits), the natural exon strength decreases from 21.0 to 2.3 bits. When the negative information content is set to zero bits, the change is significantly smaller (21.0 -&gt; 12.2 bits; \u0394R<em>i,total<\/em>\u00a0= -8.8 bits). When a weak natural splice site is abolished, the difference as expressed as \u0394R<em>i,total<\/em>\u00a0can be quite small (Mutation #9; -14.8 vs -3.1 bits). In the case of Mutation #38, the reduction in \u0394R<em>i,total<\/em>\u00a0leads to a partially discordant prediction where the abolished natural exon is weaker than the experimentally confirmed activated cryptic exon. Results for this mutation were concordant with the published version when the negative bit value of the mutated natural site was included in the calculation.<\/p>\n<p>The impact of mutations in splicing regulatory (SR) factors can also be predicted on ASSEDA, where the Ri of the SR binding site is added to the R_i,total, as well as a secondary gap surprisal value for the particular SR protein. These sites can also be abolished. But when a SR protein binding site is no longer active, should the SR gap surprisal still be applied, or is the SR gap surprisal no longer applicable?<\/p>\n<p>We tested mutations from\u00a0<a title=\"Prediction of mutant mRNA splice isoforms by information theory-based exon definition.\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23348723\/\">Mucaki EJ, 2013<\/a>\u00a0(Supplementary Table 4), which abolish the splicing enhancer SF2\/ASF with and without the SR protein gap surprisal when Ri of the SR site is &lt; 0 bits. The removal of the gap surprisal term for Mutation #2 of Supplementary Table 4 leads to a discordant prediction, where the \u0394R<em>i<\/em>\u00a0is less than the SR gap surprisal at that distance and therefore the \u0394R<em>i,total<\/em>\u00a0is positive. As experimental evidence shows an increase in skipping, it is a discordant prediction. Therefore, the gap surprisal is still applied in the computation of both initial and final R<em>i<\/em>,total values when the SR protein of interest is abolished as the site is naturally present and therefore expected for binding. Conversely, when we apply the gap surprisal to the initial R<em>i,total<\/em>\u00a0for a splicing factor that is being created, we are essentially applying a penalty for a site that does not normally exist. Therefore, we no longer apply the SR gap surprisal value to the initial R<em>i,total<\/em>\u00a0in these cases.<\/p>\n<p>The revised R<em>i,total<\/em>\u00a0values of SR binding site mutations slightly differ from those reported in\u00a0<a title=\"Prediction of mutant mRNA splice isoforms by information theory-based exon definition.\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23348723\/\">Mucaki EJ, 2013<\/a>\u00a0(Supplementary Table 4). This is because the gap surprisal distributions were recomputed for the following factors: SF2\/ASF, SC35 and SRp40, with updated versions of these models based on CLIP Seq data (<a title=\"DoRiNA 2.0--upgrading the doRiNA database of RNA interactions in post-transcriptional regulation.\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/25416797\/\">Blin K, 2015<\/a>,\u00a0<a title=\"CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins.\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/21087992\/\">Khorshid M, 2011<\/a>). This resulted in small changes to the distributions for SF2\/ASF and SC35, however changes for SRp40 were significant, and now more closely resembles the other gap surprisal functions. The updated graphs of distance vs. gap surprisal are available at:\u00a0<a href=\"http:\/\/splice.uwo.ca\/gapsurprisals.html\" class=\"broken_link\">http:\/\/splice.uwo.ca\/gapsurprisals.html<\/a>. While this should not significantly affect \u0394R<em>i,total<\/em>\u00a0values, it may affect the initial and final R<em>i,total<\/em>\u00a0values.<\/p>\n<\/div>\n<ul id=\"comment_list\" data-page=\"emb\" data-pcid=\"1bmvzvyklN_0pnYv3EiJ5t0FU3W_A463fwq2XjjAezPXkst5z\">\n<li data-cmid=\"78253\">\n<div>\n<p>This article was mentioned in a comment by\u00a0<a id=\"ui-ncbipopper-11\" role=\"button\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/myncbi\/peter.rogan.1\/comments\/\" class=\"broken_link\">Peter Rogan<\/a>2018 Jan 12 2:39 p.m.<\/p>\n<p>See:<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/29280214\/#cm29280214_78253\">Thorough in silico and in vitro cDNA analysis of 21 putative BRCA1 and BRCA2 splice variants and a complex tandem duplication in BRCA2 allowing the identification of activated cryptic splice donor sites in BRCA2 exon 11.<\/a>\u00a0[Hum Mutat. 2017.]<\/div>\n<\/li>\n<li data-cmid=\"77995\">\n<div>\n<p>This article was mentioned in a comment by\u00a0<a id=\"ui-ncbipopper-12\" role=\"button\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/myncbi\/peter.rogan.1\/comments\/\" class=\"broken_link\">Peter Rogan<\/a>2017 Dec 12 09:53 a.m.<\/p>\n<p>See:<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23169495\/#cm23169495_77995\">Analysis of the effects of rare variants on splicing identifies alterations in GABAA receptor genes in autism spectrum disorder individuals.<\/a>[Eur J Hum Genet. 2013.]<\/div>\n<\/li>\n<li data-cmid=\"77962\">\n<div>\n<p>This article was mentioned in a comment by\u00a0<a id=\"ui-ncbipopper-13\" role=\"button\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/myncbi\/peter.rogan.1\/comments\/\" class=\"broken_link\">Peter Rogan<\/a>2017 Dec 07 5:24 p.m.<\/p>\n<p>See:<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/29185120\/#cm29185120_77962\">Characterization of a novel germline BRCA1 splice variant, c.5332+4delA.<\/a>\u00a0[Breast Cancer Res Treat. 2017.]<\/div>\n<\/li>\n<li data-cmid=\"74191\">\n<div>\n<p>This article was mentioned in a comment by\u00a0<a id=\"ui-ncbipopper-14\" role=\"button\" href=\"https:\/\/www.ncbi.nlm.nih.gov\/myncbi\/peter.rogan.1\/comments\/\" class=\"broken_link\">Peter Rogan<\/a>2017 Oct 01 8:57 p.m.<\/p>\n<p>See:<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/28949076\/#cm28949076_74191\">Rules and tools to predict the splicing effects of exonic and intronic mutations.<\/a>\u00a0[Wiley Interdiscip Rev RNA. 2018.]<\/div>\n<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Peter Rogan\u00a02015 May 04 6:14 p.m. The Logic and Formulation of Exon Definition for Splice and Splicing Regulatory Sites with Negative Information Content. PK Rogan, EJ Mucaki Update on:\u00a0Mucaki EJ, 2013\u00a0and the\u00a0Automated Splice Site and Exon Definition Analysis server (ASSEDA). In\u00a0Mucaki EJ, 2013, we described a method of predicting the overall strength of an exon [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[6],"tags":[],"_links":{"self":[{"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=\/wp\/v2\/posts\/4415"}],"collection":[{"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=4415"}],"version-history":[{"count":1,"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=\/wp\/v2\/posts\/4415\/revisions"}],"predecessor-version":[{"id":4416,"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=\/wp\/v2\/posts\/4415\/revisions\/4416"}],"wp:attachment":[{"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=4415"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=4415"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.cytognomix.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=4415"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}