{"id":178,"date":"2012-11-28T13:04:55","date_gmt":"2012-11-28T13:04:55","guid":{"rendered":"http:\/\/nextmovesoftware.com\/blog\/?p=178"},"modified":"2015-06-22T17:04:05","modified_gmt":"2015-06-22T16:04:05","slug":"text-mining-for-a-worthy-cause","status":"publish","type":"post","link":"https:\/\/nextmovesoftware.com\/blog\/2012\/11\/28\/text-mining-for-a-worthy-cause\/","title":{"rendered":"Text Mining for a Worthy Cause"},"content":{"rendered":"<p>I recently received an e-mail from the charity &#8220;<a href=\"http:\/\/www.jeansforgenesday.org\/\">jeans for genes<\/a>&#8221; introducing me to &#8220;<b>black bone disease<\/b>&#8220;, a rare genetic disease without a cure.  It is more formally known as &#8220;<a href=\"http:\/\/en.wikipedia.org\/wiki\/Alkaptonuria\">Alkaptonuria<\/a>&#8221; (<a href=\"http:\/\/omim.org\/entry\/203500\">OMIM entry<\/a>) and is a defect in the <a href=\"http:\/\/en.wikipedia.org\/wiki\/Homogentisate_1,2-dioxygenase\">homogentisate 1,2-dioxygenase<\/a> gene (HGD) which leads to a toxic build-up of homogentisic acid in the blood, causing the symptoms of the disease.<\/p>\n<p>Interestingly a re-purposed herbicide, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Nitisinone\">nitisinone<\/a>, is currently being <a href=\"http:\/\/clinicaltrials.gov\/ct2\/show\/NCT00005909\">investigated<\/a> as a possible treatment for the disease based on its previous re-purposing as a therapy in related genetic disorder, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Type_I_tyrosinemia\">Type 1 Tyrosinemia<\/a>.<\/p>\n<p>The story starts in 1977 when a researcher in California observed that relatively few weeds were growing under the <a href=\"http:\/\/en.wikipedia.org\/wiki\/Callistemon\">bottlebrush (Callistemon)<\/a> plants in his backyard.  Analytical chemistry of the soil fractions revealed the active compound to be the natural product <a href=\"http:\/\/en.wikipedia.org\/wiki\/Leptospermone\">Leptospermone<\/a>.  Traditional ligand based optimization of this compound led to the effective herbicides <a href=\"http:\/\/en.wikipedia.org\/wiki\/Mesotrione\">mesotrione<\/a> (Syngenta&#8217;s <a href=\"http:\/\/www.syngenta.com\/global\/corporate\/en\/products-and-innovation\/product-brands\/crop-protection\/herbicides\/Pages\/callisto.aspx\">Callisto<\/a>) and <a href=\"http:\/\/en.wikipedia.org\/wiki\/Nitisinone\">nitisinone<\/a> being synthesized and tested in 1984, with the first patents on this class of herbicides appearing in 1986 (e.g. <a href=\"http:\/\/www.google.com\/patents\/US4780127\">US 4780127<\/a>).  At the point these patents were filed\/granted, the mechanism of action and protein target weren&#8217;t yet known, although they were experimentally proven to be toxic to plants but harmless to mammals.  Much later it was discovered that these compounds worked by inhibiting the enzyme <a href=\"http:\/\/en.wikipedia.org\/wiki\/4-Hydroxyphenylpyruvate_dioxygenase\">4-hydroxyphenylpyruvate dioxygenase<\/a> (HPPD) which blocks the synthesis of chlorophyll and leads to &#8220;bleaching&#8221; and eventual plant death.<\/p>\n<p>It is the role that HPPD plays in human metabolism that make these herbicides so interesting as therapeutic agents.  The pathway diagram below describes the five enzymatic steps (arrows) in the degradation metabolism of tyrosine.<br \/>\n<a href=\"http:\/\/en.wikipedia.org\/wiki\/File:Tyrosinedegradation2.png\"><img decoding=\"async\" src=\"\/\/upload.wikimedia.org\/wikipedia\/commons\/c\/c9\/Tyrosinedegradation2.png\" width=\"640\"><\/a><\/p>\n<p>Defects in these various enzymes responsible for each step lead to a number of <a href=\"http:\/\/apps.who.int\/classifications\/icd10\/browse\/2010\/en#\/E70.2\">related diseases<\/a>: Problems with the first step, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Tyrosine_aminotransferase\">tyrosine-transaminase<\/a>, cause <a href=\"http:\/\/en.wikipedia.org\/wiki\/Tyrosinemia_type_II\">type 2 tyrosinemia<\/a>; the second step, <a href=\"http:\/\/en.wikipedia.org\/wiki\/4-Hydroxyphenylpyruvate_dioxygenase\">p-Hydroxylphenylpyruvate-dioxygenase<\/a> (HPPD) is our herbicide target for which defects cause <a href=\"http:\/\/en.wikipedia.org\/wiki\/Type_III_tyrosinemia\">type 3 tyrosinemia<\/a>; step three, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Homogentisate_1,2-dioxygenase\">homogentisate dioxygenase<\/a> (HGD) causes <b>alkaptonuria<\/b> (aka black bone disease); and step 5, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Fumarylacetoacetate_hydrolase\">4-fumaryl-acetoacetate hydrolase<\/a> causes <a href=\"http:\/\/en.wikipedia.org\/wiki\/Type_I_tyrosinemia\">type 1 tyrosinemia<\/a>.<\/p>\n<p>In the case of type 1 tyrosinemia, it was noticed that those patients with active HPPD had a more severe form of the disease, so it was hypothesized that a HPPD inhibitor may be beneficial.  At the time Zeneca worked on both pharmaceuticals and crop protection and were able to evaluate their proven-safe herbicide nitisinone directly in the clinic.  In what seems incredible by the standards of today&#8217;s pharmaceutical pipelines, their <a href=\"http:\/\/www.google.com\/patents\/US5550165\">US 5550165<\/a> patent filing describes the administration to, and recovery of, sick infants and children, where it is now more usual for a drug candidate to spend years in phase I, II and III clinical trials after a patent is granted before it gets approved by the FDA.<\/p>\n<p>HPPD inhibitors can be anticipated to treat alkaptonuria by much the same mechanism:<br \/>\nBy blocking the formation of the toxic metabolite homogentisate, and causing tyrosine<br \/>\nto be metabolised via alternate routes.<\/p>\n<p>One of the goals of modern text mining is to automatically discover links such as those between the above two patents, <a href=\"http:\/\/www.google.com\/patents\/US4780127\">US4780127<\/a> and <a href=\"http:\/\/www.google.com\/patents\/US5550165\">US5550165<\/a>.  Unfortunately, a range of technical issues complicate the process: In common with many pharmaceutical patent filings, the drug target is not known or not mentioned, so it is necessary to identify and annotate compound classes or modes of action such as &#8220;kinase inhibitor&#8221;, &#8220;beta-blocker&#8221;, &#8220;herbicide&#8221; or &#8220;antibiotic&#8221;.  The large number of synonyms and typographical variants of enzyme and disease names requires the use of synonym dictionaries or ontologies to recognize that &#8220;tyrosine transaminase&#8221; is the same entity as &#8220;tyrosine aminotransferase&#8221; is the same as &#8220;<a href=\"http:\/\/www.chem.qmul.ac.uk\/iubmb\/enzyme\/EC2\/6\/1\/5.html\">EC 2.6.1.5<\/a>&#8220;.  Finally, as revealed by the mistake &#8220;<b>tyosinemia<\/b>&#8221; in the title of the above <a href=\"http:\/\/www.google.com\/patents\/US5550165\">US 5550165<\/a>, documents in real life frequently contain spelling errors, making it impossible to find the most relevant documents when searching for a keyword like &#8220;tyrosinemia&#8221; (without automatic spelling correction).<\/p>\n<p>These are exactly the types of challenges our <a href=\"http:\/\/www.nextmovesoftware.com\/products\/LeadMine.html\">LeadMine<\/a> software attempts to tackle.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I recently received an e-mail from the charity &#8220;jeans for genes&#8221; introducing me to &#8220;black bone disease&#8220;, a rare genetic disease without a cure. It is more formally known as &#8220;Alkaptonuria&#8221; (OMIM entry) and is a defect in the homogentisate 1,2-dioxygenase gene (HGD) which leads to a toxic build-up of homogentisic acid in the blood, &hellip; <a href=\"https:\/\/nextmovesoftware.com\/blog\/2012\/11\/28\/text-mining-for-a-worthy-cause\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Text Mining for a Worthy Cause<\/span><\/a><\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/178"}],"collection":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/comments?post=178"}],"version-history":[{"count":38,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/178\/revisions"}],"predecessor-version":[{"id":1467,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/178\/revisions\/1467"}],"wp:attachment":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/media?parent=178"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/categories?post=178"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/tags?post=178"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}