{"id":1605,"date":"2015-09-24T09:40:55","date_gmt":"2015-09-24T08:40:55","guid":{"rendered":"https:\/\/nextmovesoftware.com\/blog\/?p=1605"},"modified":"2015-09-24T09:40:55","modified_gmt":"2015-09-24T08:40:55","slug":"shakespeare-through-the-eyes-of-a-chemist-part-ii","status":"publish","type":"post","link":"https:\/\/nextmovesoftware.com\/blog\/2015\/09\/24\/shakespeare-through-the-eyes-of-a-chemist-part-ii\/","title":{"rendered":"Shakespeare through the eyes of a chemist Part II"},"content":{"rendered":"<p><a href=\"https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2015\/09\/2724589320_9ffdc9cfb6_m.jpg\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2015\/09\/2724589320_9ffdc9cfb6_m.jpg\" alt=\"Lego Shakey\" width=\"180\" height=\"240\" class=\"alignright size-full wp-image-1611\" \/><\/a>In an <a href=\"https:\/\/nextmovesoftware.com\/blog\/2013\/10\/28\/shakespeare-through-the-eyes-of-a-chemist\/\">earlier post<\/a> I looked at the chemicals found in Shakespeare&#8217;s plays. Following on from the improved text-mining of diseases described in the <a href=\"https:\/\/nextmovesoftware.com\/blog\/2015\/09\/17\/using-wikipedia-to-understand-disease-names\/\">previous post<\/a>, let&#8217;s look at diseases this time.<\/p>\n<p>First of all, I should point out that it is actually useful to us to run LeadMine on arbitary texts. It helps to find errors in the dictionaries we use, but also makes us aware that certain terms may be fine if used to mine PubMed abstracts or patents, but may produce false positives on general text.<\/p>\n<p>Here are the most common disease terms found in Shakespeare&#8217;s plays, with counts, MESH Id, then the text as it appeared in the play:<\/p>\n<pre>176 D010146 : ('pains', 91) ('pain', 66) ('painful', 8) ('sorely', 6) ('aches', 5)\r\n124 D000435 : ('drunk', 67) ('drunken', 19) ('drunkard', 13) ('drooping', 9) ('drunkards', 6) ('drunkenness', 4) ('besotted', 2) ('intemperance', 2) ('being drunk', 1) ('buzzed', 1)\r\n109 D004332 : ('drown', 74) ('drowned', 22) ('drowning', 9) ('drowns', 4)\r\n107 D010930 : ('plague', 85) ('plagues', 13) ('the plague', 9)\r\n68 D020521 : ('stroke', 41) ('strokes', 23) ('apoplexy', 4)\r\n48 D001733 : ('sting', 26) ('bites', 11) ('stings', 8) ('stinging', 3)\r\n44 D018746 : ('sirs', 44)\r\n39 D006470 : ('bleeding', 31) ('bleeds', 7) ('loss of blood', 1)\r\n34 D005076 : ('rash', 33) ('a rash', 1)\r\n33 D003221 : ('confusion', 33)\r\n32 D013217 : ('starve', 19) ('famine', 11) ('starving', 1) ('starves', 1)\r\n29 D002921 : ('scars', 15) ('scar', 10) ('cicatrice', 3) ('cicatrices', 1)\r\n28 D003141 : ('infect', 21) ('infectious', 6) ('infecting', 1)\r\n27 D018908 : ('weakness', 25) ('decrepit', 2)\r\n27 D012614 : ('scurvy', 27)\r\n27 D003288 : ('bruised', 9) ('bruise', 8) ('black and blue', 4) ('bruising', 4) ('contusions', 1) ('bruises', 1)\r\n27 D002056 : ('burns', 27)\r\n25 D034381 : ('deaf', 24) ('hard of hearing', 1)\r\n23 D005334 : ('fever', 22) ('fevers', 1)\r\n20 D004487 : ('swelling', 19) ('dropsy', 1)\r\n19 D005221 : ('wearied', 9) ('weariness', 3) ('wearies', 2) ('weariest', 1) ('wearying', 1) ('wearily', 1) ('languor', 1) ('unwearied', 1)\r\n19 D001237 : ('smother', 15) ('suffocating', 1) ('smothered', 1) ('suffocation', 1) ('smothering', 1)\r\n18 D014202 : ('trembling', 17) ('tremor', 1)\r\n18 D007239 : ('infection', 17) ('infections', 1)\r\n18 D004216 : ('distemper', 18)<\/pre>\n<p>This already has highlighted some changes that we need to make (and have already made). For example, SIRS should only be matched uppercase, &#8220;unwearied&#8221; may redirect to &#8220;wearied&#8221; on Wikipedia but it&#8217;s the opposite, &#8220;besotted&#8221; no longer means drunk (except with love) and &#8220;buzzed&#8221; is probably not a useful synonym. \ud83d\ude42<\/p>\n<p>But overall, the software seems to be in good health, although Shakespeare&#8217;s protagonists may not be. Don&#8217;t they all die at the end? [SPOILER ALERT]<\/p>\n<pre>3 D058734 : ('bleed to death', 3)\r\n3 D003645 : ('sudden death', 3)<\/pre>\n<p><b>Image credit:<\/b> <a href=\"https:\/\/www.flickr.com\/photos\/ryanrocketship\/\">Ryan Ruppe<\/a> on Flickr<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In an earlier post I looked at the chemicals found in Shakespeare&#8217;s plays. Following on from the improved text-mining of diseases described in the previous post, let&#8217;s look at diseases this time. First of all, I should point out that it is actually useful to us to run LeadMine on arbitary texts. It helps to &hellip; <a href=\"https:\/\/nextmovesoftware.com\/blog\/2015\/09\/24\/shakespeare-through-the-eyes-of-a-chemist-part-ii\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Shakespeare through the eyes of a chemist Part II<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/1605"}],"collection":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/comments?post=1605"}],"version-history":[{"count":12,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/1605\/revisions"}],"predecessor-version":[{"id":1618,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/1605\/revisions\/1618"}],"wp:attachment":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/media?parent=1605"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/categories?post=1605"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/tags?post=1605"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}