{"id":428,"date":"2013-04-18T13:50:13","date_gmt":"2013-04-18T12:50:13","guid":{"rendered":"http:\/\/nextmovesoftware.com\/blog\/?p=428"},"modified":"2015-06-22T17:01:01","modified_gmt":"2015-06-22T16:01:01","slug":"handling-biologics-a-file-format-problem","status":"publish","type":"post","link":"https:\/\/nextmovesoftware.com\/blog\/2013\/04\/18\/handling-biologics-a-file-format-problem\/","title":{"rendered":"Handling biologics: A file format problem?"},"content":{"rendered":"<p>The increasing importance of biological therapeutics, or biologics, to the pharmaceutical industry is well-known. For example, <a href=\"http:\/\/www.drugs.com\/stats\/top100\/2012\/q4\/sales\">data from Drugs.com<\/a> show that of the top 15 best selling therapies in the US in Q4 2012, six were biologics. Monoclonal <a href=\"http:\/\/dx.doi.org\/10.2210\/rcsb_pdb\/mom_2001_9\">antibodies<\/a> are a typical example; these are glycoproteins, comprising of short oligosaccharides attached to a multi-chain polypeptide.<\/p>\n<p>It is clear that handling such molecules requires a different approach than that taken for small-molecules. For example, here is an all-atom depiction of the peptide <a href=\"http:\/\/en.wikipedia.org\/wiki\/Crambin\">crambin<\/a>:<br \/>\n<a href=\"http:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambin.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter  wp-image-429\" alt=\"Crambin\" src=\"\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambin.gif\" width=\"420\" height=\"420\" \/><\/a><br \/>\nNo &#8211; it&#8217;s not a cyclic peptide. It just happens to have three disulfide bridges. A more useful depiction can be generated if we follow the IUPAC or FDA guidelines for peptide depiction; here the primary structure is much clearer as is the presence of the disulfide bonds:<\/p>\n<figure id=\"attachment_431\" aria-describedby=\"caption-attachment-431\" style=\"width: 398px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambinFDA.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-431\" alt=\"FDA\" src=\"\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambinFDA.png\" width=\"398\" height=\"166\" srcset=\"https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambinFDA.png 398w, https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambinFDA-300x125.png 300w\" sizes=\"(max-width: 398px) 100vw, 398px\" \/><\/a><figcaption id=\"caption-attachment-431\" class=\"wp-caption-text\">FDA Style<\/figcaption><\/figure>\n<figure id=\"attachment_432\" aria-describedby=\"caption-attachment-432\" style=\"width: 400px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambinIUPACb.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-432\" alt=\"IUPAC\" src=\"\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2013\/04\/crambinIUPACb.png\" width=\"400\" height=\"400\" \/><\/a><figcaption id=\"caption-attachment-432\" class=\"wp-caption-text\">IUPAC Style<\/figcaption><\/figure>\n<p>However, to create these sorts of depictions, and otherwise handle biopolymers more appropriately, we need to know the polymer structure.<\/p>\n<p>Some consider this a file format problem. Some file formats which have been developed to store or represent biopolymer structures include the <a href=\"http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ci00019a017\">CHUCKLES<\/a> and <a href=\"http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ci00028a012\">CHORTLES<\/a> languages from Chiron and Daylight, <a href=\"http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ci3001925\">HELM<\/a> (Hierarchical Editing Language for Macromolecules) from Pfizer, <a href=\"http:\/\/www.biochemfusion.com\/doc\/\">Protein Line Notation<\/a> from Biochemfusion and <a href=\"http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ci2001988\">SCSR<\/a> (Self-Contained Sequence Representation, an MDL V3000 extension) from Accelrys. Naturally, Wisswesser Line Notation has also <a href=\"http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/c160034a004\">been extended<\/a> to handle this problem.<\/p>\n<p>In particular, the HELM format has recently received support from the <a href=\"http:\/\/www.pistoiaalliance.org\/\">Pistoia Alliance<\/a>. See for example this <a href=\"http:\/\/www.pistoiaalliance.org\/blog\/2013\/03\/helm-%E2%80%93-better-encoding-for-biologics\/\">post<\/a> on the Pistoia blog which describes how HELM &#8220;gives us a single consistent way to describe macromolecules which can be used across industry and academia&#8221; so that &#8220;researchers do not have to spend time creating their own notations&#8221;.<\/p>\n<p>But is a new file format the best way to achieve this goal? (I can&#8217;t resist inserting the xkcd comic on standards at this point \ud83d\ude42 )<\/p>\n<figure style=\"width: 500px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" alt=\"\" src=\"\/\/imgs.xkcd.com\/comics\/standards.png \" width=\"500\" height=\"283\" \/><figcaption class=\"wp-caption-text\">From xkcd, the web comic: http:\/\/xkcd.com\/927\/<\/figcaption><\/figure>\n<p>While NextMove&#8217;s software for handling biopolymers, <a href=\"http:\/\/nextmovesoftware.co.uk\/products\/SugarNSplice.html\">Sugar &amp; Splice<\/a>, will handle popular file formats such as HELM, I will describe a different view of the problem in the <a href=\"http:\/\/nextmovesoftware.com\/blog\/2013\/05\/10\/handling-biologics-a-perception-problem\/\">follow-up blog post<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The increasing importance of biological therapeutics, or biologics, to the pharmaceutical industry is well-known. For example, data from Drugs.com show that of the top 15 best selling therapies in the US in Q4 2012, six were biologics. Monoclonal antibodies are a typical example; these are glycoproteins, comprising of short oligosaccharides attached to a multi-chain polypeptide. &hellip; <a href=\"https:\/\/nextmovesoftware.com\/blog\/2013\/04\/18\/handling-biologics-a-file-format-problem\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Handling biologics: A file format problem?<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/428"}],"collection":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/comments?post=428"}],"version-history":[{"count":34,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/428\/revisions"}],"predecessor-version":[{"id":1460,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/428\/revisions\/1460"}],"wp:attachment":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/media?parent=428"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/categories?post=428"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/tags?post=428"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}