{"id":2609,"date":"2017-09-04T15:43:26","date_gmt":"2017-09-04T14:43:26","guid":{"rendered":"https:\/\/nextmovesoftware.com\/blog\/?p=2609"},"modified":"2017-09-04T15:43:26","modified_gmt":"2017-09-04T14:43:26","slug":"pubchem-as-a-biologics-database-the-acs-presentation","status":"publish","type":"post","link":"https:\/\/nextmovesoftware.com\/blog\/2017\/09\/04\/pubchem-as-a-biologics-database-the-acs-presentation\/","title":{"rendered":"PubChem as a biologics database: The ACS presentation"},"content":{"rendered":"<p>Keen readers of the blog will have noticed a recent <a href=\"https:\/\/nextmovesoftware.com\/blog\/2017\/05\/03\/pubchem-as-a-sequence-database-disulfide-bridging-patterns\/\">series<\/a> of <a href=\"https:\/\/nextmovesoftware.com\/blog\/2017\/05\/10\/pubchem-as-a-sequence-database-identity-search\/\">blog<\/a> <a href=\"https:\/\/nextmovesoftware.com\/blog\/2017\/05\/15\/pubchem-as-a-sequence-database-sequence-variation\/\">posts<\/a> on the topic of PubChem and sequence databases. This was in preparation for my recent ACS presentation on &#8220;PubChem as a biologics database&#8221; (see below), the goal of which was (a) to convince the audience that PubChem is actually a biologics database (with some small molecules thrown in for disguise), and (b) to show that applying structure perception of biologics (e.g. via <a href=\"https:\/\/nextmovesoftware.com\/sugarnsplice\">Sugar&amp;Splice<\/a>) on top of an existing chemical database yields useful insights.<\/p>\n<figure id=\"attachment_2615\" aria-describedby=\"caption-attachment-2615\" style=\"width: 517px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2017\/09\/PubChemAAMonomers.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-2615 size-full\" src=\"https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2017\/09\/PubChemAAMonomers.png\" alt=\"\" width=\"517\" height=\"383\" srcset=\"https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2017\/09\/PubChemAAMonomers.png 517w, https:\/\/nextmovesoftware.com\/blog\/wp-content\/uploads\/2017\/09\/PubChemAAMonomers-300x222.png 300w\" sizes=\"(max-width: 517px) 100vw, 517px\" \/><\/a><figcaption id=\"caption-attachment-2615\" class=\"wp-caption-text\">Figure 1 &#8211; Counts of peptide monomers in PubChem<\/figcaption><\/figure>\n<p>One perennial question in the field of biologics, at least from the point of view of biologic registration systems, is how many monomers are there and how best to handle them? This depends on how you slice-and-dice them of course, and how many of the long tail of possible monomers you actually support. I show that one view considers PubChem biologics to contain ~27K monosaccharide monomers, and ~8K amino acid monomers (Figure 1).<\/p>\n<p>One of the surprising results was that there are more peptides in PubChem than in the PDB (~500K vs ~110K). This is not quite the case for oligosaccharides, but the number is not too far off the total in <a href=\"https:\/\/glytoucan.org\/\">GlyTouCan<\/a> (~67K vs ~80K). The type of peptide\/sugar can be quite different though, as the entries in PubChem are those that chemists have gotten their hands on, and it&#8217;s full of reaction intermediates with a whole host of protecting groups not usually observed <em>in vivo<\/em>. Similarly, as I discussed in an earlier <a href=\"https:\/\/nextmovesoftware.com\/blog\/2017\/05\/15\/pubchem-as-a-sequence-database-sequence-variation\/\">blog post<\/a>, when one looks at sequence variation in PubChem, you&#8217;re not getting a view of the million-year process of evolution but rather the sites of variation that a chemist determined might best modulate activity.<\/p>\n<p><iframe loading=\"lazy\" style=\"border: 1px solid #CCC; border-width: 1px; margin-bottom: 5px; max-width: 100%;\" src=\"\/\/www.slideshare.net\/slideshow\/embed_code\/key\/K41vOAdulz2J5T\" width=\"595\" height=\"485\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\" allowfullscreen=\"allowfullscreen\"> <\/iframe><\/p>\n<div style=\"margin-bottom: 5px;\"><strong> <a title=\"PubChem as a Biologics Database\" href=\"\/\/www.slideshare.net\/NextMoveSoftware\/pubchem-as-a-biologics-database\" target=\"_blank\" rel=\"noopener\">PubChem as a Biologics Database<\/a> <\/strong> from <strong><a href=\"https:\/\/www.slideshare.net\/NextMoveSoftware\" target=\"_blank\" rel=\"noopener\">NextMove Software<\/a><\/strong><\/div>\n<p>Thank you for clicking through my slides, and I&#8217;d be happy to take any questions.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Keen readers of the blog will have noticed a recent series of blog posts on the topic of PubChem and sequence databases. This was in preparation for my recent ACS presentation on &#8220;PubChem as a biologics database&#8221; (see below), the goal of which was (a) to convince the audience that PubChem is actually a biologics &hellip; <a href=\"https:\/\/nextmovesoftware.com\/blog\/2017\/09\/04\/pubchem-as-a-biologics-database-the-acs-presentation\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">PubChem as a biologics database: The ACS presentation<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/2609"}],"collection":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/comments?post=2609"}],"version-history":[{"count":11,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/2609\/revisions"}],"predecessor-version":[{"id":2621,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/posts\/2609\/revisions\/2621"}],"wp:attachment":[{"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/media?parent=2609"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/categories?post=2609"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nextmovesoftware.com\/blog\/wp-json\/wp\/v2\/tags?post=2609"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}