NextMove Software
  • Home
  • Blog
  • News
  • Talks
  • Events
  • About Us
  • Careers
  • ELNs & Reactions
  • Patents/TextMining
  • Biologics
  • Similarity & Search
 
General Inquiries: info@nextmovesoftware.com
Support: support@nextmovesoftware.com

Patsy

Version 0.9 beta [201211]

Efficient Matching of Multiple Chemical Patterns

Although many chemical informatics toolkits provide mechanisms for matching substructure patterns, such as MDL queries, SMARTS and SMIRKS, against a target molecule, none currently treat the patterns as first-class objects, allowing them to be manipulated, compared, combined, optimized, canonicalized and analyzed.

NextMove Software's Patsy suite of tools is designed to support exactly such "chemical pattern" informatics. Amongst the features supported is the ability to compile one (or more) substructure patterns into C, C++, Java or Python code for very efficient matching at run-time. Typical use cases include pre-processing the chemical validity filters often used in preparing virtual library screening databases, and efficient implementations of feature-based fingerprinting.

  • A presentation describing Patsy presented at the 9th International Conference on Chemical Structures (ICCS) in Noordwijkerhout, The Netherlands, (ICCS) American Chemical Society (ACS) National Meeting June 2011
Arthor provides fast state-of-the-art substructure and chemical similarity search capabilities for ultra-large databases of hundreds of millions of compounds, using SMARTS optimization, Just-In-Time compilation and/or GPUs.
CaffeineFix is used to rapidly match chemical names or terms against a dictionary or grammar (e.g. a grammar for IUPAC names). As well as use in text-mining, it can be used to provide autocomplete functionality and spell-correction.
Casandra is a server for delivering real time safety warnings of experimental hazards straight to the pharmaceutical electronic laboratory notebooks (ELNs).

HazELNut is a suite of tools used to extract, normalize and analyse information in Electronic Lab Notebooks (ELNs). This can be used to implement a search interface, find/eliminate duplicates, find similar reactions and so on.
LeadMine extracts chemical names and terms from text. It incorporates NextMove's CaffeineFix technology to find terms that match appropriate dictionaries or grammars. It has enhanced functionality to handle the patent literature.
Matsy is a set of tools for creating and analysing Matched Molecular Series (the general form of Matched Molecular Pairs). In particular, it can be used to suggest what compound to make next in a Medicinal Chemistry program.
MPSearch rapidly searches a database to find Matched Pairs related to a query molecule. This type of search is used to explore previous medicinal chemistry strategies.
NameRXN is used to classify and name reactions. It is particular useful in the context of ELN analysis but also as a plugin to chemical drawing software. NameRXN builds on NextMove Software's Patsy technology.
Patsy is used to speed up SMARTS pattern matching by creating optimized SMARTS patterns or source code. Speed gains are particularly large when multiple SMARTS patterns are matched against a single structure.
Pistachio is a reaction dataset browser providing loading, querying, and analytics of chemical reactions. With over 9 million chemical reactions extracted from US & EPO patents, it demonstrates an AI interface to faceted (structure) search
SmallWorld is an index of chemical space based on more than 230 billion molecular substructures. It can be used to measure similarity based on graph-edit distance, find the MCS of two or more molecules, analyse HTS results and much more.
Sugar & Splice can be used to perceive and depict biopolymer structure. It makes it easy to interconvert between small-molecule representations (e.g. SMILES, MOL) and biopolymer representations (HELM, IUPAC line notation).
©2023 NextMove Software. All rights reserved.