You are reading the documentation for the development version of OpenStructure. Jump to the documentation of the stable versions:
1.1
1.2
1.3
1.4
1.5
1.6
1.7
1.7.1
1.8
1.9
1.10
1.11
2.0
2.1
2.2
2.3
2.3.1
Connectivity¶Motivation¶The connectivity of atoms is notoriously difficult to come by for biological macromolecules. PDB files, the de facto standard exchange format for structural information allows bonds to be specified in CONECT records. However, they are not mandatory. Many programs, especially the ones not requiring on connectivity of atoms, do not write CONECT records. As a result, programs and structural biology frameworks can’t rely on connectivity information to be present. The connectivity information needs to be derived in the program itself. Loader heuristics are great if you are the one that implemented them but are problematic if you are just the user of a software that has them. As time goes on, these heuristics become buried in thousands of lines of code and they are often hard yet impossible to trace back. Different clients of the framework have different requirements. A visualisation software wants to read in a PDB files as is without making any changes. A script in an automated pipeline, however, does want to either strictly reject files that are incomplete or fill-in missing structural features. All these aspects are implemented in the conop module, separated from the loading of the PDB file, giving clients a fine grained control over the loading process. The conop logic can thus be reused in code requiring the presence of The conop module defines a Processors¶The exact behaviour for a processor is implementation-specific. So far, two classes implement the processor interface: A heuristic and a rule-based processor. The processor mainly differ in the source of their connectivity information. The Heuristicprocessor uses a hard-coded heuristic connectivity table for the 20 standard amino acids as well as nucleotides.For other compounds such as ligands the HeuristicProcessor runs a distance-based connectivity algorithm that connects two atoms if they are closer than a certain threshold. The RuleBasedProcessor uses a connectivity library containing all molecular components present in the PDB files on PDB.org. The library can easily be extended with custom connectivity information, if required.
|
ContentsSearchEnter search terms or a module, class or function name. Previous topicFunctions and classes for standard amino acids Next topicYou are here |