Database Support¶
This module contains features for accessing databases containing protein related data.
Pfam¶
The following functions can be used to search and retrieve Pfam data:
fetchPfamMSA()- download MSA filessearchPfam()- search for domain families of a protein
UniProt¶
The following functions and class can be used to search and retrieve UniProt data:
queryUniprot()- query UniProt and parse the results as a dictionaryUniprotRecord- a wrapper from UniProt data with functions including parsing PDBssearchUniprot()- search UniProt and return a UniprotRecord
CATH¶
The following class and its functions can be used to search and retrieve CATH data:
CATHDB- parse, handle and navigate the tree-like structure of the CATH database
DALI¶
The following class and functions can be used to search and retrieve data using the DALI structure alignment server:
searchDali()- search for similar structures using DALIDaliRecord- fetch and handle outputs from DALI searchesdaliFilterMultimers()- filter DALI results to obtain multimers of a particular size
QuartataWeb¶
The following classes and functions can be used to search and retrieve data using the QuartataWeb structure alignment server:
QuartataWebBrowser- class based on the Splinter web browser package to search QuartataWebQuartataChemicalRecord- class to handle the outputs of QuartataWeb searchessearchQuartataWeb()- perform QuartataWeb searches and return the output in a QuartataChemicalRecord
Gene Ontology Annotation (GOA)¶
The following classes and functions can be used to search and retrieve data from the EBI GOA database:
queryGOA()- query GOA using a PDB IDGOADictList- class to handle data from GOA queriesparseOBO()- parse an OBO file containing the Gene Ontology.parseGAF()- parse a Gene Association File (GAF)showGoLineage()- visualize GO treecalcGoOverlap()- Calculate overlap between GO terms from their distance in the graph
Interpro¶
The following functions can be used to search and retrieve Interpro data:
searchInterpro()- search for domain families of a protein
BioExcel-CV19¶
The following functions can be used to retrieve BioExcel-CV19 data:
fetchBioexcelPDB()- fetch PDB files for starting structures for trajectories