The objective of this research is to develop bioinformatics methods to predict and identify different classes of calcium binding sites in proteins based on key factors that contribute to calcium-binding affinity and the selectivity of proteins and calcium-dependent conformational change.

In collaboration with faculty in the Georgia State University Department of Mathematics and Statistics, algorithms for predicting calcium binding sites based on structural and genomic information have been established using geometric, graphing, and statistical algorithms along with a web server containing up-to-date sequence, structure and literature information about calcium and its binding proteins in chemical, biological, and biomineralization. The Calcium Bank also provides putative calcium binding sites predicted in different genomes.


Calcium Pattern Search

CaPS(Calcium Pattern Search provides a list of Ca(II)-binding A sequence patterns that indicate the presence of a Ca(II)-binding motif. To date, we have developed sequence patterns, or signatures, for canonical EF-hand, pseudo EF-hand, and baterial EF-hand. We expect to significantly expand this list in the future to indlude non-contiguous (binding motif includes A residues distant from each other in the primary structure) patterns. To check a sequence for the presence of a Ca(II)-binding signature click on the CaPS link and enter your sequence for analysis.

MUGπ

MUGπ is a web-based algorithm for processing x-ray and NMR protein data using bio-python. MUGπ creates a protein topology network using bio-python and NetworkX. A NetworkX topology of oxygen and carbon maximum cliques is used to filter protein features for calcium-binding site identification.

This is the latest iteration of software developed in our labs to identify Ca2+-binding sites in protein structure files. The first iteration of the program, GG, was a graph-theoretic and geometric analysis program developed by Hai Deng, Guan Tao Chen, Wei Yang, and Jenny J Yang. The program was used for predicting Ca(II)-binding sites in protein based on the geometric information. GG was later extended in the MUG (Multiple Geometries) program, a prediction algorithm capable of identifying calcium-binding sites in proteins with atomic resolution. MUG worked by first identifying all possible oxygen clusters from maximal cliques, and then calculating a calcium center (CC) for each cluster corresponding to the potential Ca2+ position, located to maximally regularize the structure of the (cluster, CC) pair. The structure was then inspected by geometric filters. Any unqualified (cluster, CC) pairs were further handled by recursively removing oxygen atoms and relocating the CC until its structure was either qualified or contained fewer than four ligand atoms. Ligand coordination was then determined for qualified structures. Immediately following the release of MUG, MUGC was developed to recognize calcium-binding sites without explicit reference to side-chain oxygen ligand coordinates. By using second shell carbon atoms, main-chain oxygen atoms, center of mass of side-chain, and graph theory, it was able to predict calcium-binding sites exhibiting insignificant conformational change with high sensitivity and selectivity based on X-ray structures or NMR structures.

The new, advanced MUGπ extends the original algorithms efforts to identify oxygen clusters, combined with the identification of carbon clusters, into a single application. MUGπ was developed by M.G. Mashiku under the supervision of Dr. Suranga Pathirannehelage and Dr. Jenny J. Yang.

Related Resources

Click here to see a list of related resources, including databases and prediction algorithms.

Acknowledgments, Publications

We would like to acknowledge the efforts of the developers, graduate students and faculty who contributed to the development of these algorithms. The list of related publications is summarized below.

Publications

Yubin Zhou, Shenghui Xue, and Jenny J. Yang. Calciomics: integrative studies of Ca2+ binding proteins and their interactomes in biological systems. Metallomics, (2013) Jan;5(1):29-42. doi: 10.1039/c2mt20009k. PMID: 23235533

Kun Zhao, Xue Wang, Michael Kirberger, Hing Wong, Guantao Chen, and Jenny J. Yang, Predicting Calcium-binding Site Using Second Shell Carbon Atoms. Proteins (2012).

Xue Wang, Michael Kirberger, Guantao Chen, and Jenny J. Yang, Towards Predicting Ca(2+)–binding Sites with Different Coordination Numbers in Proteins with Atomic Resolution. Proteins (2009).

Yubin Zhou, Wei Yang, Michael Kirberger, Hsiau-Wei Lee, Gayatri Ayalasomayajula, and Jenny J. Yang. Prediction of EF-hand Calcium Binding Proteins and Analysis of Bacterial EF-hand Proteins. Proteins (2006).

Hai Deng, Guantao Chen, Wei Yang, Jenny J. Yang. Predicting calcium-binding sites in proteins - a graph theory and geometry approach. Proteins (2006).

Wei Yang, Hsiau-wei Lee, Homme Hellinga and Jenny J. Yang. Identification and Design of Metal-Binding Proteins. Proteins (2002).

Wei Yang, Hsiauwei Lee, Homme Hellinga , Michelle Pu and Jenny Jie Yang. Design Calcium Binding Sites by Computer Algorithm. Computational Studies, Nanotechnology , and Solution Thermodynamics of Polymer Systems . Kluwer Academic/Plenum Publishers (2000).

Mashiku, M., & Edirisinghe, N. (2019, July). Serverless Science Gateway Development for Ca2+ binding site prediction on Amazon Web Services: Case Study. In Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning) (p. 56). ACM.