Classification and Analysis of Regulatory Pathways Using Graph Property, Biochemical and Physicochemical Property, and Functional Property

PLoS ONE - Tập 6 Số 9 - Trang e25297
Tao Huang1,2,3, Lei Chen4, Yu‐Dong Cai5,1, Kuo‐Chen Chou5
1Institute of Systems Biology, Shanghai University, Shanghai, People’s Republic of China
2Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China
3Shanghai Center for Bioinformation Technology, Shanghai, People’s Republic of China
4College of Information Engineering, Shanghai Maritime University, Shanghai, People’s Republic of China
5Gordon Life Science Institute, San Diego, California, United States of America

Tóm tắt

Từ khóa


Tài liệu tham khảo

M Kanehisa, 1997, A database for post-genome analysis., Trends in genetics: TIG, 13, 375, 10.1016/S0168-9525(97)01223-7

M Kanehisa, 2000, KEGG: Kyoto encyclopedia of genes and genomes., Nucleic acids research, 28, 27, 10.1093/nar/28.1.27

H Ogata, 1999, KEGG: Kyoto encyclopedia of genes and genomes., Nucleic acids research, 27, 29, 10.1093/nar/27.1.29

M Kanehisa, 2004, The KEGG resource for deciphering the genome., Nucleic acids research, 32, D277, 10.1093/nar/gkh063

A Bairoch, 1994, The ENZYME data bank., Nucleic acids research, 22, 3626, 10.1093/nar/22.17.3626

I Schomburg, 2002, BRENDA: a resource for enzyme data and metabolic information., Trends in biochemical sciences, 27, 54, 10.1016/S0968-0004(01)02027-8

I Schomburg, 2002, BRENDA, enzyme data and metabolic information., Nucleic acids research, 30, 47, 10.1093/nar/30.1.47

C Krieger, 2004, MetaCyc: a multiorganism database of metabolic pathways and enzymes., Nucleic acids research, 32, D438, 10.1093/nar/gkh100

M Kanehisa, 2008, KEGG for linking genomes to life and the environment., Nucleic Acids Res, 36, D480, 10.1093/nar/gkm882

C Klukas, 2007, Dynamic exploration and editing of KEGG pathway diagrams., Bioinformatics, 23, 344, 10.1093/bioinformatics/btl611

R Caspi, 2006, MetaCyc: a multiorganism database of metabolic pathways and enzymes., Nucleic Acids Res, 34, D511, 10.1093/nar/gkj128

R Caspi, 2008, The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases., Nucleic Acids Res, 36, D623, 10.1093/nar/gkm900

P Pharkya, 2003, Review of the BRENDA Database., Metab Eng, 5, 71, 10.1016/S1096-7176(03)00008-9

JM Dale, 2010, Machine learning methods for metabolic pathway prediction., BMC Bioinformatics, 11, 15, 10.1186/1471-2105-11-15

L Chen, 2010, Analysis of protein pathway networks using hybrid properties., Molecules, 15, 8177, 10.3390/molecules15118177

H Peng, 2005, Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy., IEEE Transactions on pattern analysis and machine intelligence, 1226, 10.1109/TPAMI.2005.159

S Salzberg, 1992, Predicting protein secondary structure with a nearest-neighbor algorithm* 1., Journal of molecular biology, 227, 371, 10.1016/0022-2836(92)90892-N

T Denoeux, 1995, A k-nearest neighbor classification rule based on Dempster-Shafer theory., IEEE Transactions on Systems Man and Cybernetics, 25, 804, 10.1109/21.376493

1998, Fast training of support vector machines using sequential minimal optimization

SS Keerthi, 2001, Improvements to Platt's SMO algorithm for SVM classifier design., Neural Computation, 13, 637, 10.1162/089976601300014493

RR Bouckaert, 2004, Bayesian network classifiers in Weka.

KC Chou, 1995, Critical Reviews in Biochemistry and Molecular., Biology, 30, 275

KC Chou, 2011, Some remarks on protein attribute prediction and pseudo amino acid composition (50th Anniversary Year Review)., Journal of Theoretical Biology, 273, 236, 10.1016/j.jtbi.2010.12.024

KC Chou, 2001, Prediction of protein cellular attributes using pseudo amino acid composition., PROTEINS: Structure, Function, and Genetics (Erratum: ibid, 2001, Vol44, 60), 43, 246

H Mohabatkar, 2010, Prediction of cyclin proteins using Chou's pseudo amino acid composition., Protein & Peptide Letters, 17, 1207, 10.2174/092986610792231564

M Esmaeili, 2010, Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses., Journal of Theoretical Biology, 263, 203, 10.1016/j.jtbi.2009.11.016

YH Zeng, 2009, Using the augmented Chou's pseudo amino acid composition for predicting protein submitochondria locations based on auto covariance approach., Journal of Theoretical Biology, 259, 366, 10.1016/j.jtbi.2009.03.028

C Chen, 2009, Prediction of protein secondary structure content by using the concept of Chou's pseudo amino acid composition and support vector machine., Protein & Peptide Letters, 16, 27, 10.2174/092986609787049420

H Ding, 2009, Prediction of cell wall lytic enzymes using Chou's amphiphilic pseudo amino acid composition., Protein & Peptide Letters, 16, 351, 10.2174/092986609787848045

DN Georgiou, 2009, Use of fuzzy clustering technique and matrices to classify amino acids and its impact to Chou's pseudo amino acid composition., Journal of Theoretical Biology, 257, 17, 10.1016/j.jtbi.2008.11.003

H Mohabatkar, 2011, Prediction of GABA(A) receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine., Journal of Theoretical Biology, 281, 18, 10.1016/j.jtbi.2011.04.017

L Yu, 2010, SecretP: Identifying bacterial secreted proteins by fusing new features into Chou's pseudo-amino acid composition., Journal of Theoretical Biology, 267, 1, 10.1016/j.jtbi.2010.08.001

Q Gu, 2010, Prediction of G-Protein-Coupled Receptor Classes in Low Homology Using Chou's Pseudo Amino Acid Composition with Approximate Entropy and Hydrophobicity Patterns., Protein & Peptide Letters, 17, 559, 10.2174/092986610791112693

JD Qiu, 2010, Using the concept of Chou's pseudo amino acid composition to predict enzyme family classes: an approach with support vector machine based on discrete wavelet transform., Protein & Peptide Letters, 17, 715, 10.2174/092986610791190372

KC Chou, 2009, Pseudo amino acid composition and its applications in bioinformatics, proteomics and system biology., Current Proteomics, 6, 262, 10.2174/157016409789973707

K Chou, 1980, A new schematic method in enzyme kinetics., European Journal of Biochemistry, 113, 195, 10.1111/j.1432-1033.1980.tb06155.x

GP Zhou, 1984, An extension of Chou's graphical rules for deriving enzyme kinetic equations to system involving parallel reaction pathways., Biochemical Journal, 222, 169, 10.1042/bj2220169

KC Chou, 1989, Graphic rules in steady and non-steady enzyme kinetics., Journal of Biological Chemistry, 264, 12074, 10.1016/S0021-9258(18)80175-2

K Chou, 1990, Review: Applications of graph theory to enzyme kinetics and protein folding kinetics: Steady and non-steady-state systems., Biophysical chemistry, 35, 1, 10.1016/0301-4622(90)80056-D

J Andraos, 2008, Kinetic plasticity and the determination of product ratios for kinetic schemes leading to multiple products without rate laws: new methods based on directed graphs., Canadian Journal of Chemistry, 86, 342, 10.1139/v08-020

K Chou, 2010, Graphic rule for drug metabolism systems., Current Drug Metabolism, 11, 369, 10.2174/138920010791514261

I Althaus, 1993, Steady-state kinetic studies with the non-nucleoside HIV-1 reverse transcriptase inhibitor U-87201E., Journal of Biological Chemistry, 268, 6119, 10.1016/S0021-9258(18)53227-0

I Althaus, 1993, The quinoline U-78036 is a potent inhibitor of HIV-1 reverse transcriptase., Journal of Biological Chemistry, 268, 14875, 10.1016/S0021-9258(18)82414-0

I Althaus, 1993, Kinetic studies with the non-nucleoside HIV-1 reverse transcriptase inhibitor U-88204E., Biochemistry, 32, 6548, 10.1021/bi00077a008

C Chen, 2009, Prediction of Protein Secondary Structure Content by Using the Concept of Chous Pseudo Amino Acid Composition and Support Vector Machine., Protein and Peptide Letters, 16, 27, 10.2174/092986609787049420

KC Chou, 1997, Disposition of amphiphilic helices in heteropolar environments., PROTEINS: Structure, Function, and Genetics, 28, 99, 10.1002/(SICI)1097-0134(199705)28:1<99::AID-PROT10>3.0.CO;2-C

GP Zhou, 2011, The disposition of the LZCC protein residues in wenxiang diagram provides new insights into the protein-protein interaction mechanism., Journal of Theoretical Biology, 284, 142, 10.1016/j.jtbi.2011.06.006

ZC Wu, 2010, 2D-MH: A web-server for generating graphic representation of protein sequences based on the physicochemical properties of their constituent amino acids., J Theor Biol, 267, 29, 10.1016/j.jtbi.2010.08.007

D Chakrabarti, 2005, Tools for large graph mining

A Barabasi, 2004, Network biology: understanding the cell&apos;s functional organization., Nature Reviews Genetics, 5, 101, 10.1038/nrg1272

U Stelzl, 2005, A human protein-protein interaction network: a resource for annotating the proteome., Cell, 122, 957, 10.1016/j.cell.2005.08.029

L Chen, 2009, Multiple Classifier Integration for the Prediction of Protein Structural Classes., Journal of Computational Chemistry, 30, 2248, 10.1002/jcc.21230

Y Qi, 2008, Protein complex identification by supervised graph local clustering., Bioinformatics, 24, i250, 10.1093/bioinformatics/btn164

E Camon, 2003, The gene ontology annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro., Genome Research, 13, 662, 10.1101/gr.461403

K Chou, 2007, Recent progress in protein subcellular location prediction., Analytical Biochemistry, 370, 1, 10.1016/j.ab.2007.07.006

KC Chou, 2008, Cell-PLoc: A package of Web servers for predicting subcellular localization of proteins in various organisms (updated version: Cell-PLoc 2.0: An improved package of web-servers for predicting subcellular localization of proteins in various organisms, Natural Science, 2010, 2, 1090–1103)., Nature Protocols, 3, 153

KC Chou, 2011, iLoc-Euk: A Multi-Label Classifier for Predicting the Subcellular Localization of Singleplex and Multiplex Eukaryotic Proteins., PLoS One, 6, e18258, 10.1371/journal.pone.0018258

K Chou, 2006, Predicting Protein-Protein interactions from sequences in a hybridization space., J Proteome Res, 5, 316, 10.1021/pr050331g

L Chen, 2009, Identifying Protein Complexes Using Hybrid Properties., Journal of Proteome Research, 8, 5212, 10.1021/pr900554a

L Chen, 2010, Predicting the network of substrate-enzyme-product triads by combining compound similarity and functional domain composition., BMC bioinformatics, 11, 293, 10.1186/1471-2105-11-293

T Huang, 2010, Analysis and prediction of the metabolic stability of proteins based on their sequential features, subcellular locations and interaction networks., PLoS ONE, 5, e10972, 10.1371/journal.pone.0010972

T Huang, 2011, Analysis and prediction of translation rate based on sequence and functional features of the mRNA., PLoS ONE, 6, e16036, 10.1371/journal.pone.0016036

I Dubchak, 1995, Prediction of protein folding class using global description of amino acid sequence., Proceedings of the National Academy of Sciences of the United States of America, 92, 8700, 10.1073/pnas.92.19.8700

I Dubchak, 1999, Recognition of a protein fold in the context of the SCOP classification., Proteins: Structure, Function, and Bioinformatics, 35, 401, 10.1002/(SICI)1097-0134(19990601)35:4<401::AID-PROT3>3.0.CO;2-K

D Frishman, 1997, Seventy-five percent accuracy in protein secondary structure prediction., Proteins: Structure, Function, and Bioinformatics, 27, 329, 10.1002/(SICI)1097-0134(199703)27:3<329::AID-PROT1>3.0.CO;2-8

J Cheng, 2005, SCRATCH: a protein structure and structural feature prediction server., Nucleic acids research, 33, W72, 10.1093/nar/gki396

G Pollastri, 2002, Prediction of coordination number and relative solvent accessibility in proteins., Proteins: Structure, Function, and Bioinformatics, 47, 142, 10.1002/prot.10069

KC Chou, 1995, A novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space., Proteins: Structure, Function & Genetics, 21, 319, 10.1002/prot.340210406

P Carmona-Saez, 2007, GENECODIS: a web-based tool for finding significant concurrent annotations in gene lists., Genome Biol, 8, R3, 10.1186/gb-2007-8-1-r3

T Huang, 2010, Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties., PLoS ONE, 5, e11900, 10.1371/journal.pone.0011900

T Huang, 2011, Computational Analysis of HIV-1 Resistance Based on Gene Expression Profiles and the Virus-Host Interaction Network., PLoS ONE, 6, e17291, 10.1371/journal.pone.0017291

Z He, 2010, Predicting Drug-Target Interaction Networks Based on Functional Groups and Biological Features., PLoS ONE, 5, e9603, 10.1371/journal.pone.0009603

Y Cai, 2008, Predicting n-terminal acetylation based on feature selection method., Biochemical and biophysical research communications, 372, 862, 10.1016/j.bbrc.2008.05.143

Y Cai, 2010, Predicting subcellular location of proteins using integrated-algorithm method., Molecular Diversity, 14, 551, 10.1007/s11030-009-9182-4

L Lu, 2009, GalNAc-transferase specificity prediction based on feature selection method., Peptides, 30, 359, 10.1016/j.peptides.2008.09.020

L Lu, 2010, Protein sumoylation sites prediction based on two-stage feature selection., Molecular Diversity, 14, 81, 10.1007/s11030-009-9149-5

T Huang, 2009, Prediction of pharmacological and xenobiotic responses to drugs based on time course gene expression profiles., PLoS ONE, 4, e8126, 10.1371/journal.pone.0008126

IH Witten, 2005, Data Mining: Practical machine learning tools and techniques., Morgan Kaufmann Pub

L Chen, 2010, Prediction of Interactiveness Between Small Molecules and Enzymes by Combining Gene Ontology and Compound Similarity., Journal of Computational Chemistry, 31, 1766, 10.1002/jcc.21467

Y Cai, 2003, Nearest neighbour algorithm for predicting protein subcellular location by combining functional domain composition and pseudo-amino acid composition., Biochemical and biophysical research communications, 305, 407, 10.1016/S0006-291X(03)00775-7

GF Cooper, 1992, A Bayesian method for the induction of probabilistic networks from data., Machine learning, 9, 309, 10.1007/BF00994110

W Buntine, 1996, A guide to the literature on learning probabilistic networks from data., IEEE Transactions on Knowledge and Data Engineering, 8, 195, 10.1109/69.494161

J Cheng, Comparing Bayesian network classifiers; 1999., 101

N Friedman, 1997, Bayesian network classifiers., Machine learning, 29, 131, 10.1023/A:1007465528199

KC Chou, 1995, Review: Prediction of protein structural classes., Critical Reviews in Biochemistry and Molecular Biology, 30, 275, 10.3109/10409239509083488

H Lin, 2008, The modified Mahalanobis discriminant for predicting outer membrane proteins by using Chou&apos;s pseudo amino acid composition., Journal of Theoretical Biology, 252, 350, 10.1016/j.jtbi.2008.02.004

X Xiao, 2011, A multi-label classifier for predicting the subcellular localization of gram-negative bacterial proteins with both single and multiple sites., PLoS One, 6, e20592, 10.1371/journal.pone.0020592

GY Zhang, 2008, Predicting the cofactors of oxidoreductases based on amino acid composition distribution and Chou&apos;s amphiphilic pseudo amino acid composition., Journal of Theoretical Biology, 253, 310, 10.1016/j.jtbi.2008.03.015

XB Zhou, 2007, Using Chou&apos;s amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes., Journal of Theoretical Biology, 248, 546, 10.1016/j.jtbi.2007.06.001

CF Gao, 2011, A Novel Fuzzy Fisher Classifier for Signal Peptide Prediction., Protein Peptide Letters, 18, 831, 10.2174/092986611795713916

F Chiti, 2006, Protein misfolding, functional amyloid, and human disease., Annu Rev Biochem, 75, 333, 10.1146/annurev.biochem.75.101304.123901

YS Lobanova, 2007, Mechanism of estrogen-induced apoptosis in breast cancer cells: role of the NF-kappaB signaling pathway., Biochemistry (Mosc), 72, 320, 10.1134/S0006297907030108

M Chang, 2011, Dual roles of estrogen metabolism in mammary carcinogenesis., BMB Rep, 44, 423, 10.5483/BMBRep.2011.44.7.423

N Chazal, 2003, Virus entry, assembly, budding, and membrane rafts., Microbiol Mol Biol Rev, 67, 226, 10.1128/MMBR.67.2.226-237.2003