Practical application of a Bayesian network approach to poultry epigenetics and stress

BMC Bioinformatics - Tập 23 - Trang 1-16 - 2022
Emiliano A. Videla Rodriguez1, Fábio Pértille2,3,4, Carlos Guerrero-Bosagna2,4, John B. O. Mitchell5, Per Jensen4, V. Anne Smith1
1School of Biology, University of St Andrews, St Andrews, Fife, UK
2Environmental Toxicology Program, Institute of Organismal Biology, Uppsala University, Uppsala, Sweden
3Department of Biomedical & Clinical Sciences (BKV), Linköping University, Linköping, Sweden
4AVIAN Behavioural Genomics and Physiology Group, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden
5EaStCHEM, School of Chemistry, University of St. Andrews, St. Andrews, Fife, UK

Tóm tắt

Relationships among genetic or epigenetic features can be explored by learning probabilistic networks and unravelling the dependencies among a set of given genetic/epigenetic features. Bayesian networks (BNs) consist of nodes that represent the variables and arcs that represent the probabilistic relationships between the variables. However, practical guidance on how to make choices among the wide array of possibilities in Bayesian network analysis is limited. Our study aimed to apply a BN approach, while clearly laying out our analysis choices as an example for future researchers, in order to provide further insights into the relationships among epigenetic features and a stressful condition in chickens (Gallus gallus). Chickens raised under control conditions (n = 22) and chickens exposed to a social isolation protocol (n = 24) were used to identify differentially methylated regions (DMRs). A total of 60 DMRs were selected by a threshold, after bioinformatic pre-processing and analysis. The treatment was included as a binary variable (control = 0; stress = 1). Thereafter, a BN approach was applied: initially, a pre-filtering test was used for identifying pairs of features that must not be included in the process of learning the structure of the network; then, the average probability values for each arc of being part of the network were calculated; and finally, the arcs that were part of the consensus network were selected. The structure of the BN consisted of 47 out of 61 features (60 DMRs and the stressful condition), displaying 43 functional relationships. The stress condition was connected to two DMRs, one of them playing a role in tight and adhesive intracellular junctions in organs such as ovary, intestine, and brain. We clearly explain our steps in making each analysis choice, from discrete BN models to final generation of a consensus network from multiple model averaging searches. The epigenetic BN unravelled functional relationships among the DMRs, as well as epigenetic features in close association with the stressful condition the chickens were exposed to. The DMRs interacting with the stress condition could be further explored in future studies as possible biomarkers of stress in poultry species.

Tài liệu tham khảo

Tarca AL, Carey VJ, Chen XW, Romero R, Drǎghici S. Machine learning and its applications to biology. PLoS Comput Biol. 2007;3:e116. Guzmán DA, Flesia AG, Aon MA, Pellegrini S, Marin RH, Kembro JM. The fractal organization of ultradian rhythms in avian behavior. Sci Rep. 2017;7:684. Milns I, Beale CM, Smith VA. Revealing ecological networks using Bayesian network inference algorithms. Ecology. 2010;91:1892–9. Comin A, Jeremiasson A, Kratzer G, Keeling L. Revealing the structure of the associations between housing system, facilities, management and welfare of commercial laying hens using additive Bayesian networks. Prev Vet Med. 2019;164:23–32. Mitchell EG, Wallace MI, Smith VA, Wiesenthal AA, Brierley AS. Bayesian network analysis reveals resilience of the jellyfish Aurelia aurita to an Irish sea regime shift. Sci Rep. 2021;11:3707. Greene CS, Tan J, Ung M, Moore JH, Cheng C. Big data bioinformatics. J Cell Physiol. 2014;229:1896–900. Friedman N. Inferring cellular networks using probabilistic graphical models. Science. 2004;303:799–805. Li H, Wu G, Zhang J, Yang N. Identification of the heart-type fatty acid-binding protein as a major gene for chicken fatty acid metabolism by bayesian network analysis. Poult Sci. 2010;89:1825–33. Agrahari R, Foroushani A, Docking TR, Chang L, Duns G, Hudoba M, et al. Applications of Bayesian network models in predicting types of hematological malignancies. Sci Rep. 2018;8:6951. Needham CJ, Bradford JR, Bulpitt AJ, Westhead DR. A primer on learning in Bayesian networks for computational biology. PLoS Comput Biol. 2007;3:e129. Jansen R, Yu H, Greenbaum D, Kluger Y, Krogan NJ, Chung S, et al. A Bayesian networks approach for predicting protein-protein interactions from genomic data. Science. 2003;302:449–53. Nagarajan R, Scutari M, Lèbre S. Bayesian networks in R: with applications in systems biology. New York: Springer; 2013. Heckerman D, Geiger D, Chickering DM. Learning Bayesian networks: the combination of knowledge and statistical data. Mach Learn. 1995;20:197–243. Pearl J. Probabilistic reasoning in intelligent systems: networks of plausible inference. San Mateo: Morgan Kaufmann; 1988. Aliferis CF, Statnikov A, Tsamardinos I, Mani S, Koutsoukos XD. Local causal and Markov blanket induction for causal discovery and feature selection for classification part I: algorithms and empirical evaluation. J Mach Learn Res. 2010;11:171–234. Felipe VPS, Silva MA, Valente BD, Rosa GJM. Using multiple regression, Bayesian networks and artificial neural networks for prediction of total egg production in European quails based on earlier expressed phenotypes. Poult Sci. 2014;94:772–80. Fallahsharoudi A, de Kock N, Johnsson M, Bektic L, Ubhayasekera SJKA, Bergquist J, et al. Genetic and targeted eQTL mapping reveals strong candidate genes modulating the stress response during chicken domestication. G3. 2017;7:497–504. Ducatelle R, Goossens E, De Meyer F, Eeckhaut V, Antonissen G, Haesebrouck F, et al. Biomarkers for monitoring intestinal health in poultry: present status and future perspectives. Vet Res. 2018;49:43. Affara M, Sanders D, Araki H, Tamada Y, Dunmore BJ, Humphreys S, et al. Vasohibin-1 is identified as a master-regulator of endothelial cell apoptosis using gene network analysis. BMC Genom. 2013;14:23. Guo L, Zhao G, Xu JR, Kistler HC, Gao L, Ma LJ. Compartmentalized gene regulatory network of the pathogenic fungus Fusarium graminearum. New Phytol. 2016;211:527–41. Ricard N, Scott RP, Booth CJ, Velazquez H, Cilfone NA, Baylon JL, et al. Endothelial ERK1/2 signaling maintains integrity of the quiescent endothelium. J Exp Med. 2019;216:1874–90. Shinde S, Zerbs S, Collart FR, Cumming JR, Noirot P, Larsen PE. Pseudomonas fluorescens increases mycorrhization and modulates expression of antifungal defense response genes in roots of aspen seedlings. BMC Plant Biol. 2019;19:4. Vasilescu C, Dragomir M, Tanase M, Giza D, Purnichescu-Purtan R, Chen M, et al. Circulating miRNAs in sepsis - a network under attack: An in-silico prediction of the potential existence of miRNA sponges in sepsis. PLoS One. 2017;12:e0183334. Ciaccio MF, Wagner JP, Chuu CP, Lauffenburger DA, Jones RB. Systems analysis of EGF receptor signaling dynamics with microwestern arrays. Nat Methods. 2010;7:148–55. Kumuthini J, Bessant C, Wilson P, Crowther D. Gene regulatory network of human adipocyte differentiation. BMC Syst Biol. 2007;1:P57. Matthäus F, Smith VA, Gebicke-Haerter PJ. Some useful mathematical tools to transform microarray data into interactive molecular networks. In: Tretter F, Gebicke-Haerter PJ, Mendoza ER, Winterer G, editors. Systems biology in psychiatric research: from high-throughput data to mathematical modeling. Hoboken: Wiley; 2010. p. 277–300. Aghdam R, Ganjali M, Zhang X, Eslahchi C. CN: a consensus algorithm for inferring gene regulatory networks using the SORDER algorithm and conditional mutual information test. Mol BioSyst. 2015;11:942–9. Natori K, Uto M, Nishiyama Y, Kawano S, Ueno M. Constraint-based learning Bayesian networks using Bayes factor. In: Suzuki J, Ueno M (editors) Advanced Methodologies for Bayesian Networks. AMBN 2015. Lecture Notes in Computer Science, vol 9505. Cham: Springer; 2015; p. 15–31. Scutari M, Graaand CE, Gutiérrez JM. Who learns better Bayesian network structures: constraint-based, score-based or hybrid algorithms? Proc Mach Learn Res. 2018;72:416–27. Scanagatta M, Salmerón A, Stella F. A survey on Bayesian network structure learning from data. Progr Artif Intell. 2019;8:425–39. Yu J, Smith VA, Wang PP, Hartemink AJ, Jarvis ED. Advances to Bayesian network inference for generating causal networks from observational biological data. Bioinformatics. 2004;20:3594–603. Yu J. Developing Bayesian network inference algorithm to predict casual functional pathways in biological systems. PhD thesis, Duke University, Durham. 2005. Scutari M. Learning Bayesian networks with the bnlearn R package. J Stat Softw. 2010;35:1–22. Vogogias A, Kennedy J, Archambault D, Bach B, Smith VA, Currant H. Bayespiles: visualisation support for Bayesian network structure learning. ACM Trans Intell Syst Technol. 2018;10:5. Stephens CS, Johnson PA. Occludin expression and regulation in small follicles of the layer and broiler breeder hen. Gen Comp Endocrinol. 2017;248:106–13. Zhang B, Shao Y, Liu D, Yin P, Guo Y, Yuan J. Zinc prevents Salmonella enterica serovar Typhimurium-induced loss of intestinal mucosal barrier function in broiler chickens. Avian Pathol. 2012;41:361–7. Furuse M, Tsukita S. Claudins in occluding junctions of humans and flies. Trends Cell Biol. 2006;16:181–8. Du Y, Xu JT, Jin HN, Zhao R, Zhao D, Du SH, et al. Increased cerebral expressions of MMPs, CLDN5, OCLN, ZO1 and AQPs are associated with brain edema following fatal heat stroke. Sci Rep. 2017;7:1691. Schuster MK, Schmierer B, Shkumatava A, Kuchler K. Activin A and follicle-stimulating hormone control tight junctions in avian granulosa cells by regulating occludin expression. Biol Reprod. 2004;70:1493–9. Sekelova Z, Stepanova H, Polansky O, Varmuzova K, Faldynova M, Fedr R, et al. Differential protein expression in chicken macrophages and heterophils in vivo following infection with Salmonella Enteritidis. Vet Res. 2017;48:35. Sperandio M, Gleissner CA, Ley K. Glycosylation in immune cell trafficking. Immunol Rev. 2009;230:97–113. Lai E, Teodoro T, Volchuk A. Endoplasmic reticulum stress: Signaling the unfolded protein response. Physiologys. 2007;22:193–201. Jung J, Eggleton P, Robinson A, Wang J, Gutowski N, Holley J, et al. Calnexin is necessary for T cell transmigration into the central nervous system. JCI Insight. 2018;3:e98410. Yu Y, Hoffhines AJ, Moore KL, Leary JA. Determination of the sites of tyrosine O-sulfation in peptides and proteins. Nat Methods. 2007;4:583–8. Bin OY, Moore KL. Molecular cloning and expression of human and mouse tyrosylprotein sulfotransferase-2 and a tyrosylprotein sulfotransferase homologue in Caenorhabditis elegans. J Biol Chem. 1998;273:24770–4. McEver RP, Cummings RD. Role of PSGL-1 binding to selectins in leukocyte recruitment. J Clin Invest. 1997;100(11 Suppl):S97-103. Fallahsharoudi A, de Kock N, Johnsson M, Bektic L, Ubhayasekera SJKA, Bergquist J, et al. QTL mapping of stress related gene expression in a cross between domesticated chickens and ancestral red junglefowl. Mol Cell Endocrinol. 2017;446:52–8. Webster J. Animal welfare: freedoms, dominions and “a life worth living.” Animals. 2016;6:35. Renaudeau D, Collin A, Yahav S, de Basilio V, Gourdine JL, Collier RJ. Adaptation to hot climate and strategies to alleviate heat stress in livestock production. Animal. 2012;6:707–28. Lara L, Rostagno M. Impact of heat stress on poultry production. Animals. 2013;3:356–69. Bélteky J, Agnvall B, Bektic L, Höglund A, Jensen P, Guerrero-Bosagna C. Epigenetics and early domestication: differences in hypothalamic DNA methylation between red junglefowl divergently selected for high or low fear of humans. Genet Sel Evol. 2018;50:13. Bélteky J, Agnvall B, Johnsson M, Wright D, Jensen P. Domestication and tameness: brain gene expression in red junglefowl selected for less fear of humans suggests effects on reproduction and immunology. R Soc Open Sci. 2016;3:160033. Elfwing M, Nätt D, Goerlich-Jansson VC, Persson M, Hjelm J, Jensen P. Early stress causes sex-specific, life-long changes in behaviour, levels of gonadal hormones, and gene expression in chickens. PLoS One. 2015;10:e0125808. Pértille F, Ibelli AMG, El SM, Poleti MD, Fröhlich AS, Rezaei S, et al. Putative epigenetic biomarkers of stress in red blood cells of chickens reared across different biomes. Front Genet. 2020;11:508809. Goerlich VC, Nätt D, Elfwing M, Macdonald B, Jensen P. Transgenerational effects of early experience on behavioral, hormonal and gene expression responses to acute stress in the precocial chicken. Horm Behav. 2012;61:711–8. Rezaei S, Uffenorde J, Gimm O, Hosseinpour Feizi MA, Miemczyk S, Lehmann Coutinho L, et al. GBS-MeDIP: A protocol for parallel identification of genetic and epigenetic variation in the same reduced fraction of genomes across individuals. STAR Protoc. 2022;3:101202. Pértille F, Brantsæter M, Nordgreen J, Coutinho LL, Janczak AM, Jensen P, et al. DNA methylation profiles in red blood cells of adult hens correlate with their rearing conditions. J Exp Biol. 2017;220:3579–87. Feng J, Liu T, Qin B, Zhang Y, Liu XS. Identifying ChIP-seq enrichment using MACS. Nat Protoc. 2012;7:1728–40. Niazi U, Geyer KK, Vickers MJ, Hoffmann KF, Swain MT. DISMISS: detection of stranded methylation in MeDIP-Seq data. BMC Bioinformatics. 2016;17:295. Cavalcante RG, Qin T, Sartor MA. Novel bioinformatics methods for toxicoepigenetics. In: McCullough SD, Dolinoy DC, editors, Toxicoepigenetics: core principles and applications. London: Academic Press. 2019; p. 265–288. Nagarajan R, Scutari M, Lèbre S. Bayesian networks in R. New York: Chapman and Hall/CRC; 2013. Smith VA, Yu J, Smulders TV, Hartemink AJ, Jarvis ED. Computational inference of neural information flow networks. PLoS Comput Biol. 2006;2:e161.