PROTEOMAS: a workflow enabling harmonized proteomic meta-analysis and proteomic signature mapping
Tóm tắt
Toxicological evaluation of substances in regulation still often relies on animal experiments. Understanding the substances’ mode-of-action is crucial to develop alternative test strategies. Omics methods are promising tools to achieve this goal. Until now, most attention was focused on transcriptomics, while proteomics is not yet routinely applied in toxicology despite the large number of datasets available in public repositories. Exploiting the full potential of these datasets is hampered by differences in measurement procedures and follow-up data processing. Here we present the tool PROTEOMAS, which allows meta-analysis of proteomic data from public origin. The workflow was designed for analyzing proteomic studies in a harmonized way and to ensure transparency in the analysis of proteomic data for regulatory purposes. It agrees with the Omics Reporting Framework guidelines of the OECD with the intention to integrate proteomics to other omic methods in regulatory toxicology. The overarching aim is to contribute to the development of AOPs and to understand the mode of action of substances. To demonstrate the robustness and reliability of our workflow we compared our results to those of the original studies. As a case study, we performed a meta-analysis of 25 proteomic datasets to investigate the toxicological effects of nanomaterials at the lung level. PROTEOMAS is an important contribution to the development of alternative test strategies enabling robust meta-analysis of proteomic data. This workflow commits to the FAIR principles (Findable, Accessible, Interoperable and Reusable) of computational protocols.
Tài liệu tham khảo
OECD (2018) Users' Handbook supplement to the Guidance Document for developing and assessing Adverse Outcome Pathways. https://doi.org/10.1787/5jlv1m9d1g32-en
Halappanavar S et al (2020) Adverse outcome pathways as a tool for the design of testing strategies to support the safety assessment of emerging advanced materials at the nanoscale. Part Fibre Toxicol 17:16. https://doi.org/10.1186/s12989-020-00344-4
Villeneuve DL et al (2014) Adverse outcome pathway (AOP) development I: strategies and principles. Toxicol Sci 142:312–320. https://doi.org/10.1093/toxsci/kfu199
Krewski D et al (2014) A framework for the next generation of risk science. Environ Health Perspect 122:796–805. https://doi.org/10.1289/ehp.1307260
Stagljar I (2016) The power of OMICs. Biochem Biophys Res Commun 479:607–609. https://doi.org/10.1016/j.bbrc.2016.09.095
Ayers D, Day PJ (2015) Systems medicine: the application of systems biology approaches for modern medical research and drug development. Mol Biol Int. https://doi.org/10.1155/2015/698169
Simões T, Novais S, Natal-da-Luz T, Devreese B, de Boer T, Roelofs D, Sousa JP, van Straalen NM, Lemos MFL (2018) An integrative omics approach to unravel toxicity mechanisms of environmental chemicals: effects of a formulated herbicide. Sci Rep. https://doi.org/10.1038/s41598-018-29662-6
Sauer UG et al (2017) The challenge of the application of ’omics technologies in chemicals risk assessment: background and outlook. Regul Toxicol Pharmacol 91:14–26. https://doi.org/10.1016/j.yrtph.2017.09.020
Harrill JA et al (2021) Progress towards an OECD reporting framework for transcriptomics and metabolomics in regulatory toxicology. Regulatory Toxicol Pharmacol 125:105020. https://doi.org/10.1016/j.yrtph.2021.105020
Saarimäki LA, Melagraki G, Afantitis A, Lynch I, Greco D (2022) Prospects and challenges for FAIR toxicogenomics data. Nat Nanotechnol 17:17–18. https://doi.org/10.1038/s41565-021-01049-1
OECD (2021)Transcriptomic Reporting Framework (TRF). www.oecd.org/chemicalsafety/testing/transcriptomic-reporting-framework.pdf
OECD (2021) Metabolomic Reporting Framework (MRF). www.oecd.org/chemicalsafety/testing/metabolomics-reporting-framework.pdf
Jeliazkova N et al (2021) Towards FAIR nanosafety data. Nat Nanotechnol 16:644–654. https://doi.org/10.1038/s41565-021-00911-6
Cox J, Mann M (2008) MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26:1367–1372. https://doi.org/10.1038/nbt.1511
Cox J et al (2014) Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol Cell Proteomics 13:2513–2526. https://doi.org/10.1074/mcp.M113.031591
Karpievitch YV, Dabney AR, Smith RD (2012) Normalization and missing value imputation for label-free LC-MS analysis. BMC Bioinformatics 13:S5. https://doi.org/10.1186/1471-2105-13-S16-S5
Jin L et al (2021) A comparative study of evaluating missing value imputation methods in label-free proteomics. Sci Rep 11:1760. https://doi.org/10.1038/s41598-021-81279-4
Lazar C, Gatto L, Ferro M, Bruley C, Burger T (2016) Accounting for the multiple natures of missing values in label-free quantitative proteomics data sets to compare imputation strategies. J Proteome Res 15:1116–1125. https://doi.org/10.1021/acs.jproteome.5b00981
Troyanskaya O et al (2001) Missing value estimation methods for DNA microarrays. Bioinformatics 17:520–525. https://doi.org/10.1093/bioinformatics/17.6.520
Stekhoven DJ, Bühlmann P (2011) MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28:112–118. https://doi.org/10.1093/bioinformatics/btr597
Charrad M, Ghazzali N, Boiteau V, Niknafs A (2014) NbClust: an r package for determining the relevant number of clusters in a data set. J Stat Softw 61:1–36. https://doi.org/10.18637/jss.v061.i06
Li L et al (2014) Integrated omic analysis of lung cancer reveals metabolism proteome signatures with prognostic impact. Nat Commun 5:5469. https://doi.org/10.1038/ncomms6469
Tomin T et al (2018) Deletion of adipose triglyceride lipase links triacylglycerol accumulation to a more-aggressive phenotype in A549 Lung carcinoma cells. J Proteome Res 17:1415–1425. https://doi.org/10.1021/acs.jproteome.7b00782
Margalit A, Kavanagh K, Carolan JC (2020) Characterization of the proteomic response of A549 cells following sequential exposure to Aspergillus fumigatus and Pseudomonas aeruginosa. J Proteome Res 19:279–291. https://doi.org/10.1021/acs.jproteome.9b00520
European Union (2011) EU 2011/696 Commission Recommendation on the definition of nanomaterial. https://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=CELEX:32011H0696&from=EN.
Nikota J et al (2016) Meta-analysis of transcriptomic responses as a means to identify pulmonary disease outcomes for engineered nanomaterials. Part Fibre Toxicol 13:25. https://doi.org/10.1186/s12989-016-0137-5
Kohonen P et al (2017) A transcriptomics data-driven gene space accurately predicts liver cytopathology and drug-induced liver injury. Nat Commun 8:15932. https://doi.org/10.1038/ncomms15932
Serra A et al (2019) INSIdE NANO: a systems biology framework to contextualize the mechanism-of-action of engineered nanomaterials. Sci Rep 9:179. https://doi.org/10.1038/s41598-018-37411-y
Saarimäki LA et al (2021) Manually curated transcriptomics data collection for toxicogenomic assessment of engineered nanomaterials. Sci Data 8:49. https://doi.org/10.1038/s41597-021-00808-y
Labib S et al (2016) Nano-risk science: application of toxicogenomics in an adverse outcome pathway framework for risk assessment of multi-walled carbon nanotubes. Part Fibre Toxicol 13:15–15. https://doi.org/10.1186/s12989-016-0125-9
Jagiello K et al (2021) Transcriptomics-based and AOP-informed structure-activity relationships to predict pulmonary pathology induced by multiwalled carbon nanotubes. Small 17:2003465. https://doi.org/10.1002/smll.202003465
Johansson H, Albrekt A-S, Borrebaeck CAK, Lindstedt M (2013) The GARD assay for assessment of chemical skin sensitizers. Toxicol In Vitro 27:1163–1169. https://doi.org/10.1016/j.tiv.2012.05.019
Zeller KS et al (2017) The GARD platform for potency assessment of skin sensitizing chemicals. Altex 34:539–559. https://doi.org/10.14573/altex.1701101
Wilkinson MD et al (2016) The FAIR guiding principles for scientific data management and stewardship. Sci Data 3:160018. https://doi.org/10.1038/sdata.2016.18
Jeliazkova N et al (2021) Towards FAIR nanosafety data. Nat Nanotechnol 16:644–654. https://doi.org/10.1038/s41565-021-00911-6
Perez-Riverol Y et al (2018) The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res 47:D442–D450. https://doi.org/10.1093/nar/gky1106
Croux C, Filzmoser P, Oliveira MR (2007) Algorithms for projection-pursuit robust principal component analysis. Chemom Intell Lab Syst 87:218–225. https://doi.org/10.1016/j.chemolab.2007.01.004
Todorov V, Filzmoser P (2009) An object-oriented framework for robust multivariate analysis. J Stat Softw 32:1–47. https://doi.org/10.18637/jss.v032.i03
Korotkevich G et al (2021) Fast gene set enrichment analysis. bioRxiv. https://doi.org/10.1101/060012
Liberzon A et al (2015) The molecular signatures database (MSigDB) hallmark gene set collection. Cell Syst 1:417–425. https://doi.org/10.1016/j.cels.2015.12.004
Kanehisa M, Goto S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27–30. https://doi.org/10.1093/nar/28.1.27
Gillespie M et al (2022) The reactome pathway knowledgebase 2022. Nucleic Acids Res 50:D687-d692. https://doi.org/10.1093/nar/gkab1028
Wang Y-T et al (2017) Phosphoproteomics reveals HMGA1, a CK2 substrate, as a drug-resistant target in non-small cell lung cancer. Sci Rep 7:44021. https://doi.org/10.1038/srep44021
Poschmann G, Brenig K, Lenz T, Stühler K (2021) Comparative secretomics gives access to high confident secretome data: evaluation of different methods for the determination of bona fide secreted proteins. Proteomics 21:e2000178. https://doi.org/10.1002/pmic.202000178
Wiredja DD et al (2017) Phosphoproteomics profiling of nonsmall cell lung cancer cells treated with a novel phosphatase activator. Proteomics. https://doi.org/10.1002/pmic.201700214
Stewart PA et al (2017) Relative protein quantification and accessible biology in lung tumor proteomes from four LC-MS/MS discovery platforms. Proteomics. https://doi.org/10.1002/pmic.201600300
Kuenzi BM et al (2017) Polypharmacology-based ceritinib repurposing using integrated functional proteomics. Nat Chem Biol 13:1222–1231. https://doi.org/10.1038/nchembio.2489
Großerueschkamp F et al (2017) Spatial and molecular resolution of diffuse malignant mesothelioma heterogeneity by integrating label-free FTIR imaging, laser capture microdissection and proteomics. Sci Rep 7:44829
Mossina A et al (2017) Cigarette smoke alters the secretome of lung epithelial cells. Proteomics. https://doi.org/10.1002/pmic.201600243
Kammerl IE et al (2019) Dissecting the molecular effects of cigarette smoke on proteasome function. J Proteomics 193:1–9. https://doi.org/10.1016/j.jprot.2018.12.015
Wang P et al (2020) A cross-talk between epithelium and endothelium mediates human alveolar-capillary injury during SARS-CoV-2 infection. Cell Death Dis 11:1042–1042. https://doi.org/10.1038/s41419-020-03252-9
Dalskov L et al (2020) SARS-CoV-2 evades immune detection in alveolar macrophages. EMBO Rep 21:e51252–e51252. https://doi.org/10.15252/embr.202051252
Seddigh P et al (2017) Quantitative analysis of proteome modulations in alveolar epithelial type II cells in response to pulmonary Aspergillus fumigatus infection. Mol Celi Proteom 16:2184–2198. https://doi.org/10.1074/mcp.ra117.000072
Ruprecht B et al (2020) A mass spectrometry-based proteome map of drug action in lung cancer cell lines. Nat Chem Biol 16:1111–1119. https://doi.org/10.1038/s41589-020-0572-3
Deliyannis G et al (2021) TLR2-mediated activation of innate responses in the upper airways confers antiviral protection of the lungs. JCI Insight. https://doi.org/10.1172/jci.insight.140267
Xing Q-Q et al (2019) Serum proteomics analysis based on label-free revealed the protective effect of Chinese herbal formula Gu-Ben-Fang-Xiao. Biomed Pharmacother 119:109390. https://doi.org/10.1016/j.biopha.2019.109390
Billing AM et al (2020) Fast and robust proteome screening platform identifies neutrophil extracellular trap formation in the lung in response to cobalt ferrite nanoparticles. ACS Nano 14:4096–4110. https://doi.org/10.1021/acsnano.9b08818
Gallud A et al (2020) Multiparametric profiling of engineered nanomaterials: unmasking the surface coating effect. Adv Sci 7:2002221
Seidel C et al (2021) Inhaled multi-walled carbon nanotubes differently modulate global gene and protein expression in rat lungs. Nanotoxicology 15:238–256. https://doi.org/10.1080/17435390.2020.1851418
Phuyal S et al (2018) Characterization of the proteome and lipidome profiles of human lung cells after low dose and chronic exposure to multiwalled carbon nanotubes. Nanotoxicology 12:138–152. https://doi.org/10.1080/17435390.2018.1425500
Alswady-Hoff M et al (2021) Long-term exposure to nanosized TiO2 triggers stress responses and cell death pathways in pulmonary epithelial cells. Int J Mol Sci. https://doi.org/10.3390/ijms22105349