Scientific data
2052-4463
Anh Quốc
Cơ quản chủ quản: NATURE PORTFOLIO , Nature Publishing Group
Lĩnh vực:
Statistics and ProbabilityComputer Science ApplicationsInformation SystemsEducationLibrary and Information SciencesStatistics, Probability and Uncertainty
Các bài báo tiêu biểu
Charting the complete elastic properties of inorganic crystalline compounds Abstract The elastic constant tensor of an inorganic compound provides a complete description of the response of the material to external stresses in the elastic limit. It thus provides fundamental insight into the nature of the bonding in the material, and it is known to correlate with many mechanical properties. Despite the importance of the elastic constant tensor, it has been measured for a very small fraction of all known inorganic compounds, a situation that limits the ability of materials scientists to develop new materials with targeted mechanical responses. To address this deficiency, we present here the largest database of calculated elastic properties for inorganic compounds to date. The database currently contains full elastic information for 1,181 inorganic compounds, and this number is growing steadily. The methods used to develop the database are described, as are results of tests that establish the accuracy of the data. In addition, we document the database format and describe the different ways it can be accessed and analyzed in efforts related to materials discovery and design.
Tập 2 Số 1
The traded water footprint of global energy from 2010 to 2018 Abstract The energy-water nexus describes the requirement of water-for-energy and energy-for-water. The consumption of water in the production and generation of energy resources is also deemed virtual water. Pairing the virtual water estimates for energy with international trade data creates a virtual water trade network, facilitating analysis of global water resources management. In this database, we identify the virtual water footprints for the trade of eleven different energy commodities including fossil fuels, biomass, and electricity. Additionally, we provide the necessary scripts for downloading and pairing trade data with the virtual water footprints to create a virtual water trade network. The resulting database contains country-to-country virtual water trade from 2010–2018, broken down by commodity. The purpose of this data descriptor is to provide detailed methods and validation of the dataset beyond the complementary research publication. The resulting database provides opportunities to understand global energy-related water demands and advance future global water resources research.
Tập 8 Số 1
China’s environmental policy intensity for 1978–2019 Abstract Improving the measurement of environmental policy intensity would affect not only the selection of variables in environmental policy research but also the research conclusions when evaluating policy effects. Because direct evaluation is lacking, the existing research usually applies data such as pollutant emission data, or the number of policies to construct proxy variables. However, these proxy variables are affected by many assumptions and different selection criteria, and they are inevitably accompanied by endogeneity problems. In this study, China’s environmental policy is comprehensively collected for the first time, and a machine learning algorithm is applied to evaluate the policy intensity. We provide all the policies issued by the Chinese government from 1978 to 2019 and the quantified intensity for each policy. We also distinguish all policies into three types according to their attributes. This dataset can help researchers to further understand China’s environmental policy system. In addition, it provides a valuable dataset for related research on evaluating environmental policy and recommending actions for further improvement.
Tập 9 Số 1
Holocene global mean surface temperature, a multi-method reconstruction approach Abstract An extensive new multi-proxy database of paleo-temperature time series (Temperature 12k) enables a more robust analysis of global mean surface temperature (GMST) and associated uncertainties than was previously available. We applied five different statistical methods to reconstruct the GMST of the past 12,000 years (Holocene). Each method used different approaches to averaging the globally distributed time series and to characterizing various sources of uncertainty, including proxy temperature, chronology and methodological choices. The results were aggregated to generate a multi-method ensemble of plausible GMST and latitudinal-zone temperature reconstructions with a realistic range of uncertainties. The warmest 200-year-long interval took place around 6500 years ago when GMST was 0.7 °C (0.3, 1.8) warmer than the 19th Century (median, 5th , 95th percentiles). Following the Holocene global thermal maximum, GMST cooled at an average rate −0.08 °C per 1000 years (−0.24, −0.05). The multi-method ensembles and the code used to generate them highlight the utility of the Temperature 12k database, and they are now available for future use by studies aimed at understanding Holocene evolution of the Earth system.
Tập 7 Số 1
First-principles data set of 45,892 isolated and cation-coordinated conformers of 20 proteinogenic amino acids Abstract We present a structural data set of the 20 proteinogenic amino acids and their amino-methylated and acetylated (capped) dipeptides. Different protonation states of the backbone (uncharged and zwitterionic) were considered for the amino acids as well as varied side chain protonation states. Furthermore, we studied amino acids and dipeptides in complex with divalent cations (Ca2+ , Ba2+ , Sr2+ , Cd2+ , Pb2+ , and Hg2+ ). The database covers the conformational hierarchies of 280 systems in a wide relative energy range of up to 4 eV (390 kJ/mol), summing up to a total of 45,892 stationary points on the respective potential-energy surfaces. All systems were calculated on equal first-principles footing, applying density-functional theory in the generalized gradient approximation corrected for long-range van der Waals interactions. We show good agreement to available experimental data for gas-phase ion affinities. Our curated data can be utilized, for example, for a wide comparison across chemical space of the building blocks of life, for the parametrization of protein force fields, and for the calculation of reference spectra for biophysical applications.
Tập 3 Số 1
Climatologies at high resolution for the earth’s land surface areas Abstract High-resolution information on climatic conditions is essential to many applications in environmental and ecological sciences. Here we present the CHELSA (Climatologies at high resolution for the earth’s land surface areas) data of downscaled model output temperature and precipitation estimates of the ERA-Interim climatic reanalysis to a high resolution of 30 arc sec. The temperature algorithm is based on statistical downscaling of atmospheric temperatures. The precipitation algorithm incorporates orographic predictors including wind fields, valley exposition, and boundary layer height, with a subsequent bias correction. The resulting data consist of a monthly temperature and precipitation climatology for the years 1979–2013. We compare the data derived from the CHELSA algorithm with other standard gridded products and station data from the Global Historical Climate Network. We compare the performance of the new climatologies in species distribution modelling and show that we can increase the accuracy of species range predictions. We further show that CHELSA climatological data has a similar accuracy as other products for temperature, but that its predictions of precipitation patterns are better.
Tập 4 Số 1
Tracking vegetation phenology across diverse biomes using Version 2.0 of the PhenoCam Dataset Abstract Monitoring vegetation phenology is critical for quantifying climate change impacts on ecosystems. We present an extensive dataset of 1783 site-years of phenological data derived from PhenoCam network imagery from 393 digital cameras, situated from tropics to tundra across a wide range of plant functional types, biomes, and climates. Most cameras are located in North America. Every half hour, cameras upload images to the PhenoCam server. Images are displayed in near-real time and provisional data products, including timeseries of the Green Chromatic Coordinate (Gcc), are made publicly available through the project web page (https://phenocam.sr.unh.edu/webcam/gallery/ ). Processing is conducted separately for each plant functional type in the camera field of view. The PhenoCam Dataset v2.0, described here, has been fully processed and curated, including outlier detection and expert inspection, to ensure high quality data. This dataset can be used to validate satellite data products, to evaluate predictions of land surface models, to interpret the seasonality of ecosystem-scale CO2 and H2 O flux data, and to study climate change impacts on the terrestrial biosphere.
Tập 6 Số 1
Tracking vegetation phenology across diverse North American biomes using PhenoCam imagery Abstract Vegetation phenology controls the seasonality of many ecosystem processes, as well as numerous biosphere-atmosphere feedbacks. Phenology is also highly sensitive to climate change and variability. Here we present a series of datasets, together consisting of almost 750 years of observations, characterizing vegetation phenology in diverse ecosystems across North America. Our data are derived from conventional, visible-wavelength, automated digital camera imagery collected through the PhenoCam network. For each archived image, we extracted RGB (red, green, blue) colour channel information, with means and other statistics calculated across a region-of-interest (ROI) delineating a specific vegetation type. From the high-frequency (typically, 30 min) imagery, we derived time series characterizing vegetation colour, including “canopy greenness”, processed to 1- and 3-day intervals. For ecosystems with one or more annual cycles of vegetation activity, we provide estimates, with uncertainties, for the start of the “greenness rising” and end of the “greenness falling” stages. The database can be used for phenological model validation and development, evaluation of satellite remote sensing data products, benchmarking earth system models, and studies of climate change impacts on terrestrial ecosystems.
Tập 5 Số 1
A global moderate resolution dataset of gross primary production of vegetation for 2000–2016 Abstract Accurate estimation of the gross primary production (GPP) of terrestrial vegetation is vital for understanding the global carbon cycle and predicting future climate change. Multiple GPP products are currently available based on different methods, but their performances vary substantially when validated against GPP estimates from eddy covariance data. This paper provides a new GPP dataset at moderate spatial (500 m) and temporal (8-day) resolutions over the entire globe for 2000–2016. This GPP dataset is based on an improved light use efficiency theory and is driven by satellite data from MODIS and climate data from NCEP Reanalysis II. It also employs a state-of-the-art vegetation index (VI) gap-filling and smoothing algorithm and a separate treatment for C3/C4 photosynthesis pathways. All these improvements aim to solve several critical problems existing in current GPP products. With a satisfactory performance when validated against in situ GPP estimates, this dataset offers an alternative GPP estimate for regional to global carbon cycle studies.
Tập 4 Số 1
The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments Abstract The development of magnetic resonance imaging (MRI) techniques has defined modern neuroimaging. Since its inception, tens of thousands of studies using techniques such as functional MRI and diffusion weighted imaging have allowed for the non-invasive study of the brain. Despite the fact that MRI is routinely used to obtain data for neuroscience research, there has been no widely adopted standard for organizing and describing the data collected in an imaging experiment. This renders sharing and reusing data (within or between labs) difficult if not impossible and unnecessarily complicates the application of automatic pipelines and quality assurance protocols. To solve this problem, we have developed the Brain Imaging Data Structure (BIDS), a standard for organizing and describing MRI datasets. The BIDS standard uses file formats compatible with existing software, unifies the majority of practices already common in the field, and captures the metadata necessary for most common data processing operations.
Tập 3 Số 1