Scientific data

Công bố khoa học tiêu biểu

* Dữ liệu chỉ mang tính chất tham khảo

Sắp xếp:  
An EEG & eye-tracking dataset of ALS patients & healthy people during eye-tracking-based spelling system usage
Scientific data - Tập 11 - Trang 1-11 - 2024
Thi Duyen Ngo, Hai Dang Kieu, Minh Hoa Nguyen, The Hoang-Anh Nguyen, Van Mao Can, Ba Hung Nguyen, Thanh Ha Le
This research presents a dataset consisting of electroencephalogram and eye tracking recordings obtained from six patients with amyotrophic lateral sclerosis (ALS) in a locked-in state and one hundred seventy healthy individuals. The ALS patients exhibited varying degrees of disease progression, ranging from partial mobility and weakened speech to complete paralysis and loss of speech. Despite these physical impairments, the ALS patients retained good eye function, which allowed them to use a virtual keyboard for communication. Data from ALS patients was recorded multiple times at their homes, while data from healthy individuals was recorded once in a laboratory setting. For each data recording, the experimental design involved nine recording sessions per participant, each corresponding to a common human action or demand. This dataset can serve as a valuable benchmark for several applications, such as improving spelling systems with brain-computer interfaces, investigating motor imagination, exploring motor cortex function, monitoring motor impairment progress in patients undergoing rehabilitation, and studying the effects of ALS on cognitive and motor processes.
An open database of productivity in Vietnam's social sciences and humanities for public use
Scientific data - Tập 5 Số 1 - Trang 1-15
Vuong, Quan-Hoang, La, Viet-Phuong, Vuong, Thu-Trang, Ho, Manh-Toan, Nguyen, Hong-Kong T., Nguyen, Viet-Ha, Pham, Hiep-Hung, Ho, Manh-Tung
This study presents a description of an open database on scientific output of Vietnamese researchers in social sciences and humanities, one that corrects for the shortcomings in current research publication databases such as data duplication, slow update, and a substantial cost of doing science. Here, using scientists’ self-reports, open online sources and cross-checking with Scopus database, we introduce a manual system and its semi-automated version of the database on the profiles of 657 Vietnamese researchers in social sciences and humanities who have published in Scopus-indexed journals from 2008 to 2018. The final system also records 973 foreign co-authors, 1,289 papers, and 789 affiliations. The data collection method, highly applicable for other sources, could be replicated in other developing countries while its content be used in cross-section, multivariate, and network data analyses. The open database is expected to help Vietnam revamp its research capacity and meet the public demand for greater transparency in science management. Machine-accessible metadata file describing the reported data (ISA-Tab format)
Helium, inorganic and organic carbon isotopes of fluids and gases across the Costa Rica convergent margin
Scientific data - Tập 6 Số 1
Peter H. Barry, Mayuko Nakagawa, Donato Giovannelli, J. Maarten de Moor, Matthew O. Schrenk, Alan Seltzer, Elena Manini, Daniele Fattorini, Marta Di Carlo, Francesco Regoli, Katherine Fullerton, Karen G. Lloyd
Abstract

In 2017, fluid and gas samples were collected across the Costa Rican Arc. He and Ne isotopes, C isotopes as well as total organic and inorganic carbon concentrations were measured. The samples (n = 24) from 2017 are accompanied by (n = 17) samples collected in 2008, 2010 and 2012. He-isotopes ranged from arc-like (6.8 RA) to crustal (0.5 RA). Measured dissolved inorganic carbon (DIC) δ13CVPDB values varied from 3.55 to −21.57‰, with dissolved organic carbon (DOC) following the trends of DIC. Gas phase CO2 only occurs within ~20 km of the arc; δ13CVPDB values varied from −0.84 to −5.23‰. Onsite, pH, conductivity, temperature and dissolved oxygen (DO) were measured; pH ranged from 0.9–10.0, conductivity from 200–91,900 μS/cm, temperatures from 23–89 °C and DO from 2–84%. Data were used to develop a model which suggests that ~91 ± 4.0% of carbon released from the slab/mantle beneath the Costa Rican forearc is sequestered within the crust by calcite deposition with an additional 3.3 ± 1.3% incorporated into autotrophic biomass.

The global compendium of Aedes aegypti and Ae. albopictus occurrence
Scientific data - Tập 2 Số 1
Moritz U. G. Kraemer, Marianne Sinka, Kirsten A. Duda, Adrian Mylne, Freya M. Shearer, Oliver J. Brady, Jane P. Messina, Christopher M. Barker, Chester G. Moore, Roberta Gomes Carvalho, Giovanini Evelim Coelho, Wim Van Bortel, Guy Hendrickx, Francis Schaffner, William Wint, Iqbal Elyazar, Hwa‐Jen Teng, Simon I Hay
Abstract

Aedes aegypti and Ae. albopictus are the main vectors transmitting dengue and chikungunya viruses. Despite being pathogens of global public health importance, knowledge of their vectors’ global distribution remains patchy and sparse. A global geographic database of known occurrences of Ae. aegypti and Ae. albopictus between 1960 and 2014 was compiled. Herein we present the database, which comprises occurrence data linked to point or polygon locations, derived from peer-reviewed literature and unpublished studies including national entomological surveys and expert networks. We describe all data collection processes, as well as geo-positioning methods, database management and quality-control procedures. This is the first comprehensive global database of Ae. aegypti and Ae. albopictus occurrence, consisting of 19,930 and 22,137 geo-positioned occurrence records respectively. Both datasets can be used for a variety of mapping and spatial analyses of the vectors and, by inference, the diseases they transmit.

Global-scale phylogenetic linguistic inference from lexical resources
Scientific data - Tập 5 Số 1
Gerhard Jäger
Abstract

Automatic phylogenetic inference plays an increasingly important role in computational historical linguistics. Most pertinent work is currently based onexpert cognate judgments. This limits the scope of this approach to a small number of well-studied language families. We used machine learning techniques to compile data suitable for phylogenetic inference from the ASJP database, a collection of almost 7,000 phonetically transcribed word lists over 40 concepts, covering two thirds of the extant world-wide linguistic diversity. First, we estimatedPointwise Mutual Informationscores between sound classes using weighted sequence alignment and general-purpose optimization. From this we computed a dissimilarity matrix over all ASJP word lists. This matrix is suitable fordistance-basedphylogenetic inference. Second, we appliedcognate clusteringto the ASJP data, using supervised training of an SVM classifier on expert cognacy judgments. Third, we defined two types of binarycharacters, based on automatically inferred cognate classes and on sound-class occurrences. Several tests are reported demonstrating the suitability of these characters forcharacter-basedphylogenetic inference.

Satellite-based time-series of sea-surface temperature since 1981 for climate applications
Scientific data - Tập 6 Số 1
Christopher J. Merchant, Owen Embury, Claire E. Bulgin, T. Block, Gary K. Corlett, Emma Fiedler, Simon Good, Jonathan P. D. Mittaz, Nick A Rayner, David I. Berry, Steinar Eastwood, Michael Taylor, Yoko Tsushima, A. M. Waterfall, Ruth Wilson, Craig Donlon
Abstract

A climate data record of global sea surface temperature (SST) spanning 1981–2016 has been developed from 4 × 1012 satellite measurements of thermal infra-red radiance. The spatial area represented by pixel SST estimates is between 1 km2 and 45 km2. The mean density of good-quality observations is 13 km−2 yr−1. SST uncertainty is evaluated per datum, the median uncertainty for pixel SSTs being 0.18 K. Multi-annual observational stability relative to drifting buoy measurements is within 0.003 K yr−1 of zero with high confidence, despite maximal independence from in situ SSTs over the latter two decades of the record. Data are provided at native resolution, gridded at 0.05° latitude-longitude resolution (individual sensors), and aggregated and gap-filled on a daily 0.05° grid. Skin SSTs, depth-adjusted SSTs de-aliased with respect to the diurnal cycle, and SST anomalies are provided. Target applications of the dataset include: climate and ocean model evaluation; quantification of marine change and variability (including marine heatwaves); climate and ocean-atmosphere processes; and specific applications in ocean ecology, oceanography and geophysics.

Australian sea-floor survey data, with images and expert annotations
Scientific data - Tập 2 Số 1
Michael Bewley, Ariell Friedman, Renata Ferrari, Nicole Hill, Renae Hovey, NS Barrett, Ezequiel M. Marzinelli, Oscar Pizarro, Will F. Figueira, Lisa Meyer, Russell C. Babcock, Lynda M. Bellchambers, Maria Byrne, Stefan B. Williams
Abstract

This Australian benthic data set (BENTHOZ-2015) consists of an expert-annotated set of georeferenced benthic images and associated sensor data, captured by an autonomous underwater vehicle (AUV) around Australia. This type of data is of interest to marine scientists studying benthic habitats and organisms. AUVs collect georeferenced images over an area with consistent illumination and altitude, and make it possible to generate broad scale, photo-realistic 3D maps. Marine scientists then typically spend several minutes on each of thousands of images, labeling substratum type and biota at a subset of points. Labels from four Australian research groups were combined using the CATAMI classification scheme, a hierarchical classification scheme based on taxonomy and morphology for scoring marine imagery. This data set consists of 407,968 expert labeled points from around the Australian coast, with associated images, geolocation and other sensor data. The robotic surveys that collected this data form part of Australia's Integrated Marine Observing System (IMOS) ongoing benthic monitoring program. There is reuse potential in marine science, robotics, and computer vision research.

A lake data set for the Tibetan Plateau from the 1960s, 2005, and 2014
Scientific data - Tập 3 Số 1
Wei Wan, Di Long, Yang Hong, Yingzhao Ma, Yuan Yuan, Pengfeng Xiao, Hongtao Duan, Zhongying Han, Xingfa Gu
Abstract

Long-term datasets of number and size of lakes over the Tibetan Plateau (TP) are among the most critical components for better understanding the interactions among the cryosphere, hydrosphere, and atmosphere at regional and global scales. Due to the harsh environment and the scarcity of data over the TP, data accumulation and sharing become more valuable for scientists worldwide to make new discoveries in this region. This paper, for the first time, presents a comprehensive and freely available data set of lakes’ status (name, location, shape, area, perimeter, etc.) over the TP region dating back to the 1960s, including three time series, i.e., the 1960s, 2005, and 2014, derived from ground survey (the 1960s) or high-spatial-resolution satellite images from the China-Brazil Earth Resources Satellite (CBERS) (2005) and China’s newly launched GaoFen-1 (GF-1, which means high-resolution images in Chinese) satellite (2014). The data set could provide scientists with useful information for revealing environmental changes and mechanisms over the TP region.

Design Type(s) time series design • observation design • data integration objective
Measurement Type(s) lake topography
Technology Type(s) remote sensing
Factor Type(s)
Sample Characteristic(s) Tibetan Plateau • Qaidam Basin • Amu Darya • Brahmaputra River • River Ganges • Hexi District • Indus River • Mekong River • Salween River • Tarim Basin • Yangtze River • Yellow River • endorheic lake • exorheic lake

Machine-accessible metadata file describing the reported data (ISA-Tab format)

China CO2 emission accounts 1997–2015
Scientific data - Tập 5 Số 1
Yuli Shan, Dabo Guan, Heran Zheng, Jiamin Ou, Yuan Li, Jing Meng, Zhifu Mi, Zhu Liu, Qiang Zhang
Abstract

China is the world’s top energy consumer and CO2 emitter, accounting for 30% of global emissions. Compiling an accurate accounting of China’s CO2 emissions is the first step in implementing reduction policies. However, no annual, officially published emissions data exist for China. The current emissions estimated by academic institutes and scholars exhibit great discrepancies. The gap between the different emissions estimates is approximately equal to the total emissions of the Russian Federation (the 4th highest emitter globally) in 2011. In this study, we constructed the time-series of CO2 emission inventories for China and its 30 provinces. We followed the Intergovernmental Panel on Climate Change (IPCC) emissions accounting method with a territorial administrative scope. The inventories include energy-related emissions (17 fossil fuels in 47 sectors) and process-related emissions (cement production). The first version of our dataset presents emission inventories from 1997 to 2015. We will update the dataset annually. The uniformly formatted emission inventories provide data support for further emission-related research as well as emissions reduction policy-making in China.

Coastal sea level anomalies and associated trends from Jason satellite altimetry over 2002–2018
Scientific data - Tập 7 Số 1
Jérôme Benveniste, Florence Birol, Francisco M. Calafat, Anny Cazenave, H. B. Dieng, Yvan Gouzènes, Jean François Legeais, Fabien Léger, Fernando Niño, Marcello Passaro, Christian Schwatke, Andrew Shaw
Abstract

Climate-related sea level changes in the world coastal zones result from the superposition of the global mean rise due to ocean warming and land ice melt, regional changes caused by non-uniform ocean thermal expansion and salinity changes, and by the solid Earth response to current water mass redistribution and associated gravity change, plus small-scale coastal processes (e.g., shelf currents, wind & waves changes, fresh water input from rivers, etc.). So far, satellite altimetry has provided global gridded sea level time series up to 10–15 km to the coast only, preventing estimation of sea level changes very close to the coast. Here we present a 16-year-long (June 2002 to May 2018), high-resolution (20-Hz), along-track sea level dataset at monthly interval, together with associated sea level trends, at 429 coastal sites in six regions (Northeast Atlantic, Mediterranean Sea, Western Africa, North Indian Ocean, Southeast Asia and Australia). This new coastal sea level product is based on complete reprocessing of raw radar altimetry waveforms from the Jason-1, Jason-2 and Jason-3 missions.

Tổng số: 55   
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6