Implementation of a workflow for publishing citeable environmental data: successes, challenges and opportunities from a data centre perspective

Springer Science and Business Media LLC - Tập 18 - Trang 133-143 - 2016
Kathryn A. Harrison1, Daniel G. Wright1, Philip Trembath1
1Centre for Ecology and Hydrology, Lancaster Environmental Centre, Lancaster, UK

Tóm tắt

In recent years, the development and implementation of a robust way to cite data have encouraged many previously sceptical environmental researchers to publish the data they create, thus ensuring that more data than ever are now open and available for re-use within and between research communities. Here, we describe a workflow for publishing citeable data in the context of the environmental sciences—an area spanning many domains and generating a vast array of heterogeneous data products. The processes and tools we have developed have enabled rapid publication of quality data products including datasets, models and model outputs which can be accessed, re-used and subsequently cited. However, there are still many challenges that need to be addressed before researchers in the environmental sciences fully accept the notion that datasets are valued outputs and time should be spent in properly describing, storing and citing them. Here, we identify current challenges such as citation of dynamic datasets and issues of recording and presenting citation metrics. In conclusion, whilst data centres may have the infrastructure, tools, resources and processes available to publish citeable datasets, further work is required before large-scale uptake of the services offered is achieved. We believe that once current challenges are met, data resources will be viewed similarly to journal publications as valued outputs in a researcher’s portfolio, and therefore both the quality and quantity of data published will increase.

Tài liệu tham khảo

Lawrence, B., Jones, C., Matthews, B., Pepler, S., Callaghan, S.: Citation and peer review of data: moving towards formal data publication. Int. J. Digit. Curation 6(2), 4–37 (2011) Klump, J., Bertelmann, R., Brase, J., Diepenbroek, M., Grobe, H., Hock, H., Lautenschalger, M., Schindler, U., Sens, I., Wachter, J.: Data publication in the open access initiative. Data Sci. J. 5, 79–83 (2006) CODATA-ICSTI, Socha, Y. (ed.): Out of cite, out of mind: the current state of practice, policy, and technology for the citation of data. Data Sci. J. 12(0), CIDCR1–CIDCR7 (2013). doi:10.2481/dsj.OSOM13-043 Arzberger, P., Schroeder, P., Beaulieu, A., Bowker, G., Casey, K., Laaksonen, L., Moorman, D., Uhlir, P., Wouters, P.: Promoting access to public research data for scientific, economic, and social development. Data Sci. J. 3, 135–152 (2004) Mayernik, M.: Bridging data lifecycles: Tracking data use via data citations workshop report. NCAR Technical Note NCAR/TN-494+PROC (2013). doi:10.5065/D6PZ56TX Assante, M., Candela, L., Castelli, D., Manghi, P., Pagano, P.: Science 2.0 repositories: time for a change in scholarly communication. D-Lib Mag. 21(1/2) (2015). doi:10.1045/january2015-assante Kratz, J.E., Strasser, C.: Making data count. Sci. Data 2, 150039 (2015). doi:10.1038/sdata.2015.39 Duerr, R., Downs, R., Tilmes, C., Barkstrom, B., Lenhardt, W., Glassy, J., Bermudez, L., Slaughter, P.: On the utility of identification schemes for digital earth science data: an assessment and recommendations. Earth Sci. Inform. 4, 139–60 (2011). doi:10.1007/s12145-011-0083-6 Callaghan, S., Donegan, S., Pepler, S., et al.: Making data a first class scientific output: data citation and publication by NERC’s environmental data centres. Int. J. Digit. Curation 7(1), 107–113 (2012) ESIP (Federation of Earth Science Information Partners): Data citation guidelines for data providers and archives. In: Parsons, M.A., Barkstrom, B., Downs, R.R., Duerr, R., Tilmes, C., ESIP Preservation and Stewardship Committee (eds.) ESIP Commons (2012). doi:10.7269/P34F1NNJ Klump, J., Huber, R., Diepenbroek, M.: DOI for geoscience data-how early practices shape present perceptions. Earth Sci. Inform. 9, 123–136 (2016). doi:10.1007/s12145-015-0231-5 Ball, A., Duke, M.: How to Cite Datasets and Link to Publications. DCC How-to Guides. Digital Curation Centre, Edinburgh (2015). http://www.dcc.ac.uk/resources/how-guides Beresford, N.A., Barnett, C.L., Howard, B.J., Howard, D.C., Tyler, A.N., Bradley, S., Copplestone, D.: Observations of Fukushima Fallout in Great Britain. NERC Environmental Information Data Centre (2011). doi:10.5285/1a91c7d1-ec44-4858-9af2-98d80f169bbd Beresford, N.A., Barnett, C.L., Howard, B.J., Howard, D.C., Wells, C., Tyler, A.N., Bradley, S., Copplestone, D.: Observations of Fukushima fallout in Great Britain. J. Environ. Radioact. 114, 48–53 (2012). doi:10.1016/j.jenvrad.2011.12.008 Haxton, T., Crooks, S., Jackson, C.R., Barkwith, A.K.A.P., Kelvin, J., Williamson, J., Mackay, J.D., Wang, L., Davies, H., Young, A., Prudhomme, C.: Future flows hydrology data. NERC Environ. Inf. Data Centre (2012). doi:10.5285/f3723162-4fed-4d9d-92c6-dd17412fa37b Prudhomme, C., Haxton, T., Crooks, S., Jackson, C., Barkwith, A., Williamson, J., Kelvin, J., Mackay, J., Wang, L., Young, A., Watts, G.: Future flows hydrology: and ensemble of a daily river flow and monthly groundwater levels for use for climate change impact assessment across Great Britain. Eath Syst. Sci. Data 5, 101–107 (2013). doi:10.5194/essd-5-101-2013 Royan, A., Prudhomme, C., Hannah, D.M., Reynolds, S.J., Noble, D.G., Sadler, J.P.: Climate-induced changes in river flow regimes will alter future bird distributions. Ecosphere 6(4), 50 (2015). doi:10.1890/ES14-00245.1 British Library, DataCite: Working with the British Library and DataCite—a guide for Higher Education Institutions in the UK. British Library (2013). http://www.bl.uk/aboutus/stratpolprog/digi/datasets/WorkingWithDataCite_2013.pdf Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: collated indices 2011. NERC Environmental Information Data Centre (2012). doi:10.5285/ff55462e-38a4-4f30-b562-f82ff263d9c3 Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Species Trends 2011. NERC Environmental Information Data Centre (2013). doi:10.5285/cad2af6c-0c97-414c-8d5f-992741b283cf Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Collated Indices 2012. NERC Environmental Information Data Centre (2013). doi:10.5285/7949cc99-76c4-4a3e-8c33-41a35b8b7777 Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Species Trends 2012. NERC Environmental Information Data Centre (2013). doi:10.5285/5afbbd36-2c63-4aa1-8177-695bed98d7a9 Rauber, A., Pröll, S.: Scalable dynamic data citation—RDA-WG-DC position paper (2015). https://rd-alliance.org/groups/data-citation-wg/wiki/scalable-dynamic-data-citation-rda-wg-dc-position-paper.html. Accessed 25 June 2015