Implementation of a workflow for publishing citeable environmental data: successes, challenges and opportunities from a data centre perspective
Tóm tắt
In recent years, the development and implementation of a robust way to cite data have encouraged many previously sceptical environmental researchers to publish the data they create, thus ensuring that more data than ever are now open and available for re-use within and between research communities. Here, we describe a workflow for publishing citeable data in the context of the environmental sciences—an area spanning many domains and generating a vast array of heterogeneous data products. The processes and tools we have developed have enabled rapid publication of quality data products including datasets, models and model outputs which can be accessed, re-used and subsequently cited. However, there are still many challenges that need to be addressed before researchers in the environmental sciences fully accept the notion that datasets are valued outputs and time should be spent in properly describing, storing and citing them. Here, we identify current challenges such as citation of dynamic datasets and issues of recording and presenting citation metrics. In conclusion, whilst data centres may have the infrastructure, tools, resources and processes available to publish citeable datasets, further work is required before large-scale uptake of the services offered is achieved. We believe that once current challenges are met, data resources will be viewed similarly to journal publications as valued outputs in a researcher’s portfolio, and therefore both the quality and quantity of data published will increase.
Tài liệu tham khảo
Lawrence, B., Jones, C., Matthews, B., Pepler, S., Callaghan, S.: Citation and peer review of data: moving towards formal data publication. Int. J. Digit. Curation 6(2), 4–37 (2011)
Klump, J., Bertelmann, R., Brase, J., Diepenbroek, M., Grobe, H., Hock, H., Lautenschalger, M., Schindler, U., Sens, I., Wachter, J.: Data publication in the open access initiative. Data Sci. J. 5, 79–83 (2006)
CODATA-ICSTI, Socha, Y. (ed.): Out of cite, out of mind: the current state of practice, policy, and technology for the citation of data. Data Sci. J. 12(0), CIDCR1–CIDCR7 (2013). doi:10.2481/dsj.OSOM13-043
Arzberger, P., Schroeder, P., Beaulieu, A., Bowker, G., Casey, K., Laaksonen, L., Moorman, D., Uhlir, P., Wouters, P.: Promoting access to public research data for scientific, economic, and social development. Data Sci. J. 3, 135–152 (2004)
Mayernik, M.: Bridging data lifecycles: Tracking data use via data citations workshop report. NCAR Technical Note NCAR/TN-494+PROC (2013). doi:10.5065/D6PZ56TX
Assante, M., Candela, L., Castelli, D., Manghi, P., Pagano, P.: Science 2.0 repositories: time for a change in scholarly communication. D-Lib Mag. 21(1/2) (2015). doi:10.1045/january2015-assante
Kratz, J.E., Strasser, C.: Making data count. Sci. Data 2, 150039 (2015). doi:10.1038/sdata.2015.39
Duerr, R., Downs, R., Tilmes, C., Barkstrom, B., Lenhardt, W., Glassy, J., Bermudez, L., Slaughter, P.: On the utility of identification schemes for digital earth science data: an assessment and recommendations. Earth Sci. Inform. 4, 139–60 (2011). doi:10.1007/s12145-011-0083-6
Callaghan, S., Donegan, S., Pepler, S., et al.: Making data a first class scientific output: data citation and publication by NERC’s environmental data centres. Int. J. Digit. Curation 7(1), 107–113 (2012)
ESIP (Federation of Earth Science Information Partners): Data citation guidelines for data providers and archives. In: Parsons, M.A., Barkstrom, B., Downs, R.R., Duerr, R., Tilmes, C., ESIP Preservation and Stewardship Committee (eds.) ESIP Commons (2012). doi:10.7269/P34F1NNJ
Klump, J., Huber, R., Diepenbroek, M.: DOI for geoscience data-how early practices shape present perceptions. Earth Sci. Inform. 9, 123–136 (2016). doi:10.1007/s12145-015-0231-5
Ball, A., Duke, M.: How to Cite Datasets and Link to Publications. DCC How-to Guides. Digital Curation Centre, Edinburgh (2015). http://www.dcc.ac.uk/resources/how-guides
Beresford, N.A., Barnett, C.L., Howard, B.J., Howard, D.C., Tyler, A.N., Bradley, S., Copplestone, D.: Observations of Fukushima Fallout in Great Britain. NERC Environmental Information Data Centre (2011). doi:10.5285/1a91c7d1-ec44-4858-9af2-98d80f169bbd
Beresford, N.A., Barnett, C.L., Howard, B.J., Howard, D.C., Wells, C., Tyler, A.N., Bradley, S., Copplestone, D.: Observations of Fukushima fallout in Great Britain. J. Environ. Radioact. 114, 48–53 (2012). doi:10.1016/j.jenvrad.2011.12.008
Haxton, T., Crooks, S., Jackson, C.R., Barkwith, A.K.A.P., Kelvin, J., Williamson, J., Mackay, J.D., Wang, L., Davies, H., Young, A., Prudhomme, C.: Future flows hydrology data. NERC Environ. Inf. Data Centre (2012). doi:10.5285/f3723162-4fed-4d9d-92c6-dd17412fa37b
Prudhomme, C., Haxton, T., Crooks, S., Jackson, C., Barkwith, A., Williamson, J., Kelvin, J., Mackay, J., Wang, L., Young, A., Watts, G.: Future flows hydrology: and ensemble of a daily river flow and monthly groundwater levels for use for climate change impact assessment across Great Britain. Eath Syst. Sci. Data 5, 101–107 (2013). doi:10.5194/essd-5-101-2013
Royan, A., Prudhomme, C., Hannah, D.M., Reynolds, S.J., Noble, D.G., Sadler, J.P.: Climate-induced changes in river flow regimes will alter future bird distributions. Ecosphere 6(4), 50 (2015). doi:10.1890/ES14-00245.1
British Library, DataCite: Working with the British Library and DataCite—a guide for Higher Education Institutions in the UK. British Library (2013). http://www.bl.uk/aboutus/stratpolprog/digi/datasets/WorkingWithDataCite_2013.pdf
Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: collated indices 2011. NERC Environmental Information Data Centre (2012). doi:10.5285/ff55462e-38a4-4f30-b562-f82ff263d9c3
Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Species Trends 2011. NERC Environmental Information Data Centre (2013). doi:10.5285/cad2af6c-0c97-414c-8d5f-992741b283cf
Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Collated Indices 2012. NERC Environmental Information Data Centre (2013). doi:10.5285/7949cc99-76c4-4a3e-8c33-41a35b8b7777
Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Species Trends 2012. NERC Environmental Information Data Centre (2013). doi:10.5285/5afbbd36-2c63-4aa1-8177-695bed98d7a9
Rauber, A., Pröll, S.: Scalable dynamic data citation—RDA-WG-DC position paper (2015). https://rd-alliance.org/groups/data-citation-wg/wiki/scalable-dynamic-data-citation-rda-wg-dc-position-paper.html. Accessed 25 June 2015