An excellent question came up on ecoinformatics list today on data citation from Kyle Kwaiser at the Michigan Biological Station
I am working with graduate students this summer to archive their work at our field station. I want to tell them to cite their datasets on their CV’s but I know this is not yet the norm.
Any general thoughts on how close we are to including datasets on CV’s? Can you suggest recent papers that argue (decisively) for this practice? Here are two relevant but slightly tangential examples:
- Reichman, O. J., M. B. Jones, and M. P. Schildhauer. 2011. “Challenges and Opportunities of Open Data in Ecology.” Science 331 (6018) (February): 703-705.
- Vision, Todd J. 2010. “Open Data and the Social Contract of Scientific Publishing.” BioScience 60 (5) (May): 330-331.
I gave a look over my Mendeley archives on the topic to come up with a few more. As this question seemed of interest to the FriendFeed groups ((yeah, so I was probably supposed to post this in G+, but not everyone can access that yet, right?)) I thought I’d take the liberty to share there. Unfortunately comments can’t be formatted, so I’m adding my list here. Tried to include the relevant quote from the article unless that would mean quoting the whole thing. Will try and polish a bit later.
References arguing for Data Citation
- Mons et al’s piece is essentially an argument for data citation (Mons et. al. 2011)
- Birney et al: “another would be to track the usage and citation of data sets using electronic systems similar to those used for traditional publications” and cite this in support: Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility (Wellcome Trust, 2003); available here
- “Providing a secure but flexible cyberinfrastructure while promulgating best practices such as data citation and metadata reuse, will help build confidence in data sharing”(Tenopir et. al. 2011)
- Rod discusses data citation quite a bit in print (Page, 2010)
- “By ensuring that data remain curated at the source, and by showing the importance of data sharing to promote data citation and usage, we have grown past our original technology implementation and are ready to move into a long-term production environment that departs from the original model.”(Constable et. al. 2010)
- These three make mention of data citation, mostly in reference to increased citation rates of papers(MOORE et. al. 2010),(Whitlock et. al. 2010),(Whitlock, 2011)
- Mark Parson’s talk: https://ands.org.au/guides/data-citation-awareness.html
- Heather’s excellent summary of resources on data citation principles
References
Reichman O, Jones M and Schildhauer M (2011). “Challenges And Opportunities of Open Data in Ecology.” Science, 331. ISSN 0036-8075, https://dx.doi.org/10.1126/science.1197962.
Vision T (2010). “Open Data And The Social Contract of Scientific Publishing.” Bioscience, 60. ISSN 0006-3568, https://dx.doi.org/10.1525/bio.2010.60.5.2.
Mons B, van Haagen H, Chichester C, Hoen P, den Dunnen J, van Ommen G, van Mulligen E, Singh B, Hooft R, Roos M, Hammond J, Kiesel B, Giardine B, Velterop J, Groth P and Schultes E (2011). “The Value of Data.” Nature Genetics, 43. ISSN 1061-4036, https://dx.doi.org/10.1038/ng0411-281.
unknown u (2009). “Prepublication Data Sharing.” Nature, 461. ISSN 0028-0836, https://dx.doi.org/10.1038/461168a.
Tenopir C, Allard S, Douglass K, Aydinoglu A, Wu L, Read E, Manoff M, Frame M and Neylon C (2011). “Data Sharing by Scientists: Practices And Perceptions.” Plos One, 6. https://dx.doi.org/10.1371/journal.pone.0021101.
Page R (2010). “Enhanced Display of Scientific Articles Using Extended Metadata.” Web Semantics: Science, Services And Agents on The World Wide Web, 8. ISSN 15708268, https://dx.doi.org/10.1016/j.websem.2010.03.004.
Constable H, Guralnick R, Wieczorek J, Spencer C and Peterson A (2010). “Vertnet: A New Model For Biodiversity Data Sharing.” Plos Biology, 8. https://dx.doi.org/10.1371/journal.pbio.1000309.
MOORE A, Mcpeek M, RAUSHER M, RIESEBERG L and WHITLOCK M (2010). “The Need For Archiving Data in Evolutionary Biology.” Journal of Evolutionary Biology, 23. ISSN 1010061X, https://dx.doi.org/10.1111/j.1420-9101.2010.01937.x.
Whitlock M, McPeek M, Rausher M, Rieseberg L and Moore A (2010). “Data Archiving.” The American Naturalist, 175. ISSN 0003-0147, https://dx.doi.org/10.1086/650340.
Whitlock M (2011). “Data Archiving in Ecology And Evolution: Best Practices.” Trends in Ecology & Evolution, 26. ISSN 01695347, https://dx.doi.org/10.1016/j.tree.2010.11.006.