|Are you interested in contributing to HLWIKI International? contact
To browse other articles on a range of HSL topics, see the A-Z index.
- This entry is out of date, and will not be updated, July 2017
See also Bioinformatics | Data management portal | Data visualization | e-Science | Open data | Research Portal for Academic Librarians | Semantic web | Text-mining
International projects & websites
4 stars denotes librarian-selected, high quality information. Starred sites are great places to begin your research.
Managing data is central to health care
- CSAIL looks at the issue of big data as "fundamentally multi-disciplinary"; the MIT team includes faculty and researchers across related technology areas, including algorithms, architecture, data management, machine learning, privacy and security, user interfaces, and visualization; as well as domain experts in finance, medical, smart infrastructure, education and science
- RDCs (Research data centres) provide short descriptions and details about data sets available at the RDC. The program provides analytical and methodological research tools to assist researchers.
- Databib is a tool for helping people identify and locate online repositories of research data
- helping you to find, access and use data
- DataCite Canada's services are offered in cooperation with DataCite, an international consortium of national-scale libraries and research organizations committed to increasing access to research data on the Internet
- DataCite Canada is DataCite's DOI allocation agent for Canada
- DataCite promotes the value of data archiving, citation and discoverability within Canada
- table lists NIH-supported data repositories that accept submissions of appropriate data from NIH-funded investigators (and others). Also included are resources that aggregate information about biomedical data and information sharing systems
- DMPTool adheres to National Institutes of Health (NIH) data sharing requirements
- DMPTool provides step-by-step guidance to help users create ready-to-use data management plans and meet funder data management requirements. While anyone can create an account and use this resource, many institutions have partnered with the DMPTool to allow login through their home institution, and, in some cases have provided customized help and support
- Dryad is an international repository of data underlying peer-reviewed articles in the basic and applied biosciences. Dryad enables scientists to validate published findings, explore new analysis methodologies, repurpose data for research questions unanticipated by the original authors, and perform synthetic studies. Dryad also aims to make data archiving as simple as possible via a suite of services not necessarily provided by publishers or institutional websites.
- a collaborative project devoted to educating science and medical librarians on e-Science, the portal was initiated at the University of Massachusetts Medical School through funding from the National Network of Libraries of Medicine
- a vision of pioneering computer scientist Jim Gray for a new fourth paradigm of discovery based on data-intensive science; this extensive monograph offers insights into how it can be fully realized
- U.S. federal government initiatives to make data more accessible for monitoring, assessment and policy development
- access to high quality data improves understanding of a community’s health status and determinants
- provide a single, user-friendly, source for national, state, and community health indicators
- minimize duplication of effort in provision of digital preservation training and education programmes
- describe, promote and contextualize current training and education offerings
- identify and exploit collaborative training and education opportunities
- maximize inter-disciplinary training and education opportunities
- develop a shared digital preservation training infrastructure to enable reuse of training and education materials
- ensure synergy and complementarity between emerging curation and preservation education programmes with professional development training courses
- a research and teaching unit at Harvard University dedicated to exploring and expanding the frontiers of networked culture in the arts and humanities
- a social web site for researchers sharing research objects such as scientific workflows
- aims to solve name ambiguity problem in scholarly communications by creating a registry of persistent unique identifiers for individual researchers and an open and transparent linking mechanism between ORCID, other ID schemes, and research objects such as publications, grants, and patents
- aimed at helping researchers share biomedical data and models; PhysiomeSpace has just completed its beta implementation and is open to users
- centralized, standards compliant, public repository for proteomics data; developed to provide proteomics community with a repository for protein and peptide identification with evidence supporting it; details of post-translational modifications coordinated relative to peptides in which they have been found also
- an excellent pathfinder at Tulane University for American public health data sets
- selective links representing a sample of available information. Items are selected for their quality, authority of authorship, uniqueness, and appropriateness.
- Need to create a data plan for a grant proposal? Find out what to include & see examples.
- Wolfram Alpha provides access to a world of factual data, without searching, calling itself the first computational knowledge engine. On the web, there is increased emphasis on repositories of data maintained by national or international agencies, organizations and individuals. Wolfram Alpha now hosts the Wolfram Data Summit to bring together those responsible for data repositories and to develop innovative concepts for the future.
- provide all users with improved access to World Bank data and to make that data easy to find and use
Data Information Literacy at Purdue
In partnership with librarians at the University of Minnesota, University of Oregon and Cornell University, the Purdue University Libraries received $250,000 from IMLS to develop programs for the next generation of scientists to enable them to find, organize and share data. The program is intended for graduate students in science working their way toward careers as research scientists. In 2012, technology makes it easier to share research data beyond the lab. In many cases, data is not administered in ways that enable it to be easily discovered, understood, or re-purposed by others. This training is vital to scientists as they look to secure research funding. The National Science Foundation issued a report in 2007 on the need to build public collections of research data; since 2011, it has required scientists to include data management plans in their grant applications.
The Data Information Literacy effort will be carried out over two-years by five teams. Two teams, consisting of a data librarian, subject librarian and faculty researcher, are based at Purdue, with one team each at the other institutions. Teams are constructed to represent various subjects from computer engineering to landscape architecture so commonalities and differences in data curation can be explored. Each team will conduct an assessment of data needs for their discipline, including interviewing and observing researchers. Teams will develop and implement targeted instruction and assess the impact of that instruction in developing the data information literacy skills of graduate students.
More information on the data information literacy project is available at http://wiki.lib.purdue.edu/display/ste
See also Indiana University-Purdue University Indianapolis. Data Services Program
Data storage costs and data curation in libraries
- Purdue’s pricing: https://purr.purdue.edu/about/pricing
- Princeton’s pricing: http://dataspace.princeton.edu/jspui/about/DataSpacePnG.pdf
- The 4C project announced the beta version of the Curation Costs Exchange (CCEx) website. CCEx is an online community platform for the exchange of curation cost information. The goal is to help organizations make smarter investments in digital curation by enabling knowledge transfer and cost comparisons between organizations of all types. The value of the project will depend on the willingness to share cost data and on benefits that sharing will bring about. CCEx is a crowd-sourced database and library of curation cost information. It uses costs data to provide automatic generation of results for self-assessment, cost comparisons with peers and insights into the financial accounting and activity of other organizations. 4C Project’s vision is to create a better understanding of digital curation costs through collaboration.
- ACRL Academic Libraries and Research Data Services: current practices and plans for the future. An ACRL White Paper, 2012. Carol Tenopir, Ben Birch, Suzie Allard.
- ALA Connect. The fourth paradigm: data-intensive research, digital scholarship and implications for libraries
- Bailey CW. Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works, 2012.
- Baker K, Yarmey L. Data stewardship: environmental data curation and a web-of-repositories. Int J Digital Curation. 2009;4(2).
- Ball A. Review of the state of the art of the digital curation of research data. ERIM Project. University of Bath; 2010.
- Beagrie N. Digital preservation: setting the course for a decade of change. 2007.
- Canadian Association of Research Libraries. Research data: unseen opportunities an awareness toolkit commissioned by CARL; 2009.
- Cech T. Sharing publication-related data and materials: responsibilities of authorship in the life sciences. Washington, DC: National Academies Press; 2003.
- Cox A, Verbaan E, Sen B. Upskilling liaison librarians for research data management. Ariadne. 2012;70.
- Cragin MH, Palmer CL, Heidorn PB, Smith LC. An educational program on data curation. American Library Assocation, Science and Technology Section; 2007.
- Cragin MH, Palmer CL, Heidorn PB. Extending the data curation curriculum to practicing LIS professionals. DigCCurr2009: Digital Curation: Practice, Promise & Prospects; 2009.
- De Roure D, Goble C, Aleksejevs S. The myExperiment Open Repository for scientific workflows. Open Repositories. 2009.
- Delserone LM. At the watershed: preparing for research data management and stewardship at the University of Minnesota Libraries. Library Trends. 2008;57(2):202–210.
- Giarlo MJ. Academic libraries as data quality hubs. J Libr Scholarly Commun. 2013;1(3):eP1059.
- Gore SA. e-Science and data management resources on the web. Med Ref Serv Q. 2011;30(2):167–77.
- Heidorn PB. The emerging role of libraries in data curation and e-science. J Libr Admin. 2011;51(7-8):662–672.
- Hey T, Tansley S, Tolle K. The fourth paradigm: data-intensive scientific discovery. Microsoft Research. Redmond, Washington, 2009.
- Humphrey C. Preserving research data: a time for action. In: Canadian Conservation Institute. Preservation of electronic records: new knowledge and decisionmaking. Ottawa, ON: 2004.
- Interagency Working Group on Digital Data. Science of the National Science and Technology Council. Harnessing the Power of Digital Data for Science and Society. Washington, DC; 2009.
- Lewis SC, Rodrigo Z, Hermida A. Content analysis in an era of big data: a hybrid approach to computational and manual methods. J Broadcast Elec Media. 2013;57(1):34-52.
- LIBER Working Group. Ten recommendations for libraries to get started with research data management. Final Report on E-Science, 2012.
- Mallon M. Data curation. Public Services Q. 2012;8(4) :326-337.
- Martin R. What do data services librarians do? J eSci Libr. 2012;1(3):Article 3.
- Miller HE. Big-data in cloud computing: a taxonomy of risks. Info Res. 2013;18(1):paper 571.
- Ohno-Machado L. A hybrid open-access model to bridge the publishing divide and reach out to a broader community. JAMIA. 2011;18(3):210–1.
- Piwowar HA, Day RS, Fridsma DB. Sharing detailed research data is associated with increased citation rate. PLoS ONE. 2007;2(3):e3082.
- Rani M, Buckley BS. Systematic archiving and access to health research data: rationale, current status and way forward. Bull World Health Organ. 2012;90:932–939.
- Rothenberg J. Ensuring the longevity of digital documents. Sci Am. 1995;272(1):24-29.
- Rusbridge C. Tomorrow, and tomorrow, and tomorrow: poor players on the digital curation stage. In: Digital convergence – libraries of the future. London: Springer; 2008.
- Scaramozzino JM, Ramirez M, McGaughey K. Managing the data deluge: understanding scientists' need for data curation services; 2010.
- Research Data Strategy Working Group. Stewardship of research data in Canada: a gap analysis; 2008.
- Ross JS, Krumholz HM. Ushering in a New Era of Open Science Through Data Sharing. JAMA. 2013;():1-2.
- Simons N. Implementing DOIs for research data. D-Lib Magazine. May/June 2012;18(5/6).
- Stahl-Timmins W. Information graphics in health technology assessment. PhD thesis, University of Exeter, UK. 2011.
- Stuart D. Programming skills could transform librarians' roles. Research Information; 2010.
- Tenopir C, Sandusky RJ, Allard S, Birch B. Academic librarians and research data services: preparation and attitudes IFLA Journal. March 2013;39:70-78.
- Walters TO. Data curation program development in US universities: the Georgia Institute of Technology example. Int J Digital Curation. 2009;4(3):83–92.