Open data

From HLWIKI Canada
Jump to: navigation, search
'"If people put data on the web – government data, scientific data, community data – it will be used by other people to do wonderful things in ways that they could never have imagined." – Sir Tim Berners-Lee
Are you interested in contributing to HLWIKI International? contact: dean.giustini@ubc.ca

To browse other articles on a range of HSL topics, see the A-Z index.

Contents

Last Update

  • Updated.jpg 7 November 2014

Introduction

See also Data glossary | Data management portal | Finding medical / health care statistics online | Metadata | Open source | Semantic web |

"Access to data is fundamental if researchers are to reproduce, verify and build on results that are reported in the literature … The presumption must be that, unless there is a strong reason otherwise, data should be fully disclosed and made publicly available. In line with this principle, where possible, data associated with all publicly funded research should be made widely and freely available... — Walport, 2011 in the Lancet

Open data is a 21st century concept referring to data that is open and free to (re)use and examine without barriers or restrictions due to copyright, patents or controls of any kind. According to OpenDefinition.org, open data is "...data that can be freely used, reused and redistributed by anyone subject to the requirement to attribute and sharealike of the Creative Commons movement". (See the Open Data Commons Attribution License.) The Open Knowledge Foundation has produced a helpful Open Data Handbook which introduces you to the legal, social and technical aspects of open data.

Open data is similar to other information trends that have emerged in the Internet era, and is related philosophically to open source and open access. Some health librarians have started to participate in discussions about data due to their involvement in e-Science and data curation. An associated discourse within the open data movement is the notion of transparency and accountability inherent in making clinical datasets widely-available; this is particularly important in government-funded research (i.e., CIHR, NIH). A vast amount of data produced by governments (see also civic media), researchers and universities is not shared or made easily-accessible on the web. Where does this data go? There is a movement to make public institutions' data more widely available and to allow researchers to remix, reuse and repurpose it. For more open exchange of data to occur, data warehouses need to be made more widely-accessible to everyone.

  • In 2013, it was announced that Wikidata, an offshoot of Wikipedia, and centralized repository for data and facts, now feeds information for Wikipedia. For clinicians interested in tracking down "missing data", see Missing Data UK.

What is Open Data?

Open data is both a philosophical orientation and a practice that seeks to make data available for free (in machine-readable formats) to all without restriction. Open data formats run the gamut from MS Excel to extremely large bioinformatics datasets. In being free and open, open data is similar conceptually to open access and other open movements. Open data is concerned with making data more freely-accessible especially where it might lead to innovation but also to more transparency in governments; it is linked to social technologies in the cloud (e.g. social media), and the semantic web. To be truly open, data needs to be present on platforms with no restrictions, controls or barriers. Academic libraries can perform some archival functions by curating data; and, by making metadata in their catalogues free and open. Future roles for librarians includes explaining these trends. When speaking of open research data, we refer to information that might result from conducting research in clinical trials which may include datasets, microarray, numerical data, textual records, images or multimedia. Data critical for researchers helps to validate findings, observations and hypotheses. By making data open you encourage knowledge production - recognized as critical to solving society's most difficult problems.

Other definitions

Creative Commons plays a key role in promoting openness in science. Events such as this one in Auckland demonstrate the concern about open science that the community shares with Creative Commons.

Pros & cons

  • open data may be complex, hard to understand, and packaged in a way that makes it inaccessible
  • there are few good apps to find and assemble data
  • there are no incentives and ecosystems to make data usage practical and sustainable
  • come questions such as "who updates the data?" and "what can they really use it for?" go unanswered
  • open data is a great trend in open access; visuals are cool, statements are bold, but what's in it for information people?
  • how does open data help us do our work more efficiently

Select open data projects & initiatives

4-star.gif 4 stars denotes librarian-selected, high quality information. Starred sites are great places to begin your research.
  • Databib is a tool for helping people identify and locate online repositories of research data
DataCite170.png
  • DataCite Canada's services are offered in cooperation with DataCite, an international consortium of national-scale libraries and research organizations committed to increasing access to research data on the Internet
  • DataCite Canada is DataCite's DOI allocation agent for Canada
  • DataCite promotes the value of data archiving, citation and discoverability within Canada
  • Research Data Canada is a collaborative effort to address the challenges and issues surrounding the access and preservation of data arising from Canadian research. This multi-disciplinary group of universities, institutes, libraries, granting agencies, and individual researchers has a shared recognition of the pressing need to deal with Canadian data management issues from a national perspective.

Select examples of other open data

See also Open access

Canadian perspective

This conference showcased the breadth and depth of health data for researchers, planners, academics and decision-makers. It provided excellent opportunities to share information and expand our knowledge. Health Data: Pushing the Boundaries was the theme of Health Data Users which was sponsored by Statistics Canada and the Canadian Institute for Health Information (CIHI). The program for the conference consisted of two tracks (Methods and techniques / Data informing decisions), each targeting a different perspective on using health data.

References

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox