Helping science succeed
Helping science succeed

Report from the Open Impacts Workgroup

Download

Abstract / OSI2016 Workgroup Question

This report summarizes the discussion of the Open Impacts workgroup held at the Open Scholarship Initiative meeting in Fairfax, VA, April 19-22, 2016. The workgroup tackled the following questions at OSI2016: How fast is open access growing? Is this fast enough? Why or why not? What are the impacts of currently evolving open systems? For instance, are over­all costs being reduced for scholarly libraries? Is global access to scholarly information increasing? What about in the Global South? What is the impact in this region of increasing adoption of the author-pays system?

Introduction

Society would be better off if the future were one of 100 percent open scholarship, including immediate open access to publica­tions and to data. That hope, which is shared by most, must be allowed to shape how we act today if we are going to achieve such ambition. We must feel empowered to create change, but such change will require setting targets and developing the strategies, studies, and reflection to get there. A house cannot be built without first understanding the soil that it will rest upon and how the environ­ment will be impacted.

Foundational, then, is that we have accu­rate and ample data in order to support the lofty goals of open scholarship. We believe such data is necessary to under­stand the effects that any strategies and targets for open scholarship will have upon all stakeholders.

Each discipline and stakeholder will have its own idiosyncratic needs and reflexes, thus there must be a specific data-gath­ering approach by and for each field to understand the impact of moving toward open scholarship.

To create the open impacts measures that we believe are needed to achieve such change, we wish to target reward and evalua­tion systems, the transparency of academic publishing costs, support for new scholarly communication mediums, and improved text and data mining.

There are already numerous case studies on these issues. We lack milestones, goals, and widely accepted routine measures. In short, we need to know where we’ve been and where we are in order to go where we want to be.

We know the “why” of data capturing, but to arrive at the who, how, and what re­quires study and debate between and among all stakeholders. Therefore, we have aimed to set forth a launch pad with this document for others to build upon in determining the impact(s) of becoming open. We set out three high-level areas in a framework to create an agenda for going forward: measuring openness, measuring the use of open, and understanding the economic impacts of open. Finally, we conclude with a target to fill the knowledge gaps.

As the initial workgroup for “Open Im­pacts,” we have gone in circles, found our way out again, only to then paint ourselves back into a corner once or twice. A look back on where we’ve already been may be helpful to future groups looking to extend this work.

The initial question we asked ourselves was “what should we be working on or solving?” Preliminary discussions of the workgroup’s remit fell into four loosely defined categories:

 

Table 1: Proposed categories discussed for what should be the initial workgroup’s remit and outputs

Remit Outputs
We should set spe­cific targets
  • By [X year] scholars should be able to incorporate 2x as much research in their work as they do now.
  • Businesses and jobs based on access to scholarly literature should measurably increase year on year.
  • Discipline by discipline we should arrive at a point where a broad consensus exists by [X year] that scholars and institu­tions anywhere in world can get by with only what’s available for free on the Internet.
  • By [X year] at least 60% of academic libraries have “flipped” from pay-wall to open access.
  • The majority of scholarly output will be allowed to be ma­chine-readable by 2020.
We should define the stakeholders & impacts
  • Map out who the stakeholders are and explore the kinds of impacts that should be better understood for each of those groups.
  • Is there a set of measures and recommendations we can come to that might apply to all stakeholders?
We should gather existing data
  • What % of literature is currently open access?
  • What are the costs of open scholarship?
  • Which fields are already open enough?
  • What are the impacts of openness on libraries, the public, current systems, etc.?
  • Where and how can we get the data in a con­sistent/systematic way to know where we are and how to measure progress?
We should deter­mine which data to capture
  • Desire for good measures.
  • Approaches to measurements.
  • How do we measure progress in open scholarship?

 

Given our short timeframe, we deter­mined in the end that specific targets (Table 1) related to open scholarship were out of scope for our workgroup. We also concluded that an attempt to gather exist­ing data within the workgroup’s limited hours would be of minimal value. Such an exercise could, however, be part of a fu­ture workgroup or more likely a task for organizations created or enlisted and de­voted to such.

Similarly, even defining who all of the stakeholders are and the impacts upon them seemed to be both too granular and too expansive for an initial workgroup gathering. It seemed neither realistic nor productive to develop consensus on measures since we were working in the abstract. We will not be able to come up with a concrete plan for measurement without a budget, timeline and other real world constraints.

Therefore, we decided to focus on devel­oping a general framework for understanding what data to gather on the impacts of open scholarship.

Three areas for an Open Impacts framework

The team ended up with three different high-level areas of foci to create the agenda for understanding open impact (Fig. 1):

  1. Measuring openness
  2. Utilization measures
  3. Understanding economic impacts of open

Figure 1: Three areas of the Open Impact framework

open-impacts-graphic

Measuring openness is a near-term oppor­tunity to establish key baseline measures for how open scholarship is deployed in different fields and at different institu­tions. It would also be useful for policy makers (e.g. funders, institutions) to have insight into the effectiveness of an open access policy. Finally, such measures can help all stakeholders (publishers, funders, scientists, institutions, etc.) measure their progress towards openness in their field.

Measuring Openness

To determine how open scholarship may be for particular fields we found it helpful to define the possible measures and group them by domain (Table 2). Challenges for future workgroups abound:

  • There is a need to firm up common measures for what will be counted and how. This will require stakeholder con­sensus.
  • Who will do this work? Existing pro­jects need to be coordinated on a global scale, resources secured, data col­lected and curated, and so on.
  • Which incentives can be recognized and developed to drive participation?
  • How will any pilot projects scale?

 

Table 2: Measuring Openness: metrics devise to address openness

Products License measures Availability measures Permanence measures Format measures
  • Articles
  • Monographs
  • Data
  • Software, etc.
  • Creative Commons
  • Free to read
  • Free to mine
  • Embargoed and embargo length
  • Pay-walled
Different ways to meas­ure availability [needs more research]. Exam­ples:

  • Metadata quality,
  • Discoverability,
  • Crawling
  • Machine readability
  • Links to other re­sources
  • Public access to usage data
Official certification

  • Yes
  • No
  • No but committed to long-term preser­vation
Per file formats (e.g., PDF, PDF-A, HTML, embedded figures, tables, csv, xls, json, xml)

 

Recommendation: Work and organiza­tions already exist, that can be leveraged to coordinate this work. Examples in­clude, but are in no way limited to VIVO, SHARE Notify, OpenAIRE, SPARC, ORCiD, CrossRef, and USUS.

Our recommendation in this area is to create an “Openness Score” based upon Table 2. This score could apply to specific research objects, aggregated by object to disciplines, funders, institutions, etc. An Openness Score Summit could be orga­nized. Such a summit would involve all stakeholders, including those collecting data. This meeting’s initial focus would be to establish a plan for developing the infra­structure, funding, and sustainability needed. It would also focus on aligning with existing infrastructures. The key out­put should be a strategic, organizational, and technical plan for how the data above would be collected, organized, coordi­nated and managed.

Utilization measures

This is the “who” and “what” for the vari­ous types of use. Leveraging the available information, a key consideration is whether a common approach can be estab­lished. We defined categories of current uses of open scholarship and a limited set of possible metrics for each (Table 3).

 

Table 3: Utilization measures

Current use Possible Metrics
To grow global and national economies New job sector growth, invention-to-prod­uct time
To increase literature access Page reads, downloads, citations, derivative works (translations, reuse of figures), num­ber of publications, machine usage, time on site, altmetrics, geo-location, etc.
To engage a broader audience Page reads (number, distribution, reader ID), downloads, references in non-academic sources, increased diversity in readers, crowd sourcing/funding
To gain access to more data Access frequency, distribution, user ID, citations, crowd sourcing/funding.
To support education (K-12, higher ed) Access by learners, references in educational material, programs, lesson plans, syllabi
To accelerate innovation Use in patent applications, number of pa­tents arising, investment growth
To facilitate archiving and curation Frequency of use, growth of archives

 

Recommendation: We were split in our recommendation. Some delegates sug­gested that several working groups should be created in order to tackle this effort in coordination. Such working groups could include the National Information Stand­ards Organization (NISO) and the Research Data Alliance (RDA) to propose standards for open access usage metrics to be adopted by the community at large that would include published material provided via publishers, content in institutional repositories, domain specific repositories, and funder repositories like PubMed, and other sources of research output. Follow­ing this path we would strongly encourage that any U.S. initiative be aligned with existing initiatives: Impact Story for alt­metrics, IRUS UK, OpenAIRE analytics, DE OA Statistik, and COAR working group on repository usage stats.

Other delegates would prefer to entrust others to develop the metrics we need rather than going through the extra step of creating standards for metrics first.

Measuring Economic Impact

What are the economic impacts of open Scholarship, and what is considered to be worth paying to provide access in the long term (i.e., to preserve)? Much like the early discussion of the workgroup’s scope, many concrete measurements were consid­ered: increased economic growth, democratization of innovation, increase in cross domain research, change of peer-review, disruption to the academic publish­ing industry, job placement at universities, institutional ranking, etc.

Recommendation: We agreed that eco­nomic impact is too broad to address through specific measures and needs to be analyzed via ongoing inquiries. We recom­mend that one or more funders explore this domain in the areas listed above and more. We recommend that as part of this effort the Open Scholarship Initiative (OSI) establish a research agenda for open scholarship if an existing program does not yet exist somewhere else in the schol­arly community. Subsequent OSI meetings could include a program aiming to address the economic impact. Finally, we suggest gathering a diverse stakeholder group to define reasonable metrics for the publishing industry, as well as identify the socioeconomic impacts of open access policies.

OSI2016 Open Impacts Workgroup

  • John Dove, library and publishing consultant
  • Jean-Gabriel Bankier, President, bepress
  • Jason Hoyt, CEO, PeerJ
  • Rebecca Kennison, Principal, K|N Consultants
  • Natalia Manola, Director, OpenAIRE
  • Trevor Owens, Senior Program Officer, Institute of Museum and Library Services (IMLS)
  • Christopher Thomas, Administrator, Defense Technical Information Center (DTIC), US Department of Defense
  • Jack Schultz, Director, Christopher S. Bond Life Science Center, University of Missouri
  • Neil Thakur, Special Assistant to the Deputy Director for Extramural Research, NIH, and program manager for the NIH Public Access Policy