In between the mega macro level of the universe of scholarly communication and the micro level of an individual scholar's interests and projects lie a number of mediating layers of culture and context, and in these the European perspective differs somewhat from that in North America. European governmental subsidization of higher education has a number of exceptionally positive effects on society and education, not least among them the leveling of the financial playing field for access to university study. It also, however, creates a system of administration and oversight that transcends not just the individual department or disciplinary unit but all of the institutions in the country. The need to distribute resources “fairly” creates the need to measure and compare outputs, not just between the sciences and the humanities, but between large and small universities, rural and urban centers, and research versus teaching-focused institutions. There is open recognition that institutional missions differ, that disciplinary norms of quality and impact differ, and that the presence or absence of quantifiable and comparable output measures of a scholarly communications ecosystem varies widely between the disciplines. Yet at the end of the day a certain exact number of Euros or Pounds must be allocated to each player in the system, and while there is no good way of making this determination, there are lots and lots of bad ones.
Although Eugene Garfield, the father of such approaches to scientific impact assessment as citation analysis and journal impact factors, was US-based, it is not surprising that his ideas and inventions have been embraced by European government departments of education and research councils alike. Tools such as the “H-Index” provide an elegantly simple way of saying precisely where an individual scholar sits in the hierarchy of production, placing a simple numeric value on her contribution to her field, which can easily be compared with the numeric values of her colleagues and competitors. The problem is that outside of a few relatively restricted areas, these simple indices are not representative of any accepted definition of either the quality or the impact of the work in question. The fact that much of the bibliometrics system is built upon citation analysis, and that citation analysis is neither supported to the same level by such database tools as Thompson Reuters ISI or Elsevier’s Scopus for all areas, nor is as relevant for all areas as for, e.g., particle physics, means that our lovely simple system of numeric indices ranking scholarly work is as dodgy a basis for funding allocation across a national system as is the political connection of one’s local minister of state or, for that matter, a dart board.
The arts and humanities have largely stayed on the margins of this debate. In fact, when the United Kingdom decided to scrap its hugely influential and much maligned Research Assessment Exercise (RAE) in favor of a metrics-based system, a special expert group had to be convened to determine how the arts and humanities would be treated. This group came up with a very long list of possible forms of research output that could be measured (although the report did not always define or specify how one might capture these measurements) and determined that a basket of these measures, including a “peer review lite” approach, could be chosen closer in time to the first round of the new assessment exercise in 2013. This report was largely an admission of failure in the face of a task for which the system had no easy or accessible tools.
This is not to say that there have not been attempts to develop frameworks to enable a more systematic quality assessment of research outputs in the arts and humanities. The European Science Foundation funded the development of a project called the European Reference Index for the Humanities. The ERIH was intended to provide a relatively uncomplicated impact mark for the journal in which an article was published on a discipline-by-discipline basis. In spite of the project leaders’ insistence that the A, B, and C rankings given to journals were intended to indicate their degree of internationalization and profile, not quality (whatever that is), the use of the simple letter grading became a source of great controversy and discontent with the project. The real threat this index posed to smaller publishers and national journals, which were very appropriate places for certain kinds of research to appear, resulted in the project being largely abandoned after its initial development funding ran out.
But even if ERIH had been embraced by the community, what then? It didn’t include the publishers, so had no capacity to aid in the evaluation of monographs, chapters, editions, or articles in edited collections. It was a misleading, unauthoritative tool, and although the scholarly community reviled the project, there were still enough cases seen of research councils and institutions attempting to use it that these bodies’ desire for some easy way of making even an initial judgment of quality or impact was clearly perceived.
This extended anecdote is intended to illustrate an environment that holds both opportunities and threats for the development of scholarly communications in Europe. Institutional autonomy is low, and the pressure to contribute to the system in certain particular ways is very real. “Is it RAE-able” was the question that always used to be asked about potential project outputs in the UK, and it is doubtful that the question will change much under the new system; only the terminology used to describe acceptability to the higher powers will shift. On the other side, however, the existence of national systems with gatekeepers who are unsure how to evaluate scholarship most effectively also means that these gatekeepers may also be willing to work with their community to define reasonable metrics and approaches to assessment. If the traditional analog scholarly communications formats are to be transcended, then success will be guaranteed only if this transcendence can be translated into terms comparable with those applied to the known scholarly output forms. There have been calls for years for the creation of “kite markers” and other ways of recognizing the scholarly contribution made by non-traditional output forms. Certainly in Europe, one has the sense that if the scholarly community took the lead in establishing norms and measures for best practice where these norms don't already exist, then their lead would be both appreciated and followed.
If the opportunities are so clear, then why are the responses so unsatisfactory, as with the case of ERIH? It is easy to point the finger at the faults in the tried and true system of peer review as the source of difficulties. Yes, it is a system that is difficult to manage and expensive to maintain, but the glory of Garfield’s citation analysis is that it manages to capture peer review happening, as it were, in the wild, at that point when another scholar references a piece of work as influential for his own, thereby making explicit a judgment call about the importance of another scholar’s work. Surely the digital humanities community can find a similar juncture, more subtle than hit counts but more easily captured than full-blown reviews, because the community of scholars should be able to agree upon what constitutes a quality output, whatever its format.
Unfortunately, it appears that we cannot do this even in the analog realm. A recent workshop held in conjunction with the 2010 Modern Language Association Conference in assessing digital humanities scholarship created in non-traditional formats (thematic research collections, e-lit, e-scholarly editions, etc) suggested the opposite. Over thirty academics, from Department Chairs to graduate students, attended. The workshop was developed around a series of case studies. The first case study was an e-edition of the work of a little-known poet. While the edition was replete with contextual information, scholarly apparatus, and technical background, the first comment by a workshop participant was that this was not scholarship, it was service. The creation of a scholarly edition was a service activity, not a scholarly one regardless of the medium of presentation. The workshop facilitators immediately realized that the battle lines were far from fixed and that having a work in digital form only served to reinforce certain existing prejudices rather than allow for a widened scholarly horizon.
Given such challenges, is it possible we could focus, writ large, on developing new modes of peer review for digital production, as occurs within the NINES community? If so, can we convince institutions and funding agencies to accept the results of these modes? The answers to both of these questions surely must be yes, but the ever-broadening focus of European digital humanities scholars has not yet come to rest on these most political and procedural issues within the ecosystem of scholarly communications and production. The problem may be deeper entrenched than we expect, but doesn't that also mean that the opportunities for real innovation are greater?
Many of the reasons for the disagreements among us are cultural, but others may have to do with the nature of the resources themselves, in particular when we shift our focus back purely to the digital realm. For it is here that we find ourselves yet again handling that old chestnut crossed with a hot potato, sustainability. If we do not yet “trust” digital resources the way we do analog ones, at least part of the reason for this is that we have not had them around for hundreds of years, and do not yet feel secure that we will have them around for many years or decades more. How can we validate a scholarly career on the basis of something that may be perceived as ephemeral?





