How is traditional physical infrastructure paradigmatic for defining and turning to cyberinfrastructure? We are used to considering roads, sewers, and power as infrastructure and we have expectations about their funding and maintenance; how do those known types of infrastructure turn our imagination when it comes to virtual infrastructure?
Connections Between, as in Roads
Visible infrastructure like roads that make modern life possible seem an obvious candidate paradigm for what information infrastructure should be. When we buy a house, we expect the state to create and maintain the infrastructure that connects our house to others so we can walk or drive to and fro. We use the roads to move bits of physical stuff, including ourselves from our space to work, play and others, over roads. That’s what we pay taxes for.
By analogy, the Internet would seem to be a road system for moving virtual stuff, comparable to what we use for the physical stuff. Nicholas Negroponte in being digital played with this switch from atoms to bits. That the Internet was called the “Information Superhighway” in the 1980s and 90s developed the analogy. Al Gore is supposed to have promoted the view that just as his father, former US Senator Albert Gore, promoted the development of an interstate highway system, so he was a champion of the new Information Superhighway that would benefit commerce, education and communication. His signature initiative was, not surprisingly, called the “National Information Infrastructure.” Road jargon has woven itself thoroughly into network jargon with terms like “traffic” and “onramp.” The very visible and useful road system made a perfect analogy for explaining the invisible Internet and its need in the modern state to make connections. Who, after all, could imagine a state without roads?
Applying this infrastructure paradigm to research, we can see how infrastructure is the connective tissue maintained to allow us to collaborate and exchange information. Just as roads are absolutely necessary for movement and economic development, so are information highways needed for virtual movement and electronic business. You can also see where the articulation between infrastructure and research computing is located. The infrastructure connects researchers and other entities like corporations and governments. Anything that is needed to connect more than one person, project, or entity is infrastructure. Anything used exclusively by a project is not.
It is worth pointing out an important difference and that is that the Internet is, in fact, not run like the highway system. Governments do not maintain the Internet, though they regulate it; and despite all the talk about how it is a system designed to bypass interruption, certain larger ISPs like Cogent can effectively block others, leading to blackouts such as when, in March 2008, parts of Canada could not access parts of Sweden thanks to a commercial dispute between Cogent and Telia.8
Utilities that Service, as in Power
We also tend to think of less visible services like electricity and water as infrastructure. Obviously the transmission lines, the water mains, and the sewers are infrastructure in the sense of what is between, but they are not between individuals. They are between us and service providers and generally include the generating stations and the sewage plants. The infrastructure really is the service that provides electricity, provides water and disposes our waste.
While we can’t imagine civilization without roads of some sort, we can imagine alternatives to government provision of services like power and water. Roads by definition have to be shared to connect. Utilities don’t need to be shared and are thus more likely to be privatized, though there still seems to be an obvious efficiency in maintaining one power grid and one water/sewage system. We could each dispose of our own sewage, generate our own electricity and get our own water, but these functions are better provided as services on a large and efficient scale. The question with such utilities is at what scale and how should regional utilities interoperate.
Services as infrastructure serve as a second paradigm for research infrastructure, though a more complicated one. There are a number of computing services that we have come to expect as infrastructure beyond the provision of the physical Internet. There are the services like DNS that are needed to make the Internet work; there are the services like e-mail that work over the Internet that we have also come to expect; and there are services like digital libraries that are more efficiently provided centrally, but have not become expectations yet. We can think of a digital library or data service as a utility infrastructure that fuels research rather than as the virtual reflection of the Library as building. Just as our machines need electricity, so our minds need information. Just as library services are a form of expected research infrastructure, so digital library services make sense as research infrastructure.
Much of the turn towards cyberinfrastructure focuses on the development of these large information services. The age of small research projects developing scholarly electronic editions is passing. We can all see the value of shifting from individual editor-run projects maintaining information services to a model where research data services are managed as infrastructure with centralized providers and a professional staff dedicated to the infrastructure. That would let researchers move on, just as they do after publishing a monograph. We expect publishers and libraries to maintain our scholarship after the research, so why not have equivalent service infrastructure to maintain our virtual scholarship?
That we don’t have national digital research data/text archives or libraries despite the decades of development is one of the hurdles that might explain the turn to infrastructure, though I worry the time may have passed for such a utility, as it may be perceived as unnecessary given large-scale commercial services like Google Books. For a sustained discussion of digital library (DL) developments and infrastructure, see Carl Jay Lagoze’s dissertation on Lost Identity: The Assimilation of Digital Libraries into the Web.
A legitimate question to ponder is why the “imposition from above” model was successful in the context of the Internet, but not in DLs. A look at the history on the Internet reveals a key factor that initial deployment and ramp-up occurred within a tightly scoped community, academic institutions and (primarily defense-related) research labs. The infrastructure had a long percolation period in this context before its subsequent mass popularization. This is quite different than the DL infrastructure work, which from the beginning was motivated by visions of widespread grassroots dissemination inspired by scenarios such as that articulated by then Vice President Gore in his “schoolchild in Carthage, Tennessee plugs into the Library of Congress” speeches (http://www.ibiblio.org/icky/speech2.html). 9
Organizations that Run, like Governments
When we look closely at civic infrastructure, we see that the physical infrastructure and service infrastructure are dependent on organizations for maintenance and operation. In fact, if it is important that infrastructure last and be open, then the organization that maintains it is more important than the item itself. A good organization that builds and maintains bridges is more important than any one bridge. A bridge might be built, but it won’t be safe to cross if there aren’t regular safety checks and engineering support. It thus follows that good infrastructure includes the management, staffing, ongoing budgets, and support equipment that keep it all working. If we think of the Library as a traditional form of research infrastructure, we can see the importance of professional organization. The buildings and the books are important, but the Library can’t work as infrastructure without professional staff organized and funded to maintain services.
That said, describing organizations as infrastructure seems to push the definition. We tend to think of infrastructure as what you can touch and use, not the maintenance organization. One can see this in the ongoing politics of physical infrastructure renewal which are stable entertainment for those interested in municipal politics and stimulus packages. On a regular basis there are calls for infrastructure renewal like the dramatic and “hard-hitting” 1983 America In Ruins which has the ruins of a Roman forum on the cover. The cover says it all: the American Empire will fall apart as the Roman one did if there isn’t the political will to invest in infrastructure renewal. The report, while documenting the state of national infrastructure in the US, starts mostly with political recommendations to create the sustained organization and attention needed.
We might ask why calls for renewal are needed? The reason is that funding bodies like to build new infrastructure, but don’t like to budget for its ongoing maintenance. What funders can see is appreciated; maintaining infrastructure that is so expected that it becomes transparent is a thankless job. Funding new stuff looks progressive; maintaining infrastructure doesn’t impress.
Sustainability and governance is likewise an issue for cyberinfrastructure. When applying to CFI it is actually not the researchers who apply, but the universities that apply (with a researcher as a project leader). CFI requests ongoing maintenance plans, expects the university to take ownership, and does provide some additional funding, though most feel it is not enough. Edwards et al., in a must-read report that came out of a workshop bringing historians and social scientists to bear on CI, argue the importance of the social to infrastructure:
It is also possible that a tech-centered approach to the challenge of data sharing inclines us toward failure from the beginning, because it leaves untouched underlying questions of incentives, organization, and culture that have in fact always structured the nature and viability of distributed scientific work. Questions of trust loom large here, and run both ways. (Understanding Infrastructure: Dynamics, Tensions, and Design, p. 32)
It is therefore important to think of infrastructure realistically as some mix of hard visible components, softer services, and professionals that operate and maintain the two. I suspect that the largest part of the costs for the cyberinfrastructure proposed for the humanities will go to people, not hardware or buying services. This is despite the perception that when you invest in infrastructure you are buying the hard stuff, like roads.
Policies that Make Interchange, like Standards
Digging another level down, one finds that essential to certain types of cyberinfrastructure are the standards, policies, and procedures that allow us to run the infrastructure. It matters that electricity is provided at a standard and advertised voltage. Governments have zoning laws, policies, and procedures for handling construction both of the infrastructure they will maintain and for those who build new developments on infrastructure.
In computing we see the importance of standards in technologies like the World Wide Web. What makes virtual infrastructure like the web work is not one cable or one web browser, but the W3C standards that let different tools work together. The story we tell about the web as infrastructure is that all it took is HTTP and HTML to spark the collaborative and open development of information infrastructure. This is the lightest type of infrastructure, where there is no material or service base to maintain, but a base of definitions and standards on which others build layers. This is the most attractive paradigm for infrastructure for funders, as it is the least expensive to maintain. Perhaps things like the Text Encoding Initiative Guidelines are the real infrastructure of humanities computing, and consortia like the TEI are the future for light and shared infrastructure maintenance.









