Digital Historiography & the Archives [4/5]

papers delivered at the 2014 AHA conference.

Table of Contents:

Digital Historiography and the Archives, 2014 AHA Conference, Washington, DC, Friday, 3 January 2014

Part 1: Katharina Hering, Joshua Sternfeld, Kate Theimer, and Michael J. Kramer, “Introduction”

Part 2: Joshua Sternfeld, “Historical Understanding in the Quantum Age”

Part 3: Katharina Hering, “Contextualizing digital collections based on the principle of provenance and the tradition of source criticism”

Part 4: Kate Theimer, “A Distinction worth Exploring: ‘Archives’ and ‘Digital Historical Representations'”

Part 5: Michael J. Kramer, “Going Meta on Metadata”

—

Kate Theimer, “A Distinction worth Exploring: ‘Archives’ and ‘Digital Historical Representations'”

kate.theimer@gmail.com, twitter: @archivesnext, www.archivesnext.com

As noted in the text, I had read drafts of Josh and Katja’s remarks before writing mine, which was very helpful in deciding what aspect of my perspective as an archivist I should focus on. I spoke without slides, and this is an only slightly modified version of the text from which I spoke. I’ve added links to the sites I reference and other sources that might be useful.

In approaching this session and this topic, I had trepidations, as I often do, about how the other speakers and the audience would be framing their conception of “archives.” In preparing my talk I read an article Josh had written for an archival journal in 2011 [“Archival Theory and Digital Historiography: Selection, Search, and Metadata as Archival Processes for Assessing Historical Contextualization,” American Archivist Fall/Winter 2011] and was pleased to see his careful usage of the phrase “digital historical representations” as an umbrella term covering some of the products created by archives, as well as a range of products created by other sources.

In approaching the subject of archives with historians and other humanities scholars, I often feel somewhat pedantic in my continual emphasis on the meaning of words. But after all, words represent concepts and perceptions of reality, and if those words aren’t clearly communicating what we intend, then it’s hard to achieve meaningful progress. What I’d like to talk about in the time I have, and hopefully as part of the discussion, is to illustrate the points Josh and Katja have made about the importance of questioning, understanding, and articulating the context of creation of digital historical representations by discussing the differences between different types of digital information sources created and used by historians—many if not most of which are often all referred to as “archives.” In all of these cases the context of the creation of the information sources is critical to understanding the problems that may be inherent in that source and which the researcher should take into consideration. I am not a historian, but I would think that understanding why and how an information resource was created—that is to say, its context—is more valid than ever in digital historiography.

Everyone here is familiar with what for lack of a better term I’ll call “traditional” archives—that is, primarily paper-based (or non-digital) largely unique materials, brought together in repositories in aggregations either created by the originating organization or person, or by a third party, such as a scholar, manuscript dealer, or the repository itself (as in special collections). Appraisal and selection of such materials is a multi-dimensional process, as you might imagine, with many factors involved, including sometimes political influence, censorship on the part of the creator/collector, resource limitations on the part of the repository, random chance and “acts of God.” How and why the materials on our shelves end up there is not always a straightforward story and one that is usually not captured in detail in the public description of the materials. How the materials were aggregated and for what purpose is usually described at some level in the finding aid, but documentation in this area is sometimes sporadic. I would guess most archivists believe—rightly or wrongly—that fields like “Custodial History,” “Appraisal, Destruction and Scheduling Information,” and “Administrative/Biographical History” (which applies to creators of aggregates) are not valued by most users. To be honest, I’m not sure how often it’s even of interest to historians, or at least how often they ask the archivist about more information if the finding aid is skimpy in this regard. Anecdotal evidence from my colleagues and user studies indicate that it is not widely valued or used by users.

Again, that’s “traditional” physical archival materials, represented digitally by descriptions in online finding aids, catalog records, etc. For these materials, what has changed for historians in the modern digital age, I think is the increased expectation—and reality—that more descriptive information about materials will be made available online, and also the ability to easily create their own digital copies with digital cameras and smart phones.

Next we have collections of digitized analog historical materials—sometimes called “digital archives.” These may be topically based—assembled from holdings of many repositories, like the William Blake Archive or the Wilson Center Digital Archive, which is focused on documents related to international relations. Or they may be all from one repository—as in the recently launched FRANKLIN site, which provides online access to digitized collections from the Franklin D. Roosevelt Presidential Library and Museum. These collections may be created by archivists, librarians, historians, passionate amateurs, nonprofit organizations or for-profit companies. Because these digital historical representations, to use Josh’s term, are created by such a wide range of sources, it’s critical to know about the context of these collections—including who assembled them, what their purpose was, and what criteria they used.

Often when historians are talking about archives, when I probe to see what they mean, it is these kinds of collections they are referring to. Katja’s point that it’s important to know where the individual original materials are located and where they fit in their archival context is a valid one, but it’s also important to understand where they fit in the context of the new digital collection. On what basis were items added to this collection? Why were some items excluded? To what extent is what’s being presented a subset of what’s available? Where does the metadata come from? How was it created and reviewed? As with online finding aids for physical collections, what you’re accessing in this kind of digital collection is a surrogate—a description of that object or aggregate created by a person to represent it. Even the scan is a surrogate—although hopefully an accurate one. Descriptions and metadata can be subjective and also subject to errors.

It seems to me as if these kinds of collection—or “digital archives” as they’re commonly called, would raise a host of questions in terms of digital historiography—some similar to those presented by online information for “traditional” archives, but many others that are different.

Yet a different kind of aggregate, also sometimes called “digital archives” are groups of born-digital materials as opposed to the digital surrogates of analog originals I just talked about. These types of aggregates, kept together because they come from a single source or creator, reside primarily within archives and special collections repositories, and consist of records created or received by an organization in the course of business, maintained by them and transferred to their associated archival repository. For example, the electronic records created by the Census Bureau and transferred to the National Archives. You can also have the equivalent of the “papers” of a person or family, such as Salman Rushdie collection at Emory, which contains the contents of his personal computers. For these kinds of aggregates archives have most of the same kinds of issues with selection, appraisal, and custodial history as they do with non-digital materials, but with additional issues raised by their digital format, as Katja noted, related to reliability and authenticity as well as how to provide access.

And last but not least, you can have assembled collections of born-digital materials—yet another category of what are termed “digital archives.” The September 11 Digital Archive created by the Center for History and New Media is a good example of this type of collection. In this case—and also with the Internet Archive—the collection serves a critical function: acquiring born-digital materials that might not otherwise survive. Many born-digital materials are more fragile than their analog counterparts for various reasons, and so some of these collections are similar in function to special collections libraries, which pull together valuable individual items for preservation. It’s also worth noting that in digital collections, copies of materials can reside in more than one collection. For example, in the September 11 collection there are copies of documents created by the New York City Fire Department (Incident Action Plans). Presumably there are also copies of these born-digital records being transferred to the official repository for the municipal records of New York City. These kinds of “digital archives” combine the issues related to assembled collections—that is, the necessity of exploring who is creating them, for what purpose and using what methods— and those concerns related to born-digital materials as far as preservation and authenticity.

Coming back to Josh’s use of the term “digital historical representations,” I’m happy to see this broader term being used in discussions about “archives” and digital historiography. For me, many products that come under this term—like databases and sources like Google Books—would be removed one step (or more than one step) too far to be categorized as “archives.” I would consider these as separate intellectual products created from archival sources. And, indeed, in a way, so are any of the collections in which copies of archival materials are removed from their original context and “re-mixed” to be part of a new creation—a new “digital archives” like Valley of the Shadow, to use a classic example. In fact, in a pre-digital era analogous versions of the scholarly products I’ve talked about here (other than databases) would still have existed, I think, and been called something other than “archives”—they would have taken the form of exhibits, edited volumes of letters or printed collections of documents, assembled and edited by historians or other sources. The question of why the word “archives” has been adopted to refer to collections of materials is one for a different discussion, but I do think it’s worth noting that this co-opting of the word does seem to be a rather recent development.

I hope the efforts being discussed today encouraging more rigorous assessment of digital historical representations will result in a greater understanding and appreciation of what makes archives distinct from these other kinds of products. I often fear that this appreciation and understanding is being lost as fewer historians work with “old-fashioned” physical archival collections, and do most of their work online, where it is easy to think that all digital collections are the same. The value of the collections of materials preserved in archives often lies in the relationship of the records to each other—what’s called the archival bond—which means that the whole is greater than the sum of the parts. As a whole, the materials provide evidence about the activities of the creator.

In considering the topic of this session, I’d like all of us to consider this as two way street. It’s heartening to see archival concepts such as appraisal and provenance being discussed at an AHA session and so information flow from the archival literature to this audience, and hopefully this will continue. On a related topic, I’m always interested in hearing how much historians actually know about either archival theory or practice. Anecdotal evidence provided by many of my archivist colleagues suggests that such knowledge is, shall we say, uneven. So that’s another topic that might be worth discussing—how much do historians know about archives and what more would be helpful or necessary to assist in their work.

But I also want to see information flow the other way, and I hope we can get into this a bit in the discussion that follows. That is, I’m interested in learning what digital historiography, that is the study of the interaction of digital technology with historical practice—what can this new field of study and you as historians tell the archival profession—and me specifically. How has the way you do your work changed? And how can archives and archivists do things differently to assist in that?

Today’s conversation is about how digital technology has changed the way you do your work as historians, and certainly it has also effected the way archivists do our work as well. Among the most significant of those ways is in the increased workload to create descriptions and digital copies to post online, find ways to collect and preserve digital materials, and of course, actively connect with the public via the ever widening world of digital tools and social media. Digital technology has increased the user base for archival resources, meaning that the connection between our historian users and archivists is more diluted than it was in the past. In prioritizing our work and establishing our practices, archivists are trying to meet the needs of the broadest range of users. In so doing, it’s possible that the more specialized needs of historians—if indeed they are different from other users—are not being met. We need to keep an ongoing dialog between our two professions to ensure that we’re all working together as effectively as possible to support the historical enterprise.

I look forward to discussing both archival theory and practice, and hopefully historical practice as well, in the discussion that follows, and in many subsequent conversations.

—