TITLE: We Built it. Can They?: Text Encoding and the Humanities Scholar

AUTHOR: Hope Greenberg

AFFILIATION: University of Vermont

E-MAIL: hope.greenberg@uvm.edu

KEYWORDS: TEI humanities etext

CONTACT ADDRESS:
Academic Computing
238 Waterman Building
University of Vermont
Burlington, VT 05405

FAX NUMBER: 802-656-0872

PHONE NUMBER: 802-656-1176

EQUIPMENT NEEDS: Mac or Win PC with network connection and web browser. Also an overhead display if done as a presentation.

--------------------

In The Diffusions of Innovation, Everett Rogers characterizes the adopters of new technologies as innovators, early adopters, early and late majority, and laggards. Building on this idea, and on Moore's conclusion that a chasm exists between the first two groups and subsequent groups, Geoghegan suggests that early adopters have a high comfort level with technology, value innovation, and are willing to experiment. The majority, however, values minimal risk, requires clear and compelling pedagogical value before attempting a new technology, does not wish current practices to be constrained, and eschews technology for technology's sake.

Since its first 'official' appearance in 1994, the Text Encoding Initiatives Guidelines for Electronic Text Encoding and Interchange has had a profound impact on the innovators and early adopters of the electronic text world. Large text collections and scholarly projects already versed in SGML, like the University of Virginia's Electronic Text Center and The Rosetti Archive, and other projects at the Institute for Advanced Technology in the Humanities, have adopted it. Others, like the Model Editions Partnership and the Making of America project, are exploring ways to apply it to historical documents. The MLA recommends the TEI as the method of choice in its Guidelines for Electronic Scholarly Editions. Text collections that focus on a particular topic or era, such as the Brown Women Writers, Victorian Women Writers, and the ORLANDO projects, as well as several thesaurus projects, have found that it supports building a collection that might not otherwise be possible in the print world.

A quick perusal of the list of projects using the TEI, as collected at their web site, shows that many are large scale projects hosted by libraries or consortia, and specifically funded. Given the technical difficulties associated with creating these documents, it is not surprising that early adopters tend to be those with resources to devote exclusively to these projects. Nor is it surprising that several groups are attempting to provide materials to assist scholars and students who are interested in undertaking projects of this type (cf. University of Virginia Electronic Text Center, Brown University Scholarly Technologies Group, etc.), providing information on their encoding practices (cf. British Women Romantic Poets Project, Victorian Women Writers Project, etc.), or formulating best practices for creating digital resources in general (cf. Arts and Humanities Data Service Guides).

Individual scholars or initially small-scale projects are not entirely absent. The fine work begun by Dr. Stuart Lee as a web-based project on the poetry of Isaac Rosenberg and growing into the Virtual Seminars for Teaching Literature, as well as projects like The Walt Whitman Archive and others supported by IATH, all point to the benefits to be obtained from scholars creating these texts for specific pedagogical needs.

But can the chasm between the TEI early adopters and the majority of scholars, be bridged? Is there a compelling pedagogical benefit to the individual humanities scholar in creating TEI-encoded electronic texts? Is this a job best left to large projects, consortia, publishers, and libraries? Should individual scholars be creators or simply consumers?

The University of Vermont is exploring these questions. Before compelling benefits can be derived from the creation of individual projects, we must determine if the creation of such projects is feasible given limited time and resources. That is, can the creation of electronic texts fit within the requirements for adoption by the majority as described by Geoghagen?

Several collections are being created, each of which focuses on one or more aspects of electronic text creation. The Special Collections Department of the University's Bailey/Howe Library is digitizing their Finding Aids using the EAD DTD. In addition, they are building a page image and transcription collection of selected works in the collection. Experimental individual projects underway include a page image backed by indexed OCR'd text edition of Godey's Lady's Book, the popular 19th century American magazine, as well as a more deeply encoded edition of the short fiction of Alice B. Neal Haven, one of Godey's most prolific and respected authors. These projects will contain ancillary documents related to the Book's contents, supporting documents describing the encoding, and a tutorial for creating similar documents. Another project involves moving an early web-based collection of images of Ovid's Metamorphosis to a more robust image and transcription TEI-encoded edition.

In choosing to create these collections, several issues were considered:

  • Issues of preservation and archiving: It is not enough to have a digital version, even a TEI or EAD-encoded platform-independent digital version if plans are not in place for storing and perpetuating the resulting files. In the first step of a plan for long-term preservation, the documents will be moving from two personal servers to the university cluster in spring of 1999.

  • Issues of economics: The collections have been created using free, donated or low cost software and low-end hardware. The indexing and display software is DynaText and DynaWeb, which were acquired through Inso Corporation's educational grant program. The hidden costs have been in the time devoted to their creation and in the upcoming use of disk space on the university cluster. The University of Vermont has planned for space for these projects as well as upcoming faculty or student projects.

  • Issues of usability: The web provides a wonderful vehicle for disseminating and sharing digital projects. However, as David Chesnutt suggests in his work with the Model Editions Partnership, an infrastructure that facilitates cross-edition searching and indexing would be a boon to scholars. Nor does creation of TEI-encoded documents automatically ensure complete interoperability between texts. As Perry Willett points out, searching for "compatible encoding practices, in order to allow for future interoperability of collections" is another laudable goal. The University of Vermont collections are following these and other projects with the object of developing in concert with them.

  • Issues of scalability: While the contents of the digital collections is and will continue to be a growing resource for interested scholars, its parallel purpose, the creation process itself acting as a model for other individually-based electronic text projects, cannot be forgotten. Concurrent with the development of the collections is an effort to introduce faculty and students to the TEI and text encoding through presentations (cf. Text + Context = Etext: Scholarship in the Information Age) and hosting guest speakers. Further developments of this process will be discussed during the presentation.

At each phase in creating the electronic text collections at the University of Vermont, the questions kept at the forefront are: can this model be duplicated by individuals or small groups with limited resources while remaining in concert with, and informed by, the broader text encoding world? And, are the scholarly and pedagogical benefits derived from the creation, rather than just the consumption, of these texts sufficient to offset the difficulty of undertaking such projects? Unless the answers are yes, the chasm between early adopters of digital resource creation and the majority of humanities scholars may well remain unbridgeable.

Works cited:

Chesnutt, David. The Model Editions Partnership--Towards a National Database. Paper presented at the ACH/ALLC Joint Conference, 1997. Available online: http://www.qucis.queensu.ca/achallc97/papers/p036.html

Geoghegan, William H.Whatever Happened to Instructional Technology? Paper presented at the 22nd Annual Conference of the International Business Schools Computing Association, Baltimore, MD. 1994.

Moore, Geoffrey A.Crossing the Chasm. New York : HarperBusiness, 1991.

Rogers, Everett M. The Diffusion of Innovations. NY: The Free Press, 1995.

Willett, Perry. Issues in Project Cooperation II: Markup Issues. Presented at the ACH/ALLC Joint Conference, 1998. Available online: http://lingua.arts.klte.hu/allcach98/abst/jegyzek.htm

URLs for sites listed:

Arts and Humanities Data Service Guides: http://ahds.ac.uk/public/guides.html
British Women Romantic Poets Project: http://sunsite.berkeley.edu/MLA/guidelines.html
Brown University Scholarly Technologies Group: http://www.stg.brown.edu
Brown Women Writers Project: http://www.stg.brown.edu/projects/wwp/wwp_home.html
University of Virginia's Electronic Text Center: http://etext.lib.virginia.edu/
The Rosetti Archive: http://jefferson.village.virginia.edu/rossetti/rossetti.html
Institute for Advanced Technology in the Humanities: http://www.iath.virginia.edu/
Making of America: http://moa.umdl.umich.edu/moa/
MLA Guidelines for Electronic Scholarly Editions: http://sunsite.berkeley.edu/MLA/guidelines.html
Model Editions Partnership: http://mep.cla.sc.edu/
ORLANDO: http://www.ualberta.ca/ORLANDO
Text + Context = Etext: Scholarship in the Information Age: http://www.uvm.edu/~hag/presentations/etext
Text Encoding Initiative: http://www-tei.uic.edu/orgs/tei/
Victorian Women Writers: http://www.indiana.edu/~letrs/vwwp
Virtual Seminars for Teaching Literature: http://info.ox.ac.uk/jtap
Walt Whitman Archive: http://www.iath.virginia.edu/whitman

University of Vermont Electronic Text Collections: currently at http://132.198.103.233:6336/dynaweb and http://sageunix.uvm.edu:6336/dynaweb. Moving to http://etext.uvm.edu in late spring of 1999.