<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://thomas.kiehnefamily.us"  xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>infoSpace - Metadata</title>
 <link>http://thomas.kiehnefamily.us/taxonomy/term/6/0</link>
 <description>Anything concerning metadata languages, structures, protocols, and implementations.</description>
 <language>en</language>
<item>
 <title>Guarding the Guards: Archiving the Electronic Records of Hypertext Author Michael Joyce</title>
 <link>http://thomas.kiehnefamily.us/guarding_the_guards_archiving_the_electronic_records_of_hypertext_author_michael_joyce</link>
 <description>&lt;p&gt;In June of 2006, Thomas Kiehne and Catherine Stollar were selected to present the results of work performed the previous year at the Harry Ransom Center in Austin, Texas to a &lt;a href=&quot;http://rpm.lib.az.us/newskills&quot;&gt;colloquium&lt;/a&gt; assembled by the Society of American Archivists and the National Archives and Records Administration.  The following is the case study that was presented.&lt;/p&gt;
&lt;p&gt;Update:  This text was re-published by SAA in the &lt;a href=&quot;http://www.archivists.org/news/NewSkillsForADigitalEra.pdf&quot;&gt;proceedings from the &quot;New Skills for a Digital Era&quot; colloquium&lt;/a&gt;.&lt;/p&gt;
&lt;!--break--&gt;&lt;!--break--&gt;&lt;h2&gt;Abstract&lt;/h2&gt;
&lt;p&gt;In 2005, the Harry Ransom Center at the University at Austin acquired the fonds of hypertext author Michael Joyce.  The major emphasis of the Ransom Center&#039;s collections is the study of literature and culture in the late 20th and early 21st century of the United States, Great Britain, and France.  Michael Joyce&#039;s groundbreaking work in hypertext poetry and fiction make his papers a desirable addition to the Ransom Center holdings.&lt;/p&gt;
&lt;p&gt;The Michael Joyce Papers are mostly composed of electronic records with an additional 60 manuscript boxes of paper -based materials  This is the first mostly electronic archive the Ransom Center has acquired and new strategies for preserving digital content were employed.  This case study discusses the techniques and skills utilized to preserve the electronic records of Michael Joyce as a model for processing future digital manuscripts at the Ransom Center.&lt;/p&gt;
&lt;h2&gt;Scenario&lt;/h2&gt;
&lt;p&gt;Established in 1957 by University of Texas Vice President and Provost Harry Huntt Ransom, the Harry Ransom Humanities Research Center at The University of Texas at Austin incorporated a strategy for collecting older rare books and manuscript collections with a new initiative to collect literary, photographic, and theatrical works by modern artists.  Some of the authors whose works are included in the Ransom Center&#039;s collections are Norman Bel Geddes, Don DeLillo, T.S. Eliot, James Joyce, Ernest Hemingway, Norman Mailer, D.H. Lawrence, Ezra Pound, Anne Sexton, Isaac Bashevis Singer, and Tennessee Williams.  Michael Joyce&#039;s work as perhaps the most influential hypertext poet and author fits nicely into the Ransom Center&#039;s contemporary author collecting policy. &lt;/p&gt;
&lt;p&gt;Our case study to preserve Michael Joyce&#039;s digital manuscripts resulted from collaboration between the School of Information at the University of Texas at Austin and the Harry Ransom Center.  Three students, Thomas Kiehne, Vivian Spoliansky, and Catherine Stollar, from Dr. Patricia Galloway&#039;s Problems in Permanent Retention of Electronic Records course offered at the School of Information undertook a semester long project to develop a strategy for archiving an initial accession of electronic materials saved on 371 3.5&quot; floppy disks (totaling 211 KB) from author Michael Joyce.  Upon completion of the project, a second accession of electronic and paper-based materials, including the contents of three hard drives (totaling 8.38 GB) and 60 manuscript boxes, was acquired by the Harry Ransom Center and is currently being processed by staff archivist Catherine Stollar according to the strategy developed during the class project.  Our case study discusses strategies for file recovery, migration, preservation, arrangement, and description developed working with both accessions of Joyce&#039;s materials. The electronic records are currently maintained in a DSpace repository administered by the School of Information, however, in the future the Joyce records will move to a DSpace repository controlled by the Ransom Center and the General Libraries of the University of Texas.  &lt;/p&gt;
&lt;p&gt;The lack of Ransom Center staff with skills in digital archivy provided the impetus for the Ransom Center to partner with the School of Information on the Michael Joyce Papers.  Although the Ransom Center employs talented archivists and IT professionals, no staff member possessed skills necessary for archiving digital manuscripts.  The Ransom Center sought advice from Professor Galloway and agreed to use the Joyce materials as a case study in the Problems in Permanent Retention of Electronic Records course.  &lt;/p&gt;
&lt;p&gt;Some audio and video migration preservation projects were already in progress at the Ransom center in the Department of Photography and Visual Collections to preserve audio and video works, but there were no concerted efforts to preserve born digital manuscripts.  Policies and Procedures for migration of audio and video content to new media were unsuited for born digital manuscript preservation and policies for preserving digital manuscripts were inadequate to capture the complete behavior of the original digital record.  Previously, the few electronic manuscripts and correspondence already in the Ransom Center&#039;s manuscript collections were printed and organized in boxes like paper records.  Because digital records are entirely unlike paper-based records, a preservation strategy based in printing records preserved very little of the original document.  Electronic media were saved, but researchers were prevented from viewing original disks and no access copies were created.  &lt;/p&gt;
&lt;p&gt;The main component of our preservation strategy is to ingest electronic records and associated metadata into an institutional repository.  DSpace, created from a joint project between MIT and Hewlett-Packard, is the institutional repository we used and will continue to use for electronic record preservation.  At the heart of DSpace, like most open archival information systems (OAIS), is a database populated by individual digital objects supported by content, context, and structure metadata.  We used DSpace, instead of FEDORA or another institutional repository, because it was already the established as the repository of choice for the School of Information.  Although we had issues with the web user-interface for ingesting, viewing, and accessing materials within the repository, we plan to work with a talented ISchool student with Java programming skills to make our installation of DSpace more user-friendly.&lt;/p&gt;
&lt;p&gt;Partnerships were key components to our case study&#039;s productivity and success.  The initial group of students participating in the first part of the case study each represented a different background.  Thomas Kiehne brought a wealth of Information Technology skills to the project, including programming and operating systems knowledge. Catherine Stollar shared her knowledge of archival theory and practice during the case study.  Vivian Spoliansky viewed the case study through the lens of preservation and shed light on aspects of authenticity and desired levels of service for object preservation.  Working with a variety of subject specialists on the project enabled participants to learn key skills from the others that will be useful on future digital record preservation projects.        &lt;/p&gt;
&lt;h2&gt;Processing as Digital Archeology&lt;/h2&gt;
&lt;p&gt;A number of digital materials, including emails and published articles, within the archive had a paper-based counterpart, demonstrating that he created both digital and analog records while performing the same activities.  Both formats of records were created synchronously, and at an institution like the Ransom Center that preserves not only works that have influenced the arts and humanities fields, but also preserves the context in which those works were created, we determined it would be desirable to reflect synchronous creation in the arrangement.  We did not originally understand relationships between Joyce&#039;s digital and paper materials because our first portion of the case study only dealt with electronic records from the first accession of 371 floppy disks. One of the more unique aspects of this project involved the processing of 371 3.5â€ floppy discs that contained the digital objects of the first accession.  The provided floppy discs were mostly from the Macintosh â€œclassicâ€ era, some of which date as far back as the mid to late 1980s.  The assumption at this stage is that the original storage media is not stable or reliable and the information that they hold must be moved quickly and efficiently.  Otherwise, little was known about what to expect in terms of specific technological issues or challenges.&lt;/p&gt;
&lt;p&gt;At the outset of the project, we had only a general idea of the process of moving the digital files from the source media to a repository, and as such, we could not express specific requirements for software tools and utilities that might be needed.  In order to minimize project overhead in terms of time and resources, we desired to use only open source, shareware, or freeware tools that are readily available in order to assist with the extraction process.  This approach allowed us to assess the suitability of tools that are currently available and their ability to interoperate.  In the absence of suitable free tools, we intended to find commercial software or create our own programs or scripts to perform the required tasks as we identified them.  In the course of processing the first accession of discs, we quickly elucidated a more detailed procedural framework that can be abstracted and applied to future projects.&lt;/p&gt;
&lt;p&gt;The general process implemented during the processing of the discs is as follows:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Receive and identify physical media&lt;/li&gt;
&lt;li&gt;Catalog the physical media&lt;/li&gt;
&lt;li&gt;Copy files to newer physical media&lt;/li&gt;
&lt;li&gt;Perform initial file processing&lt;/li&gt;
&lt;li&gt;Create an item-level index of all recovered files&lt;/li&gt;
&lt;li&gt;Create and process working copies of all files while retaining the original bitstream copies&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Technical metadata is collected at each step in the process not only to facilitate the work in progress, but to support provenance and authenticity.  Each operation performed on the bitstream â€“ every copy and access â€“ provides the opportunity for inadvertent loss or alteration, so careful recordkeeping is as essential as careful handling.  Additionally, all personnel involved in processing must thoroughly understand the procedures involved in order to prevent duplication of effort or discontinuities in results.  In many cases, software can automate these processes, thus reducing the chance of errors, but the extent to which software can mitigate such risks is limited by the assumptions made by the creators of the software and how well the personnel making use of this software understand these limitations.&lt;/p&gt;
&lt;p&gt;Given that time was of the essence, we opted to use text entries in Microsoft Excel spreadsheets to create the initial disc catalog and the associated metadata.  This approach allowed us to leverage existing proficiency with spreadsheets and the availability of the software to eliminate the time needed to create a custom database application or to learn project management software.  Unfortunately, the conspicuous absence of relational or workflow aspects in the spreadsheet format made us vulnerable to recordkeeping errors, making quality control a primary concern.&lt;/p&gt;
&lt;p&gt;The copy functionality of the computer operating systems involved were sufficient to perform the movement of digital files from floppies to hard rives and removable media.  Unfortunately, the differences between Macintosh and Windows in the management of file system metadata became significant.  Creation dates are handled differently between these two operating systems such that a copy made in Windows takes on the date of the copy operation, not the creation date of the original from which it was made.  Additionally, file system metadata for Macintosh files are stored as separate, invisible resource forks that are notorious for becoming corrupted.  As a result, we often could not trust the dates ascribed by the operating system and had to refer to external resources, such as Michael Joyce&#039;s curriculum vitae, to confirm or provide date metadata at a later time.  Issues with Macintosh resource forks also affected file downloads from DSpace after ingest. &lt;/p&gt;
&lt;p&gt;At many points during the processing, we encountered technical difficulties in the form of file or disc errors.  These errors can occur for a number of reasons, including damaged media, exposure to magnetic or other hazards, dirty data surface areas, and so on.  In the case of dirty surface areas, several attempts were needed to overcome a copy error.  It is suggested to have a drive cleaning kit available and use it periodically to prevent build up of debris on the drive head.  For other errors, it was necessary to have available software utilities that can attempt to recover from file copying errors.  Windows provides such capabilities within the operating system (e.g.: Scandisk), but Macintosh does not.  For our purposes we were able to discover an older version of a commercial program, Norton Utilities, which allowed us to recover many files that could not be copied initially.  Virus checking was also a preeminent concern.  Errors and crashes must be met with persistence as they are often surmountable, which implies at least a minimum degree of technical knowledge.&lt;/p&gt;
&lt;p&gt;In moving the digital files to other media, we created a filesystem hierarchy that mimicked the physical arrangement of the discs.  Such hierarchical arrangement allowed us to use file system tools to generate some of the metadata automatically.  There are a number of freeware, shareware, and commercial applications for Macintosh that will catalog a file volume and produce reports.  We used a shareware utility called &lt;a href=&quot;http://www.mindspring.com/~shdtree/newsite/id9.html&quot;&gt;CatFinder&lt;/a&gt; to index the copied files and export a report to a delimited format that was imported into Excel. This report formed the basis of our item-level metadata, including fields for filename, file size, kind (document or folder), Macintosh file type (analogous to the Windows file extension), Macintosh creator code, creation date, and modification date. To this basic report we added a comments field for use during appraisal and to collect technical notes.&lt;/p&gt;
&lt;p&gt;MD5 file hashes were also generated for each file.  Having an MD5 hash for each file allowed us to do two important things: to identify and/or eliminate redundant files, and to support provenance auditing during the repository ingest process.  A freeware PERL application called &lt;a href=&quot;http://therockquarry.com/integrity.htm&quot;&gt;Integrity&lt;/a&gt; automatically created MD5 hash calculations and exported the results to a delimited text file.  Unfortunately, integrating the MD5 hashes into the CatFinder index was not trivial due to differences between the two applications in file name recursion and handling of hidden files.  &lt;/p&gt;
&lt;p&gt;Having created a unified index of filesystem metadata, augmented with processing notes and MD5 hashes, we were able to more accurately assess the extent of the digital files and facilitate arrangement and appraisal. Unfortunately, the index was in no way tied to the digital files and presented us with a significant information management problem.  For example, any movement of files was not automatically noted in the index, nor was any change or deletion in the index reflected in the filesystem.  We can envision a workflow-oriented system that stands between the filesystem and a metadata database that would greatly increase the speed and reliability of processing large bodies of digital documents.&lt;/p&gt;
&lt;h2&gt;Arrangement&lt;/h2&gt;
&lt;p&gt;(Note: More detail about our project can be found in a forthcoming article by Catherine Stollar about processing the Michael Joyce Papers in &lt;i&gt;Provenance&lt;/i&gt;.)&lt;/p&gt;
&lt;p&gt;After recovering most of the unique digital content from the first accession of floppy disks, we began the process of archival arrangement.  In the beginning, we asked ourselves some questions.  Can and should digital files be arranged like paper-based records?  Should we heed traditional archival arrangement practices or follow newer theories of arrangement based on item-level metadata?  Do electronic records have a natural hierarchy that can be expressed in a traditional arrangement?  Should physical housing for digital materials be kept? If so, where?  Our answers to these questions are not definitive, but we came to a compromise incorporating basic tenets of archival theory with features of on-demand, flexible file arrangement using item-level metadata.  &lt;/p&gt;
&lt;p&gt;A number of digital materials, including emails and published articles, within the archive had a paper-based counterpart, demonstrating that Michael Joyce created both digital and analog records while performing the same activities.  Both formats of records were created synchronously, and at an institution like the Ransom Center that preserves not only works that have influenced the arts and humanities fields, but also preserves the context in which those works were created, we determined it would be desirable to reflect synchronous creation in the arrangement.  We did not originally understand relationships between Joyce&#039;s digital and paper materials because our first portion of the case study only dealt with electronic records from the first accession of floppy disks.   We initially arranged the files into 5 series: Works, Academic Career, Correspondence, Storyspace, Third-party Works, and Personal.  After surveying the paper-based materials and the second accession of electronic materials, we had to alter our originally arrangement to include the newly accessioned materials.  The final arrangement we created is Works and Related Materials, Academic Career, Correspondence, Storyspace, Journals and Appointment Books, Personal, Works by Other Authors, and Published Materials.  &lt;/p&gt;
&lt;p&gt;Institutional repositories like DSpace can facilitate digital object arrangement into our specified series by using the community, sub-community, collection, sub-collection, and item level hierarchies.  DSpace&#039;s hierarchies relate to traditional archival hierarchical levels: communities equate to archival fonds, sub-communities to series and sub-series, collections as other layers of granularity within a series, and item-level entries relate to digital objects.  In an additional level of granularity, items composed of multiple sub-components or related files, i.e.: websites with multiple linked HTML files can be ingested as bundled files.&lt;/p&gt;
&lt;p&gt;After determining how to arrange the paper and digital materials, we decided how to arrange the physical housing (jewel cases, magnetic media, paper holders, plastic cases, etc.) from Joyce&#039;s electronic works.  Previous policies and procedures at the Ransom Center dictated that electronic media should be physically housed in Hollinger boxes separate from the rest of the paper-based materials.  This separation policy apparently arose out of concern for potential damage to other materials caused by degrading electronic media and to limit access to the electronic materials by researchers.  No studies on electronic media degradation have found any instances of off-gassing or other damaging effects of filing electronic media with paper-based materials, so we determined physically integrating paper-based material and digital media would be the best policy for physically arranging the Michael Joyce Papers.  The Ransom Center will still limit access to files saved on original media because researchers will have access to the files via DSpace.  &lt;/p&gt;
&lt;p&gt;Although we integrated Joyce&#039;s digital objects into a functional group arrangement similar to his paper-based records, we also took advantage of the flexible, non-linear nature of digital object arrangement by enabling on-demand, user-controlled arrangement by item-level metadata.  Preservation of digital objects depends on item-level metadata used to document, migrate, emulate, and preserve the objects.  Item-level metadata recorded for preservation in DSpace&#039;s database also enables flexible arrangement of digital objects.  Digital arrangement allows archivists, and users, multiple options for organizing objects depending on the parameters set by the user interface, such as file name, title, author, date created, subject, or other metadata element.  Arrangement is limited only by the skills of the programmer developing the user interface used to access the OAIS database and the precision of metadata recorded for each object. &lt;/p&gt;
&lt;p&gt;Arrangement is also affected by how we ingested objects into DSpace because our method of ingest affected what metadata fields we included.  Although manual metadata assignment of all files within the Joyce archive was laborious, certain metadata fields were impossible to record automatically.  Content metadata, such as subject and title of work, had to be entered by hand because automatic tools to accurately extract content were not available.&lt;sup&gt;1&lt;/sup&gt;  We found it difficult to use file names within the archive to associate files with published titles because the file names were not specific or standardized.  &lt;/p&gt;
&lt;p&gt;We incorporated methods for traditional archival arrangement and strategies for on-demand item-level arrangement while processing digital objects within the Michael Joyce Papers.  Together, both methods allow users to browse records according to functional series and create new arrangements based on any metadata available for individual objects.&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;1&lt;/sup&gt; Literary text comparison tools designed for use with small numbers of digital works were not sufficient for our large collection of files.  Apparently text-mining tools could serve our purposes to compare large bodies of records with each other.   We have not utilized any text-mining tools to date. &lt;/small&gt;&lt;/p&gt;
&lt;h2&gt;Challenges&lt;/h2&gt;
&lt;p&gt;In addition to the challenges we encountered developing a strategy to preserve Joyce&#039;s text and graphic files, we faced unique challenges associated with preserving Joyce&#039;s most influential creative works written using specialized software called Storyspace.  Storyspace, created by Michael Joyce, Jay David Bolter, and John B. Smith, as a format type presented (and continues to present) challenges for media migration, ingest, and file use.  Hypertext works written in Storyspace are composed of multi-faceted texts linked by guard fields (words within texts that enable direct links to other nodes, usually under specific conditions) and can only be viewed using Storyspace software.  To complicate matters, we originally thought the latest version of Storyspace was backwards compatible and could read works written in the first version of Storyspace.  Unfortunately, this is not entirely the case as older Storyspace documents do not degrade gracefully.  For example, the text from files written in Storyspace 1.5 can be read in Storyspace 2, but the individual nodes and links are missing, making the Storyspace 2 rendering of a older work vastly different from the original.&lt;/p&gt;
&lt;h2&gt;New Skills&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;A thorough grounding in the various operating systems.  The profusion of technical difficulties and operating system inconsistencies required an intuition about the various platforms that can only be gained by direct experience.  While processing digital files it is essential to have an understanding of the environment in which they were created.  Computer literacy in more than one platform and with networked environments will be ideal traits for archivists of the future.&lt;/li&gt;
&lt;li&gt;A basic understanding of the structure of digital documents.  Knowing how a digital file is created and stored, including such basics as the difference between binary file formats and textual formats such as ACSII and UTF helps provide an understanding of what happens during processing.  Furthermore, an intimate knowledge of the types of formats and how they might be identified (e.g.: file type extensions or creator codes), accessed, and converted is essential.  Understanding digital formats offers clues to where to find item-level metadata (e.g.: document properties embedded in word processing files, ID3 tags embedded in MP3 audio files, etc.) and suggests migration paths for long-term preservation.&lt;/li&gt;
&lt;li&gt;Proficiency with and trust in new tools.  Familiar means of handling physical documents are not present with digital documents.  Software tools and operating systems augment the functions of our senses in the digital world and mitigate some of this loss, but not completely.  Integrated toolkits and processing systems are needed and must be developed so that they can be trusted to conform to the expectations of archival practice.&lt;/li&gt;
&lt;li&gt;Establishment of new workflows and procedures.  The intangible nature of digital information makes documentary evidence crucial to processing.  Many institutions have established procedures for document processing, including audio/visual materials, but these cannot be assumed to be sufficient for digital objects.  Operating systems alone cannot document processes, so new systems that function according to sound processing policies are necessary.&lt;/li&gt;
&lt;li&gt;Ability to monitor current trends in digital preservation including metadata standards, crosswalks between encoding standards, available tools, storage systems, file format repositories, national and international research initiatives, user expectations, and published best practices guides.&lt;/li&gt;
&lt;li&gt;A thorough understanding of traditional archival theory and practice.  Archivists who work with digital records should be able to extrapolate traditional theory and apply it to electronic record preservation, but must be flexible enough to create new standards for archival practice.   What we do as archivists will change (practice), but why we do it will not (theory).&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;Discussion Questions&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Would automatic content management be more time consuming or less time consuming that manually arranging the digital manuscripts?   Would it result in a better arrangement?&lt;/li&gt;
&lt;li&gt;Is it feasible to devise a one-size fits all processing toolkit?&lt;/li&gt;
&lt;li&gt;How can authors, who may deposit their materials in an institution like the Ransom Center, implement a digital preservation strategy at home, closer to the point of document creation?&lt;/li&gt;
&lt;li&gt;How desirable is it to keep most files in proprietary formats that are the current de facto standard? (i.e. Microsoft Word, Adobe PDF, etc.)&lt;/li&gt;
&lt;li&gt;Should files be arranged at all or should they be indexed and sorted using search engines using item-level metadata?&lt;/li&gt;
&lt;li&gt;Is DSpace a viable option for smaller repositories and organizations?&lt;/li&gt;
&lt;li&gt;How will DSpace integrate into existing points of access?  (i.e. OPACs, website, EAD consortium sites)&lt;/li&gt;
&lt;li&gt;How do archivists best obtain the skills we are advocating they have? Classes? Projects? Workshops? Conferences?&lt;/li&gt;
&lt;/ul&gt;
&lt;table id=&quot;attachments&quot; class=&quot;sticky-enabled&quot;&gt;
 &lt;thead&gt;&lt;tr&gt;&lt;th&gt;Attachment&lt;/th&gt;&lt;th&gt;Size&lt;/th&gt; &lt;/tr&gt;&lt;/thead&gt;
&lt;tbody&gt;
 &lt;tr class=&quot;odd&quot;&gt;&lt;td&gt;&lt;a href=&quot;http://thomas.kiehnefamily.us/thomas_files/4_Stollar_Kiehne.pdf&quot;&gt;4_Stollar_Kiehne.pdf&lt;/a&gt;&lt;/td&gt;&lt;td&gt;134.9 KB&lt;/td&gt; &lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
</description>
 <comments>http://thomas.kiehnefamily.us/guarding_the_guards_archiving_the_electronic_records_of_hypertext_author_michael_joyce#comments</comments>
 <category domain="http://thomas.kiehnefamily.us/page_subjects/digital_archives">Digital Archives</category>
 <category domain="http://thomas.kiehnefamily.us/page_subjects/metadata">Metadata</category>
 <enclosure url="http://thomas.kiehnefamily.us/thomas_files/4_Stollar_Kiehne.pdf" length="138136" type="application/pdf" />
 <pubDate>Fri, 28 Jul 2006 18:27:45 +0000</pubDate>
 <dc:creator>tkiehne</dc:creator>
 <guid isPermaLink="false">31 at http://thomas.kiehnefamily.us</guid>
</item>
<item>
 <title>An OAIS Ingest Metadata Specification</title>
 <link>http://thomas.kiehnefamily.us/an_oais_ingest_metadata_specification</link>
 <description>&lt;p&gt;&lt;b&gt;Problem Definition&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	For this exercise, we will prepare a digital object for submission to a digital archive for long term preservation.  The digital object in question is an HTML text with an in-line image and links to several other HTML texts.  The objects must be readable, but the specific look and feel of the rendered text is not important.  We are to generate a metadata set that will conform to ingest requirements for an Open Archival Information System (OAIS) according to a Submission Information Package (SIP) agreement.  Additionally, we will consider the process for converting a SIP into an Archival Information Package (AIP) and extend the metadata set with additional elements for the conversion process, as needed.&lt;/p&gt;
&lt;!--break--&gt;&lt;!--break--&gt;&lt;p&gt; &lt;b&gt;OAIS Ingest and SIP&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	The OAIS model outlines a system-agnostic framework to ensure reliable, long-term preservation of information.  The process begins with the submission of an information package from a producer of information (publisher, author, researcher, etc.) to the archive.  The information is submitted as one or more SIP objects that conform to the archive&#039;s SIP agreement.  For a digital archive, the information is sent electronically along with descriptive metadata.  The components of an OAIS information package are shown in Figure 1 (CCSDS, 2002, &amp;sect;4).&lt;/p&gt;
&lt;p&gt;&lt;span id=&quot;Frame1&quot; dir=&quot;LTR&quot; style=&quot;float: left; border: none; padding: 0in; background: #ffffff&quot;&gt;&lt;br /&gt;
&lt;p style=&quot;margin-top: 0.08in&quot;&gt;&lt;img src=&quot;/thomas_files/inf389k-final_assignment_html_7cc6006a.png&quot; name=&quot;Graphic3&quot; align=&quot;left&quot; width=100% border=0 /&gt;&lt;br clear=&quot;left&quot; /&gt;&lt;i&gt;Figure 1: OAIS information package (CCSDS, 2002, p. 4-31)&lt;/i&gt;&lt;/p&gt;
&lt;p&gt; &lt;/p&gt;&lt;/span&gt;&lt;br clear=&quot;left&quot; /&gt; &lt;/p&gt;
&lt;p&gt;	As shown above, the information package is comprised of Content Information (CI) and Preservation Description Information (PDI).  In the current exercise, the content information consists of the HTML code and image file, or pointers to these resources, and the representation information necessary to decode the content of the digital objects.  The Packaging Information binds the CI and PDI by some means, including universal identifiers, encoding (such as used for a CD-ROM), or some other system-specific method.&lt;/p&gt;
&lt;p&gt;	Descriptive information is derived from the PDI or declared prior to submission.  PDI contains the metadata needed for conversion into an AIP and further storage within and access from the repository.  Four functional areas are defined in the PDI as summarized in Table 1&lt;/p&gt;
&lt;table width=&quot;100%&quot; border=&quot;1&quot; bordercolor=&quot;#000000&quot; cellpadding=&quot;4&quot; cellspacing=&quot;0&quot; style=&quot;page-break-after: avoid&quot;&gt; 	&lt;col width=&quot;64*&quot; /&gt; 	&lt;col width=&quot;64*&quot; /&gt; 	&lt;col width=&quot;64*&quot; /&gt; 	&lt;col width=&quot;64*&quot; /&gt;&lt;br /&gt;
&lt;thead&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;th width=&quot;25%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;Provenance&lt;/p&gt;
&lt;/th&gt;
&lt;th width=&quot;25%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;Content&lt;/p&gt;
&lt;/th&gt;
&lt;th width=&quot;25%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;Reference&lt;/p&gt;
&lt;/th&gt;
&lt;th width=&quot;25%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;Fixity&lt;/p&gt;
&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;25%&quot;&gt;
&lt;p&gt;&lt;u&gt;Source&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Description 					and pointer to original object(s)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Copyright/legal 					restrictions&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Access 					restrictions&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Authority 					to modify representation information or migrate&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Agreements 					with external organizations&lt;/p&gt;
&lt;/li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;&lt;/p&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;u&gt;History&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Change 					history&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Pointers 					to other versions&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;text-decoration: none&quot;&gt;Custody 					since origination&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;td width=&quot;25%&quot;&gt;
&lt;p&gt;&lt;u&gt;Relationship to other 				objects&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Description&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Pointers 					to other objects&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p style=&quot;text-decoration: none&quot;&gt;
&lt;/p&gt;
&lt;p&gt;&lt;u&gt;Purpose/Reason for creation&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Reason 					for creation&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Reason 					for archiving&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p style=&quot;text-decoration: none&quot;&gt;
&lt;/p&gt;
&lt;p&gt;&lt;u&gt;Encoding environment&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Software&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Languages&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Character 					set(s)&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;td width=&quot;25%&quot;&gt;
&lt;p&gt;&lt;u&gt;Unique identifier(s)&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;text-decoration: none&quot;&gt;URI, 					ID Number, etc.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p style=&quot;text-decoration: none&quot;&gt;
&lt;/p&gt;
&lt;p&gt;&lt;u&gt;Bibliographic 				description&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Creator(s), 					organization(s)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Date 					of creation&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Title(s)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Etc.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;25%&quot;&gt;
&lt;p&gt;&lt;u&gt;Authenticity indicators&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Checksum, 					CRC, MD5 hash&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Digital 					signatures&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Encryption&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p style=&quot;text-decoration: none&quot;&gt;
&lt;/p&gt;
&lt;p&gt;&lt;u&gt;Quality of service 				requirements&lt;/u&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Specification 					of integrity preserving mechanisms&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Error 					protection specifications&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p style=&quot;margin-top: 0.08in&quot;&gt;&lt;i&gt;Table 1: Elements of PDI (CCSDS, 2002, pp. 4-27 &amp;ndash; 4-29)&lt;/i&gt;&lt;/p&gt;
&lt;p&gt;  &lt;b&gt;A Model SIP&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	When approaching the problem of metadata creation, two basic approaches may be used.  First, we may generate a new metadata set that conforms to the archives requirements.  Generating a new set allows the specific project requirements to be addressed in detail, but requires a significant amount of labor to produce the DTD or schema and the tools to use them.  Furthermore, a new metadata set does not leverage existing standards and practices for interchange.  The second, and preferred, approach is to use or extend an existing metadata specification.&lt;/p&gt;
&lt;p&gt;	A review of existing metadata specifications reveals that the Metadata Encoding and Transmission Standard (METS) provides a framework for many of the necessary elements for a SIP (Library of Congress, 2004).  The first two columns of Table 2 show a  basic mapping of METS container elements to OAIS ingest information.  Since METS does not provide detailed descriptive metadata elements, other metadata schemes must be used to complete the SIP.   Source metadata schemes and sets for detailed metadata are shown in the third column of Table 2.  External metadata sets can be linked from the METS container using &amp;lt;mdRef&amp;gt;, or embedded within using one of the metadata types allowed in &amp;lt;mdWrap&amp;gt;.   &lt;/p&gt;
&lt;table width=&quot;100%&quot; border=&quot;1&quot; bordercolor=&quot;#000000&quot; cellpadding=&quot;4&quot; cellspacing=&quot;0&quot; style=&quot;page-break-after: avoid&quot;&gt; 	&lt;col width=&quot;85*&quot; /&gt; 	&lt;col width=&quot;85*&quot; /&gt; 	&lt;col width=&quot;85*&quot; /&gt;&lt;br /&gt;
&lt;thead&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;th width=&quot;33%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;OAIS Ingest Information&lt;/p&gt;
&lt;/th&gt;
&lt;th width=&quot;33%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;METS Element Location(s)&lt;/p&gt;
&lt;/th&gt;
&lt;th width=&quot;33%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;Source for Specific Metadata&lt;/p&gt;
&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;&lt;u&gt;&lt;b&gt;Preservation 				Description Information&lt;/b&gt;&lt;/u&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;all elements nested under 				&amp;lt;mets&amp;gt; root&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;&lt;b&gt;Provenance&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Description of original object(s)&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;                 &lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;sourceMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;relatedItem 				&lt;span style=&quot;font-weight: medium&quot;&gt;type=&amp;rdquo;original&amp;rdquo;&lt;/span&gt;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Rights management information&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;rightsMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Rights 				Declaration Extension Schema:&lt;/p&gt;
&lt;p style=&quot;text-decoration: none&quot;&gt;&amp;lt;RightsDeclarationMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Access restrictions&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;rightsMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Rights 				Declaration Extension Schema:&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;&amp;lt;RightsDeclarationMD&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;Context&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Agreements with creator(s)&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;rightsMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Rights 				Declaration Extension Schema:&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;&amp;lt;RightsDeclarationMD&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;Context&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Agreements with external 				organizations&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;rightsMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;Rights 				Declaration Extension Schema:&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in; text-decoration: none&quot;&gt;&amp;lt;RightsDeclarationMD&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;Context&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Change history&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;originInfo&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;dateModified&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Pointers to other 				version(s)&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;relatedItem 				&lt;span style=&quot;font-weight: medium&quot;&gt;type=&amp;rdquo;otherVersion&amp;rdquo;&lt;/span&gt;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Custody since origination&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;note 				type=&amp;rdquo;bibliographic history&amp;rdquo;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;&lt;b&gt;Content&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Relationship to related objects&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;relatedItem 				&lt;span style=&quot;font-weight: medium&quot;&gt;type=&amp;rdquo;XXX&amp;rdquo;&lt;/span&gt;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Pointers to related 				objects&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;relatedItem 				&lt;span style=&quot;font-weight: medium&quot;&gt;xlink=&amp;rdquo;XXX&amp;rdquo;&lt;/span&gt;&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;&lt;span style=&quot;font-weight: medium&quot;&gt;location&lt;/span&gt;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Reason for creation&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;note 				type=&amp;rdquo;bibliographic history&amp;rdquo;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Reason for archiving&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;note 				type=&amp;rdquo;conservation  history&amp;rdquo;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Original encoding/technical environment&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;techMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;Schema 				for Technical Metadata for Text:&lt;/p&gt;
&lt;p&gt;&amp;lt;byte_order&amp;gt;, 				&amp;lt;charset&amp;gt;, &amp;lt;encoding&amp;gt;, &amp;lt;markup_language&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;&lt;b&gt;Reference&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Unique identifier(s)&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;&amp;lt;dmdSec&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;identifier&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Bibliographic description&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;&amp;lt;dmdSec&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-left: 0.01in; margin-right: 0in; text-indent: 0in&quot;&gt; 				&amp;lt;titleInfo&amp;gt;, &amp;lt;name&amp;gt;, 				&amp;lt;subject&amp;gt;, &amp;lt;language&amp;gt;, &amp;lt;typeOfResource&amp;gt;, 				&amp;lt;genre&amp;gt;, &amp;lt;abstract&amp;gt;, &amp;lt;originInfo&amp;gt;, 				&amp;lt;physicalDescription&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;&lt;b&gt;Fixity&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Authenticity indicators&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;fileGrp&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;file 				CHECKSUM=&amp;rdquo;XXX&amp;rdquo;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;N/A&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Quality of service requirements&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;techMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;Schema 				for Technical Metadata for Text:  				&lt;/p&gt;
&lt;p&gt;&amp;lt;viewingRequirements&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;&lt;u&gt;&lt;b&gt;Content Information&lt;/b&gt;&lt;/u&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;all elements nested under 				&amp;lt;mets&amp;gt; root&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Data object(s)&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;fileGrp&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;file&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;&lt;span style=&quot;font-weight: medium&quot;&gt;FLocat&lt;/span&gt;&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;FContent&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;&lt;span style=&quot;font-weight: medium&quot;&gt;binData&lt;/span&gt;&amp;gt; 				| &amp;lt;&lt;span style=&quot;font-weight: medium&quot;&gt;xmlData&lt;/span&gt;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;N/A&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;&lt;u&gt;&lt;b&gt;Packaging Information&lt;/b&gt;&lt;/u&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt;all elements nested under 				&amp;lt;mets&amp;gt; root&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot; bgcolor=&quot;#e6e6e6&quot;&gt;
&lt;p&gt; 				&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Relationships between data 				objects&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;structMap&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;div&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;&lt;span style=&quot;font-weight: medium&quot;&gt;mptr&lt;/span&gt;&amp;gt; 				| &amp;lt;&lt;span style=&quot;font-weight: medium&quot;&gt;fptr&lt;/span&gt;&amp;gt;&lt;br /&gt;                 &amp;lt;structLink&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;N/A&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p style=&quot;margin-top: 0.08in&quot;&gt;&lt;i&gt;Table 2: Basic OAIS ingest and METS container mapping (derived from: Tingle, 2004; Library of Congress, 2003; Library of Congress, 2004b)&lt;/i&gt;&lt;/p&gt;
&lt;p&gt;   &lt;b&gt;Augmenting the SIP to Facilitate Conversion to an AIP&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	Figure 2 illustrates the complete OAIS ingest process.  Once all of the component SIPs have been received and checked for conformance to the archive&#039;s ingest specifications, the SIPs are enhanced and reformatted according to the archive&#039;s technical standards, as necessary.  Workflow procedures may require an administrative audit of the submitted information package.  Once these processes are complete, the AIP is created, descriptive metadata and &amp;ldquo;browse products&amp;rdquo; are extracted, then the AIP is sent to data management.   &lt;/p&gt;
&lt;p&gt; 
&lt;/p&gt;&lt;p&gt;&lt;span id=&quot;Frame2&quot; dir=&quot;LTR&quot; style=&quot;float: left; border: none; padding: 0in; background: #ffffff&quot;&gt;&lt;br /&gt;
&lt;p style=&quot;margin-top: 0.08in&quot;&gt;&lt;img src=&quot;/thomas_files/inf389k-final_assignment_html_m7712c7c2.png&quot; name=&quot;Graphic2&quot; align=&quot;left&quot; width=&quot;100%&quot; border=&quot;0&quot; /&gt;&lt;br clear=&quot;left&quot; /&gt;&lt;i&gt;Figure 2: OAIS ingest process (CCSDS, 2002, p. 4-5)&lt;/i&gt;&lt;/p&gt;
&lt;p&gt; &lt;/p&gt;&lt;/span&gt; &lt;/p&gt;
&lt;p&gt; &lt;br clear=&quot;left&quot; /&gt;&lt;br /&gt; 
&lt;/p&gt;&lt;p&gt;	In the process of converting SIPs to an AIP, new metadata is created.  To ease the conversion process, the SIP format should include element definitions for these items of information.  Additional metadata for the SIP to AIP conversion are shown in Table 3 along with the METS container location and prescribed metadata element source.&lt;/p&gt;
&lt;table width=&quot;100%&quot; border=&quot;1&quot; bordercolor=&quot;#000000&quot; cellpadding=&quot;4&quot; cellspacing=&quot;0&quot; style=&quot;page-break-after: avoid&quot;&gt; 	&lt;col width=&quot;85*&quot; /&gt; 	&lt;col width=&quot;85*&quot; /&gt; 	&lt;col width=&quot;85*&quot; /&gt;&lt;br /&gt;
&lt;thead&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;th width=&quot;33%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;Additional Information&lt;/p&gt;
&lt;/th&gt;
&lt;th width=&quot;33%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;METS Element Location(s)&lt;/p&gt;
&lt;/th&gt;
&lt;th width=&quot;33%&quot; bgcolor=&quot;#e6e6ff&quot;&gt;
&lt;p&gt;Source for Specific Metadata&lt;/p&gt;
&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Ingest history&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;note 				type=&amp;rdquo;acquisition&amp;rdquo;&amp;gt;&lt;br /&gt;                 &amp;lt;note 				type=&amp;rdquo;conservation history&amp;rdquo;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Quality assurance (QA) results&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;techMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Schema for Technical 				Metadata for Text &amp;lt;processingNote&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;History  of formatting and 				encoding changes made during conversion process&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;note 				type=&amp;rdquo;conservation history&amp;rdquo;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Administrative audit reports&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;note 				type=&amp;rdquo;admin&amp;rdquo;&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Additional descriptive or 				classification data&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;&amp;lt;dmdSec&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;classification&amp;gt;, 				&amp;lt;subject&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Relationship to archive objects or collections&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;amdSec&amp;gt;&lt;br /&gt;                 &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;digiprovMD&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;relatedItem&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;Internal identifier(s)&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p&gt;&amp;lt;dmdSec&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;33%&quot;&gt;
&lt;p class=&quot;hanging-indent&quot; style=&quot;margin-bottom: 0in&quot;&gt;Metadata 				Object Description Schema:&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;&amp;lt;identifier&amp;gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p style=&quot;margin-top: 0.08in&quot;&gt;&lt;i&gt;Table 3: Additional metadata required for SIP to AIP conversion (derived from: Tingle, 2004; Library of Congress, 2003; Library of Congress, 2004b).&lt;/i&gt;&lt;/p&gt;
&lt;p&gt;   &lt;b&gt;Example SIP Profile&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	The &lt;a href=&quot;/thomas_files/mets_sip_example.xml&quot;&gt;appendix&lt;/a&gt; contains a METS Profile Schema document describing the requirements enumerated in Tables 2 and 3.  At the end of the profile is an &amp;lt;Appendix&amp;gt; element containing an example ingest package encoded according to the profile specification.   &lt;/p&gt;
&lt;p&gt;   &lt;b&gt;References&lt;/b&gt;&lt;br /&gt;
&lt;/p&gt;&lt;p class=&quot;hanging-indent&quot;&gt;Consultative Committee for Space Data Systems (CCSDS) (2002).  Reference model for an Open Archival Information System (OAIS).  CCSDS 650.0-B-1 BLUE BOOK.  Retrieved on 28 November, 2004, from &lt;a href=&quot;http://ssdoo.gsfc.nasa.gov/nost/wwwclassic/documents/pdf/CCSDS-650.0-B-1.pdf&quot;&gt;http://ssdoo.gsfc.nasa.gov/nost/wwwclassic/documents/pdf/CCSDS-650.0-B-1.pdf&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Library of Congress (2003).  METS news and announcements: Draft rights declaration schema is ready for review.  Retrieved on 11 December, 2004, from &lt;a href=&quot;http://www.loc.gov/standards/mets/news080503.html&quot;&gt;http://www.loc.gov/standards/mets/news080503.html&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Library of Congress (2004). METS: An overview &amp;amp; tutorial.  Retrieved on 7 December, 2004, from &lt;a href=&quot;http://www.loc.gov/standards/mets/METSOverview.v2.html&quot;&gt;http://www.loc.gov/standards/mets/METSOverview.v2.html&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Library of Congress (2004).  Metadata Object Description Schema (MODS).  Retrieved on 11 December, 2004, from &lt;a href=&quot;http://www.loc.gov/standards/mods&quot;&gt;http://www.loc.gov/standards/mods&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;McDonough, J. (2003).  Schema for Technical Metadata for Text.  Retrieved on 11 December, 2004, from &lt;a href=&quot;http://dlib.nyu.edu/METS/textmd.html&quot;&gt;http://dlib.nyu.edu/METS/textmd.html&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Tingle, B. (2004).  METS 1.3 schema documentation. Retrieved on 9 December, 2004, from &lt;a href=&quot;http://ark.cdlib.org/mets/schema_documentation&quot;&gt;http://ark.cdlib.org/mets/schema_documentation&lt;/a&gt;.&lt;/p&gt;
&lt;!-- end content --&gt;&lt;!-- end content --&gt;&lt;table id=&quot;attachments&quot; class=&quot;sticky-enabled&quot;&gt;
 &lt;thead&gt;&lt;tr&gt;&lt;th&gt;Attachment&lt;/th&gt;&lt;th&gt;Size&lt;/th&gt; &lt;/tr&gt;&lt;/thead&gt;
&lt;tbody&gt;
 &lt;tr class=&quot;odd&quot;&gt;&lt;td&gt;&lt;a href=&quot;http://thomas.kiehnefamily.us/thomas_files/mets_sip_example.xml&quot;&gt;mets_sip_example.xml&lt;/a&gt;&lt;/td&gt;&lt;td&gt;18.98 KB&lt;/td&gt; &lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
</description>
 <comments>http://thomas.kiehnefamily.us/an_oais_ingest_metadata_specification#comments</comments>
 <category domain="http://thomas.kiehnefamily.us/page_subjects/digital_archives">Digital Archives</category>
 <category domain="http://thomas.kiehnefamily.us/page_subjects/metadata">Metadata</category>
 <enclosure url="http://thomas.kiehnefamily.us/thomas_files/mets_sip_example.xml" length="19433" type="text/xml" />
 <pubDate>Tue, 14 Dec 2004 06:40:48 +0000</pubDate>
 <dc:creator>tkiehne</dc:creator>
 <guid isPermaLink="false">8 at http://thomas.kiehnefamily.us</guid>
</item>
<item>
 <title>TEI Lite History and Evaluation</title>
 <link>http://thomas.kiehnefamily.us/tei_lite_history_and_evaluation</link>
 <description>&lt;p&gt;	New and disparate ways of digitally encoding texts were developed as computing became available to scholars of the humanities in the 1980s.  The encoding of textual objects into a digital form creates opportunities for examining old and rare texts simultaneously and without the risk of wear or damage to the original object.  Additionally, an encoded object permits new ways of interacting with the text, such as concurrent views of different versions and viewing subsequent editorial or annotations.  The lack of standard methods for encoding and describing texts made it difficult for researchers to exchange objects and diminished the benefits that the digital format offers.   &lt;/p&gt;
&lt;!--break--&gt;&lt;!--break--&gt;&lt;p&gt;	 The Text Encoding Initiative (TEI) was conceived in this disjointed digitization environment.  TEI is a successful and influential metadata encoding standard that is primarily concerned with the encoding of textual objects, but is flexible enough to apply to many other types of information objects.  The standard is customizable and extensible.  One such customization is TEI Lite, a subset of the TEI specification.  In this essay we will examine the development and history of TEI Lite as well as the role it plays in documenting the lifecycle of digital objects.  TEI Lite&#039;s relationship to other metadata initiatives will also be explored.  Finally, an evaluation will be made of how well TEI achieves its purpose and some of the problems the specification faces.&lt;/p&gt;
&lt;p&gt; &lt;b&gt;History of TEI Lite&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	TEI Lite shares its formative history with its superset, TEI.  Work on TEI formally began in 1987 with the meeting of a group of 32 scholars from North America, Europe, and Asia held at Vassar College in Poughkeepsie, NY.  The initial meeting was convened by the Association for Computers in the Humanities and funded by the National Endowment for the Humanities with the purpose of beginning work on the problems facing digital text encoding (Mylonas &amp;amp; Renear, 1999, pp. 3-4).   &lt;/p&gt;
&lt;p&gt;	At the close of the conference, the group issued a closing statement to provide direction for the development of guidelines.  The statement, known as the &amp;ldquo;Poughkeepsie Principles,&amp;rdquo; directed that the forthcoming guidelines should (Burnard &amp;amp; Sperberg-McQueen, 2002, p. 1):  &lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;suffice to represent the textual features needed for 	research;&lt;/p&gt;
&lt;/li&gt;&lt;li&gt;
&lt;p&gt;be simple, clear, and concrete;&lt;/p&gt;
&lt;/li&gt;&lt;li&gt;
&lt;p&gt;be easy for researchers to use without special-purpose 	software;&lt;/p&gt;
&lt;/li&gt;&lt;li&gt;
&lt;p&gt;allow the rigorous definition and efficient processing of 	texts;&lt;/p&gt;
&lt;/li&gt;&lt;li&gt;
&lt;p&gt;provide for user-defined extensions;&lt;/p&gt;
&lt;/li&gt;&lt;li&gt;
&lt;p&gt;conform to existing and emergent standards.&lt;/p&gt;
&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;	After the meeting in 1987, three organizations participated in forming the guidelines:  the Association for Computers in the Humanities, the Association for Literary and Linguistic Computing, and the Association for Computational Linguistics (Mylonas &amp;amp; Renear, 1999, p. 3).  Draft versions of the TEI Header and Guidelines were completed and distributed in 1990 (MIT Libraries, 2004).  After several years of refinement, the final draft (version P3) was released in 1994.  &amp;ldquo;Guidelines for the encoding and interchange of Machine-Readable Texts&amp;rdquo; spanned 1300 pages and defined over 600 elements of Standardized General Markup Language (SGML).  The TEI specifications defined an extensible set of elements that could be customized by  user communities for their specific needs.  One of these customizations is TEI Lite, which defines a subset of TEI meant to serve as a &amp;ldquo;starter set&amp;rdquo; of core elements to assist in learning the extensive TEI set (Burnard, 2000).   &lt;/p&gt;
&lt;p&gt;	The P3 guidelines underwent several minor revisions between 1994 and 2001, mostly to clarify varying interpretations and practices (Burnard &amp;amp; Popham, 1999, p. 39).  During this time, however, the success of TEI as a metadata specification informed and influenced the development of the eXtensible Markup Language (XML) (DeRose, 1999).  TEI was subsequently converted to XML and released as version P4 in Summer 2001 (MIT Libraries, 2004).  Development of TEI Lite continues in parallel with TEI, the next revision of which (P5) is expected to be released at the end of 2004 (TEI Consortium, 2003, How to participate - Next version).&lt;/p&gt;
&lt;p&gt;	Formal structures were developed to guide future development as participation increased.  An Executive Committee formed in the mid-nineties that included representatives from each of the three sponsoring associations and two influential researchers, Michael Sperberg-McQueen (University of Illinois at Chicago) and Lou Burnard (Oxford University).  By 1996, a Technical Review Committee was established to conduct the development and maintenance of the guidelines in a manner similar to the International Standards Organization (ISO) (Burnard &amp;amp; Light, 1996, pp. 25-26).&lt;/p&gt;
&lt;p&gt;	In 1999,  the Executive Committee was petitioned to create an international membership organization that could better handle the TEI&#039;s increasing administration and development responsibilities.  The petition resulted in the formation of a non-profit corporation (Burnard, 2000).  Membership in the consortium includes dozens of agencies from the humanities, education, computing, linguistics, and librarianship.  Members elect a technical council that oversees development of the guidelines and funding for the organization.  The consortium&#039;s first Council was elected in 2001 and met for the first time in 2002.  Members may also participate in the various special interest groups or workforces that develop the guidelines (TEI Consortium, 2004, How to participate).  The Consortium relies on its members to expand TEI&#039;s user base and has chartered a special interest group for training to support their efforts (TEI Consortium, 2004, How to participate &amp;ndash; Special interest groups).&lt;/p&gt;
&lt;p&gt;	TEI is hosted by four universities and is sponsored by the three associations originally responsible for initial development of the guidelines.  Significant support is provided by the U.S. National Endowment for the Humanities (NEH), Directorate XIII of the Commission of the European Communities (CEC/DG-XIII), the Andrew W. Mellon Foundation, and the Social Science and Humanities Research Council of Canada (TEI Consortium, 2004).&lt;/p&gt;
&lt;p&gt; &lt;b&gt;The Functional Role and Structure of TEI Lite&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	TEI and TEI Lite intend to define a framework for the encoding of texts that facilitates the interchange of digital objects.  The specification defines a common and extensible language that different software platforms can understand and use to render the digital object in consistent ways. Although the development of TEI has focused on the encoding of texts, particularly capturing non-digital texts, the framework is applicable to the description of non-text objects such as images and sound (Burnard &amp;amp; Sperberg-McQueen, 2002, p. 1).&lt;/p&gt;
&lt;p&gt;	The description and interchange goals of TEI implies fidelity to the structure and content of the object being encoded.  As such, much of the focus of TEI is on the structural description of textual objects, while the TEI header supports most of the lifecycle metadata functions (see Table 1).  Elements of the TEI header provide creation, appraisal and descriptive metadata and, to a lesser extent, transfer/authenticity and preservation metadata.  Accession and usage metadata are much less apparent, but may be augmented by the information system that stores the digital object.  Rights metadata is simply represented in regard to the original object.  In fact, the distinction between metadata about the digital encoding and metadata about the original object is difficult to discern from the element definitions and likely results in differing practices.  In general, the file description (&lt;i&gt;fileDesc&lt;/i&gt;) describes attributes of the original text while the encoding description (&lt;i&gt;encodingDesc&lt;/i&gt;) concentrates on aspects of the digital implementation&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt; &lt;/p&gt;
&lt;table width=&quot;625&quot; border=&quot;1&quot; bordercolor=&quot;#000000&quot; cellpadding=&quot;2&quot; cellspacing=&quot;0&quot; style=&quot;page-break-after: avoid&quot;&gt; 	&lt;col width=&quot;152&quot; /&gt; 	&lt;col width=&quot;155&quot; /&gt; 	&lt;col width=&quot;150&quot; /&gt; 	&lt;col width=&quot;150&quot; /&gt;&lt;br /&gt;
&lt;tr valign=&quot;bottom&quot;&gt;
&lt;td width=&quot;152&quot; height=&quot;12&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Creation&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;155&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Appraisal&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Transfer/Authenticity&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Accession&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;top&quot;&gt;
&lt;td width=&quot;152&quot; height=&quot;93&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.28in; margin-bottom: 0in&quot;&gt; 			 &lt;b&gt;&amp;lt;titleStmt&amp;gt;&lt;/b&gt;(all)&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;profileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.2in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;creation&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot;&gt;&lt;b&gt;&amp;lt;revisionDesc&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;/td&gt;
&lt;td width=&quot;155&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.24in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;editionStmt&amp;gt;&lt;/b&gt;&lt;span style=&quot;font-weight: medium&quot;&gt;(all)&lt;/span&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.24in; margin-bottom: 0in; font-weight: medium&quot;&gt; 			&lt;b&gt;&amp;lt;seriesStmt&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.24in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;sourceDesc&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;encodingDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;projectDesc&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;samplingDecl&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in&quot;&gt; 			&lt;b&gt;&amp;lt;editorialDecl&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.26in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;extent&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.26in; margin-bottom: 0in&quot;&gt; 			&amp;lt;publicationStmt&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.49in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;publisher&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.49in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;distributor&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.49in&quot;&gt; 			&lt;b&gt;&amp;lt;authority&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.24in; margin-bottom: 0in&quot;&gt; 			&amp;lt;publicationStmt&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.49in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;authority&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.24in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;notesStmt&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt; 			&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;font-weight: medium&quot;&gt;(also 			defined by containing system)&lt;/div&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=&quot;bottom&quot;&gt;
&lt;td width=&quot;152&quot; height=&quot;13&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Descriptive&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;155&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Preservation&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Usage&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;p align=&quot;center&quot;&gt;&lt;b&gt;Rights&lt;/b&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr valign=top&gt;
&lt;td width=&quot;152&quot; height=&quot;106&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in; font-weight: medium&quot;&gt; 			&amp;lt;titleStmt&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.98in; margin-right: 0in; text-indent: -0.39in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;title&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in; font-weight: medium&quot;&gt; 			&amp;lt;editionStmt&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.98in; margin-right: 0in; text-indent: -0.38in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;edition&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in; font-weight: medium&quot;&gt; 			&lt;b&gt;&amp;lt;seriesStmt&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;sourceDesc&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;profileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.19in&quot;&gt; 			&lt;b&gt;&amp;lt;textClass&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;/td&gt;
&lt;td width=&quot;155&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;extent&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;encodingDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.2in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;tagsDecl&amp;gt;&lt;/b&gt;(all)&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.2in; margin-bottom: 0in&quot;&gt; 			&lt;b&gt;&amp;lt;refsDecl&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;profileDesc&amp;gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.17in&quot;&gt; 			&lt;b&gt;&amp;lt;langUsage&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;/div&gt;&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&lt;b&gt;&amp;lt;revisionDesc&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt; 			&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;font-weight: medium&quot;&gt;(also 			defined by containing system)&lt;/div&gt;
&lt;/td&gt;
&lt;td width=&quot;150&quot;&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-bottom: 0in&quot;&gt;&amp;lt;fileDesc&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.49in; margin-right: 0in; text-indent: -0.21in; margin-bottom: 0in&quot;&gt; 			&amp;lt;publicationStmt&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.47in; margin-bottom: 0in&quot;&gt; 			&amp;lt;publisher&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.47in; margin-bottom: 0in&quot;&gt; 			  &lt;b&gt;&amp;lt;availability&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.47in; margin-bottom: 0in&quot;&gt; 			&amp;lt;distributor&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.47in; margin-bottom: 0in&quot;&gt; 			  &lt;b&gt;&amp;lt;availability&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.47in; margin-bottom: 0in&quot;&gt; 			&amp;lt;authority&amp;gt;&lt;/div&gt;
&lt;div align=&quot;left&quot; style=&quot;margin-left: 0.99in; margin-right: 0in; text-indent: -0.47in&quot;&gt; 			  &lt;b&gt;&amp;lt;availability&amp;gt;&lt;/b&gt;&lt;/div&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;
&lt;p style=&quot;margin-top: 0.08in&quot;&gt;&lt;i&gt;Table 1: Metadata life-cycle roles of TEI header elements. (derived from Burnard &amp;amp; Sperberg-McQueen, 2002)&lt;/i&gt;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in&quot;&gt;	A complete TEI Lite document contains a header and text body (see Figure 1).  The header, as indicated above, contains metadata related to the digital object and the original information object.  The header is separable from the encoded body, which allows it to serve as a description for non-text objects stored separately from the header.  The TEI header is analogous to the title page of a text. It has up to four parts: a description of the electronic file (&lt;i&gt;fileDesc&lt;/i&gt;), an encoding description (&lt;i&gt;encodingDesc&lt;/i&gt;), a non-bibliographic description of the text (&lt;i&gt;profileDesc&lt;/i&gt;), and a revision history (&lt;i&gt;revisionDesc&lt;/i&gt;)(Burnard &amp;amp; Sperberg-McQueen, 2002, p. 6).  Of these, only the file description is required, the elements of which can be related directly to MAchine Readable Cataloging (MARC) fields.  Unlike MARC, elements of the TEI header are not required to conform to a controlled vocabulary such as described by the Anglo American Cataloging Rules (AACR), although such rules may be applied at the encoder&#039;s discretion (Pouchard, 1998).&lt;/p&gt;
&lt;p&gt; &lt;br /&gt;
&lt;/p&gt;&lt;p style=&quot;margin-bottom: 0in&quot;&gt;&lt;span id=&quot;Frame1&quot; dir=&quot;LTR&quot; style=&quot;float: left; border: none; padding: 0in; background: #ffffff&quot;&gt;&lt;br /&gt;
&lt;p style=&quot;margin-top: 0.08in&quot;&gt;&lt;img src=&quot;/thomas_files/TEI-Lite_essay_html_m54aa5587.gif&quot; name=&quot;Graphic1&quot; align=&quot;left&quot; border=&quot;0&quot; /&gt;&lt;br clear=&quot;left&quot; /&gt;&lt;i&gt;Figure 1: Structure of a TEI Lite document (HTML Writers Guild, 2001)&lt;/i&gt;&lt;/p&gt;
&lt;p&gt; &lt;/p&gt;&lt;/span&gt; &lt;/p&gt;
&lt;p&gt;&lt;br clear=&quot;left&quot; /&gt;&lt;br /&gt; 
&lt;/p&gt;&lt;p&gt;	For textual objects, structural encoding is defined within the &lt;i&gt;text&lt;/i&gt; element.  A TEI text may contain a single, unitary work, or a group of works as realized in a series or anthology.  For the latter case, the text element may contain an arbitrary number of &lt;i&gt;group&lt;/i&gt; elements, each containing a text body with optional front and back matter.  Additionally, multiple TEI objects may be grouped as a corpus, analogous to a collection of texts (Burnard &amp;amp; Sperberg-McQueen, 2002, p. 7).&lt;/p&gt;
&lt;p&gt;	The range of elements and the relatively relaxed markup rules allow for varying granularity depending on the intended usage.  The body of an encoded text is structured by &lt;i&gt;p&lt;/i&gt; and &lt;i&gt;div&lt;/i&gt; elements, similar to those in HyperText Markup Language (HTML), that represent chapters, sections, and subsections of a text.  Text within these structures may be further encoded using a myriad of markup that indicate layout and appearance.  Furthermore, elements are available for defining alternate appearances or versions of text and editorial markup or annotations as applied to the original object.  Additionally, elements such as &lt;i&gt;unclear&lt;/i&gt; allow for the indication of unintelligible or damaged areas of text (Burnard &amp;amp; Sperberg-McQueen, 2002). Such elements enhance textual analysis by allowing the encoding of multiple version of a text within the same electronic file.&lt;/p&gt;
&lt;p&gt; &lt;b&gt;Relationship to Other Metadata Initiatives&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	TEI was one of the first metadata initiatives, predated only by MARC and the International Standard Bibliographic Description (ISBD), AACR, and SGML standards (Burnard &amp;amp; Light, 1996).  The TEI header and the descriptive fields of later versions of MARC closely resemble the functional structure of the ISBD.  Despite the structural similarities with MARC and ISBD, however, TEI does not require the use of controlled vocabulary and as such does not readily convert to either standard.  Early TEI development eschewed strict cataloging requirements in the expectation that non-catalogers would use the specification.  The decision to conform to standard cataloging practices is left to the creating agency (SCHEMAS Registry, 2002).  Such a flexible approach favors ease of use over uniformity in order to facilitate a wider adoption of the standard (MIT Libraries, 2004) - an approach that Dublin Core has also uses.  &lt;/p&gt;
&lt;p&gt;	Metadata initiatives developed subsequent to TEI have benefited from TEI&#039;s success and derive structures from TEI Lite.  Encoded Archival Description (EAD) borrowed TEI&#039;s header concept (Burnard &amp;amp; Light, 1996, p. 13).  Other metadata initiatives are domain specific applications of TEI.  The Consortium for the Computer Interchange of Museum Information (CIMI) uses the TEI framework for the description of museum resources (Burnard &amp;amp; Light, 1996, p. 15).  Another derivation is the Spoken Text Markup Language (STML), a text to speech markup language inspired by TEI (Sproat, 1997).  Similarly, the Music Encoding Initiative (MEI) was based on TEI (Roland, 2002).&lt;/p&gt;
&lt;p&gt;	Most notable of TEI influences was in the development of XML.  TEI represented the first and  most precise SGML implementation at the time of XML&#039;s development.  As a result, developers of TEI were closely involved in defining XML.  Especially useful to the nascent XML specification was TEI&#039;s extended pointer language which served as a prototype for XLink and Xpointer (DeRose, 1999).&lt;/p&gt;
&lt;p&gt; &lt;b&gt;Evaluation of TEI Lite&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	TEI Lite was created to present a useful subset of TEI that provides the elements necessary for most common encodings.  The 140 elements of TEI Lite represent only a fraction of the hundreds of elements available in TEI and its extensions.  The majority of the subset, besides those in the header, define basic structural and perceptual attributes necessary for textual objects, but not so many as to become overly granular.  Additionally, the use of a lesser number of elements restricts the size of the metadata vocabulary that different agencies need to have in order to understand conventions used during encoding and markup.  The subset represents a lowest common denominator of sorts that is compliant with and upgradeable to the full TEI specification.  &lt;/p&gt;
&lt;p&gt;	There are a number of criticisms with TEI encodings.  First, as mentioned previously, the header lacks a controlled vocabulary for bibliographic elements.  There is a compromise between usability from the perspective of creation and accessibility in terms of resource location.  Free text bibliographic descriptions, however, could prove to be more useful for scholars of ancient texts which, by their unique character, require more detailed descriptions than those afforded in library cataloging (Pouchard, 1998).  The upcoming P5 version of TEI will allow external metadata and namespaces to be included in TEI documents (TEI Consortium, 2004, Guidelines - P5 status).  Embedding MARC encoded data may offer a solution to controlled vocabulary problem, although it is uncertain how such features will cascade into TEI Lite.&lt;/p&gt;
&lt;p&gt;	Second, texts may overlap semantic and organizational structures.  XML and TEI are hierarchical languages that require inelegant procedures to represent such overlapping structures.  The overlap problem is especially pertinent to representing variant structures beyond the word or character level such as macro-level versions and variations (Smith, 1999).&lt;/p&gt;
&lt;p&gt;	Third, the reduced set of elements available in TEI Lite reduces the chance of over-granular structure, but divergent encoding practices are still possible.  The basic structural elements (&lt;i&gt;p&lt;/i&gt; and &lt;i&gt;div&lt;/i&gt;) and their attributes may be used differently and result in confusion when encoded documents are exchanged.  Numerous &amp;ldquo;best practices&amp;rdquo; standards have been created to help alleviate variation within institutions (TEI Consortium, 2004, Tutorials).  The loosely prescribe structuring rules, however, demand that TEI rendering tools be just as flexible and not beholden to a particular encoding practice.&lt;/p&gt;
&lt;p&gt;	 Finally, the basic assumptions underlying the use of structural elements creates problems for representing the physical structure of a work.  TEI is based primarily on encoding the intellectual structures of a text, such as chapters, acts, volumes, and other semantic containers.  Such assumptions preempt encoding structures based on physical attributes of the container, such as the sequence of formes in early printed texts (Bauman &amp;amp; Catapano, 1999).  The scope of this problem may be beyond the capabilities of TEI Lite and require use of the larger element set of TEI.&lt;/p&gt;
&lt;p&gt;	Despite these criticisms, TEI Lite successfully achieves its goal of providing a readily adaptable point of entry to TEI.  Furthermore, TEI Lite sufficiently addresses the domain problems that TEI was meant to solve.  We can judge the 1987 Poughkeepsie Principles in terms of the current implementation of TEI Lite:  TEI Lite provides for simple, clear, and concise representations of textual objects;  Expression in XML allows for efficient processing, the use of non-specialized software, and conforms to existing standards;  Structural definitions in TEI Lite are not as rigorous as TEI, and the user may not extend TEI Lite freely, however, upward compatibility between the specifications provides a solution.&lt;/p&gt;
&lt;p&gt;	In addition to basic principles, we may judge success by the degree to which TEI Lite has been adopted.  The Oxford Text Archive and the Electronic Text Centers at the University of Virginia and the University of Michigan use TEI Lite to encode their holdings. The TEI Consortium uses TEI Lite in its technical documentation (Burnard &amp;amp; Sperberg-McQueen, 2002, p. 2).  Additionally, a significant number of the projects listed on the consortium Web site use TEI Lite (TEI Consortium, 2004, Projects using TEI) and a cursory Web and journal search reveals that TEI Lite is frequently used for encoding projects and research.&lt;/p&gt;
&lt;p&gt; &lt;b&gt;Conclusion&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;	TEI Lite is an introductory subset of TEI, one of the earliest metadata initiatives.  The encoding standard blends a flexible implementation with established descriptive principles.  The result is a metadata set that is easy to apply and capable of describing many types of objects.  The success of TEI, representing the efforts of scholars worldwide, has informed the development of many subsequent metadata standards and influenced the development of XML.  Development of the standard continues as does its increased use in projects for a variety of domains.&lt;/p&gt;
&lt;p&gt; &lt;b&gt;References&lt;/b&gt;&lt;br /&gt; (see also &lt;a href=&quot;/thomas_files/TEI-Lite_pathfinder.pdf&quot;&gt;pathfinder &amp;amp; annotated bibliography&lt;/a&gt;)&lt;br /&gt;
&lt;/p&gt;&lt;p class=&quot;hanging-indent&quot;&gt;Bauman, S. &amp;amp; Catapano, T. (1999).  TEI and the encoding of the physical structure of books. &lt;i&gt;Computers and the Humanities, 33&lt;/i&gt;(1/2), 113&amp;ndash;127.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Burnard, L. &amp;amp; Sperberg-McQueen, C. (1995, updated 2002).  &lt;i&gt;TEI Lite: An introduction to text encoding for interchange&lt;/i&gt;.  Retrieved on 18 September, 2004, from &lt;a href=&quot;http://www.tei-c.org/Lite/teiu5_en.pdf&quot;&gt;http://www.tei-c.org/Lite/teiu5_en.pdf&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Burnard, L. &amp;amp; Light, R. (1996).  &lt;i&gt;Three SGML metadata formats: TEI, EAD, and CIMI:  A Study for BIBLINK Work Package 1.1&lt;/i&gt;.  Retrieved on 18 September, 2004, from &lt;a href=&quot;http://www.ifla.org/documents/libraries/cataloging/metadata/biblink2.pdf&quot;&gt;http://www.ifla.org/documents/libraries/cataloging/metadata/biblink2.pdf&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Burnard, L. &amp;amp; Popham, M. (1999).  Putting our headers together: A report on the TEI header meeting 12 September 1997.  &lt;i&gt;Computers and the Humanities, 33&lt;/i&gt;(1/2), 39&amp;ndash;47.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Burnard, L. (2000).  Text encoding for interchange: A new consortium.  &lt;i&gt;Ariadne, 24&lt;/i&gt;(21 June 2000).  Retrieved on 16 September, 2004, from &lt;a href=&quot;http://www.ariadne.ac.uk/issue24/tei/&quot;&gt;http://www.ariadne.ac.uk/issue24/tei/&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;DeRose, S. (1999).  XML and the TEI.  &lt;i&gt;Computers and the Humanities, 33&lt;/i&gt;(1/2), 11&amp;ndash;30.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;HTML Writers Guild (2001).  An introduction to the Text Encoding Initiative (TEI), DTD. Retrieved on 26 November, 2004, from &lt;a href=&quot;http://gutenberg.hwg.org/teidtds.html&quot;&gt;http://gutenberg.hwg.org/teidtds.html&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;MIT Libraries (2004).  &lt;i&gt;MIT metadata reference guide: TEI (Text Encoding Initiative) metadata&lt;/i&gt;.  Retrieved on 16 September, 2004, from &lt;a href=&quot;http://libraries.mit.edu/guides/subjects/metadata/standards/tei.html&quot;&gt;http://libraries.mit.edu/guides/subjects/metadata/standards/tei.html&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Mylonas, E. &amp;amp; Renear, A. (1999).  The Text Encoding Initiative at 10: Not just an interchange format anymore &amp;ndash; But a new research community.  &lt;i&gt;Computers and the Humanities, 33&lt;/i&gt;(1/2), 1&amp;ndash;9.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Pouchard, L. (1998).  Cataloging for digital libraries: The TEI scheme and the TEI header.  &lt;i&gt;Katharine Sharp Review, 6&lt;/i&gt;(Winter 1998).  Retrieved on 18 September, 2004, from &lt;a href=&quot;http://alexia.lis.uiuc.edu/review/6/pouchard.html&quot;&gt;http://alexia.lis.uiuc.edu/review/6/pouchard.html&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Roland, P. (2002).  &lt;i&gt;The Music Encoding Initiative (MEI)&lt;/i&gt;.  &lt;span style=&quot;font-style: normal&quot;&gt;Musical Applications using XML (MAX) 2002 Conference.&lt;/span&gt;  Retrieved on 21 November, 2004, from &lt;a href=&quot;http://dl.lib.virginia.edu/bin/dtd/mei/maxpaper.pdf&quot;&gt;http://dl.lib.virginia.edu/bin/dtd/mei/maxpaper.pdf&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;SCHEMAS Registry (2002).  &lt;i&gt;Activity reports: Text Encoding Initiative&lt;/i&gt;.  Retrieved on 16 September, 2004, from &lt;a href=&quot;http://www.schemas-forum.org/registry/desire/activityreports.php3?field=filename&amp;amp;value=TEI_D29D35(RDF).rtf&quot;&gt;http://www.schemas-forum.org/registry/desire/activityreports.php3&lt;br /&gt;?field=filename&amp;amp;value=TEI_D29D35(RDF).rtf&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Smith, D. (1999).  Textual variation and version control in the TEI.  &lt;i&gt;Computers and the Humanities, 33&lt;/i&gt;(1/2), 103&amp;ndash;112.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;Sproat, R., Taylor, P., Tanenblatt. M. &amp;amp; Isard, A. (1997).  &lt;i&gt;A markup language for text-to-speech synthesis&lt;/i&gt;.  5th European Conference on Speech Communication and Technology, Rhodes, Greece, September 22-25, 1997.  Retrieved on 21 November, 2004, from &lt;a href=&quot;http://www.talkingheads.computing.edu.au/resources/documents/serge/Sproat/A%20Markup%20Language%20for%20TTS%20Synthesis-Sproat.pdf&quot;&gt;http://www.talkingheads.computing.edu.au/resources/documents/serge/&lt;br /&gt;Sproat/A%20Markup%20Language%20for%20TTS%20Synthesis-Sproat.pdf&lt;/a&gt;.&lt;/p&gt;
&lt;p class=&quot;hanging-indent&quot;&gt;TEI Consortium (2004).  &lt;i&gt;Text Encoding Initiative&lt;/i&gt;.  Retrieved on 16 September, 2004, from &lt;a href=&quot;http://www.tei-c.org&quot;&gt;http://www.tei-c.org&lt;/a&gt;.&lt;/p&gt;
&lt;!-- end content --&gt;&lt;!-- end content --&gt;&lt;table id=&quot;attachments&quot; class=&quot;sticky-enabled&quot;&gt;
 &lt;thead&gt;&lt;tr&gt;&lt;th&gt;Attachment&lt;/th&gt;&lt;th&gt;Size&lt;/th&gt; &lt;/tr&gt;&lt;/thead&gt;
&lt;tbody&gt;
 &lt;tr class=&quot;odd&quot;&gt;&lt;td&gt;&lt;a href=&quot;http://thomas.kiehnefamily.us/thomas_files/TEI-Lite_pathfinder.pdf&quot;&gt;TEI-Lite_pathfinder.pdf&lt;/a&gt;&lt;/td&gt;&lt;td&gt;235.58 KB&lt;/td&gt; &lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
</description>
 <comments>http://thomas.kiehnefamily.us/tei_lite_history_and_evaluation#comments</comments>
 <category domain="http://thomas.kiehnefamily.us/page_subjects/digital_archives">Digital Archives</category>
 <category domain="http://thomas.kiehnefamily.us/page_subjects/digital_libraries">Digital Libraries</category>
 <category domain="http://thomas.kiehnefamily.us/page_subjects/metadata">Metadata</category>
 <enclosure url="http://thomas.kiehnefamily.us/thomas_files/TEI-Lite_pathfinder.pdf" length="241236" type="application/pdf" />
 <pubDate>Tue, 30 Nov 2004 05:57:09 +0000</pubDate>
 <dc:creator>tkiehne</dc:creator>
 <guid isPermaLink="false">3 at http://thomas.kiehnefamily.us</guid>
</item>
</channel>
</rss>
