Archives

May 28, 2020

MetaArchive Member Profile: Oregon State University Libraries

By: Michael Boock, Associate Professor/Scholarly Communication Librarian

MetaArchive Member Profiles

Tell us a bit about the digital preservation program at your organization?

Oregon State University Libraries has been firmly committed to the long term preservation of the scholarship of the university and its unique digital assets as far back as 2008 when Terry Reese was appointed to an endowed position with responsibilities for building the digital preservation infrastructure of the Libraries. During his tenure, the Libraries began using LOCKSS for preserving journal content and joined the MetaArchive Cooperative as a sustaining member.  Our digital preservation operations were vastly improved after 2012 with the hire of Brian Davis as Digital Production Unit Head , who developed format-specific identification, validation, characterization, and fixity checking of digitized content. The Libraries further committed to digital preservation in a 2012-2017 strategic plan that called for the creation of a “robust and flexible digital preservation and curation infrastructure” and “a long-term preservation system for university scholarship and digital collections developed and curated by OSU Libraries and Press.”

Looking ahead, what are you excited about, or what’s on the horizon for your program?

Brian and I presented a report to library leadership in 2017 that described the current state of the library’s digital preservation efforts and recommended next steps for preserving the Libraries digital objects. Emblematic of how quickly things are changing in the digital preservation space, many of the report recommendations have shifted over the last couple of years, but I am thrilled to say that some of the recommendations, in particular upgrading the library’s backup and storage systems to include monthly and incremental daily backups and the increased use of Archivematica for processing digital objects before repository ingest.

MetaArchive Member Profile: Oregon State University Libraries. Aside from our use of MetaArchive to preserve substantial amounts of our most important digital content, I value MetaArchive for its community of experts. It is immensely valuable to be able to learn from leaders in our field.
” Aside from our use of MetaArchive to preserve substantial amounts of our most important digital content, I value MetaArchive for its community of experts. It is immensely valuable to be able to learn from leaders in our field.
Pictured, Top Row, L-R: Michael Boock, Associate Professor/Scholarly Communication Librarian; Brian Davis, Digital Production Unit Supervisor. Bottom Row, L-R: Hui Zhang, Associate Professor/Digital Services Librarian; Margaret Mellinger, Associate Professor/Director, Emerging Technologies and Services

Tell us a bit about your local workflow. How has the MetaArchive preservation storage service been incorporated?

As noted, for digitized objects, a breadth of preservation work is done to ensure content validation and fixity for digitized objects. The master, preservation-level files are then moved onto ZFS storage systems via a BagIt protocol. For born-digital scholarship housed in the Samvera/Fedora based ScholarsArchive@OSU institutional repository, file integrity using a checksum tool is checked as part of file ingestion. Dr. Hui Zhang, digital services librarian, uses a script that traverses the hierarchy of repository objects in the institutional repository to locate and export binary files with the RDF metadata from specific repository collections. The generated BAGs are then moved to temporary Amazon Web Services storage for MetaArchive harvesting.

What types of digital collections are you focusing on for preservation in MetaArchive? What will preserving those collections for the long-term mean for their users or your institution? How are some of those collections used now?

The Libraries first used MetaArchive to replicate the university’s corpus of Electronic Theses and Dissertations. Theses and dissertations represent the breadth of significant research and scholarship conducted at the university over its entire history, and also serve as an important historical record of the OSU research and teaching interests. MetaArchive is also used to replicate all of the University’s Extension and Experiment Station Communication Publications (EESC). As noted in this editorial from the Corvallis Gazette-Times, published after the EESC collection of over 6,000 technical reports were digitized and made available in the IR, many of the same issues that were important to Oregon residents 100 years ago continue to be important today. Preserving this content with MetaArchive’s robust Private LOCKSS network helps to ensure that it will be available to citizens today and long into the future.

An Editorial from the Corvallis Gazette-Times: After 100 years, Extension still valuable.

Editorial from the Corvallis Gazette-Times, published after the EESC collection of over 6,000 technical reports were digitized and made available in the IR, noted that many of the same issues that were important to Oregon residents 100 years ago continue to be important today.

Tell us about your experience in participating in the MetaArchive community. How has it influenced you or your work?

I have personally served as OSU’s representative on the MetaArchive Steering Team since 2015 and as this year’s Chair of the Steering Team.  When I joined the Steering Team five years ago, I had a strong interest in digital preservation, but I had very little idea about how to do it. As noted above, OSU Libraries has invested in staff and resources to improve digital preservation operations, but this work should not be done in a vacuum. Aside from our use of MetaArchive to preserve substantial amounts of our most important digital content, I value MetaArchive for its community of experts. It is immensely valuable to be able to learn from leaders in our field such as Katherine Skinner, Matt Schultz, and Sam Meister (former MetaArchive Community Manager), and to learn from colleagues from a variety of different library and museum types about their preservation work.

Tell us a bit about your experience participating in the Changing for Continued Impact Series? What have been some of your key takeaways from the series thus far?

As Katherine Skinner (Executive Director of Educopia) noted to members last year, MetaArchive, as the world’s longest tenured distributed digital preservation solution in the world, has been in place and operating within the same technology base and governance structure since its inception. As part of this Series, our community has had an opportunity to hear from experts in the field about alternative technological approaches. It has been invaluable to me to learn from experts in the field, and MetaArchive’s own experts like Nathan Tallman (Penn State) and Zach Vowell (Cal State Poly), that there are alternative solutions that are ripe for further exploration by the community. Another key takeaway for me is that MetaArchive will remain viable as a preservation network only so long as we are prepared to transition to meet the needs of the community. Fortunately, the transparency of the network and its governance structure helps to ensure that the community’s needs will continue to be met.

Editorial note: “Since late 2019 the MetaArchive community has been undergoing a series of intensive evaluations of both their organizational model as well as their technical approaches to distributed digital preservation (DDP). This is the Changing for Continued Impact (CFCI) Series, a facilitated framework led by Educopia that engages the MetaArchive members in a series of focused-discussions and work-sessions. This generative and co-creative process got underway in earnest this past Fall 2019, and will continue through Spring 2020 leading up to the next Annual MetaArchive Membership Meeting.”


April 15, 2020

MetaArchive Member Profile: Virginia Tech University Libraries

By: Alex Kinnaman, Digital Preservation Coordinator, and Nathan Hall, Director of Digital Imaging and Preservation

MetaArchive Member Profiles

Tell us a bit about the digital preservation program at your organization?

Virginia Tech University Libraries was a founding member of the MetaArchive Cooperative and has hosted a LOCKSS cache since 2007. Our preservation system has evolved since then, including the addition of a second distributed digital preservation service with APTrust, the hiring of two digital preservation faculty members in 2017, and the ongoing development of a preservation-centric Digital Library Platform. The preservation system is managed by the Director of Digital Imaging and Preservation Services, the Digital Preservation Coordinator, and the Digital Preservation Technologist, and it is implemented by the Digital Library Development team in the Library. This group is responsible for developing and maintaining policies, overseeing workflows, and collaborating with our content producers. We are currently working on our Digital Preservation Program Priorities and Deliverables, outlining policies, services, and automations to be integrated with our new platform in development.

Looking ahead, what are you excited about, or what’s on the horizon for your program?

We have recently received a grant to digitize the Virginia Tech Insect Collection in 3D using photogrammetry in collaboration with the Entomology Department. 3D objects are complex and dynamic objects that present a preservation challenge, and we are investigating how our preservation workflow for these objects will differ from workflows for simpler objects. We are also developing a more robust Digital Humanities support system in the Libraries and are collaborating with VT Publishing to develop preservation levels for the variety of DH projects we hope to host. Ultimately, we are excited to have an automated preservation system built into the Digital Library Platform that communicates directly with MetaArchive.

MetaArchive is integral to our preservation system, particularly since the MetaArchive network is where we store or most unique and valuable content.
MetaArchive is integral to our preservation system, particularly since the MetaArchive network is where we store or most unique and valuable content.
L-R: Nathan Hall, Director of Digital Imaging and Preservation; Alex Kinnaman, Digital Preservation Coordinator; and Luke Menzies, Digital Preservation Technologist

Tell us a bit about your local workflow. How has the MetaArchive preservation storage service been incorporated?

Our current workflows are under revision as our new Digital Library Platform is in its beta form. In the past we performed our MetaArchive ingests manually; we are working on a MetaArchive Automation Service to better streamline our preservation system. MetaArchive currently holds all of our digitized bound theses and dissertations prior to 2017. While we have not ingested content into MetaArchive during our preservation system development, we have maintained an active role in hosting our cache and staying active within the community.

Tell us about your experience in participating in the MetaArchive community. How has it influenced you or your work?

Virginia Tech University Libraries has been an active member in MetaArchive since 2004, both as a cache host and in the community, including Steering Committee participation, and committee participation. The Change for Continued Impact Series has enabled us to engage more in the community and offer feedback. We have often relied on this community for advice or discussion in making our preservation decisions. MetaArchive is integral to our preservation system, particularly since the MetaArchive network is where we store our most unique and valuable content.

Tell us a bit about your experience participating in the Changing for Continued Impact Series? What have been some of your key takeaways from the series thus far?

We have been active in the Changed for Continued Impact Series, and appreciate the expanded interest in community needs and comprehensive engagement. One of the most valuable outcomes thus far has been the MetaArchive-LOCKSS Sustainability Evaluations provided by Penn State and Cal Poly, as they are in line with our needs at Virginia Tech as we develop an automated system.

Editorial note: “Since late 2019 the MetaArchive community has been undergoing a series of intensive evaluations of both their organizational model as well as their technical approaches to distributed digital preservation (DDP). This is the Changing for Continued Impact (CFCI) Series, a facilitated framework led by Educopia that engages the MetaArchive members in a series of focused-discussions and work-sessions. This generative and co-creative process got underway in earnest this past Fall 2019, and will continue through Spring 2020 leading up to the next Annual MetaArchive Membership Meeting.”


March 25, 2020

MetaArchive Member Profile: University of Louisville Archives & Special Collections

By: Kyna Herzinger, Archivist for Record Management, and Rachel Howard, Digital Initiatives Librarian

MetaArchive Member Profiles

Tell us a bit about the digital preservation program at your organization?

Our colleague, Rare Books Curator Delinda Buie, happened to be in the right place at the right time when Martin Halbert and others discussed applying for an NDIIPP (National Digital Information Infrastructure and Preservation Program) grant to explore distributed digital preservation at an ARL (Association of Research Libraries) meeting in 2003. At the time, UofL did not have a formal digitization program, but the Special Collections department, in which Delinda worked, had been doing ad hoc digitization for customer orders and exhibits for several years. The successful NDIIPP grant evolved into the MetaArchive Cooperative, and locally led to the creation of the Digital Initiatives program, in which Rachel Howard has served since 2006 and has overseen Digital Collections of cultural heritage materials and an institutional repository of university scholarship. UofL’s digital preservation efforts focused on digitized images and oral histories. In 2017, after Kyna Herzinger had joined the team, UofL took steps to develop a framework for a digital preservation program, drafting policies, exploring tools, and documenting workflows.  At that time, UofL’s digital preservation expanded to include born-digital university records, oral histories, and community collections.

Looking ahead, what are you excited about, or what’s on the horizon for your program?

In terms of content, we are looking forward to preserving our electronic theses and dissertations, which are currently backed up in the cloud. We plan to establish a workflow to have them harvested into the MetaArchive network. In terms of maturing our overall program, we have identified two areas of focus. Having no single position that is responsible for handling born-digital content, we are still ensuring that our curators can accession and process their own born-digital collections.  This means fine-tuning workflows. We are also starting to shift focus toward improving access to born-digital content, both in terms of discovery and researcher support.

“We both enjoy knowing a welcoming community of people who are engaged in similar work and are always willing to share advice or lend an ear. As we assess the resources at our disposal, we are especially cognizant of the role that MetaArchive plays as our most robust storage option. It provides what we could not have done ourselves: secure, distributed, bit-level preservation.”
In photo, L-R: Rachel Howard, Kyna Herzinger

Tell us a bit about your local workflow. How has the MetaArchive preservation storage service been incorporated?

For digitized content, after creating master and access files and metadata and launching a digital collection to the public, Rachel would copy master files and an XML file of the metadata to a staging server and organize them into archival units (AUs) of acceptable size for ingest into the MetaArchive network. The size of those AUs grew over time as we tested network capabilities, so that, for example, our yearbooks, whose master files ballooned to as much as 50 GB per yearbook, could each be treated as a single AU, thus requiring less “data wrangling”. She would then create a manifest page and (as was required in the early days) plugin, document the locations of those files and the AUs in the MetaArchive Conspectus database, and then work with MetaArchive partners to test and then ingest the collection into the preservation storage network. 

Now, with born-digital content, we use the BagIt profile specification, and recently participated in the MetaArchive’s SuperNode Pilot project, testing Bagit + OwnCloud and Exactly + SFTP to ingest content into the network.

Tell us about your experience in participating in the MetaArchive community. How has it influenced you or your work?

We both enjoy knowing a welcoming community of people who are engaged in similar work and are always willing to share advice or lend an ear. Working together with this group has also provided us with  opportunities for research and professional leadership/ service at an international level. As we assess the resources at our disposal, we are especially cognizant of the role that the MetaArchive plays as our most robust storage option.  It provides what we could not have done ourselves: secure, distributed, bit-level preservation.

Tell us a bit about your experience participating in the Changing for Continued Impact Series? What have been some of your key takeaways from the series thus far?

It has been reenergizing to connect in a more focused way with the partners as we talk about the past, present, and future of the Cooperative. The series has provided reassurance that growing pains are normal, that challenges are opportunities for growth, and that it is better to be proactive about change than to wait until circumstances demand an immediate reaction.We appreciate being part of a community in which we have a say in its future.

Editorial note: “Since late 2019 the MetaArchive community has been undergoing a series of intensive evaluations of both their organizational model as well as their technical approaches to distributed digital preservation (DDP). This is the Changing for Continued Impact (CFCI) Series, a facilitated framework led by Educopia that engages the MetaArchive members in a series of focused-discussions and work-sessions. This generative and co-creative process got underway in earnest this past Fall 2019, and will continue through Spring 2020 leading up to the next Annual MetaArchive Membership Meeting.”


February 3, 2020

MetaArchive Member Profile: Indianapolis Public Library

By: William Knauth, Indianapolis Marion County Public Library, Digital Indy

MetaArchive Member Profiles

Tell us a bit about the digital preservation program at your organization?

The Digital Indy project has been a member of InDiPres since 2017, previous to this there had been concerns about the integrity and longevity of the digital archival collections being created by the project and an analysis found that the level of preservation and cost associated with InDiPres was the best available. The primary work of preparing and transferring digital collections to the InDiPres server is done as part of the role of the Metadata Specialist, as well as communication with the InDiPres and MetaArchive groups. I regularly attend and participate at meetings of these organizations and report back developments to the team at the library. As far as goals and visions for our involvement with this project I would be very pleased if we are able to preserve 100% of our large digital collections in the MetaArchive network by 2021. I would also like to see the ongoing Supernode efforts materialize into an efficient streamlined ingest system that would attract new members to InDiPres and MetaArchive.

Looking ahead, what are you excited about, or what’s on the horizon for your program?

We are presently working on getting more of our collections data ingested into MetaArchive as well as setting up firm and effective workflows for sending data to the InDiPres staging server after some technical issues have placed this on hold. I am excited to see how this will be made more efficient by some of the projects being worked on at MetaArchive.

MetaArchive Member Profile: Indianapolis Public Library
“We are presently working on getting more of our collections data ingested into MetaArchive as well as setting up firm and effective workflows for sending data to the InDiPres staging server after some technical issues have placed this on hold. I am excited to see how this will be made more efficient by some of hte projects being worked on at MetaArchive.”

Pictured, L-R: William Knauth, Victoria Duncan, Beth Franklin, and Meaghan Fukunaga (formerly of InDiPres)

Tell us a bit about your local workflow. How has the MetaArchive preservation storage service been incorporated?

Our team has not had to significantly alter the established workflows in the initial areas of organizing and describing collections. The current standards we use are sufficiently robust as to create results that are effective for preservation purposes. We have had to make some additions to the workflows for successful ingest. This has involved processing collections through data integrity programs like Bagger and Exactly, setting up online transfer protocols, and creating documentation for preservation status of collections.

Tell us about your experience in participating in the MetaArchive community. How has it influenced you or your work?

I have had a positive experience meeting and working with the MetaArchive community in the several years of my involvement with the organization. I have found the membership to be very informed about both their own digital preservation situation and the state of this field of expertise in general. It has been useful and beneficial to have a group of individuals facing similar challenges to share ideas and solutions with.


January 14, 2020

MetaArchive Member Profile: Purdue University

By: Sandi Caldrone and Michael Witt

MetaArchive Member Profiles

Tell us a bit about the digital preservation program at your organization?

The Purdue University Research Repository, also known as PURR (insert cat joke here), is one of a couple of Purdue University Libraries and School of Information Studies repositories which utilize MetaArchive for preservation storage. PURR is a university core research facility provided by the Libraries, the Office of the Executive Vice President for Research and Partnerships, and Information Technology at Purdue. It provides an online, collaborative working space, data sharing, and publication platform for Purdue researchers and their collaborators. PURR also provides preservation support for published datasets and the MetaArchive Cooperative is a huge part of that preservation support.

Looking ahead, what are you excited about, or what’s on the horizon for your program?

We’ve recently started to talk with faculty members who create virtual reality (VR) environments and objects as part of their research. VR preservation is an exciting and challenging new area for us and we are looking into how our platform and preservation workflows can support the preservation of VR objects and what new features or support we might need to develop down the road.

“We’ve recently started to talk with faculty members who create virtual reality (VR) environments and objects as part of their research. VR preservation is an exciting and challenging new area for us and we are looking into how our platform and preservation workflows can support VR preservation and what new features we might need to develop down the road.”

Pictured back row L-R: Standa Pejša, Carly Dearborn, Matthew Kroll, Michael Witt. Front row L-R: Clair Stirm, Anthony Fuentes, Sandi Caldrone, and Yanqun Kuang.

Tell us a bit about your local workflow. How has the MetaArchive preservation storage service been incorporated?

We were lucky to have been still developing PURR when the Libraries joined the MetaArchive Cooperative and were able to develop our preservation infrastructure with a distributed model in mind. We use BagIt bags to package our datasets and metadata for preservation.

We also regularly try to think through a “fire drill” scenario—what would we do if we experience partial loss of content in our repository? This has proven to be a great way for us to interrogate the construction of our archival units and determine if we have embedded the necessary metadata to rebuild our local repository from our backups in MetaArchive.

Tell us about your experience in participating in the MetaArchive community. How has it influenced you or your work?

Digital Preservation is hard work, and MetaArchive has a demonstrated track record of success with the biggest challenges of digital preservation, which aren’t related to storage or technology, but governance and sustainability. It is so valuable to have a built-in community to troubleshoot the various issues that arise in digital processing, preservation planning, and everything in between. The MetaArchive Cooperative represents a mature solution and community—it isn’t a flash in the pan.


December 19, 2019

MetaArchive Member Profile: Atlanta University Center Robert W. Woodruff Library

By: Josh Hogan on behalf of the Historically Black Colleges and University Library Alliance (HBCU LA)

MetaArchive Member Profiles

Tell us a bit about the digital preservation program at your organization?

Since 2010, the Atlanta University Center (AUC) Woodruff Library has served as the technical lead and host of the LOCKSS server on behalf of the HBCU Library Alliance’s membership in the MetaArchive Cooperative.  Digital preservation at the AUC Woodruff Library is implemented by the Digital Preservation Working Group (DPWG), a collaborative team with members from the Archives Research Center, the Digital Services Department, Records Management, and the IT Department.  The DPWG is responsible for identifying, acquiring, and providing the means to preserve and ensure ongoing access to selected digital assets and associated metadata in accordance with AUC Woodruff Library’s collection development policies. For the past three years, we have pursued a three-year plan to develop our policies, workflows, and priorities.

Looking ahead, what are you excited about, or what’s on the horizon for your program?

We are excited about recently completing a revision of our digital collection development policy, providing clarity to our collecting areas related to born digital materials. We are also pleased to have wrapped up our first three-year plan, completing all of our goals for the period. We are eager to tackle the development of the new three-year plan in the coming months with an eye toward taking the program to the next level.

Member Profile: Atlanta University Center Woodruff Library. "We are excited about recently completing a revision of our digital collection development policy, providing clarity to our collecting areas related to born digital materials. We are also pleased to have wrapped up our first three-year plan, completing all of the goals for the period. We're eager to tackle the development of the new three-year plan in the coming months with an eye toward taking the program to the next level." Photograph of Cliff Landis, Jessica Leming, Robert Fallen, Josh Hogan, Alex Dade, Aletha Carter, Suteera Apichatabutra, Christine Wiseman.
“We are excited about recently completing a revision of our digital collection development policy, providing clarity to our collecting areas related to born digital materials. We are also pleased to have wrapped up our first three-year plan, completing all of the goals for the period. We’re eager to tackle the development of the new three-year plan in the coming months with an eye toward taking the program to the next level.”

Pictured back row L-R: Cliff Landis, Jessica Leming, Robert Fallen, Josh Hogan. Front row L-R: Alex Dade, Aletha Carter, Suteera Apichatabutra, Christine Wiseman

Tell us a bit about your local workflow. How has the MetaArchive preservation storage service been incorporated?

Our local workflow identifies three broad categories of material to be preserved: born digital archival material, digitized archival material, and born digital institutional photographs and records. In addition to these categories, there are two tiers related to the priority of preserving the object or collection. The first tier objects are those of the highest priority, and these will be the ones that we will seek to ingest into robust preservation networks such as MetaArchive. Second tier objects and collections will be preserved in at least two different geographical areas and stored on Amazon Glacier.

Tell us about your experience in participating in the MetaArchive community. How has it influenced you or your work?

The AUC Woodruff Library has long participated in MetaArchive as a member of the HBCU Library Alliance. Most of the material we have ingested has been digitized copies of the founding documentation of the participating HBCUs, and we have been the host site of that initiative since 2008.

We recently participated in the SuperNode Pilot Project, playing the role of one of the ingesting institutions. This participation helped us ingest a significant portion of the digital material that we have identified as tier one, and it helped us evaluate the use of Exactly and OwnCloud as tools for use in our program. We hope that feedback provided to MetaArchive Steering Committee will be useful in determining the future path of this intiative that could reduce barriers to digital preservation for smaller institutions.