Archive for the 'Digital Library' Category

Computer Forensics and Born-Digital Content in Cultural Heritage Collections

Laurie N. Taylor on Feb 21st 2010

The report won’t be out until after the meeting this May, but Computer Forensics and Born-Digital Content in Cultural Heritage Collections should be exciting and timely reading for everyone involved in supporting cultural heritage collections.

Filed in Digital Library, cultural heritage, digital collections, digitalarchive | No responses yet

Jamaica Journal Historical Collection Online in the Digital Library of the Caribbean

Laurie N. Taylor on Nov 11th 2009

Celebrate the official launch of the Jamaica Journal online in the Digital Library of the Caribbean:

Jamaica Journal Historical Collection Online
November 15, 2009, 4-6pm
Florida International University; Green Library 220

Event will feature Jamaican writer Donna Weir-Soley, reading from her book, Eroticism, Spirituality, and Resistance in Black Women’s Writings. Tour an exhibit featuring the Jamaica Journal with presentations from the Institute of Jamaica.

Admission is FREE and open to the public. Refreshments will be served. Subscriptions to the Jamaica Journal and books by Weir-Soley will be available for purchase and signing.

Co-Sponsors Include: Air Jamaica, Jamaican Information Service, Jamaica Tourist Board, and the Latin American and Caribbean Center

This event celebrates the collaborate work of the Jamaica Journal and the Digital Library of the Caribbean. With the cooperation of the publisher, a long run of the Jamaica Journal from the holdings of Digital Library of the Caribbean partners has now been made available in the Digital Library of the Caribbean. These issues are now online here.

One issue of special note is that of Volume 10, No. 1 (1976), dedicated to the folk music of Jamaica. When originally distributed, this issue was accompanied by a vinyl record of examples of traditional Jamaican music. The supplemental recordings were converted to digital audio, and may be heard online. This serves as an example of how the Digital Library of the Caribbean can bring together information from different formats (print, audio, etc.) in one online repository.

Jamaica Journal Online: http://www.dloc.com/ufdc/?m=hdBI2&b=UF00090030

Digital Library of the Caribbean: http://www.dloc.com

Filed in Digital Library, dloc, events | No responses yet

Building Digital Archives That Last

Laurie N. Taylor on May 21st 2009

In practice, development doesn’t stop. Recognise this and deal with it.

“The coolest thing…” The biggest risk is that premature proscription prevents the coolest thing.

Is this a problem? Only if the system is considered as a whole. Decompose system into independent components that are tractable.

- Neil Jefferies, “Persistent IT Architectures: Building Digital Archives That Last” from the Digital Repositories Workshop: Tools and Infrastructure,23 April 2009.

Filed in Digital Library, standards, technologies | No responses yet

Job Posting: Digital Collections Curator, The Pennsylvania State University Libraries

Laurie N. Taylor on Feb 11th 2009

The Pennsylvania State University Libraries seeks a Digital Collections Curator to play a key role in the further development of our electronic content stewardship and publishing programs. These programs will be developed through a strategic and dynamic partnership between the Penn State Libraries and Information Technology Services (ITS). The Digital Collections Curator will lead the Libraries’ efforts to develop and plan user focused services that enable the effective creation, sharing, discovery, and use of digital content in support of research, teaching and learning. The Digital Collections Curator collaborates extensively with colleagues throughout the Libraries and ITS to achieve his or her objectives. The Curator will report to the Assistant Dean for Scholarly Communications who also oversees Digitization and Preservation, Scholarly Communications Services, and the Office of Digital Scholarly Publishing. This is a tenure track appointment.

Responsibilities will include:

  • Lead development of an inclusive, user-focused agenda for digital scholarly content stewardship.
  • Investigate, recommend, and develop plans for user-focused and repository- based services to effectively manage the sustainable creation, collection and distribution of high-value digital scholarly content.
  • Manage a broad set of existing digital collections and repository content, including: reformatted materials (images, books, newspapers, manuscripts, etc), publication related content (journals, conference proceedings, monographs, hybrid formats, post & pre-prints, working papers, etc), as well as the potential and emerging needs for data collections in a wide array of disciplines.
  • Research and develop in-depth knowledge of new and emerging technologies, relevant national standards, and best practices, in order to assess and promote their integration into local operations as appropriate.
  • Serve on standing working groups and committees related to web functionality and digital content creation and management.
  • Communicate effectively with internal stakeholders in the areas of collections & public services, technical services, information technologies, and scholarly communications.
  • Promote and report on Penn State’s activities through conference and workshop presentations, written publications
  • Represent Penn State in relevant professional contexts and engage with national and consortial peers to identify and/or carry out mutually beneficial partnerships.

Requirements:

  • Master’s degree in library and/or information science, or advanced degree in relevant academic field.
  • Should have 3 years work related to the creation, management, and provision of electronic data resources in a higher education environment.
  • Should demonstrate strong organizational and/or process management abilities.
  • Should demonstrate familiarity with developing trends in higher education information management, including, but not limited to: Cyberinfrastructure development, data curation and preservation, electronic publishing, digital scholarship and non-traditional scholarly communications
  • Ability to lead and work collaboratively in an evolving and decentralized environment.
  • Commitment to user focused design, development, and service provision.
  • Communication skills that will support work with both technology experts and novices.
  • Facility with common standards and practices in contemporary digital library management. Experience with XSLT, Perl or other scripting languages, and/or experience with major repository platforms is desirable.

Environment:

As an outcome of joint strategic planning, the Penn State Libraries and Information Technology Services (ITS) are collaborating in the development of this Content Stewardship program to meet extant and emerging digital content and asset management needs in areas such as digital library collections, scholarly communications, electronic record archiving, and e-science/e-research. Building on existing services and infrastructure, this program will put in place a cohesive and extensible suite of data access, management, and preservation services that will support the creation and distribution of digital scholarship. Additionally, the Penn State Press and the Libraries jointly operate the Office of Digital Scholarly Publishing to explore and incubate publishing services that support the Penn State community.

Penn State, a land-grant institution, is a member of the CIC (Big 10) academic consortium. The Penn State University Libraries currently rank 8th in North America among private and public research universities, based on Association for Research Libraries Investment Index. The Libraries hold membership in ARL, OCLC, CRL and the Digital Library Federation. Collections exceed 6.5 million volumes, including more than 68,000 current serial subscriptions.

The University Libraries are located at University Park and 23 other campuses throughout Pennsylvania, with approximately 6,000 faculty and 42,000 students at University Park, and more than 82,000 students system wide. The University Park campus is set in the State College metropolitan area, a university town located in the heart of central Pennsylvania. State College offers a vibrant community with outstanding recreational facilities, a low crime rate, and excellent public schools. The campus is within a half-day drive to Washington, DC, Baltimore, Philadelphia, New York City and Pittsburgh. For more information, please visit http://www.libraries.psu.edu and http://www.cbicc.org/

Application Instructions:

Send a letter of application, resume, and the names and contact information of three references to Search Committee, The Pennsylvania State University, Box DCC-PSUA, 511 Paterno Library, University Park, PA 16802, via email to lhrsearches@psulias.psu.edu, or fax to 814-863-5592. Review of applications will begin March 2, 2009 and continue until the position is filled.

Penn State is committed to affirmative action, equal opportunity, and the diversity of its workforce.

Filed in Academia, Digital Library, digital collections, digitalarchive, digitalcollections, digitalhumanities, job | No responses yet

New York Times Article, Digital Archivists, Now in Demand

Laurie N. Taylor on Feb 9th 2009

The New York Times has an article “Digital Archivists, Now in Demand” and it’s great to see the field recognized and growing! I’m not sure where they got the salary information though, but perhaps the field pays more in some areas than others. Regardless of the financial compensation, working with issues of digital preservation and with the actual materials to-be-digitized is incredibly rewarding and there’s always more to learn.

Filed in Digital Library, digitalarchive, preservation | No responses yet

Usage Statistics

Laurie N. Taylor on Jan 23rd 2009

The UFDC usage stats are now online in great detail. The stats include overall, collections by date, items by date, collection history, items by collection, and definitions to sort out what all of this means. The items by collection is particularly interesting where you can see how many hits are on particular items, like this page for the Baldwin. Some collections, like the newspapers, are by title and issue, so I assumed the hits would skew to the first issue of any of the titles and they do in some cases, but in others the 157th issue is the most hit.

We’re still interpreting the data, but we’re already learning from it. For instance, Lourdes noticed that one of the University Archives Photos had significantly more hits than any of the others and in trying to find what made it special, she noticed it was the highlight image on the landing page for that collection. We’ve now added more highlight images for the collection that will alternate on reloads to help distribute the hits and provide more data to users. We normally try to provide multiple highlight images, but it hasn’t been a significant priority in the past and now we know it needs to be.

Filed in Digital Library, digital collections, statistics | No responses yet

ARL’s Call for National Support for Large Scale Digitization Initiatives

Laurie N. Taylor on Jan 19th 2009

The Association of Research Libraries (ARL) issued a call for President Obama’s administration to support large-scale digitization initiatives. The brief call from the ARL Newsletter is online as is the full letter.

As an addendum to to ARL’s call for “a large-scale initiative to digitize public domain collections,” let’s also make sure these initiatives include all holdings that are in the public domain in however many selected US institutions, including the millions upon millions of pages published in other countries and collected by the US. The US has so many collections that would benefit the US and so many collections that, if shared openly, would benefit the world and international relations overall. Presenting and sharing these materials, especially in a way that makes sure they’re sustainable, will create new national resources and new sources for global collaboration.

My personal dream would be to see the public domain documents acquired through foreign cooperative acquisitions plans digitized as part of the larger US institution holdings where they can be found. In the Farmington Plan different US institutions collected materials by area or country so that the materials would be accessible in the US and preserved. The Farmington plan was formulated with the fear of data loss (paper data loss) from war and the first official Farmington meeting was in the 1940s in Farmington Connecticut. The Farmington Plan was official in 1948, but it’s official operations didn’t come with the necessary official funding so it only formalized work that had already been going on.

The cooperative collection plans formalized in the Farmington Plan date back far earlier. The reason I know any of this history is because the University of Florida, for instance, had been collecting Caribbean material for decades prior to the Farmington Plan (see this article) and continues to do so today. UF became the official institution responsible for the Caribbean in 1952 when a “modification of the subject basis for assignment was suggested when it was recommended that libraries accept total responsibility for publications issued by a given country or area not presently covered by the Plan. Thus, the Caribbean area was accepted by the University of Florida” (source). According to all of the documentation I’ve found (and this is still new research for me), the University of Florida had been collecting Caribbean materials and so UF was simply asked and added to the Farmington Plan for what it was already doing. Because of the existing relationships in the Caribbean, UF was able to acquire copies of documents–in print and in microfilm through “mobile microfilming units” (meaning barges with microfilm cameras that traveled the Caribbean and made microfilm copies of important documents and books)–and in at least one case, UF’s microfilm was the last copy in the world (hurricanes and tropical weather are a constant danger to archives now and were much more so before air conditioning). In the last copy instance, UF was able to digitize the materials through the Digital Library of the Caribbean (dLOC) and return the cultural materials to their rightful owners while also sharing the materials with the world. The last copy became one of many copies and became inaccessible to easily accessible.

The Digital Library of the Caribbean (dLOC) is a wonderful and downright amazing project in terms of technology, history, and significance. It’s successful because it took the cooperative collection plan and made it work digitally. Yet there are so many other existing projects that are or could also build on parts of earlier cooperative collection plans. Funding is needed though and these materials are needed and are not otherwise accessible. The purpose of cooperative collection plans was to ensure that someone would have a copy in the US and to avoid purchasing multiple copies and possibly overspending on always underfunded budgets (and this is going back past the 1940s discussions and earlier). The majority of materials in research libraries are unique because libraries couldn’t afford to get extra copies. Digitizing full collections is wonderful, but we need to digitize everything in all of them or very nearly. The uniqueness means that each library collection only has a tiny portion of the whole. Google scanning entire university libraries is only beginning to hint at scratching the surface and we need so much more.

ARL’s call for a large-scale digitization initiative is so right because the need is so huge, the benefits so great, and the possibilities so enormous. As ARL states, the initiative “will lay a foundation for innovation and national competitiveness in the decades ahead.” To that I would add “and a spirit of international cooperation and collaboration” to ensure that past brilliance and innovation are included. While the Farmington Plan wasn’t funded as it needed to be, it’s hard to imagine how much funding would be needed to support essentially an Internet of data made of microfilm, so many copies, mailed to institutions for all to access. The costs remain high, but are much lower and the potential rewards so much greater.

Filed in Digital Library, Library, archives, digital collections, dloc | 2 responses so far

PDF to TIFF Extraction tools and Auto-PDF methods

Laurie N. Taylor on Jan 7th 2009

Because so many people have PDFs that they’d like mounted online and because so many people want PDFs and because PDFs are imperfect for preservation, I’m looking for a free or inexpensive (for non-profit, public educational institutions) software solution to convert from PDF to TIFF and TIFF to PDF. I know ImageMagick will do this, but I need the PDFs-made-using-TIFFs-in-hand to be compressed and to have as few artifacts/problems as possible and I haven’t seen how to best do that with ImageMagick. For TIFFs-extracted-from-PDFs-in-hand, I need the extracted TIFFs to be settable as RGB or Grayscale, 300 dpi, no compression, and single page (so a 12 page PDF results in 12 TIFFs).

I’d prefer to have a batch process that watches a PDF file and does the conversion automatically, saving the converted files to the same folder where the original PDF/TIFF file is. The main emphasis here is on automation–we don’t have people time for it, so we need to script it and have it work. By having it work, we also need to not have problems (or not many) with the compression size (for PDFs made from TIFFs) and the image quality. If anyone has existing scripts for ImageMagick or OpenOffice or any recommendations in general, please share them!

Filed in Digital Library, processing | One response so far

New Open Access Monograph: Economics and Usage of Digital Libraries: Byting the Bullet (Press Release)

Laurie N. Taylor on Oct 11th 2008

************(Press Release)************
The Scholarly Publishing Office of the University of Michigan Library is pleased to announce the availability of a new open access monograph, Economics and Usage of Digital Libraries: Byting the Bullet, edited by Wendy Pradt Lougee (University Librarian, University of Minnesota) and Jeffrey K. MacKie-Mason (Arthur W. Burks Collegiate Professor of Information and Computer Science, School of Information, University of Michigan). In the late 1990’s, researchers and digital  library production staff at the University of Michigan collaborated on deploying the Pricing Economic Access to Knowledge project (PEAK), a full-scale production-quality digital access system to enable usage of content  from all of Elsevier’s (then about 1200) scholarly journals, and at the same  time to conduct a field experiment to answer various questions about the  interplay between pricing models and usage. The experiment culminated in a lively conference that engaged scholars, library practitioners and publishers. This volume captures some of the most interesting and provocative discussions to come out of that conference. PEAK was a ground-breaking effort in its day, and references to the project have continued over time. It raised important questions about the potential for highly functional journal content and new economic models of publishing. In today’s context of socially-enabled systems and open-access publishing, the motivating questions of PEAK remain relevant.

This monograph is part of the SPO Scholarly Monograph Series, an interdisciplinary collection of original, open-access scholarly monographs and essays. The University of Michigan Library, through its Scholarly Publishing Office, provides academic publishing services that are responsive to the needs of both producers and users, that foster a sustainable economic model for academic  publishing, and that support institutional control of intellectual assets.

Filed in Digital Library, open access | No responses yet

LOC Press Release: Federal Agencies Collaborate on Guidelines for Digitization

Laurie N. Taylor on Oct 3rd 2008

LOC Press Release:

The Library of Congress is among a dozen federal agencies launching an initiative to establish a common set of guidelines for digitizing historical materials. Basing its efforts on a combination of collaborative research and combined experience, the Federal Agencies Digitization Guidelines Initiative will address a variety of issues related to the complex activities involved in the digitization of cultural heritage items.

Two working groups have been formed, one addressing content that can be captured in still images, the other involved with content categorizing sound, video, or motion-picture film. The initiative includes a just-launched Web site, www.digitizationguidelines.gov.

The Federal Agencies Still Image Digitization Working Group will focus its efforts on content such as books, manuscripts, maps, and photographic prints and negatives. Its members include the Library of Congress, the National Agricultural Library, the National Archives and Records Administration, the National Gallery of Art, the National Library of Medicine, the National Technical Information Service, the National Transportation Library, the Smithsonian Institution, the U.S. Geological Survey, and the U.S. Government Printing Office. An Advisory Board of technical experts from industry and academia will also contribute to the initiative.

The Federal Agencies Audio-Visual Working Group, which will address standards and practices for sound, video, and motion picture film, includes the Defense Visual Information Directorate of the Department of Defense, the Library of Congress, the National Agricultural Library, the National Archives and Records Administration, the National Library of Medicine, the Smithsonian Institution, the Government Printing Office and the Voice of America.

The agencies began meeting in 2007 to identify common practices for digitizing cultural heritage materials in a sustainable way. Establishing guidelines is expected to increase the quality and consistency of digitized documents and media that are made available to the public, streamline workflows and reduce costs, promote the exchange of research, and encourage collaboration across agencies. The guidelines will also provide common benchmarks for digitization service providers and manufacturers.

The Web site currently features two documents developed by the Still Image Digitization Working Group that are open for comment until mid-November. The first proposes a minimal set of embedded TIFF metadata for use in historical and cultural heritage digital imaging. The second two-part document presents a taxonomy of digital image characteristics and provides corresponding metrics and criteria to describe and validate imaging performance and quality.

The Web site also provides a glossary of digitization terms and concepts, and presents digitization-related news and events on the subject from the participating agencies.

This collaborative effort initially formed under the auspices of the National Digital Information Infrastructure and Preservation Program (NDIIPP), a Library of Congress-led program initiated by Congress in December 2000 to develop a national strategy to collect and preserve digital content. For more information on NDIIPP visit www.digitalpreservation.gov.

Filed in Digital Library, LOC, LibraryofCongress, digitalarchive, digitalcollections, standards, technologies | No responses yet

Next »