Archive for June, 2008

OCLC Microfilm to Digitization Roadshow

Laurie N. Taylor June 29th, 2008

Le Matin, digitized from microfilm and available online in dLOCI’m at ALA (still today and through some of tomorrow before a red eye flight home) and this morning I attended and presented within the OCLC Sponsored “Microfilm to Digitization Roadshow.” This included presentations from Kelly Barrall and Joan DaShiel on the ins and outs of their microfilm and microfilm digitization processing and Katherine Walter from the University of Nebraska-Lincoln on her work with the Nebraska Public Documents project. Katherine is the Co-Director for the Center for Digital Research in the Humanities and Chair of the Digital Initiatives & Special Collections Department, and my presentation on digitizing from microfilm for the Digital Library of the Caribbean (dLOC). These were great presentations, but it was especially great to chat with the presenters and attendees both before and after. One of the projects I’m working on is an “adopt a reel” option for donors to fund microfilm digitization by reel, and learning more about Preservation Resources and processing is helping me build that project proposal which will hopefully be successful and lauch before the end of 2008. While there’s a lot to be done before that can happen, this was one more step in the process building toward that project and toward other projects as well.

Print on Demand (POD) for Libraries, from ALA/ALCTS/PARS

Laurie N. Taylor June 28th, 2008

“Staying Alive: Books through Print on Demand Technology,” an ALA/ALCTS/PARS Program  (Saturday, June 29, 2008, ACC Room 304a-b)

Presenters include:

  • Brian from Bridgeport National Bindery
  • Lynne Terhune, Wiley & Sons, Print on Demand
  • Beth, New York Public Library, head of access, espresso book machine
  • University Conservator from the University of Iowa, and that will be posted on the ALA wiki.

Brian from Bridgeport National Bindery
Brian began by speaking with the importance of the printing press in the history of inventions, and the lose-ability of books. With digitization, how print on demand works. Conceptually, take a collection of print files, order them, have them printed. Print file can come from many sources (author, digitization, existing electronic files), to get people interested make the lists from bookstores, catalogs, company websites, others. Printing and binding the books has to be done quickly, hours and not days. POD, customer normally doesn’t know where the book comes from.

Keys to successful POD:  rapid and high quality copies

Strengths of POD:

  • Low inventories
  • No/low production costs until order is placed
  • Highly dynamic; allows rapid changes to content
  • Allows nearly all titles to be ‘in print’

Weaknesses

  • Production costs per title are more expensive
  • Not as effective for instantaneous wide distribution (Harry Potter stock everywhere at once).
  • Some limits in size, print quality, and binding options

What Option to Choose: Choose by cost effectiveness

Now librarians don’t need to buy a rare book when finding it. In his experience with Brittle and SlavCopy at the University of Kansas, would make multiple preservation photocopies of books using a list-serve to see who was interested.

Bridgeport doing microfilm digitization, also doing printing and binding of ETDs, and doing retrospective dissertation scanning, POD will get larger as more people want print from digital more easily.

Lynne, Wiley & Sons, Print on Demand
Lynne Terhune, spoke on Wiley’s Global POD/USR Program, and they want all items available through it. POD is not inventoried nor returnable, but USR materials are. Print on demand/Ultra-short Run Library to fill orders, materials all available, no more out of stock, order ships the same day just like it was on the shelf. Some books have increased sales when put online, cash flow has improved, working with authors, no obsolescence, green advantage. Sales are doubling or tripling by year for the items in POD, from 2004-2007 went from 5 to 50 thousand. Industry-costing means that POD is generally a penny and up per page.

Beth, New York Public Library, head of access, espresso book machine
Beth spoke on NYPL’s experiment printing books from OpenLibrary using an Espresso machine for patrons. On Demand is the company that sells the Espresso Machine. When the NYPL team visited On Demand, they knew they’d need faq sheets and they’d need to pre-select titles to keep from overwhelming the public. Ultimately, they offered 13 books, 11 from Open Content Alliance, and two contemporary books were from authors who allowed their books to be printed this way. They also limited by book length, and titles were limited because almost all needed some tweaking, and of course quality going in determines quality coming out. Before the machine is usable, need to make upfront decisions and digital files need editing. The 1.5 machine espresso version is around $150,000, but version 2.0 may be cheaper, however it’s still expensive to digitize and format materials for print.

ALA Annual Conference 2008

Laurie N. Taylor June 25th, 2008

The ALA Annual Conference begins tomorrow in Anaheim. I’ve been too busy to put together my schedule thus far, but I’ll soon have my days planned out and I’ll add them here. In case anyone else is still reading and re-reading the schedules (and for my own quick reference), LITA has a quick list here and ALCTS has a quick list here.

RSS Feeds for the University of Florida’s Digital Collections

Laurie N. Taylor June 24th, 2008

In our ongoing work to improve the findability of books in the UF Digital Collections (UFDC), we now have an RSS page with feeds for each of the collections. The RSS feed page is http://www.uflib.ufl.edu/ufdc2/rss/.

Please sign up for a feed or two to learn about the great materials added daily, and please share the RSS feeds with others!

1.92 Million, with More Loading Tonight

Laurie N. Taylor June 20th, 2008

Photo taken in Haiti for dLOCOn June 6, I posted about the University of Florida Digital Collection (UFDC) statistics. At that time, UFDC included 55,072 titles with 74,341 items and 1,896,811 pages. UFDC has been growing by an average of 100,000 pages pages a month, and I was hopeful that we could–even if by only a small margin–surpass that rate from April 20 to June 20. I’m happy to note that we’ve done it!

As of April 20, the UF Digital Collections had 1.718 million pages. With today’s load on June 19, even before the evening loading occurs, we have 1.92 million pages.

The University of Florida Digital Collections now include 55,352 titles with 74,893 items and 1,925,734 pages!

We’re fighting to get to the 2 million page mark as quickly as possible, and we hope to make it by July 15.

Even if we don’t make it by July 15, we’ll make it soon. In the process, we hope to load new materials that we’re scanning on a daily basis and older materials like the photo to left taken over a year ago, and many early materials digitized for preservation that were not loaded online. Since UF has been digitizing to meet preservation needs since 1999, it means we have quite the archives to finalize for online-digital access instead of just CD-digital access. It’s exciting to see so much new and old, and I can’t wait to see and to share what all our digital-offline archives hold!

History of Florida Colleges and Universities

Laurie N. Taylor June 19th, 2008

With the new Florida College System developing where a handful of current community colleges will soon begin offering four year degrees, maybe changes for colleges and universities in the state of Florida. Florida needs the Florida College System to serve the public, but even in positive budget years major change is a great deal of work. Far too many factors are at play to know exactly what elements will have the greatest impact or what that impact will be on how the Florida College System will shape the future of Florida’s higher ed. Knowing the past, though, can offer insight into complicated situations and books like Robert B. Gentry’s A College Tells Its Story, an oral history of Florida Community College at Jacksonville could help light the way.

RSS Feeds, Coming Soon!

Laurie N. Taylor June 18th, 2008

In addition to our UFDC search engine optimization, we’re working on RSS feeds for all new items and for new items from each of the collections. Our RSS feed page will be here: http://www.uflib.ufl.edu/ufdc2/rss/ but it’s still in development right now. RSS feeds take advantage of the power of the web to syndicate and share content and the methods search engines use for ranking content. While this has been arguably problematic as traditional media takes its time in changing, using RSS feeds makes sense and especially so for sites that the University of Florida Digital Collections where we want to share content as widely and completely as possible. Hopefully this post will soon be followed by others on the active RSS feed!

Search Engine Optimization

Laurie N. Taylor June 17th, 2008

Now that the University of Florida Digital Collections is optimized for internal coding, we’re trying to start optimizing for search engines. We currently use robots.txt to request that search engines do not crawl our site. Doing so was a hard choice because we want our materials to be accessible and used. However, we were forced to stop the search engines because they were crashing our server.  We had a number of overzealous search engines that crawled and re-crawled, and crawled in strange ways. With our JPG2000 images, the over-crawling and overly quick crawling ate too much memory and we couldn’t do it and remain functional. This overcrawling happened even with a site map and all of the proper webmaster configurations. Because the normal right way wasn’t working, we’ve chosen a secondary right way. We hope that this method works until we can make the normal right way work.

We’re currently in the process of building a separate single-page for every item in the collection, and we’ll create these weekly until the normal search indexing works. These pages will live on www.uflib.ufl.edu/ufdc2 as opposed to our real site www.uflib.ufl.edu/ufdc. These pages will have the basic information for each item and the links will go over to the main site (UFDC). By allowing search engines to crawl and index the information on UFDC2, we hope that the search engines will include our information so that site will be more findable without creating huge server memory drains.

We’re not sure what the search engine problems were exactly, just that the engines (from multiple companies) were overcrawling. The University of Florida has an internal Google  search appliance. Theoretically - and I haven’t read anything on this, but I would appreciate more information if anyone can help - Google’s main bots and UF’s instance could have simultaneously crawled, driving up their apparent traffic. However, this doesn’t explain why multiple search engines were overcrawling even with a validated sitemap in use.

Most of the information online explains issues with deep folder hierarchies, dynamic URLs, and masses of pages, but there doesn’t seem to be an easy solution. We’re hoping UFDC2 serves as a solution for now. In the meantime, if anyone has recommendations for other options that have worked for search engine optimization of deep websites, and especially for digital libraries with millions of pages, please let me know (via comments or email).

Also in the meantime, search engines should start crawling UFDC2, and the static pages will be finishing building later today. We’re hoping this works!

Digital Library of the Caribbean at ACURIL, Correction

Laurie N. Taylor June 11th, 2008

In my last post on the Digital Library of the Caribbean presenting at ACURIL, the title for Brooke Wooldridge and Marilyn Ochoa’s presentation was incorrectly listed as “dLOC Toolkit and Usability Testing: A User-­Centered Approach to Improve Electronic Resource Design” when it should have been “A User-Centered Approach to Improve Electronic Resource Design.” More importantly, I failed to list (or even realize) that Mark Sullivan from the University of Florida presented twice on the dLOC Toolkit, “dLOC Toolkit: Create Your Own Electronic Resources.”

Mark’s presentation will soon be online within dLOC here and an earlier presentation, “dLOC Technical Overview,” is already online within dLOC here.

Cartoon Art at the Special Collections Research Center (Syracuse University Library)

Laurie N. Taylor June 10th, 2008

The press release is below, and this is great news for the many growing comics programs across the country. as we edge ever closer to critical mass for full, mainstream recognition of the importance of comics studies and collections.

—–

The Special Collections Research Center (SCRC), Syracuse University Library has been awarded a grant of $79,440 by the National Historical Publications and Records Commission to support the arrangement and description of the library’s 134 unprocessed collections of original cartoon art. The funds will help support a full-time project archivist for a period of two years. The award to Syracuse was one of six “Detailed Processing Grants” awarded by NHPRC and the Archivist of the United States. Other recipients included Princeton University and the University of Chicago.

Syracuse’s collection of original cartoon art is among the most comprehensive in America. It includes original work by approximately 173 artists (more than 20,000 items) and comprises more than 1,000 linear feet of material. Spanning the course of the 20th century, it includes both serial and editorial cartoons. Among the serial cartoonists represented are: Bud Fisher, whose Mutt and Jeff was the earliest successful daily comic strip; Mort Walker, whose Beetle Bailey anticipated the changing notions of American masculinity and militarism during the Cold War; Hal Foster, whose lavishly illustrated Prince Valiant elevated the artistic ambitions of the genre; and Morrie Turner whose Wee Pals was the first comic strip to chronicle the lives of racial and ethnic minorities in American life. The editorial and political cartoonists represented in the collection include: William Gropper, whose leftist political cartoons in the Daily Worker raised working class consciousness during World War II; F.O. Alexander, whose everyman alter-ego “Joe Doakes” experienced the turbulence of the 1960s in the pages of the Philadelphia Bulletin; and Carey Orr, whose editorial cartoons appeared in the Chicago Tribune for nearly fifty years straight.

The physical cartoons in Syracuse’s collection are as wide-ranging and diverse as the artists that created them, assuming countless shapes, sizes, and media including pencil, pen, and gouache on paper. Over the next two years, the project archivist will take steps to ensure that the cartoons are housed in archival-quality containers. He or she will also draft online, searchable finding aids so that curious individuals all over the world can access them. The NHPRC grant is exciting news for scholars who specialize in the genre, casual fans, and, of course, for Syracuse University, which has held many of these collections since the 1960s.

About the Special Collections Research Center, Syracuse University Library

With more than 100,000 printed works and 2,000 manuscript and archival collections, SCRC holds some of Syracuse University’s most precious treasures, including early printed editions of Gutenberg, Galileo, and Sir Isaac Newton as well as the library of 19th century German historian Leopold von Ranke (1795-1886). SCRC’s holdings are particularly strong in the 20th century; they include the personal papers and manuscripts of such luminaries as artist Grace Hartigan (1922- ), inspirational preacher Norman Vincent Peale (1898-1993), author Joyce Carol Oates (1938- ), photojournalist Margaret Bourke White (1904-1971), and architect Marcel Breuer (1902-1981). SCRC strives to be a “humanities laboratory” where librarians and scholars collaborate with the artifacts of history in an ongoing and vital learning process. Home to a new, state-of-the-art instructional seminar room, SCRC also regularly hosts exhibitions, lectures and classes focusing on its collections.

Next »