Archive for the 'access' Category

International Publishers and Librarians Agree to Enhance The Debate on Open Access

Laurie N. Taylor on May 22nd 2009

International Publishers and Librarians Agree to Enhance The Debate on Open Access

Geneva/The Hague 20 May 2009 - For immediate release

A joint statement released today by the International Publishers Association, the International Association of Scientific Technical and Medical (STM) Publishers, and the International Federation of Library Associations and Institutions (IFLA) calls for a more rational, evidence based debate on open access. It encourages experimentation and piloting of new concepts and ideas, whilst acknowledging that the differences in the different academic disciplines and publishing traditions may lead to differentiated approaches and business models in support of authors.

The joint statement is intended to move the oftentimes heated and polarised debate about open access as a model for scholarly communication towards a more measured and nuanced discourse.

Says IPA President Herman P. Spruijt “The debate about open access is important and publishers welcome it. Publishing is never at a standstill and we should not fear change. Now that more experience has been gained with open access publishing and now that data is available on its success, the open access debate should be able to move away from emotional accusations and oversimplification. Our discussions with IFLA on this topic are always spirited, but have become more insightful and less polarised as we moved towards facts, evidence and differentiated arguments. There is a lesson here to be learned for the public debate on this issue.”

Says IFLA Working Group co-chairman Ingrid Parent: “IFLA is pleased to announce the joint declaration on open access with IPA. This statement shows that both our associations share the important objective of providing the broadest possible access to information. IFLA and IPA believe publishers and librarians have a lot to gain by supporting innovation, experimentation and pilot projects in developing open access to scholarly publications.”

Notes for Editors:
The full text of the statement is available here.

More about IPA:
The International Publishers Association (IPA) is an international industry federation representing all aspects of book and journal publishing. Established in 1896, IPA’s mission is to promote and protect publishing and to raise awareness for publishing as a force for economic, cultural and political development. Around the world IPA actively fights against censorship and promotes copyright, literacy and freedom to publish.

More about IFLA:
The International Federation of Library Associations and Institutions (IFLA) is the leading international body representing the interests of library and information services and their users. It is the global voice of the library and information profession. IFLA promotes the principles of freedom of access to information, ideas and works of imagination and freedom of expression. The delivery of high quality and equitable library and information services helps guarantee that access and improve the social, educational, cultural, democratic and economic well-being of those communities and organizations libraries serve. IFLA has 1600 Members in approximately 150 countries around the world.

Filed in Academia, access, copyright, ifla, open access | No responses yet

Newspaper Archives

Laurie N. Taylor on Apr 19th 2009

The American Historical Association has a recent blog post over the problems caused by the lack of access to certain newspapers during transition from “Paper of Record” to Google’s news archives. The blog post notes:

Regrettably, this proves yet again Roy Rosenzweig’s warning to the profession six years ago about the “the fragility of evidence in the digital era.” While it may be beyond our capacity to adjust copyright laws and the behavior of large corporations (however well meaning), as a profession we can and perhaps should develop new habits for working with digital materials—by copying down information when we see it online, and not becoming overly dependent on any one data source or having illusions about its permanence.

Seeing the problems from the Paper of Record transitioning to Google as a call to “develop new habits for working with digital materials—by copying down information when we see it online, and not becoming overly dependent on any one data source or having illusions about its permanence,” is essentially a call to develop personal copies of existing archives and it’s a poor solution to the larger problem.*

In this particular instance, there are several concerns related to technology, trust, and the public good. For technology, the transition is a normal instance of downtime (which is still normal for any technology related transition, and its normalcy is why so many of the tech folks were amazed at the speed and elegance of the most recent Whitehouse.gov transition that overcame the normal problems). However, technical issues are a parallel to the very real potential for loss if digital records are not supported and the very real problem of lost access if digital records are not supported as a need for the public good. One of the respondents to the blog post notes that perhaps newspapers should be moved into the public domain, which is a concern because copyright is often an obstacle to access, but even papers in the public domain still need financial support to ensure access to them whether in digital or physical form.

Even after covering the initial costs for requesting permissions, digitization, and hosting, new costs emerge. For instance, the University of Florida Digital Collections (UFDC) has grown by leaps and bounds in the past two years and now has over 664,269 pages of Florida newspapers alone. These newspapers include historic newspapers and current newspapers. The Digital Library Center has successfully requested and received permissions to digitize over 60 current newspapers, newspapers that in many cases were microfilmed and that are now being digitized for online access and longterm preservation (and we’re also slowly digitizing earlier years from the microfilm and will continue to do so until all of the microfilm holdings are digital).

All of the collections in UFDC, including the Florida Digital Newspaper Library, continue to grow and that growth encourages a growth in usage that, in turn, requires UFDC have more resources to support the higher usage rates. In March 2009, UFDC had 618,148 unique hits and that many hits along with the knowledge that the hits are only going to increase means that the UF Libraries have to implement additional programming to ensure the server memory usage can handle the increased load without problems for users. Other digital collections will have similar needs as they grow, and that will require support from users and the public.

Rather than attempting to copy existing resources (which would reduce the resource to a single item photocopy instead of a point within the full context and content of the database), the emphasis should be on building and supporting trusted digital archives to ensure access. The Florida Digital Newspaper Library presents one of many models, housing historic and current newspapers for open online access for all in perpetuity (and it was luck enough to build the digital model from that same model for microfilm, allowing it to utilize the existing support infrastructure that was already available).  Many archives already offer the same promises for access in perpetuity, albeit for physical access to items not yet digital, and those archives will need support to ensure they place the same importance on access and preservation for their digital collections.

Digital collections and archives need support for new and existing digital collections to build and sustain the infrastructure needed to ensure open access in perpetuity. As La Asociación Mexicana de Historia Económica (AMHE) explains in their protest to the lack of access to Mexican newspapers, the newspapers on Paper of Record are essential reference materials for research. The removal of access–even if only a delay for technical reasons–does harm. The public needs to have trust in their archival institutions, and ensuring access to physical and digital archives is a necessity to build and maintain that trust.

*{Copying single items or even attempting to copy masses of materials without infrastructure is still like photocopying. The materials would not be structured (or minimally so) and would not benefit from organization and identification. If a physical archive was in danger and photocopying was the only option, then photocopying the resource makes sense. This is not to say that photocopying is a bad solution in all cases–researches regularly photocopy materials from archives and those photocopies are then copied and shared and, in some cases, those are the only available copies for access. Photocopying is a poor solution to the overall problem, but for researchers who need access to the materials right now and who cannot wait for a new trusted archive to built over years of advocacy and funding, photocopying style solutions are wise temporary options. Internet Archive’s Wayback Machine maintains copies of many web sites and pages for just this reason.}

Filed in access, newspapers | No responses yet

Sec. 6. Revocation. Executive Order 13233 of November 1, 2001, is revoked.

Laurie N. Taylor on Jan 22nd 2009

President Barack Obama has already begun implementing important changes, including restoring public access to presidential records by revoking the Bush administration’s Executive Order 13233. The text for President Obama’s executive order is available on the Whitehouse website.

Filed in access, archives, open access | No responses yet

The Internet Before the Internet

Laurie N. Taylor on Dec 31st 2008

Before the Internet made information access faster and easier (and it continues to improve), libraries were already mass-sharing information through interlibrary loan. Interlibrary loan is such a simple concept–libraries share books with other libraries–but it was and continues to be carefully planned and implemented to ensure availability and access through cooperative collection plans, lists of records and methods for disseminating them (National Union Catalog, publishing bibliographies of what books were where), and agreements to make sure users know about the materials in order to request them.

Thanks to interlibrary loan systems everywhere for making information available and accessible. Making information findable, available, and usable is always something to celebrate, especially when they’ve been doing it for so very long. The original interlibrary groups have expanded, merged, and reformed, but some carry on under the same names like Florida’s interlibrary loan network, FLIN (The Florida Library Information Network) which turns 40 this year. Over those years FLIN has shared 6.6 million items, or 167,000 items a year! Congratulations to FLIN! And, congratulations to all of the interlibrary loan networks celebrating another year or another decade of service!

The Internet is now the main information source for many, but making the Internet really work (with information on where to find information, the information wanted) begins with the infrastructure for information access. Information architectures, systems for finding and accessing information, and making sure that information is in the best form possible has been a long tradition within interlibrary loan and with the subsequent technologies it employed, including facsimiles, microfilm (or microphotography), electronic, and digital. Without the systems for interlibrary loan, we wouldn’t be able to access many books in print and our digital-only systems wouldn’t have had the benefit of the painstaking work done through postal/train/car/horse/shoe/sneaker/net of interlibrary loan.

As this year comes to a close, thanks to all of the interlibrary loan services who have shared so much!

Filed in Library, access, findability, interlibraryloan, open access | No responses yet

Why Google Gets It

Laurie N. Taylor on Sep 10th 2008

I’ve stolen the title of this post from Shawn Rider’s article “Why Nintendo Gets It” because the title explains the whole point of this post and because of the parallels between Google and Nintendo. Nintendo gets it because they understand that games are about playability more so than technological innovation and because they understand that innovation can be  evolutionary or sustaining as well as disruptive. Evolutionary or sustaining innovations build incrementally on existing structures, but disruptive innovation changes the whole landscape.

The 8-bit NES to the Super Nintendo was an evolutionary or sustaining innovation, largely technological, but that technology enabled longer and deeper games. The current console gaming market changed in response to the Sony PlayStation 2 both because of the system and because so many had grown up with games. In the last console release, however, Nintendo showed how they got it by releasing the Wii and inviting all non-players and casual players to get into gaming and inviting existing players to learn to play in new ways. Nintendo used a disruptive technology to their advantage–investing in its development instead of in the best graphics card on the market and instead of pushing an ever-increasing polygon count, they focused on playability and leveraged it for an even greater market share and for a community of Nintendo followers.

Google announced yesterday that they’re scanning microfilm to digitize historical newspapers, which is just the latest of their work to get more content online. This could be seen as an evolutionary innovation, where Google has digitized books and now they’re working on newspapers. However, Google gets it because they make interoperable and open content. Google is digitizing whatever it can and indexing whatever it can to ensure that it has access to the most data for use by Google’s search engine and for Google’s paid services like advertisements. Google isn’t simply adding newspapers into this collective vat of information, though. Google has shown time and again that they’re adding and indexing content so that it can be faceted–for searching only by news or only by places with mapped locations–and that they’re allowing those facets to be connected together in context.

Placing content in context is an enormous task, especially when context means historical, spatial, cultural, social, and personal. Some of the existing components in traditional library records (if complete) can be extended and mined to create a basic infrastructure that can then be further enhanced, mined, and adapted for further use and this is what Google has done. This enhancement, mining, and adaptation are also what UF’s Digital Library Center has been doing for several years beginning in earnest with the Ephemeral Cities Project. The Ephemeral Cities Project began before I came to the Digital Library Center and its goals are only now beginning to be fully realized with the Map It! feature for items in the UF Digital Collections, enabled through KML becoming an Open Standard in 2008 leading to our use of the Google Maps API.

We’ve also been digitizing newspapers for the Florida Digital Newspaper Library and the Caribbean Newspaper Imaging Project, the same reasons Google is interested. Newspapers tell the stories of history in the making, connecting the current social and personal concerns to the larger cultural and historical movements and eras, and newspapers tell the local stories of their areas, along with the larger national and international stories of their days.

What surprises me most is not that Google gets it in terms of seeing the immediate need and the long tail future goals for massive amounts of interoperable data, but that there are so many people who got it and were working toward so much earlier than I’d have expected. In UF’s Digital Library Center alone, Director Erich Kesse first proposed the Ephemeral Cities Project in 2003 and Mark Sullivan (our wonderful programmer at the time who’s still with us as well) began developing the digital library software for users to access such data and for the digital library staff to most easily create the necessary metadata within the digitization process. I can’t say that I got it in 2003, but I’m glad so many others did so that the infrastructure is in place to help support the wonderful projects to come.

I’m also extremely happy that Google gets it in particular because they have the business infrastructure to make the incredibly tedious and expensive work of digitizing materials in context affordable and sustainable through ads which have a return on investment value. Universities return investments from society in the form of knowledge, a more educated and capable workforce and community, and through the infrastructure necessary for other advances, but in difficult economic times the investment itself becomes more difficult. Luckily for all, Google gets the full context of their investment and knows that digitized materials have more value when they can easily be used, thus ensuring greater usage. The smart business plan for Google requires keeping materials open and usable by as many others as possible,making it good business for Google to do what’s already in the public interest. Of course, Google is facing monopolistic concerns and smart business models can go bad with changes in leadership, so its smartest public institutions like universities to continue getting it and ensuring that the digital revolution brings as many benefits as it can for accessing, using, and understanding information while building the infrastructure for the next innovations be they sustaining or disruptive.

Filed in access, gis, google, history, innovation, interface, interoperability, newspapers, nintendo, virtualworlds, visualization | One response so far

“A Snapshot of Urban History at the Turn of the 21st Century”

Laurie N. Taylor on Aug 11th 2008

Last week, UC Santa Barbara announced that they received a massive collection of aerial photography, valued at $14.3 Million, from Pacific Western Aerial Surveys of Santa Barbara. The collection includes more than 500,000 aerial images of 65 major metropolitan areas in the United States at the turn of the 21st Century (1999-2002). This is really amazing, especially so because UCSB Map & Imagery Library is home to the Alexandria Digital Library (ADL), so these materials will be preserved and accessible in the future.

Filed in access, aerials, gis, mapping, preservation | No responses yet

US National Archives in the World Digital Library

Laurie N. Taylor on Jul 18th 2008

Rosie the RiveterThe US National Archives announced earlier this week that they will be contributing materials to the World Digital Library! This is not unexpected, but still wonderful news because it will place so many resources together in a convenient interface, and each time one collection is contributed to another mismatches and other conflicts occur that result in better interoperability.

Filed in access, archives, digital collections, interoperability, worlddigitallibrary | No responses yet

RSS Feeds for the University of Florida’s Digital Collections

Laurie N. Taylor on Jun 24th 2008

In our ongoing work to improve the findability of books in the UF Digital Collections (UFDC), we now have an RSS page with feeds for each of the collections. The RSS feed page is http://www.uflib.ufl.edu/ufdc2/rss/.

Please sign up for a feed or two to learn about the great materials added daily, and please share the RSS feeds with others!

Filed in Collection Items, access, deep web, digital collections, findability, rss, seo | No responses yet

Search Engine Optimization

Laurie N. Taylor on Jun 17th 2008

Now that the University of Florida Digital Collections is optimized for internal coding, we’re trying to start optimizing for search engines. We currently use robots.txt to request that search engines do not crawl our site. Doing so was a hard choice because we want our materials to be accessible and used. However, we were forced to stop the search engines because they were crashing our server.  We had a number of overzealous search engines that crawled and re-crawled, and crawled in strange ways. With our JPG2000 images, the over-crawling and overly quick crawling ate too much memory and we couldn’t do it and remain functional. This overcrawling happened even with a site map and all of the proper webmaster configurations. Because the normal right way wasn’t working, we’ve chosen a secondary right way. We hope that this method works until we can make the normal right way work.

We’re currently in the process of building a separate single-page for every item in the collection, and we’ll create these weekly until the normal search indexing works. These pages will live on www.uflib.ufl.edu/ufdc2 as opposed to our real site www.uflib.ufl.edu/ufdc. These pages will have the basic information for each item and the links will go over to the main site (UFDC). By allowing search engines to crawl and index the information on UFDC2, we hope that the search engines will include our information so that site will be more findable without creating huge server memory drains.

We’re not sure what the search engine problems were exactly, just that the engines (from multiple companies) were overcrawling. The University of Florida has an internal Google  search appliance. Theoretically - and I haven’t read anything on this, but I would appreciate more information if anyone can help - Google’s main bots and UF’s instance could have simultaneously crawled, driving up their apparent traffic. However, this doesn’t explain why multiple search engines were overcrawling even with a validated sitemap in use.

Most of the information online explains issues with deep folder hierarchies, dynamic URLs, and masses of pages, but there doesn’t seem to be an easy solution. We’re hoping UFDC2 serves as a solution for now. In the meantime, if anyone has recommendations for other options that have worked for search engine optimization of deep websites, and especially for digital libraries with millions of pages, please let me know (via comments or email).

Also in the meantime, search engines should start crawling UFDC2, and the static pages will be finishing building later today. We’re hoping this works!

Filed in Digital Library, UFDC, access, datamining, deep web, digital collections, seo | No responses yet

Newspapers in History, Making History

Laurie N. Taylor on Jun 8th 2008

Alligator StaffThe University of Florida supports the Florida Digital Newspaper Library and the Caribbean Newspaper Imaging Project. By preserving and digitizing the news of the past, these projects make the news new again.

The Caribbean Newspaper Imaging Project includes papers like Haiti’s Le Nouvelliste, with issues from 1899 - 1902 now online. While the early issues online are imperfect (because of materials and processing with newspaper paper, microfilming, and then digitizing from microfilm) the pages are easily readable. If I could read Haitian Creole, or at least enough French to understand with savvy use of Google’s translator, I’d be able to read the December 30, 1899 Le Nouvelliste and learn how Port-au-Prince was handling the shift into 1900, or perhaps the December 31, 1900 issue would be more interesting because its news would be that of Haiti poised for the start of the Twentieth Century.

The news of the past show has history is made. On a much more localized scale, so too do the photographs of the news in the making. Many issues of the University of Florida’s Florida Alligator newspaper, which later became the Independent Florida Alligator, is included in the Florida Digital Newspaper Library, as are photographs from its early days.

One of the Florida Alligator issues online is from September 21, 1945 and it seems surprisingly mundane when scanned quickly. However, the first page includes two articles on the first page, one on General Van Fleet explaining that the human element he gained at the University of Florida was pivotal for his successes in World War II and the second on the University officially going co-ed, after “Legislature broke down and played ‘Lady Bountiful’ by saying veterans’ wives could come, provided their husbands were here first.” Bits of history are told in these pages, just as they are in photograph above. The University of Florida Digital Newspaper Library has digitized issues from 1945 - 1948, and others await along with additional titles and issues intended for the Florida Digital Newspaper Library and the Caribbean Newspaper Imaging Project.

Filed in Caribbean, Collection Items, access, newspapers, preservation | No responses yet

Next »