Archive for the ‘Info Management and Retrieval’ Category

Podcast: Professor Robert Darnton on Harvard’s Success With Open Access

Tuesday, October 27th, 2009

From the Summary:

In October 2008 Harvard University in the US adopted an open access policy for all its research papers to be made available in their university repository, in an opt out basis. 12 months on, since the policy was adopted, JISC’s Rebecca O’Brien speaks with Professor Robert Darnton, Director of Harvard University Library and trustee of New York Public Library and the Oxford University Press (USA), about the cultural change that is taking place at Harvard and the background to why professors at the university decided to share their knowledge in this way.

The podcast runs 23 minutes. You’ll find it near the bottom of this page.

Source: JISC

See Also: DASH (Digital Access to Scholarship at Harvard): Harvard University Scholarly Repository

See Also: Harvard University Library: Open Collection Program

GeoCities Says So Long as Internet Archive Works to Preseve Content

Tuesday, October 27th, 2009

In August, we first posted about the Internet Archive (IA) asking GeoCities users to make sure their content was archived by the IA. Why? As of yesterday, GeoCities is no longer online.

From the Article:

Yahoo, which acquired the site for $3.57bn (£2.17bn) in 1999 at the height of the dotcom boom, said sites would no longer be accessible from 26th October.

However, many of the pages have been archived and will still be available to view via the nonprofit Internet Archive project.

The giant digital library, which has been archiving the public web since 1996, has set up a special project to archive GeoCities before it is lost forever.

“We’ve collected a lot of GeoCities sites over the years – but might not have every site and every page,” the Internet Archive said.

Access the Complete Article

Source: BBC

See Also: Saving a Historical Record of GeoCities (via Internet Archive)

Library of Congress’ National Digital Information Infrastructure and Preservation Program Wins Government Computing News Award

Saturday, October 24th, 2009

The NDIIPP as one of 11 projects to receive GCN [Government Computing News] Award for Agency IT Achievement.

From the Summary:

It took two centuries for the Library of Congress to acquire its 29 million books and 105 million other items. Today, it only takes 15 minutes for the world to produce an equal amount of information in digital form, creating unprecedented archiving challenges for the Library of Congress. The Library is meeting the challenge of digital preservation by developing new tools to transfer large quantities of digital content. To date, more than 3 million files have been transferred and stored using the BagIt specification. Due to the Library’s digital preservation initiatives, more than 1,000 collections of digital content have been selected, captured, preserved, and made available to the U.S. public and online visitors across the globe.

Access the Complete Article

We are warned to be careful about what we put online because data on the Internet lives forever. But keeping random copies of files on servers, routers and databases is not the same as preservation, said Martha Anderson, director of program management for the Library of Congress’ National Digital Information Infrastructure and Preservation Program. Digital data can be ephemeral. “That is the paradox,” she said.

Much More in the Summary and Complete Article

Source: GCN

See Also: Library of Congress News Release

Electronic Frontier Foundation and Other Groups Send Letter to Judge in Google Book Search Case

Friday, October 23rd, 2009

From a Blog Post:

EFF today led a coalition of authors, publishers, companies and nonprofit organizations in sending a letter to the judge overseeing the Google Book Search settlement urging the Court to ensure that those concerned about the settlement receive adequate notice of, and have sufficient time to study and comment on, any amended settlement agreement that Google, the Authors Guild, and the Association of American Publishers present.

Those following the twists and turns of the Google Book Search settlement will recall that the original Fairness Hearing scheduled for October 7, 2009, was put off because of what the Court called: “significant issues, as demonstrated not only by the number of objections, but also by the fact that the objectors include countries, states, non-profit organizations, and prominent authors and law professors.” The Court received over 400 submissions about the settlement, including the EFF-led coalition of authors and publishers concerned about reader privacy, as well as significant concerns raised by the Department of Justice.

Read the Complete Letter Sent to the Judge Denny Chin (4 pages; PDF)

The letter was signed by a large group of people and organizations including:

+ The Open Book Alliance*
+ Amazon.com
+ The Picture
+ Archive Council Of America
+ National Writers Union
+ Electronic Frontier Foundation
+ Pamela Samuelson (UC Berkeley Law Professor)
+ Microsoft
+ Washington Legal Foundation
+ The Internet Archive
+ Consumer Watchdog
+ Lyrasisk, Nylink and Bibliographical Center for Research Rocky Mountain, Inc.
+ Public Knowledge
+ Urban Libraries Council

+ The Special Libraries Association and the The New York Library Association are two of the members of the Open Book Alliance.

Source: Electronic Frontier Foundation

Getting to Know the HathiTrust Digital Library

Friday, October 23rd, 2009

Barbara Quint Writes:

With all the controversy still swirling around Google Books and its post-settlement offerings, an alternative route to the millions of digitized books and journals supplied by leading Google Book Search library partners has arrived. The HathiTrust (www.hathitrust.org) is a collaboration of 25 research libraries already participating in Google Book Search to produce a shared digital repository for preservation and access to a curated collection. By mid-November, the HathiTrust Digital Library will have a full-featured, full-text search service for 4.3-5 million items. The searches will retrieve bibliographic citations and page references, including those for in-copyright books. Content will extend beyond the digitized copies of books returned to early library partners by Google. HathiTrust is pushing to acquire other digitized special collections from its members, as well as making arrangements for opening access to university press books.

[Snip]

The new launch will open indexing to nearly 1.5 billion pages from well more than 4.3 million volumes with full-text searching by keyword or phrase. (Just between us, if you simply cannot wait until mid-November, go to

http://babel.hathitrust.org/cgi/ls.

[John] Wilkin, [associate university librarian at the University of Michigan and executive director of the HathiTrust], tipped me off that, [our emphasis] although this “experimental search” site claims to search only 500,000 documents, it actually includes the full 4.3-5 million volumes. Feedback options appear at the top and bottom of each search results page.) The system already had the equivalent of library cataloging searching, though they expect to upgrade even that kind of searching under a cooperative program with OCLC.

Much More in the Complete Article

Source: InfoToday NewsBreaks

China: Google Responds to Complaints Regarding Copyright Issues

Friday, October 23rd, 2009

It was just a few days ago when we posted that the China Written Works Copyright Society (CWWCS) was not happy with Google over copyright issues stemming from Google Book Search.

Today, in another Wall Street Journal blog post, we learn that Google has responded to CWWCS.

From the Post:

Here is the latest from Google:

“Today we have more than 50 Chinese publishers participating in Google Book Search, who together have authorized more than 30,000 books to be found through Google web search–and made available through a short preview. We also have some Chinese books that have been scanned by our Book Search library partners; in those cases, we only make the books available as a short snippet of text–as we do with web search–unless the rightsholder authorizes a greater use. We also honor rightsholders’ preferences if they ask not to be included.”

“Like all rightsholders, Chinese authors and publishers will be able to tell Google whether or not to display their books, and will be paid if the books are included in sales or subscriptions authorized under the settlement.”

Source: WSJ

See Also: Here’s How The Story Was Reported in the China Daily
Hat Tip: James Grimmelmann, The Laboratorium

Google Book Search: Video from D for Digitize Conference is Now Available Online

Friday, October 23rd, 2009

A few weeks ago the D for Digitize Conference took place. It was sponsored by the New York Law School and organized by Professor James Grimmelmann. The focus of the conference was Google Book Search (GBS). The list of speakers/panelists reads like a Who’s Who of people representing all sides of the many issues being debated at the conference and elsewhere.

Now, you can watch each session online (free). Even two pre-conference tutorials are included. A list of sessions and speakers along with links to the videos can be accessed here.

Finally, if you want to read about what was discussed during a session before viewing the video or just don’t have time to watch, no worries.
Peter Hirtle from the Law Library Blog provide excellent text summaries of each session.

Law Library Blog is a co-production between Peter and Mary Minow.

See Also: Law Library Blog also has a Twitter feed at:
http://twitter.com/librarylaw

Article: Missing Links: The Enduring Web

Thursday, October 22nd, 2009

From the Abstract:

The Web runs at risk. Our generation has witnessed a revolution in human communications on a trajectory similar to that of the origins of the written word and language itself. Early Web pages have an historical importance comparable with prehistoric cave paintings or proto-historic pressed clay ciphers. They are just as fragile. The ease of creation, editing and revising gives content a flexible immediacy: ensuring that sources are up to date and, with appropriate concern for interoperability, content can be folded seamlessly into any number of presentation layers. How can we carve a legacy from such complexity and volatility?

Access the Complete Article (PDF)

Source: International Journal of Digital Curation (4.2)

Washington University: Libraries receive federal grant to digitize pre-war slave lawsuits

Wednesday, October 21st, 2009

Here’s more about a very brief item we posted when IMLS National Leadership Grants at the end of September.

From the Article:

Washington University Libraries received one of the largest grants in the institution’s history, a $376,426 National Leadership Grant from the Institute of Museum and Library Services. The money will fund the St. Louis Freedom Suits Legal Encoding Project, which aims to digitize pre-Civil War lawsuits that slaves brought against slaveholders in the St. Louis Circuit Court.

[Snip]

The newly funded Freedom Suits Legal Encoding Project takes the digitalization process a step further. In addition to finishing the scanning of more than 20,000 pages of city directories and court records, the project also seeks to transcribe the documents to enable full-text searches.

[Snip]

The primary novel aspect of this project is to “develop extensions to the Text Encoding Initiative (TEI) for encoding legal documents to reflect legal function, genres and roles, and employ these extensions in this collection,” according to a grant announcement.

In other words, this project seeks to develop a computer language for annotating the legal functions of documents. This language would be comparable to HTML, which is used to denote structural semantics for Web pages. Ultimately, this innovation will be integrated into TEI, the existing language, to provide a model for similar archives.

Access the Complete Library

Source: Student Library (Washington University, St. Louis, MO)

IFLA’s Knowledge Management Division Now Has a Twitter Feed

Wednesday, October 21st, 2009

Jane Dysart Writes:

The KM Section of IFLA, or THE VOICE OF GLOBAL KM as we are becoming known, is now on Twitter. Our tag is #IFLAKM. If you are on Twitter already, just include that tag on your post and it will be picked up in the #IFLAKM Twitter Feed that will soon show up on the front page of our website,

See Also: IFLA KM LinkedIn Group

See Also: IFLA KM Facebook Page

Source: IFLA

Nirvanix Provides Permanent Digital Archive for National September 11 Memorial & Museum

Wednesday, October 21st, 2009

From the Announcement:

Nirvanix today announced that the Nirvanix Storage Delivery Network (SDN) is being used as the main digital archive platform for the National September 11 Memorial & Museum’s permanent collection of audio, video and photographs that chronicle the events of the day and their aftermath through the eyes of those who experienced it – both at the attack sites and around the world.

The Museum is actively acquiring materials for its permanent collection from the general public and those directly impacted by the attacks. The collection includes audio recordings of personal experiences of the attacks of September 11, 2001 left by visitors at the 9/11 Memorial Preview Site located next to the World Trade Center. The Museum is actively collecting photographs, audio, video and other 9/11 related material through a number of outreach initiatives, including a “Call to Remember.” This program is designed for 9/11 victims’ family and friends to leave voice mail messages with remembrances about their loved ones. In addition, the Museum has collected thousands of photos submitted by the public through an online initiative, “Make History,” as well as hundreds of hours of video. All of this material will be kept as a permanent digital archive of content that will be an evolving historical record of the day’s events.

Source: Nirvanix
Hat Tip: AMIA Newsbriefs

Google Books Settlement: The Chinese Chapter

Tuesday, October 20th, 2009

From a Blog Post:

The China Written Works Copyright Society (CWWCS) has called on Chinese writers to stand up for their legal rights in the face of Web search giant Google’s proposed book settlement, according to a post published on the official Web site of Chinese Writers’ Association (CWA).

CWWCS claimed to have found copyrighted works written by a number of Chinese writers scanned and posted to Google’s digital library, Google Books.

[Snip]

A Google spokeswoman said, “Google Books promotes and encourages book sales – helping to ensure that authors and publishers are rewarded for their creative efforts. Our goal remains bringing millions of the world’s difficult-to-find, out-of-print books back to life. … The scope of our U.S. settlement is limited to the U.S. and comes under U.S law and only U.S. readers will benefit. Of course, we listen carefully to all concerns and will work hard to address them.”

A Google FAQ on its book program gave more detail about how compensation works for non-US authors:

“Holders of U.S. copyrights world-wide can register their works with the Book Rights Registry and receive compensation from institutional subscriptions, book sales, ad revenues and other possible sources, as well as a cash payment if their works have already been digitized. For example, a foreign author whose book was published outside the U.S. can register with the Book Rights Registry, and receive compensation, if that book is in the collection of a U.S. library from which it was digitized.”

Much More in the Complete Blog Post

Source: China Journal (via Wall Street Journal Digits Blog)

New From the Internet Archive: Bookserver, An Open System Allowing Users to Search Multiple eBook Catalogs from a Single Interface, Makes Crawling Easier Too!

Tuesday, October 20th, 2009

Think one search interface allowing users to search multiple ebook catalogs from various providers. At the same time, making it easy for various search engines to crawl content from these and other ebook sources.

Before moving on to press coverage, you might want to take a look at a slide presentation about BookServer technology from Peter Brantley, Director, Bookserver Project at the Internet Archive and Co-Founder of the Open Book Alliance.

Also, you can get a very basic feel for searching with Bookserver technology by heading to homepage and looking for the search box on the page. You’ll only be searching one catalog (The Open Library) but it uses Bookserver’s Open Publication Distribution System (OPDS). Of course, all of the books you find are free to access, read, download, etc.

Now, on to media coverage.

From an Article

The Internet Archive and various like-minded partners have launched an open architecture for selling and lending digital books online, an effort to consolidate the fledgling market for net texts – and give Google a little food for thought.

Dubbed BookServer, the open platform is meant to provide a standard means for booksellers, publishers, libraries, and individual authors to serve texts onto laptops, netbooks, smartphones, game consoles, and specialized ereaders a la the Amazon Kindle. The Archive has already demonstrated an early incarnation of the architecture with the Kindle and Sony’s Reader Digital Book.

See Also: Access the Complete Article

Source: The Register

See Also: Internet Archive’s BookServer could ‘dominate’ Amazon (via News.com)

Internet Archive founder Brewster Kahle told CNET News that BookServer is about creating an open system that allows search engines to index books that are available from a wide group of sources. Effectively, commercial publishers, lending libraries and even individual authors would have a way to index their work and offer easy digital distribution under BookServer, Kahle said.

[Snip]

Kahle said that he’s been thinking about such a project since before the advent of the World Wide Web, but that the technology has never been ready. But that’s changed over the last 20 years, he said. “We’ve now gotten universal access to free (content),” Kahle added. “Now it’s time to get universal access to all knowledge, and not all of this will be free.”

Much More After a Click
(more…)

Brown Univesity Gets Ready to Launch Digital Repository Using Fedora Software

Tuesday, October 20th, 2009

Yesterday, we posted about a massive digitization project recently announced at NYU and an editorial from the Princeton University newspaper focusing on Google digitization and the HathiTrust.

Today, we move to Providence, RI and Brown University.

From the Article:

The Center for Digital Initiatives is preparing to launch the Brown Digital Repository, an online database to allow faculty members to easily and safely store thousands of documents — and share them with their students and colleagues.

The service, which aims to make faculty research and teaching materials more accessible in the present as well as preserve them for posterity, could be operational as soon as next semester, according to Patrick Yott, head of the library’s digital services department and the Center for Digital Initiatives.

[Snip]

Many digital repositories, such as the Massachusetts Institute of Technology’s, use a program called DSpace, which Yott said is less flexible. The new program, Fedora, improves on older repository software.

“We were waiting for some of the technologies to mature,” Yott said. “We just waited to do it the way we wanted to do it.”

[Snip]

The repository will allow users to upload faculty papers, research data, electronic dissertations, teaching materials and other files. One feature of the Fedora platform allows files to be updated into newer formats should old ones become obsolete, preserving the documents for generations to come.

Source: The Brown Daily Herald

See Also: Learn More About Fedora (via Wikipedia)

Major Digitization Program Announced: NYU Announces Plans to Digitize All Holdings of Bobst Library

Monday, October 19th, 2009

UPDATE: 10/21: Library Journal reports that the article in the Washington Square News the other day, that’s to linked to below is inaccurate.

Josh Taylor, senior director, communications, NYU Abu Dhabi responds to what he told and is quoted saying in the article. He notes the conversation with the reporter was done using e-mail. Taylor’s comments appear on a Chronicle of Higher Education weblog).

This is a case of a reporter being 100-percent accurate with a quote, but drawing a wholly different conclusion than what she was actually told (or in this case, e-mailed).

The final part of my quote [in the article below] is critical to understanding our long-term thinking on the subject. Digitizing for digitization’s sake isn’t a sound academic or economic strategy. However, digitizing as we identify specific curricular and research needs that would benefit from students and faculty being able to access materials in a city other than their own is an essential component of NYU’s vision for the global network university.

NYU Library dean Carol Mandel today told LJ that “our plan, pending more approvals, is to do some significant selected appropriate digitization projects.” She goes on to tell Library Journal that the number one priority of a digitization project is that they meet “curricular and research needs.” Mandel also lists other factors that include, not digitizing content that has been digitized elsewhere, accessibility issues, and having permissions in order.

While Mandel was not ready to discuss the size or cost of the project, she said that it was not, as implied in the article, mass digitization but rather “more akin to digital collection development.” Small projects, she said, could begin this year or next.

From the Article:

————-

With the financial backing of Abu Dhabi, NYU is planning to digitize Bobst Library.

This will be perhaps NYU Abu Dhabi’s most visible change for the university’s Washington Square campus. A digital database of all the holdings in Bobst would serve to connect Abu Dhabi and New York’s research materials.

[Snip]

“We do plan on the future digitization of materials at Bobst, for access by those in Abu Dhabi, and elsewhere in the global network university, as curricular and research needs demand it,” NYUAD spokesman Josh Taylor wrote in an e-mail.

[Snip]

No other university has a completely digitized library, though many universities have made partial steps toward digitization, usually beginning with rare collections.

Kirtas Technologies, a leading company in digitization services, has worked to digitize portions of libraries at Yale and Cornell universities and at the University of Pennsylvania.

While working on a Microsoft-funded project at Yale and Cornell, Kirtas was digitizing three million pages a month — to the tune of 10 or 12 cents per page, according to Marketing Operations Manager Todd Whiting. That adds up to approximately $3 to $4 million a year.

NYU currently has no time frame for when the project will start. The university’s libraries have a combined 5.1 million volumes.

[Snip]

The large-scale digitization projects at Yale and Cornell did encounter some hiccups. According to Whiting, Kirtas had to develop a completely new machine to digitize the pull-out maps and diagrams in many of the rare books at the two universities.

Access the Complete Article

See Also: The NYU School Paper, Washington Square News Has Published an Editorial About the Project titled, “Ditigizing Bobst is Smart Yet Risky.”

From the Editorial:

The WSN Editorial Board thinks the benefits of a digitized library are innumerable. Access to all Bobst stacks though an online, searchable database will help all students and faculty, and the project shows that NYU is on the forefront of embracing technology in academia.

However, some members of our board feel a sense of uneasiness over the project’s funding. The Abu Dhabi government will cover 100 percent of the costs in digitizing Bobst, just as it has covered all costs in building NYU’s campus in Abu Dhabi.

Source; Washington Square News
Hat Tip: Jerome McDonough

European Commission Puts Challenges of Books Digitisation for Authors, Libraries and Consumers on EU’s Agenda

Monday, October 19th, 2009

From the Announcement:

The European Commission today adopted a Communication on Copyright in the Knowledge Economy aiming to tackle the important cultural and legal challenges of mass-scale digitisation dissemination of books, in particular of European library collections. The Communication was jointly drawn up by Commissioners Charlie McCreevy and Viviane Reding. Digital libraries such as Europeana will provide researchers and consumers across Europe with new ways to gain access to knowledge. For this, however, the EU will need to find a solution for orphan works, whose uncertain copyright status means they often cannot be digitised. Improving the distribution and availability of works for persons with disabilities, particularly the visually impaired, is another cornerstone of the Communication.

On adoption, Commissioners McCreevy and Reding stressed that the debate over the Google Books Settlement in the United States once again has shown that Europe could not afford to be left behind on the digital frontier.

“We must boost Europe as a centre of creativity and innovation. The vast heritage in Europe’s libraries cannot be left to languish but must be made accessible to our citizens”, Commissioner McCreevy, responsible for the Internal Market, stated .

Commissioner Reding, in charge of Information Society and Media, said: ” Important digitisation efforts have already started all around the globe. Europe should seize this opportunity to take the lead, and to ensure that books digitisation takes place on the basis of European copyright law, and in full respect of Europe’s cultural diversity. Europe, with its rich cultural heritage, has most to offer and most to win from books digitisation. If we act swiftly, pro-competitive European solutions on books digitisation may well be sooner operational than the solutions presently envisaged under the Google Books Settlement in the United States.”

The announcement goes on to discuss three main issues:

+ Digital Preservation and Dissemination

+ Orphan Works

The digitisation and dissemination of orphan works pose a particular cultural and economic challenge – the absence of a known rightholder means that users are unable to obtain the required authorisation, e.g. a book cannot be digitised. Orphan works represent a substantial part of the collections of Europe’s cultural institutions (e.g., the British Library estimates that 40 percent of its copyrighted collections are orphan ). The Commission will now examine this phenomenon more in detail via an impact assessment.

+ Access for Persons with Disabilities

Much More in the Complete Announcement

Source: EUROPA

See Also: Summary: Commission Communication on Copyright in the Knowledge Economy (1 page; PDF)

See Also: Communication from the Commission: Copyright in the Knowledge Economy, October 19, 2009 (10 pages; PDF)

The Daily Princetonian (Princeton U.) Editorial: Going Beyond Google Books

Monday, October 19th, 2009

From the Editorial:

In 2007, Princeton signed on as one of the partner libraries in the Google Books Search project. By the end of this six-year agreement, the University will have sent Google about one million books to be scanned. All of the books — which the University has ensured are in the public domain — will then be available for free on the internet. This is an exciting project, allowing Princeton to share some of its intellectual wealth with readers around the world.

[Snip]

Though there is currently a lawsuit pending against the Google Books Search project by the Authors Guild and the Association of American Publishers, it does not pertain to Princeton’s participation in the project.

[Snip]

But this legal challenge is a reminder that Princeton’s involvement with Google — though a positive and useful partnership — could pose problems in the future. For one, Google, a for-profit corporation, may not be around forever, as it is subject to the intense competition of the technology sector. And though Google’s current goal is to digitize every book ever published, this may not always be the case. It is not unreasonable to imagine that in the future Google may develop a commercial interest in digitizing only works that would appeal to large audiences to make time for its workforce to focus on more profitable ventures.

[Snip]

One way to both reconcile this disparity between the profit motives of Google and the academic goals of Princeton, as well as to contribute to a more stable, long-term initiative, is for Princeton to join the HathiTrust. This promising nonprofit database started by Indiana University and the University of Michigan now includes 25 large university partners intent on creating a permanent database of digitized books not subject to the economic pressures corporations face.

[Snip]

The Google Books Search project has given our library a great head start into the sphere of digital libraries, and at no cost. But to protect the purely academic spirit of digitized libraries, the University should seek alternatives to its participation in the Google project.

Access the Complete Editorial

Source: The Daily Princetonian
Hat Tip: Library Stuff

New Essays Discuss the Future of Scholarly Publishing

Monday, October 19th, 2009

A new book, The Fourth Paradigm: Data-Intensive Scientific Discovery, is available online (full text, free) and is a tribute to the work of the computer scientist, Jim Gray, who was lost at sea in 2007. The entire book is a collection of essays and can be downloaded as a single PDF or individual parts.

Here’s a link to the table of contents.

The part of the book containing essays on the future of scholarly publishing will likely be of greatest interest to ResourceShelf readers. It contains an introduction along with six essays written by a who’s who in scholarly publishing and information retrieval. What follows are direct links to each essay (PDF):

+ Introduction by Lee Dirks

+ Jim Gray’s fourth paradigm and the construction of the scientific record by Clifford Lynch

+ Text in a data-centric world Paul Ginsparg

+ All aboard: toward a machine-friendly scholarly communication system by Herbert Van de Sompel, Carl Lagoze

+ The future of data policy by Anne Fitzgerald, Brian Fitzgerald, Kylie Pappalardo

+ I have seen the paradigm shift, and it is us by John Wilbanks

+ From web 2.0 to the global database by Timo Hannay

Access the Complete Table of Contents for All Four Parts of the Book

Source: Microsoft Research

Brewster Kahle, Co-Founder of the Internet Archive, Named a Visionary Who is Changing Our World

Monday, October 19th, 2009

Congrats Brewster!

The UTNE Reader is out with their list of “50 Visionaries Changing Our World” and Brewster Kahle from the Internet Archive and Open Content Alliance is on the list. You can Brewster’s entry here and review the entire list here.

His entry includes a link to a Slate article about Kahle from 2005 and a SF Chronicle article also from 2005.

The Internet Archive is not only home to the essential The Wayback Machine but it’s also home to over 1 million digitized books, thousands of hours of music*** and film. The Internet Archive also runs the Archive-IT program that works with numerous organizations to archive their specific content.

*** The Internet Archive is home to two music collections. The first contains live music and the other features a wide variety of audio including audiobooks and poetry.

The Book That Contains All Books

Sunday, October 18th, 2009

From a Column by Stephen Marche in the WSJ:

On Monday, the Kindle 2 will become the first e-reader available globally. The only other events as important to the history of the book are the birth of print and the shift from the scroll to bound pages. The e-reader, now widely available, will likely change our thinking and our being as profoundly as the two previous pre-digital manifestations of text.

[Snip]

The introduction of the printing press brought a similarly enormous change to the nature of reading. One of the most interesting figures in that transformation is the great Benedictine scholar Trithemius. He lived in Sponheim in the 15th century and managed to amass a library fully half the size of the Vatican library, an incredible achievement. He was also the author of “In Praise of Scribes,” the foremost defense of scribal practice, in favor of writing things out and against printing them.

[Snip]

But I am immensely excited for the new phase of the book. So far the new technology has been called the “e-reader,” a term obviously picked by engineers, not poets. In literary terms it’s a transbook, by which I mean that it is the book which can contain all books.

[Snip]

We are still in early days, but it is obvious where the transbook is headed: It will eventually provide access to all text that is non-copyright, and to the purchase of every book in or out of “print.” Kindle 2’s boast of being able to hold 1,500 titles will eventually sound as ludicrous as those early ads for floppy disks boasting that they could hold up to 64k of data. We will want everything and we will get it. Possibly there will eventually develop a subscription service, which provides access to all books for a monthly fee. At any rate, a single object will contain the contents of all the world’s libraries. It’s just a matter of when that will happen. And who will profit.

Much More in the Complete Column

Stephen Marche is the pop culture columnist at Esquire magazine.

Source: Wall Street Journal