Archive for the ‘Digitization Projects’ Category

Collection Development: Want a Non-Stop Stream of Recently Digitized eBooks to Choose From? Check This Out!

Friday, November 20th, 2009

A Never Ending “Virtual Stream” of Digitized Text
by Gary Price, Senior Editior

When Chris Sherman and I were writing and then giving book talks and presentations about The Invisible Web, we said John Mark Ockerbloom’s Online Books Page was an essential resource for anyone interested in digitized, full text books. Now referred by most as eBooks. More than eight years later I feel the same way about this awesome and well organized collection.

Where do you begin with a site so full of content? For me, that’s easy. Monitoring the latest additions to the catalog/page. I am always blown away by the amount of new listings (when does Ockerbloom sleep?) and the number of organizations digitizing books. If you think it’s only Google digitizing books (of course they are a major player) but not they’re far from the only one doing this type of work. Just look for yourself. The page even has an RSS feed.

So, the Online Books Page is not only a “must have” searchable directory of ebooks but it can also be a great collection development resource to find and add digitized content to your local collection/OPAC.

But wait, we’ve got more.

The Online Books Page new listings only includes some of the digitized text output from the Internet Archive (IA).

If you want to be able to review (at your leisure) all of the new digitized content text content that the IA produces, it’s possible by subscribing to this RSS feed. Even if you’re not going to review the titles, just let it run for a few days to see the AMOUNT of text material that’s digitized in variety of formats. It’s an understatement to say that the scanners at the IA are cranking it out on all cylinders. So, collection development types, subscribe to both RSS feeds and have a large virtual bookshelf to choose from each day. If you don’t do the collection development thing both feeds are useful to illustrate the amount of material being digitized each day, week, month.

UPDATE: Not an RSS user? No problem. Just visit this Internet Archive page and refresh it a few times a day. The most recent addition is at the top.

Judge Gives Preliminary Approval to Google Deal, Sets Feb. 18 for Final Hearing

Friday, November 20th, 2009

From the Article:

Judge Denny Chin has given his preliminary approval to the Google Book Search settlement agreement and established a timeline to move the agreement toward a final resolution. A final settlement/fairness hearing has been set for February 18 at which Judge Chin will hear arguments to determine whether the agreement is “fair, reasonable, adequate;” consider whether to certify the class for purposes of the settlement; and to make a determination whether to approve the agreement.

Prior to the hearing, the judge has ordered that supplemental notices about the amended agreement be sent beginning December 14, and he set a January 28 deadline for objections to be filed with the court.

[Snip]

As part of the amended settlement, companies from outside of the U.S. were to be added as plaintiffs. The order notes that new plaintiffs include Harlequin, Melbourne University Publishing Ltd., and The Text Publishing Company.

Source: Publisher’s Weekly

HathiTrust Offers Full-Text Search of Millions of Digitized Books and Journals

Thursday, November 19th, 2009

From the Announcement:

A year after its launch by 25 leading U.S. research libraries, HathiTrust Digital Library announces a service that will transform how researchers use the more than 1.6 billion pages (4.6 million volumes) in its collections.

The breakthrough allows for full-text searching capabilities across the entire library. Researchers can now search public domain and in-copyright works by keyword or phrase.

Based on open source Solr/Lucene technology, the service expands on an experimental search of public domain volumes introduced in November 2008. Full-text search will continue to be supported across the repository as it grows at a rate of hundreds of thousands of volumes every month.

“The HathiTrust partners are pleased to offer a search service that helps mine this growing body of authoritative library materials,” said John Wilkin, HathiTrust executive director and associate university librarian at the University of Michigan. “HathiTrust continues to distinguish itself with its reliability and with its efforts to broaden the availability of digitized library collections in the flow of scholarly discourse. We see this valuable discovery service as one in a series of major steps HathiTrust is taking to shed light on this vast body of material.”

In combination with the HathiTrust Digital Library’s carefully curated bibliographic data, the new functionality allows researchers to more efficiently locate items relevant to their research. It also lays the foundation for future services such as full-text search with faceted browsing, advanced search, “more like this” options, and tools that can be used in computational research.

The effort to provide full-text searching capabilities across the repository has yielded valuable benchmarking data, methods, and code to the broader large-scale search community, said Wilkin.

The HathiTrust partners are committed to developing the repository and its services to meet the long-term needs of their academic communities, and offer a unique resource on the Web for scholarship and research.

Source: HathiTrust / University of Michigan

See Also: HathiTrust Home Page and List of Partners

See Also: Access HathiTrust Search Interfaces (Including Full Text Search)

Australia: Historic Newspaper Digitisation: Early Editions of Sydney Morning Herald Now Available Online; What is Trove?

Thursday, November 19th, 2009

From an e-Mail:

The National Library’s Australian Newspapers service has recently made available https://mail.google.com/mail/?zx=1rjv366gqucji&shva=1#inbox/1250a7f37fa96144early editions of The Sydney Morning Herald.

The digitisation of The Sydney Morning Herald was made possible by a $1 million contribution from the Vincent Fairfax Family Foundation. Eventually, all out-of-copyright editions of the Herald will be available, from its inception in 1831 to 1954.

It is now just over a year since Australian Newspapers was released to the public and there are 8.5 million articles available from 33 newspaper titles. A community of volunteer ‘text correctors’ has now corrected 7 million lines of the electronically translated text in 318 000 articles, enabling more accurate search results.

Access the Collection (via Trove)

Btw, what is Trove?

“one search…a wealth of information”

Trove is our new free online service that gathers information about Australia and Australians in a single search.

Discover:
+ Digitised Australian newspapers, 1803 – 1954
+ Books, magazines and articles
+ Pictures and photographs
+ Music, oral histories and videos
+ Maps
+ Archived websites
+ Biographical information

Source: National Library of Australia

Just Released: Shakespeare Quartos Archive Opens Access to Hamlet

Tuesday, November 17th, 2009

We are looking forward to spending some quality time with this very high quality resources (that’s also free). If nothing else, it really shows off the power of digital archives and digitization.

From the Announcement:

The highly-anticipated Shakespeare Quartos Archive has been officially launched today with a complete digital collection of rare early editions of Hamlet.

For the first time, all 32 existing quarto copies of the play held by participating UK and US institutions are freely available online in one place. This initiative is jointly led by the Bodleian Library, University of Oxford and the Folger Shakespeare Library, Washington DC, through a joint transatlantic grant from Jisc in the UK and the National Endowment for the Humanities in the US.

[Snip]

Now scholars can explore these different quarto versions side by side for the first time. It features high-quality reproductions and searchable full text of surviving copies of Shakespeare’s Hamlet in quarto in an interactive interface. The project, which began in April 2008, reunites all 75 pre-1642 quarto editions of Shakespeare’s plays into a single online collection. The prototype interface is at present fully functional only for Hamlet, but the Shakespeare Quartos Archive plans to apply this technology to all the plays in quarto, and to seek involvement from new partner institutions.

Now scholars can explore these different quarto versions side by side for the first time on the project website. It features high-quality reproductions and searchable full text of surviving copies of Shakespeare’s Hamlet in quarto in an interactive interface. Functions and tools – such as the ability to overlay images, compare them side-by-side, and mark and tag features with user annotations – facilitate scholarly research, performance studies, and new applications for learning and teaching.

The project, which began in April 2008, reunites all 75 pre-1642 quarto editions of Shakespeare’s plays into a single online collection. The prototype interface is at present fully functional only for Hamlet, but the Shakespeare Quartos Archive plans to apply this technology to all the plays in quarto, and to seek involvement from new partner institutions.

Direct to Shakespeare Quartos Archive

Source: JISC, NEH

Bookless Libraries Increase Accessibility

Monday, November 16th, 2009

From the Article

Carnegie Mellon has for years already been building its own “bookless library”: “For the nearly 15 years I’ve been in the Carnegie Mellon Libraries, we’ve been working hard to provide the campus with what I think is a very realistic view of the library of the future. We are working towards a hybrid of an online and paper-based library,” computer science librarian Missy Harvey explained.

[Snip]

The rarity of many works in the Posner Memorial Collection highlights the advantages of the increased accessibility and longevity of electronic republishing. “Rare books [can be] scanned and delivered via the World Wide Web to scholars in places such as Argentina and Germany who could not visit the books,” Mary Catharine Johnsen, the special collections and design librarian, said. “Electronic versions save wear and tear on using the physical book, which is important if you are a book from 1755 and your leather spine is dry and cracking.”

[Snip]

Johnsen also said that even in successful scans of books to electronic formats, the many subtleties in a book’s presentation and metadata may be lost in an online medium. “For literature students, you really want to see the original format of the work as received by its first public. Was it a fancy coffee-table book? Was it a cheap paperback or flimsy pamphlet? Was it a colorful book to tempt you in a Victorian train station or an airport bookstall?”

Source: The Tartan (Carnegie Mellon)

Press Review+: Google Book Search Revised Settlement (2.0) Released; What About Libraries?

Friday, November 13th, 2009

We’re going to on the lookout for news, commentary from experts, and viewpoints from various organizations and companies involved in the GBS story. We’re posting selected snippets with links to the full text. We also know that in the document filed with the court, there is one mention of libraries, public libraries to be specific.

From Google and Others Involved:

+ Modifications to the Google Books Settlement (via Google Public Policy Blog, Dan Clancy)

The changes we’ve made in our amended agreement address many of the concerns we’ve heard (particularly in limiting its international scope), while at the same time preserving the core benefits of the original agreement: opening access to millions of books while providing rightsholders with ways to sell and control their work online.

The blog post also links to a settlement modifications overview (3 pages) and a Revised Settlement FAQ (2 pages).

Are libraries mentioned in these documents? Yes. As you’ll read not much is different in terms of access except that the amended agreement allows the Registry to increase the amount of terminals in a public library.

On Page 2 of the Overview it States:

The amended settlement does not change the primary access models outlined in the original agreement, including enabling readers to preview and purchase books, selling institutional subscriptions to the whole database, and giving libraries free access at designated terminals. Under the revised agreement, possible additional access models to which Google and the Registry might agree in the future have been reduced and are now limited to: print-on-demand*, file download, and consumer subscription. The amended agreement also enables the Registry to increase the number of terminals at a public library building

* The Amended Settlement limits POD, if approved, to Books that are not Commercially Available.

There is no mention of the words library or libraries in the FAQ.

There is a third document, a Supplemental Notice (an actual court filing; 6 pages; PDF),  listing all of the changes to the settlement.  #17 talks about the terminals in public libraries that we mentioned a moment ago.

Here are a few more changes (via the supplemental notice) that might be of special interest:

+ #16:

The Amended Settlement provides that the Registry will facilitate Rightsholders’ wishes to allow their works to be made available through alternative licenses for Consumer Purchase, including through a Creative Commons license…The Amended Settlement also clarifies that Rightsholders are free to set the Consumer Purchase price of their Books at zero.

+ #18:

The Amended Settlement no longer includes children’s book illustrations in the definition of Inserts. (ASA Section 1.75) The Amended Settlement, however, does not change the inclusion of pictorial works, such as graphic novels and children’s picture books, in the definition of Books and provides that the Amended Settlement only authorizes Google to display the pictorial images in such Books if a U.S. copyright owner of the pictorial image also is a Rightsholder of the Book. The Amended Settlement also clarifies that comic books are considered to be Periodicals and that Periodicals (as well as compilations of Periodicals) are not included in the definition of “Books,” and thus are not in the Amended Settlement.

Finally, if you would like to read the complete Amended Settlement Agreement, here’s the 173 page PDF file.

+ Amended Google/AAP Settlement (via Coyle’s InFormation, Karen Coyle)
An excellent overview of Settlement 2.0 from librarian Karen Coyle. She brings up several library related issues including the removal of an OCLC “exception”; download formats and course packs; and much more. This is must read material.

+ Is the Google Books Settlement Worth the Wait?

The Open Book Alliance–SLA and The New York Library Association–are two of its members has posted their views after a preliminary reading of the revised settlement. Here are a few snippets.

Open Book Alliance co-chair Peter Brantley said, “Our initial review of the new proposal tells us that Google and its partners are performing a sleight of hand; fundamentally, this settlement remains a set-piece designed to serve the private commercial interests of Google and its partners. None of the proposed changes appear to address the fundamental flaws illuminated by the Department of Justice and other critics that impact public interest.

[Snip]

Most critically, the settlement proposal must not grant Google an exclusive set of rights (de facto or otherwise) or result in any one entity gaining control over access to and distribution of the world’s largest digital database of books. It is clear that Google has failed to meet these requirements.

UPDATE: 11/17 The Monopoly Continues (Source: Open Book Alliance)

UPDATE: 11/17 Proposed Changes Fail to Address Fundamental Flaws, Says Open Book Alliance Co-Chair (via Open Book Alliance)

+ Revised Google Book Settlement Filed & Live Blogging The Press Call (via Search Engine Land, Danny Sullivan)

Danny took the time to live blog the conference call that took place early Saturday morning, east coast time. On the call were:

+ Richard Sarnoff, chairman of the American Association of Publishers

+ Paul Aiken, executive director of the Authors Guild

+ Daniel Clancy, engineering director for Google Books

Here’s how they responded to the Open Book Alliance comments that are posted and linked to above this item.

So the response to that? Clancy stepped up, saying there were lots of discussions on how to change things. Adjustments were made to address class member concerns (the people involved in the lawsuit, rather the the Open Book Alliance, which is not a party to the suit). “I understand Amazon, Microsoft and the Internet Archive don’t want to increase access to these books,” he said, or very close to that. That was a zinger, stressing that the Open Book Alliance just happens to be backed by major Google competitors. Not that Google minds. Clancy said they welcome the competition and feel the settlement addresses concerns.

Aiken: “These are substantial changes.” He added that yes, the core settlement was largely protected but that it had to be, as it was in general seen correct.

Sarnoff: Said he assumed the OBA hadn’t read the settlement. That was probably true enough. The press conference itself appears to have started about 1/2 hour after the settlement was out. Some reporters on the call mentioned they hadn’t even read it.

+ The Authors Guild Has a Review of the MaJor Changes on their Site

+ Google Book Search Settlement Revised: No Reader Privacy Added (From the Electronic Frontier Foundation)

Unfortunately, the parties did not add any reader privacy protections. The only nominal change was that they formally confirmed a position they had long taken privately that information will not be freely shared between Google and the Registry. Our partners at the ACLU of Northern California have a blog post describing the changes we, and the authors we represent, have demanded and continuing the call for readers everywhere to let Google CEO Eric Schmidt know that reader privacy should not be left behind as books move into the digital age.

+ Amended Google Book Settlement: Doesn’t Deal with Privacy Problems (ACLU of Northern California)

One of our core privacy concerns with the Settlement has been that reading records are not properly protected from disclosure to the government and third parties. Readers should be able to use Google Book Search without worrying that the government or a third party is reading over their shoulder. No Settlement should be approved that allows reading records to be disclosed without a properly-issued warrant from law enforcement and court orders from third parties.

The Amended Settlement does not resolve this concern, with its only new privacy provision being the following:

“The revised agreement includes language that specifies that Google will not share any private information with the Registry without valid legal process.”

Much More After a Click
(more…)

Digitized Historic Newspapers: Topic Guides for Chronicling America

Friday, November 13th, 2009

The Chronicling America from the NEH and the Library of Congress is searchable database containing more than million digitized American newspapers pages (and growing) from 1880-1922.

Guides cover topics “widely covered” in the American press of the time. As of today (11/13/2009), there are 21 guides available with more expected soon.

Here are the titles of a few of the guides:

+ Annexation of Hawaii
+ Bloomer Girls (Women’s baseball)
+ Clara Barton
+ Comic Strips
+ Ellis Island
+ Jack Johnson vs. James J. Jeffries
+ Jack the Ripper
+ Patent Medicines
+ Presidential Election of 1896
+ Pullman Porters
+ San Francisco Earthquake, 1906

You can the find the complete list and register for alerts when new guides released here.

Milestones: The British Library’s Digital Library Passes 500,000 Items

Friday, November 13th, 2009

From the Announcement:

The British Library has added the 500,000th item to its long-term Digital Library System. The milestone item was a digitised copy of a newspaper originally published in 1864 and scanned as part of the Library’s 19th Century British Library Newspapers project, which recently made more than 2 million pages of historic newspapers available online. [Subscription Required].

[Snip]

Steve Green, Head of the Digital Library Programme at the British Library said: “The task of collecting, preserving and providing long-term access to the nation’s digital assets is in many ways a daunting and complex undertaking. The sheer amount of material being published digitally is challenging enough in itself, but the wide range of different formats – many of which will inevitably become obsolete – makes preservation and future accessibility far from straightforward. The Digital Library Programme has made huge progress in the past few years and we now have the foundations of a robust and fully scaleable system that can handle large quantities of digital items, ensuring their availability for future generations of researchers just as our historic print collections remain available for users today.”

Currently the Digital Library System holds:

+ 386,000 items received through the Voluntary Deposit of Electronic Publications (VDEP) scheme
+ 23,000 British Library Sound Archive master files
+ 65,000 19th century digitised books
+ 2,000 electronic journal items
+ 29,000 newspaper items

Source: British Library

Update: NARA/Footnote Holocaust Collection of Digitized Records, Materials to Remain Free Through December 31st

Thursday, November 12th, 2009

At the end of September we posted an in-depth overview about a new collection of digitized Holocaust records from the National Archives (NARA) and Footnote. Our post said that at the end of October a sizable portion of the content would only be available to Footnote.com subscribers.

Today, an update. All of the material will remain free through December 31, 2009.

From an E-Mail:

…due to the popularity of this collection, we have decided to keep the records open free to the public through the rest of this year. This will enable more people to search and explore the original records from the National Archives. On January 1, 2010 these records will become part of the paid subscription on Footnote.com. These records, however, will remain free to access through any of the National Archives physical locations.

Digital Archives: Siegfried Sassoon Collection Added to First World War Digital Poetry Archive

Thursday, November 12th, 2009

We first posted about The First World War Digital Poetry Archive from Oxford University in September.

The First World War Poetry Digital Archive is an online repository of over 4000 items of text, images, audio, and video for teaching, learning, and research.

The heart of the archive consists of collections of highly valued primary material from major poets of the period, including Wilfred Owen, Isaac Rosenberg, Robert Graves, Vera Brittain, and Edward Thomas. This is supplemented by a comprehensive range of multimedia artefacts from the Imperial War Museum, a separate archive of over 6,500 items contributed by the general public, and a set of specially developed educational resources.

Yesterday, the Siegfried Sassoon Collection was added to the archive.

Jennifer Howard from The Wired Campus writes:

Although it contains photographs and other materials, the collection centers on manuscripts of Sassoon’s poems, drawn from holdings at Oxford’s Bodleian Library and at the University of Cambridge, the New York Public Library, and the Harry Ransom Center at the University of Texas at Austin. A draft of Sassoon’s poem “Standing With the Dead” turns up in a June 19, 1918, letter to his friend Robert Nichols.

A New Digital Collection: A Calm Voice in a Strident World: Senator J.W. Fulbright Speaks

Wednesday, November 11th, 2009

This new digital collection comes from the University of Arkansas Libraries.

From the About Page:

For three decades following World War II, J.W. Fulbright represented Arkansas in the Congress of the United States. His single term in the House and four terms in the Senate saw Fulbright rise to become the foremost congressional authority on American foreign policy. From the beginning, Fulbright was a voice of calmness in the halls of congress, counseling international cooperation, the exchange of information, and support for the United Nations.

This digital collection contains fifty speeches Fulbright made during his congressional career. While the speeches deal with many topics, the emphasis is given to foreign affairs.

In order to put the speeches into their historical context, a variety of resources are included—including a detailed time line, a bibliography on the senator, and a selection of photographs.

Readers are reminded that these 50 speeches and related materials included on this site represent only a tiny fraction of the J.W. Fulbright Papers, comprised of over 1400 linear feet, held by the University of Arkansas Libraries. Serious students of Senator Fulbright and his era are urged to consult the full collection at the University. The collection, and a partial guide to the collection, may be accessed
[here].

Access: A Calm Voice in a Strident World:Senator J.W. Fulbright Speaks

See Also: More Digital Collections from the University of Arkansas

More in this News Release from the U. of Arkansas

Source: U. of Arkansas Libraries

Princeton Theological Seminary Library Partnering with Internet Archive to Digitize Content

Tuesday, November 10th, 2009

From the Web Site:

The Princeton Seminary library is delighted to be working in partnership with the Internet Archive, a 501(c)3 organization in San Francisco, to digitize a selection of out-of-copyright books from its collections. Approximately 21,000 items have been scanned to date. These books include historical sources about Princeton and Princeton Seminary, early editions of John Calvin in Latin and English, illustrated works on early Protestant missions, and classic biblical commentaries, among much else. All of these books are available to alumni/ae without restriction.
,
The URL of the Internet Archive is http://www.archive.org/details/Princeton.

Note: The article includes a picture of a person scanning materials into the database using the IA’s scanning technology.

Source: Internet Archive / Princeton Princeton Theological Seminary Library

Parties Involved Ask for a Brief Extension Before Google Book Search Settlement 2.0 Released

Monday, November 9th, 2009

An new version of the Google Book Search settlement was expected to be released today. However, as the NY Times reports, those involved have asked for more time. As of today, “Settlement 2.0″ will be released this Friday.

The parties to the Google book settlement, which would legalize the creation of a vast library of digital books, have asked the judge overseeing a revision of the agreement for an extension to this Friday, Nov. 13.

The parties told Judge Denny Chin of the Federal District Court of the Southern District of New York that they would submit a revised settlement for the court’s preliminary approval by Nov. 9.

But on Monday, the parties submitted a letter to the court requesting an extension to Nov. 13. In the letter, the group indicated that it had met with the Justice Department before and after the October status hearing and had met as recently as Friday, Nov. 6.

Source: Media Coder Blog (NY Times)

Source: Open Book Alliance Releases Baseline Requirements for Revised Google Book Settlement Proposal

An Interview with Project Gutenberg Founder Michael Hart

Sunday, November 8th, 2009

Andrea Kobeskzo recently interviewed Michael Hart and now you can read the Q & A interview on the Project Gutenberg News Portal.

Here’s an interesting passage when Kobeskzo asks Hart about how Project Gutenberg has evolved over the years (PG began in 1971). Hart says:

Believe it or not people were still saying eBooks were never going to make it just a few years ago. Look for a quote in the Wall St. Journal: “Ebooks are never going to make it.” Before that the NY Times: look for: “twitchy” screen. However now that it’s obvious they are moving eBooks on their own, but I can’t tell how serious they are. They may just be following the rule of simple reporting: “Follow The Money.” If eBooks fall flat will they all just move on and pretend there was never any interest?

The first goal of PG was just to prove eBooks feasible. My own estimations were that it would take about 10,000, and that seems to have proved correct as Google called me in to advise them ASAP after we hit 10,000, and we went to do just that on December 14, 2003: and they announced they had invented eBooks and eLibraries December 14, 2004. However, they did the opposite, or rather exact opposite of what I said they should do and look what happened. Most of the big legal fray is because they were more money oriented, and as such may have intentionally played the copyright cards that got them in the big legal hassles. If they had started out by emphasizing the public domain it probably would have worked out a lot better for them in the press as the good will they would have built up would have gone a long way.

Personally, I am OK with nearly any eBook format that is compact and search quote friendly.

Access the Complete Interview

Open Book Alliance Releases Baseline Requirements for Revised Google Book Settlement Proposal

Friday, November 6th, 2009

On Monday (November 9th), a revised proposed settlement (aka Settlement 2.0) from Google, the Authors Guild and the Association of American Publisher will be released. The Open Book Alliance (OBA) has posted on its web site what they call “baseline requirements” for the Settlement 2.0.

The Special Libraries Association and The New York Library Association are members of the OBA.

From the Blog Post:

The Open Book Alliance is issuing the following baseline requirements that the new settlement proposal must meet if it is to achieve those critical objectives. These requirements reflect the collective expression of concerns by the U.S. Department of Justice, authors, publishers, academics, libraries, foreign nations, state Attorneys General, consumer advocacy groups, and many others, and thus we think it appropriate to review the revised settlement within this framework.

[Snip]

+ The settlement must not grant Google an exclusive set of rights (de facto or otherwise) or result in any one entity gaining control over access to and distribution of the world’s largest digital database of books.

+ Authors and other rights holders must retain meaningful rights and the ability to determine the use of their works that have been scanned by Google.

+ The settlement must result in the creation of a true digital library that grants all researchers and users, commercial and non-commercial, full access that guarantees the ability to innovate on the knowledge it contains.

+ All class members must be treated equitably.

+ The settlement cannot provide for competition by making others engage in future litigation.

+ Congress must retain the exclusive authority granted by the U.S. Constitution to set copyright policy.

+ All rights holders impacted by the settlement must have a meaningful ability to receive notice, understand its terms and opt-out.

+ The parties that negotiated the settlement must live under the terms to which they seek to bind others, rather than their own separately negotiated arrangements.

Access the Complete Blog Post

Source: Open Book Alliance

See Also: Press Review: Judge Chin Sets Nov. 9 Deadline For Revised Google Book Settlement (via ResourceShelf, October 7, 2009)

Internet Archive Founder Brewster Kahle Profiled in Forbes

Wednesday, November 4th, 2009

Brewster Kahle has many titles. These days he’s best known as founder of the Internet Archive (home of The Wayback Machine) and founding member of the Open Content Alliance.

From the Article:

“We have to have universal access to everything, just like a library,” he says. “Do we want that under a single corporation’s control? It is openness, not corporate control, that propels capitalism.”

[Snip]

Digital libraries will shape education, creativity and our shared intellectual heritage, Kahle declares. As founder and director of the Internet Archive, Kahle has posted online digital copies of 1.7 million books, 100,000 hours of television, 200,000 video clips, 70,000 concerts and 415,000 audio recordings. All that material can be downloaded for free from the Archive’s Web site.

[Snip]

Bookserver* uses a range of open source and proprietary electronic book standards, search algorithms, editing tools and libraries. The architecture, as Kahle calls it, potentially separates manufacturers of devices from control over much of the content inside them. It also preserves the idea of the lending library–if you “check out” a volume, others cannot access it in the time allowed to you. Publishers sell their books in the system using credit cards.

The article continues with more about Google Book Search and Kahle’s background.

We were surprised not to see The Wayback Machine mentioned in the stats about the Internet Archive listed above. At the moment (and we know of nothing coming), “Wayback” is probably the best chance a researcher has to access a page no longer on the Internet. Material in “Wayback” dates back to 1996 and as of today, contains more than 150 BILLION archived pages. The Internet Archive also offers a fee-based service that helps organizations organize and archive their web content. It’s called, Archive-It.

* See Also: We Have an In-Depth Post About Bookserver on ResourceShelf
It Includes an comprehensive press review the day after the Bookserver announcement.

Source: Forbes

Bibliotheca Alexandrina: A Digital Revival

Tuesday, November 3rd, 2009

The Bibliotheca Alexandrina is one busy place. If you want to learn more read on through our highlights but make sure to read the complete article. Our highlights is just a sample of what’s going on.

From the Article:

The International School of Information Science (ISIS) a research institute affiliated with the BA [Bibliotheca Alexandrina], aims at furthering the BA’s goals of being a leading institution in knowledge dissemination and, specifically, promoting research and development related to the digital libraries. Toward that goal, ISIS has embarked on an array of ambitious projects, in partnership with world-class institutions. These include hosting a mirror site for the Internet Archive, participating in the Million Book Project, organizing the digital archive of the Gamal Abdel Nasser collection, digitizing 113 years of Al-Hilal magazine, presenting the first-ever complete digital version of Description de l’Egypte, conducting advanced research such as the Arabic component of the UN-sponsored Universal Networking Language computerized multi-language translation program, and offering the most advanced 3D virtual imaging techniques in a virtual immersive environment for science and technology applications. Thus, despite being barely seven years in existence, the BA already has a substantial record of achievements.

Among the other projects you’ll read about are:

+ The Digital Assets Repository (DAR)

+ Memory of Modern Egypt Digital Repository

+ Archive documenting the history of the Suez Canal

+ SuperCourse

To empower science educators worldwide, the BA is working with a team of specialists, in partnership with the University of Pittsburgh, to launch the first science SuperCourse, comprising thousands of PowerPoint lectures made available for free to teachers and lecturers, who can use the lectures as they see fit in their teaching of science. The SuperCourse has been effectively implemented in the area of Public Health and Epidemiology, with a network of 65,000 scientists in 174 countries, providing more than 3,500 lectures in 31 languages. The BA maintains a mirror site of SuperCourse, which receives an average of one million hits per month, and is working on setting up a similar course in all fields of science.

Much More in the Complete Article

Source: EDUCAUSE Review
Hat Tip: OAN

More Digitization Underway: This Time Footnote.com is Digitizing the U.S. Census from 1790-1930

Tuesday, November 3rd, 2009

Footnote.com is once again partnering the National Records and Administration Agency (NARA) to digitize massive amounts of content and then make that material available, often for a fee, available online. Footnote is becoming–and for some has already become–and important resource for historians, genealogists, students, and others.

This time around, Footnote.com, is digitizing all publicly available Census materials from 1790-1930. These dates represent the period when all materials (including names) from a given census have been made publicly available. Through its partnership with NARA, Footnote.com will add more than 9.5 million pages of content when the census database project is complete. We’ve learned that Footnote.com is digitizing all of this material on their own.

From a Footnote.com Blog Post:

With over 60 million historical records already online, Footnote.com will use the U.S. Census records to tie content together, creating a pathway to discover additional records that previously have been difficult to find.

The Interactive Census Project Home Page offers much more detail and examples. You can also create email alerts when new states are added to the census database. On the lower-left side of the page you can track the progress of each census has been digitized. As you’ll see, the 1860 census is complete and the 1930 census is just about done.

Searching is free, Footnote provides numerous options to refine your search (here’s an example). Accessing the complete record is fee-based either subscribing to the database for a annually or monthly. You can also by individual documents for $2.95. Btw, Footnote.com also sells institutional access to libraries through EBSCO.

Footnote looks at the census project as a “highway” to assist the researcher in finding more information in other databases.

If you’ve been reading ResourceShelf for a while you’ve seen an increasing number of mention their services. Here’s a list of a few of them,

+ In August of 2009. we posted on the release of a joint project with the National Archives (NARA) to digitize holocaust material.

+ In December of 2008, in a partnership with NARA, Footnote released the largest interactive World War II collection online.

+ In March, 2008 we posted about Footnote.com offering an interactive version of the Vietnam Wall.

Our first post about Footnote dates back to January, 2007.

If you run this search using the ResourceShelf database, you’ll be able to see and read all of our Footnote.com posts.

But wait, there’s more. A quick review of the Footnote “press room” offers up even more projects. You can learn about them here.

The Complete Archive of National Geographic Magazine on Six DVD’s

Sunday, November 1st, 2009

Every now and then a fee-based product comes around that we believe deserves your attention. The following is one of them.

Chris Pendleton on the Bing Blog reminds us that a major digitization project, every issue ever published of National Geographic from 1887-2008, is now available (it was officially released yesterday according to this media announcement) on 6 DVD’s or an external hard drive. That’s right, all of the writing, the legendary imagery, the supplement , even the advertisements are included. For many topics, Nat Geo magazine is a resource that documents people, places, and events, on a global scale. In other words, for all of the reasons just mentioned and many others, makes the magazine an important part of the historical record.

By the way, the reason it was mentioned on the Bing Blog is because Bing is providing some the technology that powers the digitized version this recently released collection.

From the Blog Post

Nat Geo uses Bing Maps in their Geobrowse functionality which allows you to browse a map anywhere in the world to find locations where relevant articles are referenced using geographic metadata.

Yes, we still love paper and those massive collections of past issues of the print version of National Geographic Magazine many people own (where are yours)? They’re also important.

That said, we also hear and read that for today’s student, it’s all about digital access. Yes, of course, that’s rather sad. However, a digitized archive of this size and scope can truly demonstrate the power of digital info technology for people of all ages and turn 120 years of content into important research and learning resources.

Another digitized archive of the magazine was released seven years abut this 120 year collection is the most complete version ever published with more content, more search options, saving/sharing tools, interactive maps, and more. One thing we noticed right of the bat is the that the new version is available for both PC and Mac. The “112 year version” was PC only.

Here are a few fast facts about the new collection. They were gleaned from Nat Geo site (including the video overview) and news release.

+ All Issues from October, 1888-December, 2008 are included

+ Six DVD’s include more than 200,000 pages; 300 wall map supplements, more than 8,400 articles; more than 250,000 photographs

+ All images scanned in high-resolution

+ Flip one page at a time, zoom, print

+ Geobrowse

A new Geobrowse function powered by Bing Maps that allows users with Internet access to search nearly 5,000 locations on a globe that are featured in the magazine’s archive of articles and maps.

+ Search by keyword, date, contributor, and topic; refine by date or content type

+ Browse by month or year

+ Create personalized reading lists; share these lists with other users in the Nat Geo community

+ Pre-loaded “favorite article lists” compiled by experts

National Geographic is selling the DVD’s for $69.95/US and the hard drive version for $199.95/US.

The lowest price we found as of Sunday November 1st was $42.78 from an Amazon.com Merchant. The DVD’s directly from Amazon.com are $44.99/US.

We’ve ordered a copy of the DVD’s and after spending some time with them we will report back.