Archive for the ‘Information Science’ Category

The October, 2009 Issue of the IFLA Journal is Now Available

Saturday, November 7th, 2009

Access the Complete Issue (35.3) (PDF)

This Issue (full text online, free) includes:

+ Editorial: Reading, Information Literacy and Professional Development

+Reading Sources and Reading Spaces in Honduras

+ Information Literacy and Scholarly Investigation: a British perspective

+ Our Space: professional development for new graduates and professionals in Australia

+ Open Access Repositories in Computer Science and Information Technology: an evaluation

+ The Library Services to People with Special Needs Section of IFLA: an historical overview

Access the Complete Issue (35.3; PDF)

Source: International Federation of Library Associations and Institutions

GAO — National Archives: Progress and Risks in Implementing its Electronic Records Archive Initiative

Friday, November 6th, 2009

National Archives: Progress and Risks in Implementing its Electronic Records Archive Initiative (PDF: 154 KB)
From Highlights (PDF; 45 KB):

NARA has completed two of five planned increments of ERA, but has experienced schedule delays and cost overruns, and several functions planned for the system’s initial release were deferred. Although NARA initially planned for the system to be capable of ingesting federal and presidential records in September 2007, the two system increments to support those records did not achieve initial operating capability until June 2008 and December 2008, respectively. In addition, NARA reportedly spent about $80 million on the base increment, compared to its planned cost of about $60 million. Finally, a number of functions originally planned for the base increment were deferred to later increments, including the ability to delete records and to ingest redacted records. In fiscal year 2010, NARA plans to complete the third increment, which is to include new systems for Congressional records and public access, and begin work on the fourth.

Source: Government Accountability Office (David A. Powner, director, information technology management issues, before the Subcommittee on Information Policy, Census and National Archives, House Committee on Oversight and Government Reform)

Digital Preservation: ACM Will Partner with Portico and CLOCKSS for Preservation of Its Digital Library Resources

Friday, November 6th, 2009

From an Announcement:

ACM (the Association for Computing Machinery) announced today that it is providing its institutional library customers with advanced electronic archiving services to preserve their valuable electronic resources. These services, provided by Portico and CLOCKSS, address the scholarly community’s critical need for long-term solutions that assure reliable, secure, deliverable access to their burgeoning digital collection of scholarly works. ACM is offering these services to protect the vast online collection of resources in its Digital Library (DL), which are used by over 1 million computing professionals and students worldwide.

“By partnering with Portico and CLOCKSS, we are able to meet a growing demand in the library community for a trusted, reliable third-party archive, and to ensure that digital collections remain accessible to future scholars, researchers, and students,” said Scott Delman, ACM Group Publisher. “Scientific discovery and the educational process are not possible without reliable access to the accumulated scholarship of the past and secure preservation of the scholarly record, and these agreements are a clear step forward with the relationship between the ACM and the library community.”

By investing in long-term digital preservation of content, ACM’s aim is to make it easier for libraries to accelerate their transition away from print and free up resources invested in print collections in favor of new and innovative electronic products and services.

Much More After a Click
(more…)

New Video on Web Archiving

Friday, November 6th, 2009

From the Description:

Web content changes all the time. If we don’t save that content before it disappears, a major part of our cultural history will be lost.

The Library of Congress is working to provide permanent access to web content of historical importance. It selects websites for collection, requests permissions from the website owners, addresses the technology of collecting websites and preserves the websites and makes them available.

This video examines those four challenges.

Access the Video (embedded here)

A text transcript is also available (PDF)

Source: National Digital Information Infrastructure and Preservation Program

Open Book Alliance Releases Baseline Requirements for Revised Google Book Settlement Proposal

Friday, November 6th, 2009

On Monday (November 9th), a revised proposed settlement (aka Settlement 2.0) from Google, the Authors Guild and the Association of American Publisher will be released. The Open Book Alliance (OBA) has posted on its web site what they call “baseline requirements” for the Settlement 2.0.

The Special Libraries Association and The New York Library Association are members of the OBA.

From the Blog Post:

The Open Book Alliance is issuing the following baseline requirements that the new settlement proposal must meet if it is to achieve those critical objectives. These requirements reflect the collective expression of concerns by the U.S. Department of Justice, authors, publishers, academics, libraries, foreign nations, state Attorneys General, consumer advocacy groups, and many others, and thus we think it appropriate to review the revised settlement within this framework.

[Snip]

+ The settlement must not grant Google an exclusive set of rights (de facto or otherwise) or result in any one entity gaining control over access to and distribution of the world’s largest digital database of books.

+ Authors and other rights holders must retain meaningful rights and the ability to determine the use of their works that have been scanned by Google.

+ The settlement must result in the creation of a true digital library that grants all researchers and users, commercial and non-commercial, full access that guarantees the ability to innovate on the knowledge it contains.

+ All class members must be treated equitably.

+ The settlement cannot provide for competition by making others engage in future litigation.

+ Congress must retain the exclusive authority granted by the U.S. Constitution to set copyright policy.

+ All rights holders impacted by the settlement must have a meaningful ability to receive notice, understand its terms and opt-out.

+ The parties that negotiated the settlement must live under the terms to which they seek to bind others, rather than their own separately negotiated arrangements.

Access the Complete Blog Post

Source: Open Book Alliance

See Also: Press Review: Judge Chin Sets Nov. 9 Deadline For Revised Google Book Settlement (via ResourceShelf, October 7, 2009)

New Research Paper from Stanford InfoLab: A Dynamic Navigation Guide for Webpages

Thursday, November 5th, 2009

Ed. Note: One thing that we used to do more of on ResourceShelf was to occasionally link to new and hopefully interesting research papers that we came across . Granted, the papers could sometimes get very technical (even for the editors) but those readers who could read the technical content appreciated the material while non-techies could get a good idea about the research by reading the abstract and usually the first several paragraphs of the paper. So, let’s restart this feature again with a new paper the InfoLab at Stanford Univesity.

A Dynamic Navigation Guide for Webpages (4 pages; PDF)
by Jawed Karim and Ioannis Antonellis and Varun Ganapathi and Hector Garcia-Molina
Note: This version of the paper has been submitted for publication

Navigating websites is often a frustrating process: Website visitors, despite their widely varying and individual information-seeking needs, must contend with static, general-purpose link structures that have been set in place by website owners. Because many visitors tend to browse for the same content, they are individually repeating the same navigation activity. Visitors would benefit from being able to take advantage of the collective search and discovery work that has already been performed by other visitors. Although many attempts have been made to improve website navigation by tapping into the “wisdom of the crowds”, the currently available approaches suffer from maintenance, usability, and user interface integration issues. We present a navigation guide for websites that provides visitors with helpful suggestions based on their browsing activity and the browsing activity of prior, similar visitors. Our navigation guide does not require any downloads, can be easily added to websites by website owners, and automatically remains up-to-date.

Sections of the Paper Include:

+ Introduction
+ Current Methods
+ The Wisdom of Crowds
+ A Dynamic Navigation Guide
+ How it Works
+ Related Work
+ Conclusion and Future Work

Source: Stanford InfoLab

Webcast: Preserving OSTI’s Printed Archive

Thursday, November 5th, 2009

Webcast: Preserving OSTI’s Printed Archive
A three minute video from the Office of Scientific and Technical Information at the U.S. Department of Energy.

Here’s the Blurb:

The American public has invested billions of dollars in the atomic energy and subsequent related programs. This investment has mostly been in the form of the printed page. OSTIs historical preservation is described.

Direct to “Printed Archive” Video (via YouTube)

Direct to OSTI YouTube Channel

Direct to OSTI Home Page

While print preservation is essential, OSTI is home to many free online databases including:

+ Science Accelerator
+ Science.gov (Content from Many Government Databases, Search Tecnology from OSTI)
+ WorldWideScience (Global in Scope)
+ Information Bridge: DOE Scientific and Technical Information (Includes over 210K Full Text Documents)
+ DOE Data Explorer
+ Energy Citations Database
+ E-print Network
+ Several Others Linked on the OSTI Home Page

Source: OSTI

UK: Interim Findings from Study Looking at Information-Seeking Behaviour of Generation Y Doctoral Students

Thursday, November 5th, 2009

Interim findings of major study of doctoral researchers indicate urgent need for libraries and universities to develop their understanding of ‘Generation Y’

From the News Release/Summary:

Emerging findings from a major three-year research study into the information-seeking behaviour of doctoral students have highlighted the need for far greater understanding of the generation born between 1982 and 1994 – commonly dubbed Generation Y.

Researchers of Tomorrow was commissioned by the British Library and JISC to establish a benchmark for research behaviour, against which future generations can be measured – and also to provide guidance for librarians and information specialists on how best to meet the research needs of Generation Y scholars.

The longitudinal study will be supported by a number of surveys to establish the wider context of the doctoral research landscape. The first of these surveys has just been completed; it surveyed a representative sample of all doctoral students in the UK and yielded a number of significant interim findings.

+ Information format. Three quarters of Generation Y students – more than those in any other age group – found the information they sought in an e-journal article.

+ Emergent technology. Only a small proportion of respondents (10-30%) in any age group say they use ‘emergent technology’ – such as wikis, virtual research environments, social networking and other Web 2.0 applications – in their research, Of those that do use them, more generally find them useful in their research than not.

+ Help and advice. Fewer Generation Y students than other age groups say they regularly use library staff support to find research resources (11% of Generation Y compared to an average of 17% for other age groups), or take advice from subject specialist librarians (4% compared to 9% average). More Generation Y respondents (46%) than any other age group turn to their fellow students and/or supervisors for support in using emergent technologies.

+ Location of work. Compared to other age groups, more Generation Y researchers work from a dedicated or shared office space (or laboratory or studio) (40%), than work from their own home (39%).

[Snip]

Joanna Newman, the British Library’s Head of Higher Education, said, “The interim findings of the Researchers of Tomorrow study provide a fascinating snapshot of current research behaviour of doctoral students. [Our emphasis] It’s perhaps surprising that so few researchers in the 21-27 range really use the wide range of Web 2.0 applications for research or collaborative working. And when it comes to emergent technology, they’re more likely to seek the advice of their peers or supervisors than librarians or information specialists – a finding that could suggest a need for professionals to rethink how best to deliver advice and support in this area.”

More in the News Release/Summary

See Also: Researchers of Tomorrow Home Page

Source: The British Library

New Keys for the Diffusion of Information in Social Networks

Wednesday, November 4th, 2009

A brief but very interesting read!

From the Summary:

Information in social networks moves at an unexpectedly slow pace, with the exception of some mass events. This is one of the main conclusions of a study carried out by researchers at Carlos III University of Madrid (UC3M) and IBM, who analyze how the behaviour of internauts influences the spread of information through Internet.

The spread of information in social networks, something of crucial importance in awareness and marketing campaigns or the spreading of rumours and viruses, for example, is largely determined by the great heterogeneity of internauts in their response time, according to the researchers. Traditional models estimated that internauts respond in approximately one day and that, as such, it took one day for information to be transmitted. However, this study, based on an actual experiment by IBM to observe and quantify the spread of business information in social networks, points out that it occurs at two speeds due to user activity. “Those who respond very quickly to e-mails, technology addicts who are always connected, are the ones responsible for spreading certain rumours or campaigns quickly via Internet,” notes Esteban Moro, professor of Mathematics at the UC3M.

Source: Carlos III University of Madrid

Internet Archive Founder Brewster Kahle Profiled in Forbes

Wednesday, November 4th, 2009

Brewster Kahle has many titles. These days he’s best known as founder of the Internet Archive (home of The Wayback Machine) and founding member of the Open Content Alliance.

From the Article:

“We have to have universal access to everything, just like a library,” he says. “Do we want that under a single corporation’s control? It is openness, not corporate control, that propels capitalism.”

[Snip]

Digital libraries will shape education, creativity and our shared intellectual heritage, Kahle declares. As founder and director of the Internet Archive, Kahle has posted online digital copies of 1.7 million books, 100,000 hours of television, 200,000 video clips, 70,000 concerts and 415,000 audio recordings. All that material can be downloaded for free from the Archive’s Web site.

[Snip]

Bookserver* uses a range of open source and proprietary electronic book standards, search algorithms, editing tools and libraries. The architecture, as Kahle calls it, potentially separates manufacturers of devices from control over much of the content inside them. It also preserves the idea of the lending library–if you “check out” a volume, others cannot access it in the time allowed to you. Publishers sell their books in the system using credit cards.

The article continues with more about Google Book Search and Kahle’s background.

We were surprised not to see The Wayback Machine mentioned in the stats about the Internet Archive listed above. At the moment (and we know of nothing coming), “Wayback” is probably the best chance a researcher has to access a page no longer on the Internet. Material in “Wayback” dates back to 1996 and as of today, contains more than 150 BILLION archived pages. The Internet Archive also offers a fee-based service that helps organizations organize and archive their web content. It’s called, Archive-It.

* See Also: We Have an In-Depth Post About Bookserver on ResourceShelf
It Includes an comprehensive press review the day after the Bookserver announcement.

Source: Forbes

Bibliotheca Alexandrina: A Digital Revival

Tuesday, November 3rd, 2009

The Bibliotheca Alexandrina is one busy place. If you want to learn more read on through our highlights but make sure to read the complete article. Our highlights is just a sample of what’s going on.

From the Article:

The International School of Information Science (ISIS) a research institute affiliated with the BA [Bibliotheca Alexandrina], aims at furthering the BA’s goals of being a leading institution in knowledge dissemination and, specifically, promoting research and development related to the digital libraries. Toward that goal, ISIS has embarked on an array of ambitious projects, in partnership with world-class institutions. These include hosting a mirror site for the Internet Archive, participating in the Million Book Project, organizing the digital archive of the Gamal Abdel Nasser collection, digitizing 113 years of Al-Hilal magazine, presenting the first-ever complete digital version of Description de l’Egypte, conducting advanced research such as the Arabic component of the UN-sponsored Universal Networking Language computerized multi-language translation program, and offering the most advanced 3D virtual imaging techniques in a virtual immersive environment for science and technology applications. Thus, despite being barely seven years in existence, the BA already has a substantial record of achievements.

Among the other projects you’ll read about are:

+ The Digital Assets Repository (DAR)

+ Memory of Modern Egypt Digital Repository

+ Archive documenting the history of the Suez Canal

+ SuperCourse

To empower science educators worldwide, the BA is working with a team of specialists, in partnership with the University of Pittsburgh, to launch the first science SuperCourse, comprising thousands of PowerPoint lectures made available for free to teachers and lecturers, who can use the lectures as they see fit in their teaching of science. The SuperCourse has been effectively implemented in the area of Public Health and Epidemiology, with a network of 65,000 scientists in 174 countries, providing more than 3,500 lectures in 31 languages. The BA maintains a mirror site of SuperCourse, which receives an average of one million hits per month, and is working on setting up a similar course in all fields of science.

Much More in the Complete Article

Source: EDUCAUSE Review
Hat Tip: OAN

More Digitization Underway: This Time Footnote.com is Digitizing the U.S. Census from 1790-1930

Tuesday, November 3rd, 2009

Footnote.com is once again partnering the National Records and Administration Agency (NARA) to digitize massive amounts of content and then make that material available, often for a fee, available online. Footnote is becoming–and for some has already become–and important resource for historians, genealogists, students, and others.

This time around, Footnote.com, is digitizing all publicly available Census materials from 1790-1930. These dates represent the period when all materials (including names) from a given census have been made publicly available. Through its partnership with NARA, Footnote.com will add more than 9.5 million pages of content when the census database project is complete. We’ve learned that Footnote.com is digitizing all of this material on their own.

From a Footnote.com Blog Post:

With over 60 million historical records already online, Footnote.com will use the U.S. Census records to tie content together, creating a pathway to discover additional records that previously have been difficult to find.

The Interactive Census Project Home Page offers much more detail and examples. You can also create email alerts when new states are added to the census database. On the lower-left side of the page you can track the progress of each census has been digitized. As you’ll see, the 1860 census is complete and the 1930 census is just about done.

Searching is free, Footnote provides numerous options to refine your search (here’s an example). Accessing the complete record is fee-based either subscribing to the database for a annually or monthly. You can also by individual documents for $2.95. Btw, Footnote.com also sells institutional access to libraries through EBSCO.

Footnote looks at the census project as a “highway” to assist the researcher in finding more information in other databases.

If you’ve been reading ResourceShelf for a while you’ve seen an increasing number of mention their services. Here’s a list of a few of them,

+ In August of 2009. we posted on the release of a joint project with the National Archives (NARA) to digitize holocaust material.

+ In December of 2008, in a partnership with NARA, Footnote released the largest interactive World War II collection online.

+ In March, 2008 we posted about Footnote.com offering an interactive version of the Vietnam Wall.

Our first post about Footnote dates back to January, 2007.

If you run this search using the ResourceShelf database, you’ll be able to see and read all of our Footnote.com posts.

But wait, there’s more. A quick review of the Footnote “press room” offers up even more projects. You can learn about them here.

Life Scientists’ Information Use & One Size Does Not Fit All

Monday, November 2nd, 2009

From the Announcement and Summary

A report by the British Library and the Research Information Network (RIN) launched today provides unique insight into how information is used by researchers across life sciences.

[Snip]

The report, Patterns of Information Use and Exchange: Case Studies of Researchers in Life Sciences was developed using an innovative approach to capture the day-to-day patterns of information use in seven research teams from a wide range of disciplines, from botany to clinical neuroscience. The study undertaken over 11 months and involving 56 participants found that there is a significant gap between how researchers behave and the policies and strategies of funders and service providers. This suggests that the attempts to implement such strategies have had only a limited impact. Key findings from the report include:

+ Researchers use informal and trusted sources of advice from colleagues, rather than institutional service teams, to help identify information sources and resources

+ The use of social networking tools for scientific research purposes is far more limited than expected

+ Data and information sharing activities are mainly driven by needs and benefits perceived as most important by life scientists rather than ‘top-down’ policies and strategies

+ There are marked differences in the patterns of information use and exchange between research groups active in different areas of the life sciences, reinforcing the need to avoid standardised policy approaches

Access the Full Text Report (56 pages; PDF) ||| Access the Report Annex (227 pages; PDF)

Source: British Library / Research Information Network

Survey of Higher Education Faculty: Use of Print & Electronic Library Collections of Scholarly Journals

Monday, November 2nd, 2009

This new report was released today by Primary Research. The full text is fee-based but a few highlights from the report are available online.

The report is based on a representative survey of more than 550 higher education faculty in the United States and Canada. Faculty present their opinions on preferences for print or paper journal formats, degree of problems with archival access, use of url-catalog links to journals, extent to which their college library journal collection satisfies their scholarly needs, and frequency of database access and library visits. Data is broken out by 12 criteria including age, academic field or specialty, type of college, size of college, frequency of library use, and many other factors.

Here are Just a Few Findings From the Report:

+ Canadian faculty were more likely than American faculty to think of the paper copies as a waste of time – nearly 45% thought so.

+ In general, age was highly inversely correlated with the tendency to think of paper copies as wasteful and redundant when online versions were available.

+ Only 13.86% of faculty at research universities prefer paper to online journal formats.

+ Only a third of community college faculty express support for increased spending on academic journals while about 64.3% of faculty in MA/Ph.D. granting colleges expressed such support.

See Also: The Survey of Academic & Research Library Journal Purchasing Practices

Here’s another related fee-based report from a related Primary Research report.

Some Findings:

+ The libraries in the sample acquired a mean of more than 46% of their journal subscriptions in bundles of more then 50 titles.

+ The libraries in sample canceled a mean of 53 journal titles in the past year.

+ Mean spending on print edition only subscriptions was $130,721, less than a sixth of total spending.

+ About a quarter of the libraries in the sample believe that open access has already slowed the increase in journal prices.

+ 15.56% of the libraries in the sample have paid a publication fee on behalf of an author from their institution.

+ For 42.22% of the libraries in the sample, all new subscriptions to journals include electronic access.

+ More than 64% of the libraries in the sample keep track of their various journal subscriptions through use of a commercial software product.

+ In general, subscription agents seem to enjoy a relatively high level of customer satisfaction. On the issue of timeliness of service, none of the libraries in the sample said that they were highly dissatisfied with their subscription agent and only 2.22% said that they were dissatisfied.

+ Non-academic research libraries have done more than their academic counterparts to make sure that contracts renew at the same time. Smaller institutions, those with journal budgets of less than $100,000 per year, were less likely to make such efforts than libraries with higher budgets.

Source: Primary Research

Semantic Search: The European Research Project Named MESH (Part 1)

Monday, November 2nd, 2009

From the Report:

But while there is a phenomenal amount of content, most of it is not that easy to find. Sure, text content can be skimmed or glanced, but audiovisual content has to be viewed in linear time. We cannot easily search inside a film or audio recording for relevant information.

That is changing, and one European project has created the first integrated platform for semantic search that can return results based on the content and context of film and audio files, as well as text.

But European researchers in the MESH project have developed an integrated platform which they say, for the first time, can combine semantic search – or search by the meaning of the words – and a host of associated tools to deliver more relevant information, from a wide variety of sources that can be accessed from an individual user.

[Snip]

These technologies are becoming common in particular knowledge domains, and more are emerging every day, but most relate to the concepts behind text-based documents. The MESH platform sought to use semantic search for every type of media.

On the way, it created some cutting-edge technology. “Our automatic annotation for video, for example, is state of the art,” explains Pedro Concejero, coordinator of the MESH project.

“The annotation system is capable of identifying the general scene setting, such as whether a video is a studio shot or a shot recorded on location. With adequate training, it can also detect (within some error margins) the general topic of the video, such as a scene about an earthquake or a flood. It can also find a number of salient objects within the scene, such as persons or fire, but cannot yet identify consistently objects with great variations in shape or aspect.”

One of the major challenges of the project was a product of its own success: It annotated too much information!

Much More fiin the Complete Report.

We will post Part II as soon as it become available.

See Also: MESH Project Home Page
Several videos are available.

Source: ICT Results

The Complete Archive of National Geographic Magazine on Six DVD’s

Sunday, November 1st, 2009

Every now and then a fee-based product comes around that we believe deserves your attention. The following is one of them.

Chris Pendleton on the Bing Blog reminds us that a major digitization project, every issue ever published of National Geographic from 1887-2008, is now available (it was officially released yesterday according to this media announcement) on 6 DVD’s or an external hard drive. That’s right, all of the writing, the legendary imagery, the supplement , even the advertisements are included. For many topics, Nat Geo magazine is a resource that documents people, places, and events, on a global scale. In other words, for all of the reasons just mentioned and many others, makes the magazine an important part of the historical record.

By the way, the reason it was mentioned on the Bing Blog is because Bing is providing some the technology that powers the digitized version this recently released collection.

From the Blog Post

Nat Geo uses Bing Maps in their Geobrowse functionality which allows you to browse a map anywhere in the world to find locations where relevant articles are referenced using geographic metadata.

Yes, we still love paper and those massive collections of past issues of the print version of National Geographic Magazine many people own (where are yours)? They’re also important.

That said, we also hear and read that for today’s student, it’s all about digital access. Yes, of course, that’s rather sad. However, a digitized archive of this size and scope can truly demonstrate the power of digital info technology for people of all ages and turn 120 years of content into important research and learning resources.

Another digitized archive of the magazine was released seven years abut this 120 year collection is the most complete version ever published with more content, more search options, saving/sharing tools, interactive maps, and more. One thing we noticed right of the bat is the that the new version is available for both PC and Mac. The “112 year version” was PC only.

Here are a few fast facts about the new collection. They were gleaned from Nat Geo site (including the video overview) and news release.

+ All Issues from October, 1888-December, 2008 are included

+ Six DVD’s include more than 200,000 pages; 300 wall map supplements, more than 8,400 articles; more than 250,000 photographs

+ All images scanned in high-resolution

+ Flip one page at a time, zoom, print

+ Geobrowse

A new Geobrowse function powered by Bing Maps that allows users with Internet access to search nearly 5,000 locations on a globe that are featured in the magazine’s archive of articles and maps.

+ Search by keyword, date, contributor, and topic; refine by date or content type

+ Browse by month or year

+ Create personalized reading lists; share these lists with other users in the Nat Geo community

+ Pre-loaded “favorite article lists” compiled by experts

National Geographic is selling the DVD’s for $69.95/US and the hard drive version for $199.95/US.

The lowest price we found as of Sunday November 1st was $42.78 from an Amazon.com Merchant. The DVD’s directly from Amazon.com are $44.99/US.

We’ve ordered a copy of the DVD’s and after spending some time with them we will report back.

Cornell University Library Publishes New Digitization Manual

Friday, October 30th, 2009

Our friends at TeleRead.org let us know about a new digitization manual from Cornell University Library.

From the Announcement:

“Copyright and Cultural Institutions: Guidelines for Digitization for U.S. Libraries, Archives, and Museums,” a new book published today by Cornell University Library, can help professionals at these institutions answer that question.

Based on a well-received Australian manual written by Emily Hudson and Andrew T. Kenyon of the University of Melbourne, the book has been developed by Cornell University Library’s senior policy advisor Peter B. Hirtle, along with Hudson and Kenyon, to conform to American law and practice.

The development of new digital technologies has led to fundamental changes in the ways that cultural institutions fulfill their public missions of access, preservation, research, and education. Many institutions are developing publicly accessible Web sites that allow users to visit online exhibitions, search collection databases, access images of collection items, and in some cases create their own digital content. Digitization, however, also raises the possibility of copyright infringement. It is imperative that staff in libraries, archives, and museums understand fundamental copyright principles and how institutional procedures can be affected by the law.

“Copyright and Cultural Institutions” was written to assist understanding and compliance with copyright law. It addresses the basics of copyright law and the exclusive rights of the copyright owner, the major exemptions used by cultural heritage institutions, and stresses the importance of “risk assessment” when conducting any digitization project. Case studies on digitizing oral histories and student work are also included.

The rest of the news release provides background about each of the Peter Hirtle and Anne R. Kenney, the authors of the manual.

Access
The manual is available for purchase $39.95 from CreateSpace.

You can also download the entire book for free by visiting the Social Science Research Network and the eCommons@Cornell.

Source: Cornell University Libraries
Hat Tip: TeleRead

The Library of Congress Unveils API for Chronicling America Digitized Newspaper Database and Directory

Friday, October 30th, 2009

What follows is a post that might be of special interest to web developers, webmasters, site owners, or anyone who can work with an API (Application Programming Interface), It comes from a digitized collection of more than 1 million historic newspapers and a searchable directory of newspaper info. Even if you are don’t have the technical skills required, it’s possible you know someone who does and with their help you can partner to develop new resources, create mashups, etc. Btw, if you know of people who are able to work with an API, feel free to share this post with them.

First, some background.

We’ve posted about the CA program since the day it launched in March, 2007. The project is a joint effort between the Library of Congress and the National Endowment for the Humanities to digitize historic American newspapers. In addition to the digitized newspaper database CA also provides Chronicling America directory. It’s both searchable with a powerful interface (a great example of what good metadata can do) and browsable. The directory contains information about most American newspapers published from 1690 to today.

On June 16, 2009, we ran a story about CA reaching a milestone. CA had just hit the one million digitized pages mark. It has grown a lot since then. About five weeks ago we posted an item about CA adding more than 192,000 pages to CA. The media release said the size of the database at that time contained 1,442,000 digitized pages from 171 titles, that were published between 1880 and 1922.

Thanks for the info but what about the API (Application Programming Interface) ?

The following from the “About the Chronicling America API” web page:

Chronicling America provides access to information about historic newspapers and select digitized newspaper pages. To encourage a wide range of potential uses, we designed several different views of the data we provide, all of which are publicly visible. Each uses common Web protocols, and access is not restricted in any way. You do not need to apply for a special key to use them. Together they make up an extensive application programming interface (API) which you can use to explore all of our data in many ways.

The rest of the web page offers technical details about the API.

Programmable Web has also posted about the new API.

Here are a couple of highlights:

Search results are available on the web site appear with terms highlighted. The API does not have access to highlight information, but it does contain thumbnails. Each page has a permalink back to the Library of Congress site, which displays the page in a zoomable, draggable viewer similar to Google Map.

The Library of Congress is focused on making these public domain works widely available. As such, this is an API without any registration or key necessary. That’s pretty wide open.

Among the interesting technical details is that the API can return linked data via RDF. It’s good to see reference sites, especially government ones, support semantic web formats (there are now 20 APIs in our directory with RDF support.)

Sources: Library of Congress, Programmable Web
Hat Tip: Dan C.