Archive for April, 2007

Speech: My First Two Years: Access Issues at the National Archives by Allen Weinstein

Thursday, April 26th, 2007

Transcript: My First Two Years: Access Issues at the National Archives
The full text of a recent speech at the National Press Club by Allen Weinstein, Archivist of the United States.

From the speech:

Turning to NARA: I have now served as Archivist of the United States for two years and bring you a brief interim report on this process as it applies to today’s topic. First, as always, follow the funding. Despite the generosity of the President’s 2008 budget for NARA, we need additional resources if we are to fulfill our mission. But even with available funds, there has been visible progress on a number of our strategic goals in the past two years, goals linked to NAR’s newly adopted 10-year Strategic Plan:

+ We are making steady progress towards an Electronic Records Archives (ERA) that will ensure preservation of, and access to, today’s electronic records far into the future.

+ We are working closely with the intelligence community and other key agencies to ensure that we can build a National Declassification Initiative to transform the way documents are reviewed and released.

+ We are working with the private sector in partnerships to digitize key collections, to ensure the widest possible access for the American public, and to build a Learning Center at the National Archives in Washington that will parallel the wonderful learning labs that we have across the country, among a number of educational programs we have, using documents and designed to strengthen civic literacy.

+ We have started an effort to replace the existing inadequate Military Personnel Records Center in St. Louis with a facility that will provide critical improvements to the environmental storage conditions for the two million cubic feet of records we store for the military. The new facility will enable us to ensure the preservation of essential military personnel files so they will be there when they are needed by our country’s veterans to guarantee their rights and entitlements.

+ We are completing the planning to bring the privately held Nixon Library and Museum in Yorba Linda, California, into the family of Federal Presidential libraries.

+ We are working with the White House to organize a smooth transfer of the textual and digital records of the current administration to the National Archives and to plan for the George W. Bush Presidential Library.

Source: NARA

See Also: Coming Next Week at NARA-Adelphi:
href=”http://www.archives.gov/preservation/conferences/2007/”>21st Annual Preservation Conference Managing the Intangible: Creating, Storing and Retrieving Digital Surrogates of Historical Materials

Why Libraries Matter More Than Ever

Thursday, April 26th, 2007

Why Libraries Matter More Than Ever

Libraries are perhaps our most enduring public institutions–priceless repositories of history, language, and culture. The dawn of the “information superhighway” threatened to make them less relevant, even obsolete. Yet now, these institutions are as prominent as ever, with a wave of innovation as the next generation of libraries extend their mission well beyond the storage of knowledge.

Source: Project for Public Spaces Newsletter (April 2007)

See Also: New Report: Who Needs Libraries Anyway?
Source: CURL / Research Information Network (UK)

Meet More Digitization Pioneers: The LOCKSS Team, Vicky Reich & David Rosenthal

Thursday, April 26th, 2007

Meet More Digitization Pioneers: The LOCKSS Team, Vicky Reich and David Rosenthal
History is so important but often overlooked when it comes to online info retrieval and digitization. That’s why a big kudos goes to the National Digital Information Infrastructure and Preservation Program at the Library of Congress with their new series honoring digitization pioneers. The first profile offered a profile of Myron Gutman from ICPSR. We blogged about it a few weeks ago.

Now, a new profile is online that offers a brief history and profile of Vicky Reich & David Rosenthal, the co-founders of LOCKSS.

From the profile:

It has been almost 10 years since Vicky Reich and David Rosenthal co-founded LOCKSS (which stands for “Lots of Copies Keep Stuff Safe”) with the purpose of enabling libraries to preserve their own digital collections. Since then LOCKSS has continually evolved to meet the needs of librarians and has been widely adopted as an economical, easy-to-use digital archiving solution. And it has evolved in a few directions that Vicky and David did not foresee.

In the late 1990s, Vicky Reich – who by then had years of library administrative experience – was assistant director of Stanford’s HighWire Press. She noticed that an increasing number of important publications were showing up solely in digital format, not in print, and she became concerned that librarians might lose custody of those copies, especially since many librarians tended to know very little about preserving digital assets. David Rosenthal, a distinguished Silicon Valley engineer with a long history at Sun Microsystems, became involved in LOCKSS from the technical side, focusing on preservation of the bits and bytes.

Source: NDIIPP

Briefs: The myth of short attention spans on the Web; SHERPA Receives SPARC Europe Award for Outstanding Achievements in Scholarly Communications, 2007; Two Upcoming IR Workshops

Thursday, April 26th, 2007

+ SHERPA Receives SPARC Europe Award for Outstanding Achievements in Scholarly Communications, 2007

The SHERPA partnership is the recipient of the Second SPARC Europe Award for Outstanding Achievements in Scholarly Communications. The decision to present the Award to SHERPA was made in recognition of their advocacy for the adoption of institutional repositories and their development of a suite of tools in support of Open Access, including OpenDOAR, JULIET, and RoMEO.

+ The myth of short attention spans on the Web (via OCLC and Poynter)
More info here (PDF).

Two Upcoming Workshops
++ Call for Papers: Web Information-Seeking and Interaction (WISI)
++ Call for Papers: Workshop on Searching Spontaneous Conversational Speech

Matt Raymond Launches The Library of Congress Blog

Thursday, April 26th, 2007

We post plenty of info about events, personnel news, new/updated resources, and other goings on at The Library of Congress. However, it’s often just the tip of the iceberg. Now, according to this National Journal article, Matt Raymond from the Library of Congress has started the Library of Congress Blog. A new source for LC news. Congrats, Matt! The RS team and I’m sure many RS readers will be checking the weblog or RSS feed or both on a very regular basis.

Sources: Library of Congress Blog, National Journal
Thanks to Pete Weiss for the news tip.

See Also: Some Recent Library of Congress Related Posts
+ Winston Tabb receives 2007 Lippincott Award

+ The March 2007 Issue of the Global Legal Monitor is Now Online

+ URI Resource Pages from IFLA, CDNL, and LC

+ The Library of Congress and PBS Partner to Collect/Preserve Oral Histories of U.S. Wartime Veterans

+ New and Updated Research Guides from the LoC Science Reference Service

+ Testimony: Librarian of Congress James H. Billington Testifies on the Library’s Digital Evolution

Google Domain Name Housekeeping with Endoxon

Thursday, April 26th, 2007

Nothing major here but since many people like to track domain names registered to Google a few updates.

Last December Google announced they acquired a portion of Endoxon, a software company based in Switzerland developing online mapping software. Over the past few day, several domains that appear to have been registered to Endoxon were transferred to Google Inc. Here’s the list:
(more…)

Business Research: Hoover’s Launches the Hoover’s Index of Leading Public and Private Companies, Non-Profits, and Associations

Thursday, April 26th, 2007

From the announcement:

[The index is] a free, proprietary monthly index of the leading public and private companies, non-profits, and associations which represent the brand leaders, up-and-comers and “buzz” creators driving the U.S. and international economies. The Hoover’s Index, which reveals monthly spikes in company search activity, represents a relative ranking of companies generating interest and exposure independent of pure fiscal performance measured by most business ranking indexes.

The Hoover’s Index is based on a proprietary algorithm that takes into account the search trends of business professionals, including both organic and internal searches on Hoover’s site, as well as business-related searches conducted via major search engines. Movement above and below index will determine ranking, instead of gross search volume. A company’s ranking is derived from the changes in search volume, indexed against the previous 52-week average for that company.

1000 companies are listed in the Hoover’s Index. If you would like to review the complete list, you need to complete a registration form.

Direct to Hoover’s Index Web Site

Top 5 (as of today)
1. Clearwire Corporation
2. The Blackstone Group L.P.
3. Vans, Inc.
4. TXU Corp.
5. WebEx Communications, Inc.

See Also: Hoover’s Archive of CEO and Analyst Video Interviews

Three Ask.com Tidbits & News From Snap, Google, Microsoft, Yahoo, and Others

Thursday, April 26th, 2007

Here at ResourceShelf we do our best to cover the major search players as well as smaller web search, specialty search (verticals), and many other information retrieval companies. Although Gary is the Director of Online Info Resources at Ask.com we post interesting tidbits and news items about Ask.com as we also do about other companies. In most cases, when Gary posts about Ask.com we place this content “below the fold” for those who would like to easily skip it. By “below the fold” we mean it requires an extra click to get to the full text from the RS home page.

(more…)

Free Online Education: NAIC Launches Insure U for Small Business

Thursday, April 26th, 2007

NAIC Launches Insure U for Small Business

The National Association of Insurance Commissioners (NAIC) today launched a comprehensive public education program to assist small businesses with information about business risks and insurance options. Under the banner of Insure U for Small Business, the campaign includes an online education site, public service announcements in English and Spanish, and community outreach by public information officers of state insurance departments.

The Insure U for Small Business curriculum, available at http://www.insureuonline.org/smallbusiness/, includes the following six categories of vital information to small businesses: workers’ compensation, group health and disability, business property and liability, commercial auto, group life and key person life, and home-based business insurance. After reviewing the curriculum’s helpful explanations, tips and considerations, small business owners and managers can test their knowledge about insurance issues by taking an online quiz. Upon successful completion, they can download an Insure U for Small Business diploma.

See Also: Insuring Your Business: a Small Business Owners ‘ Guide to Insurance (via III)
A New Resource

Source: National Association of Insurance Commissioners (NAIC)

New Resources Added to FREE Directory of Educational Web Sites and Tools

Thursday, April 26th, 2007

New Resources for Educators via FREE
FREE (Federal Resources for Educational Excellence) aggregates education resources from numerous government agencies and compiled/maintained by members of the Office of the Chief Information Officer, U.S. Department of Education.

Recently Added:

+ Practical Uses of Math and Science offers over 60 one-page examples of how science and math can be used in interesting settings and everyday life. Topics include clouds (why they float), social security benefits (algebra), Pythagoras theorem (cabinet corners), ice sheets … (National Aeronautics and Space Administration)

+ Microscope Imaging Station presents videos and narrative that help reveal: What is blood? What happens when the immune system goes awry? How is our heart like a zebrafish’s? What are stem cells? How does the sea urchin help us understand genes, reproduction … (Multiple Agencies)

+ Gift of the Indus: The Arts and Culture of Pakistan introduces teenagers to the music, theater, dance, and visual arts of Pakistan. See video clips of calligraphers, dancers, and other artists. Hear sitars, tablas, and “qawwali” — the mystical songs of Sufi poets. Use the arts to … (Department of Education)

Source: US Dept. of Education

New Online: Site Devised to Promote Environmental Awareness from National Geographic Society

Thursday, April 26th, 2007

Green.nationalgeographic.com

Green.nationalgeographic.com is an online multimedia experience offering consumers the opportunity to explore the environment by engaging in a variety of interactive activities. Green.nationalgeographic.com encourages visitors to watch videos, download podcasts and photos, read daily news, take quizzes, participate in polls, learn about energy-saving products, download “how to” videos and participate in activities geared to inspiring people to make eco-friendly choices in their everyday lives…”We’ve created Green.nationalgeographic.com to be a multimedia tool kit for green living. It’s a powerful communication tool that is interactive, engaging and fun for all ages,” said Betsy Scolnik, president, National Geographic Digital Media. “Our goal is to encourage individuals to make a difference by giving them practical advice that empowers them to become environmental stewards.”

Additonal background here.

Source: NGS

Business Research: More Country Commercial Guides Hit the Web

Thursday, April 26th, 2007

More 2007 Country Commercial Guides from the U.S. Commercial Service are now available.
All documents are PDF files.

+ The Bahamas
49 pages.

+ Kenya
80 pages.

+ Haiti
67 pages.

+ Algeria
89 pages.

+ Nepal
59 pages.

+ Laos
49 pages.

See Also: More Guides in These Recent Posts: 1 ||| 2 ||| 3

State of Fertility Report 2007…and other full-text reports on DocuTicker

Thursday, April 26th, 2007

Posted 25 April 2007 on DocuTicker:
+ State of Fertility Report 2007 (IntegraMed America, Inc.)
+ The Scarborough Newspaper Audience Ratings Report 2007 (Scarborough Research)
+ Reluctant Gatekeepers: Corporate Ethics on a Filtered Internet (Global Information Technology Report)

How to make decisions regarding database subscriptions

Wednesday, April 25th, 2007

How to make decisions regarding database subscriptions
by Laurie A. Prange (2006)
2 pages; PDF.

A two page summary of the questions you should ask when evaluating whether or not to subscribe to a library database.

Source: Laurie the Librarian via e-LIS

What’s New in PANDORA, Australia’s Web Archive

Wednesday, April 25th, 2007

What’s New in PANDORA, Australia’s Web Archive
Every two months, the PANDORA team at the National Library of Australia and partner organizations prepare a list of titles (web pages and sites) added to the web archive. You’ll find lists back to March – April 2004. The most current list, January – February 2007 is available here.

See Also: PANDORA Stats and PANDORA Fact Sheet (Word Doc)

Article: Web Archiving at the National Library of Australia. PANDORA: Australia’s Web Archive
by Paul Koerbin.

See Also: Web Archiving Initiatives Around the World (via PADI)

See Also: International Internet Preservation Consortium

See Also: Web Archiving Software

New Demos from Next Generation Search Group at University of Helsinki

Wednesday, April 25th, 2007

New Demos from The Next Generation Search Group at University of Helsinki
From the NGIR web site:

The Next Generation Information Retrieval group looks at search and information retrieval in a world impacted by Linux and Google where open source and open standards are becoming a dominant paradigm for internet services, and information retrieval is viewed as a key function in productive internet use. The group uses probabilistic and information-theoretic methods to model information retrieval, and is committed to open source software development. The group also believes distributed, semantic-based and multilingual methods will have a central role in the future of information retrieval.

Alvis — Superpeer Semantic Search Engine
What is ALVIS?

The project will conduct research in the design, use and interoperability of topic-specific search engines with the goal of developing an open source prototype of a distributed, semantic-based search engine. Existing search engines provide poor foundation for semantic web operations…Alvis is funded by EU’s Sixth Framework Programme for Research and Technological Development.

Demos
+ Alvis News Search Engine

A collection of news articles collected from a predefined set of Internet Search related news sites at regular intervals. The database covers the latest articles as well as archives from different popular Web news and blogs resources.

+ Alvis Wikipedia Search
Background about Wikipedia search engine.

+ SMART
Topic specific search for the recent EU project Statistical Multilingual Analysis for Retrieval and Translation.
Overview of SMART search available here. Uses ALVIS technology.

The crawl uses the ALVIS focussed crawler that is guided by keywords. The key phrases relevant to the crawl are one of the following:

relevance to statistical machine translation with key phrases: cross-?lingual information access, smt systems?, statistical machine translation, textual information access, statistical translation models?, cross-?lingual information retrieval, information extraction,

or both of :

relevance to machine learning with key phrases: machine learning, statistical learning, kernel methods?, string kernels?, rational kernels?, online learning, support vector machines?, SVM, principal component analysis, independent component analysis, PCA, ICA, discriminative language models?, canonical correlation analysis, margin-?based translation models?, statistical language, latent dirichlet, automatic processing,

+ relevance to machine translation with key phrases: machine translation, information retrieval, language models?, translation models?, computational linguistics, lexicon extraction, comprehension aids?, multilingual lexicon, user trials, user evaluation, parallel corpora, language modelling, computer aided translation, comprehension aids, multilingual lexicons?, multilingual corpora, cross-?language information retrieval, natural language processing, multilingual lexicon extraction, human language technology, machine translation technology, machine translation systems?, cross-?lingual information retrieval, linguistic resources.

A list of seed sites for crawl is also available here.

Source: ALVIS Consortium, Next Generation Information Search at University of Helsinki

Briefs: Digg.com reveals news stories fade after 1 hour; Yahoo, Gracenote launch lyrics service; SearchTools Reports on IBM OmniFind Enterprise Search, Yahoo! Edition, BT Selects Autonomy Technology

Wednesday, April 25th, 2007

+ Digg.com reveals news stories fade after 1 hour (via New Scientist)

Online news articles can lose their appeal in as little as an hour. That is the message from two statistical physicists who analysed the way people access information on the user-driven news site Digg.com.

Direct to Full Text of Paper: Novelty and Collective Attention
From the abstract:

The subject of collective attention is central to an information age where millions of people are inundated with daily messages. It is thus of interest to understand how attention to novel items propagates and eventually fades among large populations. We have analyzed the dynamics of collective attention among one million users of an interactive website –digg.com- devoted to thousands of novel news stories. The observations can be described by a dynamical model characterized by a single novelty factor. Our measurements indicate that novelty within groups decays with a stretched-exponential law, suggesting the existence of a natural time scale over which attention fades.

+ Yahoo, Gracenote launch lyrics service (via Reuters)

“Finally, a free, legal and definitive way to settle a bet with the guy sitting next to you at the bar who is certain the Ramones’ most famous anthem declares, ‘I wanna piece of bacon,’” said Ian Rogers, general manager of Yahoo Music, which will offer lyrics to hundreds of thousands of songs.

400,000 songs according to SEL. You can access song lyrics by going to a artist/group page and selecting lyrics or by going to http://music.yahoo.com/lyrics where lyrics can also be searched. Examples:
+ Coldplay
+ No Doubt
+ James Taylor
+ Rolling Stones
Two notes:
1) Not every song by every artist is available.
2) Lyrics appear if an artist/group performed the song. You can find the name(s) of the songwriter(s) in a box directly above the lyrics. For example, this list of Stevie Wonder lyrics includes Blowin’ in the Wind written by Bob Dylan.
See Also: More Info from Search Engine Land

+ BT Selects Autonomy for Search
The news release says Autonomy will power, “a number of search applications.” Specifics were not described in the news release.

+ Beware of data seepage on Google Calendar (via ZDNet)

+ SearchTools.com Product Report: IBM OmniFind Enterprise Search, Yahoo! Edition

+ New download: Libraries and Google Book Search (via Google Librarian Central)

Wikipedia Users: A New Report from the Pew Internet and American Life Project

Wednesday, April 25th, 2007

Wikipedia users: A New Report from the Pew Internet and American Life Project
8 pages; PDF.

More than a third of American adult internet users (36%) consult the Citizen-generated online encyclopedia Wikipedia, according to a new nationwide survey by the Pew Internet & American Life Project. And on a typical day in the winter of 2007, 8% of online Americans consulted Wikipedia.

A Few Quick Reactions to the Report.
(more…)

Roy Tennant Takes Job With OCLC’s RLG Group

Wednesday, April 25th, 2007

The official news release announcing Roy Tennant’s new position with OCLC’s RLG Group in Mountain View can be accessed here.

Roy has been a friend, teacher, inspiration, and motivator for many years and for many people including the ResourceShelf team. We’re also honored to call him a ResourceShelf reader. We wish Roy the very best in his new job. He will be joining others at RLG including Walt Crawford, Nancy Elkington, James Michalko, Merrilee Profitt, et al. Quite a team. Score one for OCLC/RLG.

See Also: Roy Shares Some Thoughts on His Blog

See Also: RLG/OCLC Transition Info and Resources