Thursday, October 11, 2007

Validation Records

From the LC Cataloging Newsline Vol. 15, no. 3 (October 2007)
The Cataloging Policy and Support Office (CPSO) has begun creating and distributing subject authority records called "validation records" that represent valid 6XX headings plus subdivision strings (topical, chronological, geographic, and form), including strings with free-floating subdivisions for which subject authority records were not previously made. Validation records are being created to improve the "validation" capability of many integrated library management systems used by the Library of Congress and others by providing an authorized form of subject heading strings for machine matching.

The validation records are identified by the presence of the 667 field which reads: "Record generated for validation purposes." All validation records will appear in LC's online catalogs but will not be printed in the annual edition of Library of Congress Subject Headings nor will they appear as proposed headings on the LC Subject Headings Weekly List. As of Sept. 25, 2007, 1,900 validation records have been distributed. Some examples are:
  • sh2007005269 Abdominal wall$xAbnormalities (May Subd Geog)
  • sh2007100421 United States$xEconomic policy$vPeriodicals
  • sh2007100247 Great Britain$xRelations$zUnited States
  • sh2007100224 Indians of North America$vSongs and music
CPSO is creating the validation records by using a combination of one-by-one record creation as subject strings are encountered in weekly operations and use of an automated program to generate and distribute validation records without human intervention. For this latter automated method, the focus is on subject heading strings applied since the year 2002 for which the LC catalog contains fifty or more bibliographic records that include the same 6XX string. Once the automated program is tested and approved, several thousand records are expected to be generated and distributed each week. CPSO will make an announcement before the automated method is put into full production.

Wednesday, October 10, 2007

Folksonomies

The October/November 2007 issue of the Bulletin of the American Society for Information Science and Technology includes a special section on Folksonomies.
  • Introduction: Folksonomies and Image Tagging: Seeing the Future? by Diane Neal, Guest Editor
  • Why Are They Tagging, and Why Do We Want Them To? by P. Jason Morrison
  • Trouble in Paradise: Conflict Management and Resolution in Social Classification Environments by Chris Landbeck
  • Image Indexing: How Can I Find a Nice Pair of Italian Shoes? by Elaine Ménard
  • Flickr Image Tagging: Patterns Made Visible by Joan Beaudoin
Seen on Christina's LIS Rant.

Monday, October 08, 2007

Family

A couple pictures of the family are up.

Video Language Coding

The OLAC CAPC Video Language Coding Best Practices Task Force Draft Recommendations (October 2007) are now available.
The task force was charged with creating a set of best practices for coding MARC 008/lang and 041 language information for videos, especially DVDs, and with using that exercise to examine whether any changes could be made to the MARC format (coding or directions) that would improve access to the multiple types of language information found on videos.

Email Access

Providing access to a collection of email messages isn't something we worry much about, unless we are archivists. Still providing access is what catalogers do. An IMAP plugin for SquirrelRDF by John Recker, Davide Eynard, and Craig Sayers. HPL-2007-161.
The Semantic Web aims to make information accessible to both humans and machines, using standard formats for data and making information available in a formal and structured way. Since the advent of RDF (Resource Description Framework) there have been many efforts to extract and convert existing information in this format. In this paper we describe an adapter tool for the IMAP protocol, developed as a plugin of SquirrelRDF1, which allows users to query IMAP mailboxes using SPARQL. The information returned looks like RDF, is always current, and can be reused and integrated inside other applications.