Friday, December 19, 2008

PURLs

PURLZ Server version 1.2 has been released.
Purlz are Web addresses or Uniform Resource Locators (URLs) that act as permanent identifiers in the face of a dynamic and changing Web infrastructure. Instead of resolving directly to Web resources, PURLs provide a level of indirection that allows the underlying Web addresses of resources to change over time without negatively affecting systems that depend on them. This capability provides continuity of references to network resources that may migrate from machine to machine for business, social or technical reasons.

Cataloging Video Discs

DVDImage via Wikipedia

Another from the draft folder. The DVD Guide Update Task Force of the Cataloging Policy Committee (CAPC) of OLAC has completed the document, Guide to Cataloging DVD and Blu-ray Discs Using AACR2r and MARC 21 (2008 update).

Thanks to all involved.



Reblog this post [with Zemanta]

Cataloging Video Discs

The DVD Guide Update Task Force of the Cataloging Policy Committee (CAPC) of OLAC has completed the document, Guide to Cataloging DVD and Blu-ray Discs Using AACR2r and MARC 21 (2008 update).

Thanks to all involved.

Additions to the MARC Code Lists for Relators, Sources, Descriptions

The code listed below has been recently approved for use in MARC 21 records. The code will be added to MARC Code Lists for Relators, Sources, Description Conventions.

The code should not be used in exchange records until after February 16, 2008. This 60-day waiting period is required to provide MARC 21 implementers time to include newly-defined codes in any validation tables they may apply to the MARC fields where the codes are used.

Category Code Sources

The following code is for use in subfield $2 in field 072 in Authority and Bibliographic records (Subject Category Code) and in subfield $z in field 073 (Subdivision Usage) in Authority records.

Addition:
eflch
E4Libraries Category Headings
[use only after February 16, 2008]

Term, Name, Title Sources

The following code is for use in subfield $2 in fields 600-657 (Subject Added Entries/Index Terms) in Bibliographic and Community Information records; subfield 662 (Subject Added Entry) in Bibliographic records; subfield $2 in fields 700-754 (Index Terms) in Classification records; subfield $2 in fields 700-788 (Heading Linking Entries) in Authority records; and subfield $f in field 040 (Cataloging Source) in Authority records.

Addition:
eflch
E4Libraries Category Headings
[use only after February 16, 2008]

Nature Now has XMP

Nature now includes XMP semantic data in the PDF version of their articles.
We now have a complete bibliographic record (including DOI) embedded in the PDF using structured markup. And, moreover, we also have a solid bedrock for adding in any additional metadata should the need arise. This semantic labelling is available on all new issues of Nature and will be added to other NPG titles over the coming months.

XMP as a labelling technology could well go a long way towards addressing concerns raised by Olivia Judson in an op-ed piece earlier this week in the New York Times: Defeating Bedlam. The author decries that "downloading papers from journal Web sites" means that "access to information is easier and faster than ever before ... but there’s been no obvious way to manage it once you’ve got it." Those days may soon be over.

Now with XMP all manner of scholarly content - documents, images and other media types - can be properly labelled and many programs (not just Zotero and Papers which she reviews) can directly profit from the richness of semantic web descriptions.

Indexing 2.0

Unshelved is going to use use the crowd to index their strips. Ohnorobot.com is the tool they selected to do the indexing. It is also a tool for searching across over 91,000 Web comics.

Monday, December 15, 2008

Timeline and Plan for the Next Five Library of Congress Genre/Form Projects

News from LC.
In July, 2008, the Library of Congress Acquisitions and Bibliographic Access (ABA) management team approved five new genre/form projects to be undertaken by the Cataloging Policy and Support Office (now the Policy and Standards Division): cartography, law, literature, music, and religion. On October 31st, 2008, the Division presented its timeline and plan for those projects to the ABA management team, and it was approved on November 17th.

The plan follows the principles and recommendations for the management of the genre/form projects, as outlined in the moving image project report; provides opportunities for involvement by other libraries and organizations with an interest in genre/form headings; requests input from the broader library community at various points in each project; and, furnishes a timeline that will allow for the orderly roll-out of genre/form headings in each of the five disciplines under development during the next four years.

Tuesday, December 09, 2008

Bay Area Youth Singers Holiday Concert

Bay Area Youth Singers (BAYS) Holiday Concert
December 14 4:00 p.m.
University Baptist Church
Tickets $10 Adults $3 Students

Contact me for tickets.

IFLA GMD Paper

The IFLA Cataloguing Section, ISBD Review Group has the document Proposed Area 0 for ISBD up for review.
The Working Group on General Material Designations of the IFLA Meeting of Experts on an International Cataloguing Code (IME ICC) held in Frankfurt in 2003 suggested that the GMD seemed unsatisfactory because the presence of the content of the resource and of the presentation of the resource were mixed, confusing more than clarifying. Other comments were on its present location, interrupting the logical order of the title information. It was also thought that the GMD was important enough to be at the beginning of the record, and that it should not be optional as it currently is.
Comment by 30 January 2009.

GPO Separate Record Cataloging Policy

The Government Printing Office (GPO) has adopted a separate record cataloging policy.
At the request of the Federal Depository Library community, the Government Printing Office, Library Services & Content Services, Library Technical Information Services (LTIS) staff has formulated a policy for creating separate records for every manifestation of a document. This policy follows an internal review of the current approach of single record cataloging.

Monday, December 08, 2008

Preliminary Authority Records

Here is another preliminary authority record:

American Association of Petroleum Geologists. |b Committee on Structural Nomenclature

It was created in 1985 as shown by the ID n 85806886. It was updated in 2008 as shown by 005. But it is still PRELIMINARY. How long can these stay preliminary?

COinS in WordPress

The OpenBook Book Data plug-in for WordPress by John Miedema now supports COinS.
OpenBook is for book reviewers, book bloggers, library webmasters, anyone who wants to put book covers and data on their WordPress blog or website.

OpenBook gets its covers and book data from Open Library (http://openlibrary.org), the only source of bibliographic data that is both open source and open data, hence the OpenBook label.

About COinS
The goal is to embed citation metadata into html in such a way that processing agents can discover, process and make use of the metadata. Since an important use of this metadata will be to allow processing agents to make OpenURL hyperlinks for users in libraries (latent OpenURL), the method must allow the metadata to be placed any where in HTML that a link might appear. In the absence of some metadata-aware agent, the embedded metadata must be invisible to the user and innocuous with respect to HTML markup. To meet these requirements, the span element was selected. The NISO OpenURL ContextObject is selected as the specific metadata package. The resulting specification is named "ContextObject in SPAN" or COinS for short.

Thursday, December 04, 2008

Problems with Microformats

This is old news, but new to me and maybe someone else. There are some basic problems with many of the microformats, including the hCalendar. The BBC has stopped using microformats.
Since /programmes first went live we've been working to ensure that programme data was accessible to people and machines alike. The API design was baked in at the application design stage. Similarly we've worked on adding microformats to HTML pages as a lightweight API. All broadcasts use the hCalendar microformat to add start times, end times, broadcast channels etc.

Unfortunately there have been a number of concerns over hCalendar's use of the abbreviation design pattern.

They were considering RDFa as an alternative.

So, does anyone know of any tools to easily create RDFa? Something to just plug in the info and have it pop out?

Wednesday, December 03, 2008

Name Authority Records

News form LC concerning Name Authority Records.
The Library of Congress is pleased to announce that OCLC has completed the pre-population of the NACO authority file with non-Latin references (authority 4XX fields) derived from non-Latin bibliographic heading fields in WorldCat, a use of data-mining techniques originally developed for the WorldCat Identities project. The pre-population project, which began in mid-July, added non-Latin script references to 497,576 name authority records for personal names and corporate bodies.

**For NACO catalogers, this means that the moratorium on updating 100/110 authority records that existed prior to July 2008 to add non-Latin script references is now lifted. All name authority records are now candidates for the addition of non-Latin script references. Thanks for your patience during this period.**

LC hopes to announce soon a process by which catalogers that have been examining the non-Latin script references added by this project can contribute to the development of policies and practices for the future, such as the issues raised in the white paper on non-Latin script references in name authority records.

Special thanks to Robert Bremer, and colleagues at OCLC, for all the efforts to make this pre-population a reality.

Tuesday, December 02, 2008

RDFa

Elias Torres, Ben Adida discuss RDFa on Technometria with Phil Windley
While the web is primarily for human consumption, more sites are including machine readable data. However, this information is usually included separately. As the RFDa Primer states, RFDa provides a set of XHTML attributes to augment visual data with machine-readable hints. RDFa helps bloggers and website authors make their web pages smarter by adding computer-readable information to a site. Elias Torres and Ben Adida talk about it, including its history and what problems RFDa is attempting to solve.

Torres and Adida also discuss the technical details of RDFa and give a detailed technical description of how RDFa works. They review the mechanics of RDFa and give examples of its usage.

Wednesday, November 26, 2008

Metadata Extraction

Effective Metadata Extraction from Irregularly Structured Web Content by Baoyao Zhou, Wei Liu, Yu Yang, Weichun Wang Ming Zhang, (HPL-2008-203)
Metadata extraction is one crucial module for domain specific Web content discovery and management, because the accuracy and completeness of the extracted metadata would directly affect the quality of subsequent domain information services. Our Online Course Organization project aims to build an online course portal to serve the course information obtained from the Web. Since most course pages are irregularly structured, most existing approaches are not effective for extracting course metadata. In this paper, we proposed a novel hierarchical clustering approach to generate a web page semantic structure model from the DOM tree, called Logical Structure Model, such that the hidden patterns and knowledge can be revealed and used to facilitate identifying course metadata. The experimental results have shown that our solution can achieve effective metadata extraction

Library Weblogs

Now available, The Liblog Landscape 2007-2008 by Walt Crawford.
Liblogs--blogs written by library people, as opposed to official library blogs--provide some of today's most interesting and useful library literature. This book offers a broad look at English-language liblogs as they are and as they've changed between 2007 and 2008. The book includes more than 600 blogs with detailed analysis of 27 metrics for 2007 and 2008 and changes from 2007 to 2008--and, for 143 of them, 2006 as well. Through tables, charts and text, we explore the liblog landscape.

MODS Tools

The MODS users are collecting examples of tools using MODS. One example is Tellico.
Tellico is a KDE application for organizing your collections. It provides default templates for books, bibliographies, videos, music, video games, coins, stamps, trading cards, comic books, and wines.

Tellico allows you to enter your collection in a catalogue database, saving many different properties like title, author, etc. Two different views of your collection are shown. On the left, your entries are grouped together by any field you like, allowing you to see how many are in each group. On the right, selected fields are shown in column format, allowing you to sort by any field. On the bottom is a customizable HTML view of the current entry. The entry editor is a dialog box where you enter the data.

Tuesday, November 25, 2008

Cataloging Tools

LibLime has announced the beta test of a suite of cataloging tools, ‡biblios.net.
‡biblios.net is a subscription-based, hosted version of the open-source ‡biblios metadata editor that we released earlier this year. In addition to the editor, ‡biblios.net includes some extended community features such as integrated real-time chat, forums, and private messaging.

‡biblios.net also provides access to the world's largest database of freely-licensed library records. The database will be freely available to ‡biblios.net subscribers and non-subscribers alike via Z39.50, OAI, and direct download.

Furthermore, the database itself will be maintained by ‡biblios.net users similar to the way that Wikipedia's database is maintained by users.

We're now looking for enthusiastic participants to help shape the final production release of ‡biblios.net.

Ways you can help:

  • Become a beta tester for the ‡biblios.net platform by filling out the beta tester application form.
  • Donate your records to ‡biblios.net. Upload records to http://archive.org, and drop us an email at 'info AT liblime DOT com'
  • Get involved in the ‡biblios open-source community: get your copy of ‡biblios and join the development team at http://biblios.org
An aside, wouldn't it make more sense in that first paragraph to link "‡biblios metadata editor" rather than "released earlier this year?" Links are a form of mark-up and clean mark-up matters.

Monday, November 24, 2008

Metadata Matters

Diane Hillmann and Jon Phipps have started a new weblog, Metadata Matters. Based on the names, I'd call this a must read. Subscribed.