Thursday, June 27, 2002
Authority Resource
Wednesday, June 26, 2002
Open Source
The OCLC SiteSearch toolkit provides a comprehensive solution for managing distributed library information resources in a World Wide Web environment. WebZ™ is java-based Z39.50 client/server component used to create an interface to information resources. Database Builder provides the tools to build local databases in various record formats, that can then be accessed through the WebZ interface.
Open Archives Initiative
(http://jodi.ecs.soton.ac.uk/Articles/v02/i04/Liu/). - The [30]Open Archives Initiative is our latest, best hope for a technology that can knit together access to a variety of dispersed repositories of information. Although the underlying protocol for "crawling" (fetching information from) such repositories is now in place, there remain some thorny issues -- not the least of which is how to handle disparate sets of metadata. In building the [31]ARC Cross Archive Search Service, the authors had to make decisions on how to handle this issue, which they share in this piece. They determined that to federate metadata from disparate sources, no single approach would work well in all cases. One relatively easy method is to use keyword searching, but then any additional functionality offered by well-described metadata is lost. Another approach is to map each set of elements into a common set. Their decisions on how to handle this dilemma (not surprisingly, a hybrid approach), as well as a thorough accounting of the over 1 million records from dozens of archives with which they were working, round out this very informative article. As libraries try to build cross-archive search services, we will need to become very good at dealing with the challenges outlined here, as well as others. This early work should prove helpful. - [32]RT
Current Cites 13(6) (June 2002) ISSN: 1060-2356 Copyright B) 2002 by the Regents of the University of California All rights reserved.
Copying is permitted for noncommercial use by computerized bulletin board/conference systems, individual scholars, and libraries. Libraries are authorized to add the journal to their collections at no cost. This message must appear on copied material. All commercial use requires permission from the editor. All product names are trademarks or registered trade marks of their respective holders. Mention of a product in this publication does not necessarily imply endorsement of the product. To subscribe to the Current Cites distribution list, send the message "sub cites [your name]" to [50]listserv@library.berkeley.edu, replacing "[your name]" with your name. To unsubscribe, send the message "unsub cites" to the same address.
Not Cataloging
The Librarian's Book Club is a group to read and discuss books that are about libraries and the library profession. Every two months the Librarian's Book Club will select a new book to discuss. All you need to do is get yourself a copy of the current book and subscribe to the discussion list. Click on the book covers to learn more about the current selection. (Note: Although this book club discussion group is focused towards librarians, we welcome members that are not librarians.)Thanks to Blake on LISNews for alerting me to this.
Tuesday, June 25, 2002
GILS & Dublin Core
"Colleagues,
I am gratified to report the Web Metadata Standard for Minnesota has been approved by the MN Information Policy Council. This Dublin Core-based standard (ANSI/NISO Z39.85) has been in use by many agencies, beginning with the Foundations Project in 1998. The standard is integrated into the new North Star Portal content management templates, and the state search engine is tuned to use the DC elements in results ranking. It is part of the Enterprise Architecture, along with the GIS metadata standard and the recordkeeping metadata standard.
Located at http://bridges.state.mn.us/bestprac/mn_dc_standard.pdf
Eileen Quam
Information Architect
Minnesota Dept. of Natural Resources
eileen.quam@dnr.state.mn.us
651.297.2341
651.297.4946 FAX"
Resume Online
Monday, June 24, 2002
Metadata
"Large multimedia database systems have great potential for educational use. Their assets can often be used to support educational and research activities in a wide variety of educational contexts, supporting learners and educators from many subject areas. This article focuses on what appears to be a fundamental dilemma for the developers of such systems regarding how to tag or index their assets with metadata so as to support discovery of the assets by these educational users."
MARC21
Resource Description Framework
Markup Languages
Open Source Library System
"The LearningAccess ILS is a full-feature Open Source library automation system developed for use by small public and school libraries in the U.S. and the rest of the world. The Institute will make this system available free to libraries that, because of cost, have been unable to achieve the benefits of automation.
Main Components
The LearningAccess ILS consists of three modules: the patron or user module (OPAC), the cataloging module and the circulation module. In future releases it may also include an acquisition module. All modules are Web-interface based and are multi-lingual user capable, with our initial release supporting English, Spanish and French.
The system supports the full MARC21 format for bibliographic, holding, authority and community records. It has an intuitive importing program to add records to its database. The cataloging client includes Z39.50 searching capabilities to allow for copy cataloging against OCLC or other larger union databases. Future releases will also support Z39.50 searches against the database."
Customizable Local WebPAC Pages
OPACS On the Web
Joint Press Release from the British Library and the Consortium of University Research Libraries (CURL)
The British Library and the Consortium of University Research Libraries (CURL) are delighted to announce that from 17 June 2002 the British Library Public Catalogue (BLPC) will be accessible online, free of charge, from COPAC.
COPAC is an online union catalogue, hosted at MIMAS, which gives free access to the merged catalogues of 22 of the largest university research libraries in the UK and Ireland. Now that the British Library has become a full member of CURL and the BLPC has been added to COPAC, COPAC users will have free online access, via a single search interface, to more than 20 million catalogue records, which they can search simultaneously by title, author or subject.
The National Library of Scotland and the National Library of Wales / Llyfrgell Genedlaethol Cymru have also become full members of CURL and their records will be available via COPAC within the next twelve months. Meanwhile their catalogues can be searched simultaneously with the COPAC database, which also includes the BLPC, using the experimental version COPAC V3.
More details about the content of the BLPC are available.
More information about CURL and MIMAS can be found on their respective web sites.
Saturday, June 22, 2002
Information Architecture
Our focus is on the specification of shared interfaces among systems operated by e-Government initiative participants. The current draft calls out five minimum and critical agreements required for interoperability:
There is also a short paragraph noting some requirements applicable to Government participants, such as open and equal public access, permanence of public access, privacy, security, records management, and information management.
Anyone who would like to review the draft or advise on other major principles, please contact me by e-mail to echristi@usgs.gov
(Also, please forward this note to other groups interested in information architecture principles for e-Government.)
Thanks!
Eliot
Friday, June 21, 2002
Wireless Access?
Metadata Object Description Schema (MODS)
As an XML schema it is intended to be able to carry selected data from existing MARC 21 records as well as to enable the creation of original resource description records. It includes a subset of MARC fields and uses language-based tags rather than numeric ones, in some cases regrouping elements from the MARC 21 bibliographic format. The elements inherit MARC semantics, so are more compatible with existing library data than other metadata schemes.
MODS could potentially be used as follows:
MODS includes a subset of data from the MARC 21 Format for Bibliographic Data. As an element set that allows for the representation of data already in MARC-based systems, it is intended to allow for the conversion of core fields from a MARC 21 record, while some specific data may be dropped. As an element set for original resource description, it allows for a simple record to be created, in some cases using more general tags than those available in the MARC record. The Library of Congress has developed transformations to move MARC 21 records from 2709 form to MARCXML to MODS, with the software available from the MARCXML Web site.
There has been wide review and input to the development of the schema, which is now available for trial use. It will remain stable for the next six months, during which we invite comments as a result of experimentation.
For questions or comments please email the Office at ndmso@loc.gov
Sally McCallum, Chief
Network Development and MARC Standards Office
Library of Congress, Washington, DC, USA"
This is very much like MARC in XML with name tags rather than numeric.
The British Library Public Catalogue
The service allows quick and easy access to bibliographic records from the catalogues of the main British Library collections - already available online at http://blpc.bl.uk. In addition to this, the new feature will allow users with suitable retrieval software to download and make use of the Library's bibliographic citations for their own personal research purposes. This should prove of particular use to researchers and students compiling bibliographies for monographs, research reports or theses.
Developed and hosted by MIMAS (Manchester Information and Associated Services) at the University of Manchester, BLPCZ is the latest result of a co-operative alliance between the British Library and CURL (the Consortium of University Research Libraries) which is aimed at providing wider access to the Library's catalogues. "Further details are available.
Thursday, June 20, 2002
Open Source OPAC
Nearly a year since the last release, we have a new stable release of koha.
The main new features in this version are:
Main Bugfixes:
There have been numerous little fixes and a lot of code clean up.
The structure of the tarball has changed to a more sensible layout, and the INSTALL document has been rewritten.
There is also now a installer script, as well as a functioning database update script.
All this should mean its the easiest Koha to install or upgrade yet."
Info
Download
Wednesday, June 12, 2002
MeSH
Half the respondents (overwhelmingly the largest group) do their authority control manually. Half have brought their MeSH authorities up to date sometime in the last two years, but a third never have. About 70 percent have a mixture in their OPACs of pre-1999, post-1999 and NLM distributed MeSH.
A third of the respondents are using post-1999 MeSH practices in their original cataloging, and almost half are applying post-1999 MeSH practices in their OPACs. The number who would prefer that NLM provide records to bibliographic utilities with post-1999 MeSH nearly equals those who prefer NLM to continue its current practice of providing records with distributed MeSH structure. Thus, respondents appear to be divided on the post-1999 MeSH vs. distributed MeSH structure issue.
The survey also reveals that most vendor systems' global modification functionality are not adequate to convert old records in catalogs to agree with either NLM's current practices (post-1999) or NLM's distributed structure (distributed through utilities like OCLC).
Almost 60 percent enrich OCLC records with NLM classification and MeSH on a regular basis. Almost three-fourths would download MeSH authority records from OCLC if available.
Finally, an unexpectedly large number of comments probably indicate that MeSH authority control is a topic of very high interest to respondents.
For the entire survey results, including graphical displays, please refer to this PDF file, using Adobe Acrobat reader or click the survey link on this webpage
We hope you will enjoy examining the survey results and that you will contact the survey coordinators if you are interested in the work that HSOCLCUG is doing. MeSH authorities, OCLC new directions, changes at NLM, and resource sharing/universal MARC Holdings are four areas of special interest to HSOCLCUG.
The survey coordinators:
Dan Kniesner (kniesner@ohsu.edu) and Judith Dzierba (Judith_L_Dzierba@rush.edu)."
Wireless Web Browsers & the OPAC
"Patrons can interact with the Millennium system from anywhere accessible by wireless networks, allowing a user to place a hold from a classroom, check due dates and renew items during a long commute, or search the catalog while running errands to find the closest library branch where a desired title is held."
Does the Handheld Librarian know about this one?
Vacation
Cataloging in Other Languages
Preservation Metadata
DUBLIN, Ohio, June 11, 2002--The Working Group on Preservation Metadata, an initiative jointly sponsored by OCLC and RLG, has released 'A Metadata Framework to Support the Preservation of Digital Objects,' a new report available on the OCLC Web site.
The report is a comprehensive guide to preservation metadata that is applicable to a broad range of digital preservation activities. Preservation metadata is the information infrastructure necessary to support processes associated with the long-term retention of digital resources, and is an essential component of most digital preservation systems.
The report represents the consensus of leading experts and practitioners comprising the working group, and is intended for use by organizations and institutions managing, or planning to manage, the long-term retention of digital resources.
The working group based its work on preservation metadata element sets developed by several leading institutions and organizations in the digital preservation community, as well as the Open Archival Information System (OAIS) reference model. OAIS is a conceptual framework that articulates the primary functional components of a digital archive and establishes concepts and terminology for describing and comparing architectures and data models.
The report follows on the working group's earlier white paper, 'Preservation Metadata for Digital Objects: A Review of the State of the Art,' which defined and discussed the concept of preservation metadata, reviewed current thinking and practice in the use of preservation metadata, and identified starting points for consensus-building activity in this area. The white paper is also available on the OCLC Web site.
"The working group's report illustrates the value of cooperatively resolving the challenges of digital preservation," said Brian Lavoie, research scientist in the OCLC Office of Research. "The most effective solutions for the long-term retention of digital resources are likely to emerge from collaboration within the digital preservation community, pooling the expertise of researchers and practitioners from a variety of institutional and geographical backgrounds. Taken together, these consensus-building efforts will establish the necessary infrastructure, in the form of standards and best practices, to support digital preservation activity."
"For institutions creating and preserving digital objects, this new OCLC/RLG report is a good resource for both preservation metadata and its relationship to the OAIS reference model," said Robin Dale, RLG program officer. "It broadens the digital preservation toolset we've been building, which includes such aids as the RLG/OCLC report on the attributes of trusted digital repositories and RLG's OAIS practitioners' Web page."
"The work of the OCLC/RLG Working Group on Preservation Metadata has been an extremely important effort as libraries and related cultural institutions begin to tackle issues concerning digital preservation," said Rebecca S. Guenther, senior networking and standards specialist, Library of Congress, and working group member. "The working group document takes the OAIS information model closer to the goal of actually implementing archival information systems that document preservation requirements and processes and is particularly relevant to the needs of libraries. This document is an important contribution to the development of preservation metadata standards in information technology, an area where much additional work is needed."
The Working Group on Preservation Metadata is part of a cooperative effort between OCLC and RLG to support consensus-building activity in the development of key infrastructure for digital preservation.
Tuesday, June 11, 2002
Dewey in Spanish
Open Archives Initiative
The following changes/improvements were made:
Thanks to FOS for bringing this to my attention.
Monday, June 10, 2002
FRBR & the OPAC
"VTLS Inc. Announces FRBR Implementation
VIRTUA ILS NOW SUPPORTS FRBR
ALA JUNE 2002
Blacksburg, VA -- Virtua ILS – Integrated Library Systems newest release is currently the only commercially available ILS system in the marketplace to support the IFLA Functional Requirements for Bibliographic Records (FRBR). The FRBR model is the latest development in cataloging practice. It is designed to serve as a conceptual framework for systems that share and use bibliographic data, nationally or internationally. The concept was developed by IFLA with participation from National Libraries like Library of Congress and the National Library of Canada.
Present cataloging practices do not support "linked hierarchic metadata records" making it difficult to catalog certain types of materials. The FRBR model is designed to address this difficulty. The basic FRBR record consists of three entities - work, expression and manifestation. In addition, item records (holding records) can be attached to the manifestation. For example, Beethoven’s Fifth Symphony represents a work. It can be independently cataloged. The performance of the symphony by, say, the New York Philharmonic Orchestra represents an "expression" of the work. A CD by Columbia records containing the particular performance represents a "manifestation" of the work. Two copies of the CD in the library represent two "items" of this manifestation.
Using the FRBR model, librarians do not have to re-catalog the entire work again for different performances or for different manifestations of the same performance. Instead, FRBR allows the new performance to be cataloged separately and linked to a previously cataloged "work". This hierarchical parent child relationship in metadata records is the key feature of FRBR. The example above was from music; however the hierarchic relationship can exist in other material types like printed materials with different editions or language versions.
In implementing FRBR, VTLS has created an environment in which records in the FRBR model and records in the traditional cataloging model can co-exist in the same database. Such an implementation allows libraries to ease into the use of the FRBR model without requiring massive record conversions. It is clear that not every metadata record lends itself to FRBR. For example, many records do not have alternative expressions or manifestations. For this reason, Virtua ILS supports classic cataloging of metadata records along with FRBR records in the same database. The software is "FRBR aware" and automatically changes display formats depending on the type of record (FRBR or regular).
With a click of a button, the Virtua software converts a regular record to three records of the FRBR model and attaches all items to the manifestation record. This process can also be performed in a batch mode.
Refinements on the FRBR implementation continue as we receive valuable input from interested parties.
For more information on FRBR please see:
IFLA web site
VTLS web site"
Cataloging in Languages Other than English
Thesauri and Web Logs
Friday, June 07, 2002
Catalogablog
Not Cataloging
The STIX Project (http://www.stixfonts.org)
Melville, NY, June 10, 2002 - After years of planning, a group of scientific publishers today formally announced the Scientific and Technical Information Exchange (STIX) font creation project and the launch of the STIX web site. The STIX publishers aim to develop a comprehensive set of fonts for mathematics and other special characters used in Scientific, Technical, and Medical publishing. The web site provides information for potential users within the scientific and publishing communities, and a special area for software developers who may want to incorporate support for the STIX Fonts into their products.
Six publishers - the American Chemical Society (ACS), the American Institute of Physics (AIP), the American Mathematical Society (AMS), the American Physical Society (APS), Elsevier Science, and the Institute of Electrical and Electronic Engineers (IEEE) -- came together to design, fund and manage the STIX project. They have awarded the font development contract to a respected font development company, which has begun the process of designing and delivering nearly eight thousand characters/glyphs. The design submissions of the various character sets are currently being evaluated by a Technical Review Committee consisting of representatives of the six participating publishers.
There is currently a clear need for a new font set for mathematical and other scientific symbols, especially in the area of on-screen display in electronic publishing. Today, scientists must assemble scientific symbols and special characters from a variety of fonts, many of which may vary in character style, positioning, or size. The resulting documents typically have an unsatisfactory, jumbled appearance. Even more importantly, when posted to a web site, these documents may not be properly rendered unless the viewers of the document have all of the same specialized fonts available on the computer workstations they are using. This new set of fonts, known as the STIX Fonts, will solve both of these problems, serving the scientific and engineering community in the process from manuscript creation all the way through to final publication, both in electronic and print formats. It will unify support for all special symbols and alphabets into a single, comprehensive font set.
The STIX fonts will be made available, under royalty-free license, to anyone, including publishers, software developers, scientists, students and the general public. Target for completion of the project is the Fall of 2003.
By making the fonts freely available, the STIX project hopes to encourage the development of applications that make use of these fonts. In particular the STIX project will create a TEX implementation that TEX users can install and configure with minimal effort. TEX is a computer language designed for typesetting, with particular application to mathematics and other technical material.
The STIX mission will be fully realized when:
For more information visit the STIX Fonts web site
For the STIX Fonts project:
T.C. Ingoldsby (Chairman)
American Institute of Physics
2 Huntington Quadrangle, Suite 1NO1
Melville, NY 11747-4502
Phone: +1 516 576 2265
Fax: +1 516 576 2327
Email: tingoldsby@aip.org"
CONSER Task Group on FRBR and Continuing Resources
Charge
Conferences
MARC21 Codes
Network Development and MARC Standards Office Library of Congress
---------------------------------------------------
The following 12 codes have been recently approved for use in MARC 21 records. They include 2 category code source codes, 3 classification scheme source codes, and 6 subject source codes, and 1 name source code. These new codes have been added to the online "MARC Codes Lists for Relators, Sources, Description Conventions" but should not be used in exchange records until after July 30, 2002. This 60-day waiting period is required to provide MARC 21 implementers with time to include newly defined codes in any validation tables they may apply to the MARC fields where these codes are used.
---------------------------------------------------
MARC Category Code Source Codes
Changes:
agricola - AGRICOLA subject category codes (subfield $2 in Bibliographic field 072) [use after 07-30-2002]
fiaf - Classification scheme for literature on film and television (subfield $2 in Bibliographic field 072) [use after 07-30-2002]
---------------------------------------------------
MARC Classification Scheme Sources
Changes:
agricola - AGRICOLA subject category codes (subfield $2 in Bibliographic field 084 and Bibliographic/Holdings field 852)[use after 07-30-2002]
agrissc - AGRIS: subject category codes (subfield $2 in Bibliographic field 084 and Bibliographic/Holdings field 852)[use after 07-30-2002]
fiaf - Classification scheme for literature on film and television (subfield $2 in Bibliographic field 084 and Bibliographic/Holdings field 852)[use after 07-30-2002]
---------------------------------------------------
MARC Term, Name, Title Sources
Changes:
albt - Arbetslivsbibliotekets tesaurus (subfield $2 in Bibliographic and Community Information records in fields 600-651) [use after 07-30-2002]
eclas - ECLAS thesaurus (subfield $2 in Bibliographic and Community Information records in fields 600-651) [use after 07-30-2002]
ncjt - National criminal justice thesaurus (subfield $2 in Bibliographic and Community Information records in fields 600-651) [use after 07-30-2002]
nznb - New Zealand national bibliography (subfield $2 in Bibliographic and Community Information fields 600-651, 655-658 subfield $2 in Authorities fields 700-751 and subfield $f in field 040) [use after 07-30-2002]
prvt - Patent- och registreringsverket tesaurus (subfield $2 in Bibliographic and Community Information records in fields 600-651) [use after 07-30-2002]
sfit - Svenska filminstitutets tesaurus (subfield $2 in Bibliographic and Community Information records in fields 600-651) [use after 07-30-2002]
sot - Schools online thesaurus (subfield $2 in Bibliographic and Community Information records in fields 600-651) [use after 07-30-2002]
Thursday, June 06, 2002
NISO Standards for Comment
ANSI/NISO/ISO 12083 - 1995 Electronic Manuscript Preparation and Markup
Abstract: In complete conformance with ISO 8879 (SGML - Standard Generalized Markup Language), 12083 provides a toolkit for developing customized SGML applications. Four Document Type Definitions are specified for books, serials, articles, and mathematics. Instructions for the preparation of text for the near automatic conversion to grade-2 braille and for publication in large-print and computer voice editions are included.
ANSI/NISO Z39.14 - 1997 Guidelines for Abstracts
Abstract: This standard helps authors and editors prepare useful abstracts by describing the components of an abstract and the appropriate styles and formats. Numerous examples illustrate the instructions presented in the standard and clarify how to handle special cases.
ANSI/NISO Z39.23 - 1997 Standard Technical Report Number Format and Creation
Abstract: Defines a unique numbering system that improves access to the wealth of scientific and technical reports issued by the government and private organizations. The STRN is an alphanumeric code with a maximum length of 34; for international application an optional country code can be added. The standard explains how and where the code should be assigned and used. A central authority to coordinate and monitor assignments of the code is designated.
ANSI/NISO Z39.26 - 1997 Micropublishing Product Information
Abstract: Content guidelines are given for advertising materials used to describe micropublications intended for long-term retention and use. Vendor Note: This standard details the information which publishers should incorporate in advertising materials for micropublications so users will have a full description of the product. This revision now includes a section on guarantees and the publisher's policy on replacement of defective portions of the micropublication.
ANSI/NISO Z39.32 - 1996 Information on Microfiche Headers
Equivalent international standard: ISO 5123
Abstract: Defines the specific eye-legible information that should appear on the limited space available on microfiche so the fiche can be correctly identified and properly filed.The standard describes where to place the data, the order of the information, and a recommended type size and contrast to maximize readability. Many examples show how to use the standard.
ANSI/NISO Z39.41 - 1998 Printed Information on Spines
Abstract: Describes how information is best presented on the spines of books or the containers that contain an information product, such as a CD-ROM, a microform, a disk, videotape or an audiotape. Explains what information to include and how to arrange it for maximum clarity. Using this standard will help your users identify your product and use it!
ANSI/NISO Z39.48 - 1992(R1997) Permanence of Paper for Publications and Documents in Libraries and Archives
Equivalent international standard: ISO 9706
Abstract: Publishers and paper manufacturers, take note! This standard sets the basic criteria for coated and uncoated papers that will last several hundred years under normal use. It covers ph value, tear resistance, alkaline reserve and lignin threshold. Recycled papers will meet the criteria specified. This revision to the original 1984 standard is based on testing conducted by the Institute of Paper Science and Technology and contributions from paper makers, publishers, printers, and the preservation community.
Wednesday, June 05, 2002
MARC21 & XML
The Library of Congress Network Development and MARC Standards Office announces completion of a schema for MARC 21 records in an XML structure for use in communicating MARC 21 records. This schema was developed in collaboration with OCLC and RLG and reviewed by the National Library of Canada and the National Library of Medicine (NLM), after a survey of schemas in use in various projects. Many schemas have taken the "slim" approach but all vary slightly. This schema will be maintained by the Library of Congress as will software that enables lossless conversion to and from MARC 21 records in the ISO 2709 structure. As illustrated in the introductory information on the web site, the Library of Congress will develop and provide, downloadable from the MARCXML web site, tools for various transformations and for record validations. A single schema serves all the five MARC 21 formats.
By collaboratively developing a communications schema, the Library of Congress encourages the standardization of MARC 21 exchange records in the XML environment, recognizing that MARC 21 records inside systems will continue to use different record configurations, tailored to the characteristics of the system. Provision of the tools for transformations to and from other metadata approaches, such as Dublin Core and the Metadata Object Description Schema (MODS), will help to standardize derivative metadata records also. (MODS is a new schema for a bibliographic element set that is a subset of MARC expressed in XML with language-based rather than numeric tags.) The tools take the mappings between MARC and other metadata sets, that have been maintained on the MARC web site, to an operational level.
One project interested in a standard, lossless MARCXML schema is the Open Archive Initiative (OAI) which found it necessary to draft a schema in the absence of an official one. The Library of Congress worked with the OAI to provide a transformation from the original oai_marc schema to this one so the Initiative can take advantage of a schema that is maintained by the MARC 21 maintenance agency and in broad use. The transformation is available from the MARCXML web site.
With the slim approach, schema-driven validation is only possible at the highest structural level. The Network Development and MARC Standards Office will therefore maintain downloadable tag, subfield, and value validation software on the web site that will enable users to build validation programs for their needs. Use of these standard validations represent another attempt to assure standardization of records to support effective record interchange.
The Library has maintained two SGML DTDs (for Bibliographic-type and Authority-type records) since 1996, which take a different approach to the data elements in MARC - an approach that enables validation of data through the DTD itself but requires a very large DTD and DTD maintenance. The Bibliographic-type DTD was converted to an XML DTD in 2000. These DTDs have been effectively used by some agencies (including the Library of Congress), primarily for internal processes, therefore transformations between them and the new slim MARCXML schema are being provided. Maintenance techniques and/or possible revision of the XML DTDs are under consideration.
For questions or comments please email the Office at ndmso@loc.gov.
Cataloging in the Real World
I have some questions on why some of these non-standard practices exist. Why don't the institutions using their own subject headings submit them to SACO? Or at least place them in a 653 field? Why would anyone buy a system that could not access the full MARC record? My system is an inexpensive one for the PC. It is far from perfect, in display there are problems, but I can at least create and edit the full record. Authority records or editing authority records would solve some of the problems. Do some systems not use MARC authority records, or are they too hard to acquire so institutions just don't bother?
"The following message is regarding an article that has been submitted for publication to Library Resources and Technical Services. I would ask that anyone who is a potential reviewer for LRTS read no further.
As you know back in February I posted a message asking for your local practices that break cataloging standards, broadly defined as AACR2, classification systems, encoding standards, etc. The response was overwhelming, 94 original responses came back and I couldn't have been happier.
You'll all be glad to know that I have submitted a draft of an article discussing the results of this survey to Library Resources and Technical Services for publication. The current title is "Breaking the Law: a survey of non-standard cataloging practices."
This research could not have been done without the support and input from Autocat, and I wanted to thank you all. A static working draft of the article is available on my website.
WARNING: Please don't all go visit the site at once, because my ISP only allows a limited amount of data transfer per hour. If you do want to read the article, please bookmark the address above, and visit either in a few days or during an evening hour. I would hate to have you get frustrated by not being able to access the article, and not take the time to read it.
Thank you.
Eric S. Riley Graduate Student (for one more week!)
Information School University of Washington, Seattle"
Tuesday, June 04, 2002
Metadata
MARC Geographic Codes
Addition to the MARC Country and Geographic Area Code Lists for Aruba, Bonaire and East Timor
Network Development and MARC Standards Office
As the result of East Timor's gaining its independence from Indonesia in May 2002, new country and geographic area codes are being defined for use in MARC records.
The new codes for East Timor are:
em (Country code)
a-em (Geographic area code)
The Netherlands Antilles is an internally self-governing and integral part of the Netherlands. It consists of the islands of Bonaire, Curacao, Saba, Saint Eustatius and the southern half of the island of Saint Martin. Aruba was part of the Netherlands Antilles until it seceded in 1986 to become a separate, autonomous member of the Kingdom of the Netherlands. Therefore, a new geographic area code is being defined for Aruba. Moreover, because all of the separate islands of the Netherlands Antilles have their geographic area codes, a geographic area code is being defined for Bonaire for use in MARC records.
The new code for Aruba is:
nwaw (Geographic area code)
The new code for Bonaire is:
nwbn (Geographic area code)
Subscribers can anticipate receiving MARC records reflecting these changes in all distribution services not earlier than September 3, 2002. Questions regarding the country and geographic area codes should be directed to:
Cataloging Policy and Support Office
Library of Congress
Washington, DC 20540-4305
(202) 707-4380
INTERNET: cpso@loc.gov
June 3, 2002
Professional Reading
Spirit Writing
Shameless Plug
Monday, June 03, 2002
Taxonomy
"The EDUCAUSE Taxonomy, a hierarchical subject listing of over 450 terms, helps you more easily explore topics and find specific documents in the association’s extensive collection of online information resources. The EDUCAUSE taxonomy is applied to all information resources library documents, EDUCAUSE publications, Effective Practices and Solutions, and EDUCAUSE conference sessions."
RSS
Personal RSS Aggregators by Jon Udell describes many of the programs to use with RSS feeds.
If you have a site providing an RSS feed you can provide a link tag to alert news readers that there is an RSS feed. This effort is being led by Mark Pilgrim.
Friday, May 31, 2002
Not Cataloging Related
"The purpose of this mysterious device is the conversion of text files (be they pasted from your clipboard or Stripped by this very engine from the URL you provide) from their ASCII state into a .pdb file which can be read on your palm-top computer (e.g. a Palm Pilot, Visor, one of those crazy PDA-cellphone thingies, a Psion, a Jornada, etc., etc., ad infinitum.) By the glory and miracle of science, the letters of your text doc are whisked through the motivator, into the grinder and then ejected from the vaporisor to the Vagrant's Library where the finished product is stored for download and use."
Thanks to Matthew Eberle at Library Techlog for pointing this out to me.
Cataloging Instruction
"The course is designed for members of the library community who do not have formal library training. It will help you understand the basic processes of technical services and cataloging."
MARC21
It will be discussed in a meeting of the MARC Advisory Committee on June 15-16, 2002 in Atlanta.
A draft agenda for that meeting is available
Thursday, May 30, 2002
Metadata
"Object ID is an international standard for describing cultural objects. It has been developed through the collaboration of the museum community, police and customs agencies, the art trade, insurance industry, and valuers of art and antiques.
The Object ID project was initiated by the J. Paul Getty Trust in 1993 and the standard was launched in 1997. It is being promoted by major law enforcement agencies, including the FBI, Scotland Yard and Interpol; museum, cultural heritage, art trade and art appraisal organisations; and insurance companies.
Having established the descriptive standard, the Object ID project now helps to combat art theft by encouraging use of the standard and by bringing together organisations around the world that can encourage its implementation."
Classification
"ICONCLASS is a subject specific international classification system for iconographic research and the documentation of images. It was developed by Henri van de Waal (1910-1972), Professor of Art History at the University of Leiden, and completed by his staff. ICONCLASS is a collection of ready-made definitions of objects, persons, events, situations and abstract ideas that can be the subject of an image. ICONCLASS organizes iconography into 10 'main divisions' in which the definitions are ordered hierarchically."
Not Cataloging Related
Wednesday, May 29, 2002
About this Blog
Maintaining the Catalog
"MarcXGen extracts URLs from MARC 21 bibliographic records and generates HTML code to create a single web page of hyperlinks that can be used with third party Link Checking software such as LinkBot and Xenu's Link Sleuth. With Version 2, MarcXGen also creates separate files of delimited data that may be used to build a relational database environment that may simplify some maintenance tasks associated with bad or problem URLs in library database records."
OCLC
A document entitled Guide to Migration that explains current plans is located on the OCLC Web sits.
Dublin Core
"I am a student from the University of Salzburg, Austria and I developed coins and banknotes templates under DC for my PHD thesis. Libraries, archives or a coin dealer could use any of these templates which are launched with the cooperation with the REGNET Project. (REGNET-Projet under the search engine google.de) Dr. Koch from Graz www.cscaustria.at launched it.
I am the person who created the templates for the coin note short CN-Collector.
If you are interested in that program please contact me and then you could start to try these out. My goal for the PHD is to get a database, where all the people can create the on catalog online. Right now I write a handbook for the templates to get good entries.
Thank you
Alexander H.T. Schultheis
E-Mail: ahtschultheis@hotmail.com"
Friday, May 24, 2002
Metadata
Digital Libraries- Metadata As If Libraries Depended on It
Digital Libraries- The Importance of Being Granular
Digital Libraries- The Consequences of Cataloging
I like his definition "cataloging by those paid better than librarians."
GILS
Through the efforts of several people, we now have a draft Web Services Description Language (WSDL) file for the GILS search service, available for review.
This draft only defines GILS search using "HTTP Get", following the extended ZURL. The search response message follows the definition given. (BTW, these definitions align with the ZX client from Dave Vieglais.)
Future discussions of GILS as a Web Service will be conducted mostly on the GILS Version 3 Discussion List. Subscribe to the GILS V3 list.
Z39.50
"The current version of YAZ includes experimental support for the industry standard ZOOM API for Z39.50. This API vastly simplifies the process of writing new clients using YAZ, and it reduces your dependency on any single toolkit. Future versions of YAZ may include support for other emerging IR protocols through the same interface."
Thursday, May 23, 2002
Art Metadata
"The Categories describe the content of art databases by articulating a conceptual framework for describing and accessing information about objects and images. They identify vocabulary resources and descriptive practices that will make information residing in diverse systems both more compatible and more accessible. They also provide a framework to which existing art information systems can be mapped and upon which new systems can be developed."
I've had the pleasure of hearing Murtha Baca speak at the ALCTS Metadata and AACR2 Institute. If you get the opportunity to hear her talk about what the Getty is doing, it is well worth the time.
MARC21
The following papers are now available:
Proposal No. 2002-12: Coding for Publication Pattern at the First Level of Enumeration in MARC 21 Holdings Records
Proposal No. 2002-13: Changes for Faceted Application of Subject Terminology (FAST) Subject Headings Changes in Field 008 in the MARC 21 Holdings Format
Proposal No. 2002-14: Proposal No. 2002-14: Changes for UKMARC Format Alignment
Proposal No. 2002-15: Defining field 065 (Other Classification Number) in the MARC 21 Authority Format
Wednesday, May 22, 2002
Metadata
Semantic Web
The Semantic Web and Libraries by Art is "a column I am working on for InsideOLITA and would welcome any and all feedback."
Tuesday, May 21, 2002
Resource Description Framework
"The EOR toolkit is a collection of extensible Java classes and services which serve as a code base, demonstrating by example functions and services common to RDF applications, i.e., metadata capture, search engines, etc.. The current release provides services designed to validate RDF, build and search RDF triple stores (HTTP and Java API) and render RDF data using XSLT."
The latest issue of Information Technologies & Libraries (2002), v. 21, no. 1 pp. 27-31 has an article on the toolkit. "The EOR Toolkit: An Open Source Solution for RDF Metadata" by Harry R. Wagner.
Library Catalogs
"The new version has been released. The file can be downloaded PhpMyLibrary-1.0.4b here! This is a version that have a Holdings and Cataloging Modules Added. The recent module only consisted of WebPAC module and Record Import module, now you can make your own catalog entry, put an accession to it, and you're done. Your new book or material are ready for browsing online."
Monday, May 20, 2002
Spelling
Controlled Vocabularies
Geographic Cutters
"California regional and city geographic cutter numbers (G4362 and G4364) have been updated by Traci Penrod of the Earth Sciences and Map Library and the University of California, Berkeley. These lists are on the web:
Region Cutters
City Cutters"
Thursday, May 16, 2002
Postings to Catalogablog
D-Lib Magazine
The articles include:
A Metadata Registry for the Semantic Web
Rachel Heery, UKOLN, and Harry Wagner, OCLC / Dublin Core Metadata Initiative
Meta-Design of a Community Digital Library
Michael Wright and Mary Marlino, University Corporation for Atmospheric Research; and Tamara Sumner, University of Colorado at Boulder
Levels of Service for Digital Repositories
William G. LeFurgy, U.S. National Archives and Records Administration
Indigenous Cultural and Intellectual Property Rights: A Digital Library Context
Robert Sullivan, University of Auckland, New Zealand
Keywords
Genre Headings
Wednesday, May 15, 2002
Text Encoding Initiative
The primary goal of this revision has been to make available a new and corrected version of the TEI Guidelines which:
is expressed in XML and conforms to a TEI-conformant XML DTD;
generates a set of DTD fragments that can be combined together to form either SGML or XML document type definitions;
corrects blatant errors, typographical mishaps, and other egregious editorial oversights;
can be processed and maintained using readily available XML tools instead of the special-purpose ad hoc software originally used for TEI P3.
A second major design goal of this revision has been to ensure that the DTD fragments generated would not break existing documents: in other words, that any document conforming to the original TEI P3 SGML DTD would also conform to the new XML version of it. Although full backwards compatibility cannot be guaranteed, we believe our implementation is consistent with that goal.
Metadata
Tuesday, May 14, 2002
Metadata
LC Classification
To expedite your order, please read all of the explanatory information carefully and fill out the order form as completely as possible. If the information that accompanies the order form as well as the product information is not sufficient to answer your questions, email your inquiries to cdsinfo@loc.gov.
We appreciate your ongoing interest in Class Web and look forward to having it ready for access on June 1, 2002.
Cheryl C. Cook
Cataloging Distribution Service"
MARC21
Proposal no. 2002-11: Repertoire Expansion in the Universal Character Set for Canadian Aboriginal Syllabics.
Monday, May 13, 2002
Dewey Classification
The Library of Congress Subject Headings/DDC page is a "list of LC subject headings is selected from recent Weekly Lists accompanied by candidate DDC numbers from Edition 21. The purpose of the list is to provide classifier assistance for topics of recent interest not mentioned explicitly in Edition 21. The DDC numbers listed are not exhaustive, and the schedules and tables should be consulted before applying a number from this list."
The New and Changed Entries page provides PDF and Word versions of those entries. I have set a TrackEngine on the LCSH/DDC page to get the page e-mailed to me whenever it changes.
I've added the ability to make comments to the postings. Let me know if it is not working for you.
MARC21
The following papers are now available:
Discussion Paper No. 2002-6R: Changes in Field 008 in the MARC 21 Holdings Format
Proposal No. 2002-10: Defining URI Subfields in Field 506 (Restrictions on Access Note) and Field 540 (Terms Governing Use and Reproduction Note) in the MARC 21 Bibliographic Format
Proposal No. 2001-10R: Definition of Additional Codes in Field 007/10 (Type of Material) for Sound Recordings in the MARC 21 Bibliographic and Holdings Formats
Other papers will be posted next week.
Friday, May 10, 2002
Profession
Resource Description Framework
"The Resource Description Framework (RDF) is a general-purpose language for representing information in the World Wide Web. It is particularly intended for representing metadata about Web resources, such as the title, author, and modification date of a Web page, the copyright and syndication information about a Web document, the availability schedule for some shared resource, or the description of a Web user's preferences for information delivery. RDF provides a common framework for expressing this information in such a way that it can be exchanged between applications without loss of meaning. Since it is a common framework, application designers can leverage the availability of common RDF parsers and processing tools. Exchanging information between different applications means that the information may be made available to applications other than those for which it was originally created. This Primer is designed to provide the reader the basic fundamentals required to effectively use RDF in their particular applications."--abstract.
Thursday, May 09, 2002
Standard Address Number (SAN)
Summaries or Abstracts
Wednesday, May 08, 2002
Another Controlled Vocabulary
Recently on AUTOCAT someone asked the question "what to read in the area of subject headings and classification?" I do believe that the introduction of several thesaurus and subject heading lists would be a valuable part of that reading list. They provide concrete examples of how and why the list was constructed. The AAT and DDC have perhaps the best introductions but the others give the reader some contrast and renforcement of principles.
Schedule G
The pages are formatted to be trimmed down and tipped into the schedule book.
Tuesday, May 07, 2002
Genre Terms
Controlled Vocabulary
The Art & Architecture Thesaurus (AAT) is a structured vocabulary of around 125,000 terms, scope notes, and other information for describing fine art, architecture, decorative arts, archival materials, and material culture.
The Getty Thesaurus of Geographic Names (TGN) is a structured vocabulary of around 1,000,000 geographic names, including vernacular and historical names, coordinates, and place types, and focusing on places important for the study of art and architecture.
The Union List of Artist Names (ULAN) is a structured vocabulary containing around 220,000 names and biographical information about artists and architects, including a wealth of variant names, pseudonyms, and language variants.
Monday, May 06, 2002
Classification
Cataloging Skills
Both reference and tech services would benefit from seeing things from the different perspective. It would be ideal if catalogers could sit on the reference desk for four hours a week. Reference staff could work on MARC records for their four hours, adding genre/form headings would be useful and not too much of a stretch. Then catalogers could go back and create some of those cross references in the authority file, add annotations using language of the patron's, create headings for characters that would benefit their local users. The reference folks could see that we are not just making it up to have job security.
Friday, May 03, 2002
Genre and Form Headings
MeSH
The staff of Ehrman Medical Library, NYU School of Medicine have developed a product m[n]m. This PERL program provides a way to update the locally needed records from the entire file provided from the download.
Thursday, May 02, 2002
Digital Libraries
The presentations from the conference are now available.
Some of the talks include:
Community, Consensus, and the Trajectory of Progress: Reflections on the Dublin Core experience and what it tells us about the future.
RDF Query by example.
LDAP based repositories for Metadata and Ontologies.
Semantic problems of thesaurus mapping.
MARC Tool
Wednesday, May 01, 2002
Sandy Berman
Authority Records
For some time now, LC's Cataloging Distribution Service (CDS) has been processing and distributing to subscribers, including the bibliographic utilities, name authority records derived from the National Library of Canada that contain 7XX fields. NACO members will soon begin seeing 7XX "linking references" in name authority records originating from other institutions. These linking references are AACR2 authoritative forms of name used in national bibliographies or for other special purposes by national libraries/bibliographic entities.
Interim instructions, pending the release of revised documentation, are available at the PCC Web site at URL http://www.loc.gov/catdir/pcc/naco/7xx.html [May 2002]
This is excellent news, one step closer to a patron friendly authority access system. Where each institution or even each user can select the form of names they desire to use. Currently only the Library of Canada has input. Soon the Library of South Africa will as well.
Tuesday, April 30, 2002
Classification
General Material Designations
1. GMD helps patrons find, separate, collocate, and select records for print and non-print materials. Since non-print materials are often the minority in collections, is the GMD the best way to alert users of the catalog?
2. How are GMD's best applied when describing reproductions and multiple resources such as digital sound recording?
3. Professional report inconsistencies of how to use GMD particularly for non-print materials- so will rules oriented towards print materials help?
4. When should a GMD be in a record (e.g. even if there is no intrinsic relation to a transcribed title), how specific should it be?
5. Materials are described via the GMD, should they be relegated to subject classification?
6. Is SMD an alterative for GMD?
7. GMD is perceived as an example of broad issues in the rules. Thus, should GMD be addressed at all through cataloging rules?
If you have an opinion on this let the members of CC:DA know.
Monday, April 29, 2002
Serial Subscriptions
A questionnaire for libraries has been placed on the NISO web site. Because NISO supports the development of standards for the information community based on consensus among constitiuents, we are grateful for the interest of libraries who benefit so much from these standards.
Please take some time to visit the NISO web site and fill out the survey which can be found at: http://www.niso.org/survey/serials.cfm. The survey will close on May 30. A final report on this study, including recommendations, will be on the NISO website in late June."--from an e-mail received 4/26/02


