News articles and Web directories represent some of the most popular and commonly accessed content on the Web. Information designers normally define categories that model these knowledge domains (i.e. news topics or Web categories) and domain experts assign documents to these categories. The paper describes how machine learning and automatic document classification techniques can be used for managing large numbers of news articles, or Web page descriptions, lightening the load on domain experts.Seen on ResourceShelf.
Thursday, June 17, 2004
Managing Content with Automatic Document Classification by Rafael A. Calvo, Jae-Moon Lee, and Xiaobo Li appears in Journal of Digital Information, vol. 5, no. 2.