TY - GEN
T1 - Leveraging the legacy of conventional libraries for organizing digital libraries
AU - Joorabchi, Arash
AU - Mahdi, Abdulhussain E.
PY - 2009
Y1 - 2009
N2 - With the significant growth in the number of available electronic documents on the Internet, intranets, and digital libraries, the need for developing effective methods and systems to index and organize E-documents is felt more than ever. In this paper we introduce a new method for automatic text classification for categorizing E-documents by utilizing classification metadata of books, journals and other library holdings, that already exists in online catalogues of libraries. The method is based on identifying all references cited in a given document and, using the classification metadata of these references as catalogued in a physical library, devising an appropriate class for the document itself according to a standard library classification scheme with the help of a weighting mechanism. We have demonstrated the application of the proposed method and assessed its performance by developing a prototype classification system for classifying electronic syllabus documents archived in the Irish National Syllabus Repository according to the well-known Dewey Decimal Classification (DDC) scheme.
AB - With the significant growth in the number of available electronic documents on the Internet, intranets, and digital libraries, the need for developing effective methods and systems to index and organize E-documents is felt more than ever. In this paper we introduce a new method for automatic text classification for categorizing E-documents by utilizing classification metadata of books, journals and other library holdings, that already exists in online catalogues of libraries. The method is based on identifying all references cited in a given document and, using the classification metadata of these references as catalogued in a physical library, devising an appropriate class for the document itself according to a standard library classification scheme with the help of a weighting mechanism. We have demonstrated the application of the proposed method and assessed its performance by developing a prototype classification system for classifying electronic syllabus documents archived in the Irish National Syllabus Repository according to the well-known Dewey Decimal Classification (DDC) scheme.
KW - Bibliography
KW - Collective classification
KW - Digital library organization
KW - Library classification schemes
KW - Text classification
UR - http://www.scopus.com/inward/record.url?scp=77952082692&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-04346-8_3
DO - 10.1007/978-3-642-04346-8_3
M3 - Conference contribution
AN - SCOPUS:77952082692
SN - 3642043453
SN - 9783642043451
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 3
EP - 14
BT - Research and Advanced Technology for Digital Libraries - 13th European Conference, ECDL 2009, Proceedings
T2 - 13th European Conference on Research and Advanced Technologies for Digital Libraries, ECDL 2009
Y2 - 27 September 2009 through 2 October 2009
ER -