论文标题
时间概念漂移和结盟:随着时间的推移比较知识组织系统的一种经验方法
Temporal Concept Drift and Alignment: An empirical approach to comparing Knowledge Organization Systems over time
论文作者
论文摘要
这项研究探讨了知识组织系统(KOS)中的时间概念漂移和时间对齐。使用1910年国会主题标题,2020快速主题和自动索引进行比较分析。用例涉及90个19世纪的大不列颠百科全书。使用两种方法对条目进行了索引:1)全文索引; 2)使用1910 LCSH和快速主题的辅助跨学科词汇应用程序(HIVE),使用STANZA,Stanford的NLP Toolkit(Stanza,Stanza,Stanza,Stanza,Stanza,Stanza,Stanza)进行的识别的命名。分析的重点是三个目标:1)确定1910年LCSH输出独有的结果; 2)在当代LCSH中删除的独家集合中的术语,证明了时间概念漂移; 3)探索这些贬低的条款的历史意义。结果证实,历史词汇可以用来产生过时的主题标题,代表了KOS和历史资源的概念漂移。做出了一种方法上的贡献,证明了如何随着时间的推移研究KOS的变化并改善了历史人文资源的情境化。
This research explores temporal concept drift and temporal alignment in knowledge organization systems (KOS). A comparative analysis is pursued using the 1910 Library of Congress Subject Headings, 2020 FAST Topical, and automatic indexing. The use case involves a sample of 90 nineteenth-century Encyclopedia Britannica entries. The entries were indexed using two approaches: 1) full-text indexing; 2) Named Entity Recognition was performed upon the entries with Stanza, Stanford's NLP toolkit, and entities were automatically indexed with the Helping Interdisciplinary Vocabulary application (HIVE), using both 1910 LCSH and FAST Topical. The analysis focused on three goals: 1) identifying results that were exclusive to the 1910 LCSH output; 2) identifying terms in the exclusive set that have been deprecated from the contemporary LCSH, demonstrating temporal concept drift; and 3) exploring the historical significance of these deprecated terms. Results confirm that historical vocabularies can be used to generate anachronistic subject headings representing conceptual drift across time in KOS and historical resources. A methodological contribution is made demonstrating how to study changes in KOS over time and improve the contextualization of historical humanities resources.