Harvesting Knowledge from Cultural Heritage Artifacts in Museums of India

22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2018)

Publication date: June 3, 2018

Abhilasha Sancheti, Paridhi Maheshwari, Rajat Chaturvedi, Anish Monsy, Tanya Goyal, Balaji Vasan Srinivasan

Recent efforts towards digitization of cultural heritage artifacts have resulted in a surge of information around these artifacts. However, the organization of these artifacts falls short with respect to accessing the facts across these entities. In this paper, we present a method to harvest the knowledge and form a knowledge graph from the digitized artifacts in the Museums of India repository via distant supervision to enable better accessibility of the facts and ability to extract new insights around the artifacts. Triples extracted from an open information extractor are first canonicalized to a standard taxonomy based on a metric-based scoring. Since a standard taxonomy is insufficient to capture all the relationships, we propose a sequential clustering based approach to add artifact specific relationships to the taxonomy (and to the knowledge graph). The graph is enriched by inferring missing facts based on a probabilistic soft logic approach seeded from a frequent item set framework. Human evaluation of the final knowledge graph showed an accuracy of

Learn More