SEMANTIC ANNOTATION TO SUPPORT AUTOMATIC TAXONOMY CLASSIFICATION
DS 36: Proceedings DESIGN 2006, the 9th International Design Conference, Dubrovnik, Croatia
                        Year: 2006
                        Editor: Marjanovic, D.
                        Author: Kim, S.; Bracewell, R.H.; Ahmed, S.; Wallace, K.M.
                        Section: COMPETENCIES & COMMUNICATIONS
                        Page(s): 1171-1178
                        
Abstract
The paper presents a new taxonomy classification method that generates classification criteria from a small number of important sentences identified through semantic annotations. Rhetorical Structure Theory (RST) is used to discover the semantics. The annotations identify which parts of a text are more important for understanding its contents. The extraction of salient sentences is a major issue in text summarisation. Statistical analysis is commonly used, but for subject-matter type texts, linguistically motivated natural language processing techniques, e.g. semantic annotations, are preferred. An experiment to test the method using documents collected from industry demonstrated that classification accuracy can be improved by up to 16%.
Keywords: corporate taxonomy classification, semantic annotation, natural language processing, rhetorical structural theory