MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED SUBSPACE CLUSTERING

Shared by Elizabeth Foughty, updated on Oct 13, 2010

Summary

Abstract

MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED
SUBSPACE CLUSTERING

MOHAMMAD SALIM AHMED, LATIFUR KHAN, NIKUNJ OZA, AND MANDAVA RAJESWARI

Abstract. There has been a lot of research targeting text classification. Many of them focus
on a particular characteristic of text data - multi-labelity. This arises due to the fact that a
document may be associated with multiple classes at the same time. The consequence of such a
characteristic is the low performance of traditional binary or multi-class classification techniques on
multi-label text data. In this paper, we propose a text classification technique that considers this
characteristic and provides very good performance. Our multi-label text classification approach
is an extension of our previously formulated [3] multi-class text classification approach called
SISC (Semi-supervised Impurity based Subspace Clustering). We call this new classification model
as SISC-ML(SISC Multi-Label). Empirical evaluation on real world multi-label NASA ASRS
(Aviation Safety Reporting System) data set reveals that our approach outperforms state-of-theart
text classification as well as subspace clustering algorithms.

show more info
Publication Name
N/A
Publication Location
N/A
Year Published
N/A

Files

Paper 22 .pdf
MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED SUBSPACE CLUSTERING
555.8 KB 694 downloads

Discussions

Add New Comment