ANALYZING AVIATION SAFETY REPORTS: FROM TOPIC MODELING TO SCALABLE MULTI-LABEL CLASSIFICATION

Shared by Elizabeth Foughty, updated on Oct 13, 2010

Summary

Abstract

ANALYZING AVIATION SAFETY REPORTS: FROM TOPIC MODELING TO
SCALABLE MULTI-LABEL CLASSIFICATION

AMRUDIN AGOVIC, HANHUAI SHAN, AND ARINDAM BANERJEE*

Abstract. The Aviation Safety Reporting System (ASRS) is used to collect voluntarily submitted
aviation safety reports from pilots, controllers and others. As such it is particularly useful
in researching aviation safety deficiencies. In this paper we address two challenges related to the
analysis of ASRS data: (1) the unsupervised extraction of meaningful and interpretable topics
from ASRS reports and (2) multi-label classification of ASRS data based on a set of predefined
categories. For topic modeling we investigate the practical usefulness of Latent Dirichlet Allocation
(LDA) when it comes to modeling ASRS reports in terms of interpretable topics. We also
utilize LDA to generate a more compact representation of ASRS reports to be used in multi-label
classification. For multi-label classification we propose a novel and highly scalable multi-label classification
algorithm based on multi-variate regression. Empirical results indicate that our approach
is superior to several baseline and state-of-the-art approaches.

show more info
Publication Name
N/A
Publication Location
N/A
Year Published
N/A

Files

Paper 7 .pdf
ANALYZING AVIATION SAFETY REPORTS: FROM TOPIC MODELING TO SCALABLE MULTI-LABEL CLASSIFICATION
579.1 KB 160 downloads

Discussions

Add New Comment