Incident and Problem Ticket Clustering and Classification Using Deep Learning

Release Date:2023-12-25 FENG Hailin, HAN Jing, HUANG Leijun, SHENG Ziwei, GONG Zican

Abstract: In this paper we present a holistic analysis of problem and incident tickets in a real production Cloud service environment. By extracting different bags of words, we use principal component analysis (PCA) to examine the clustering characteristics of these tickets. Then K-means and latent dirichlet allocation (LDA) are applied to show the potential clusters within this Cloud environment. The second part of our study uses a pre-trained bidirectional encoder representation from transformers (BERT) model to classify the tickets, with the goal to predict the optimal dispatching department for a given ticket. Experimental results show that due to the unique characteristics of ticket description, pre-processing with domain knowledge turns out to be critical in both clustering and classification. Our classification model yields 86% accuracy when predicting the target dispatching department.

Keywords: problem ticket; ticket clustering; ticket classification

download: PDF