According to public information released by China’s National Anti-Fraud Center, telecom fraud has become the crime with the largest number of cases, the fastest growth rate, and the widest coverage in recent years. By the end of 2022, China’s public security departments had cracked 1.156 million telecom fraud cases, arrested 1.553 million suspects, and intercepted over 916.5 billion yuan of funds related to fraud cases. The increasing prevalence of telecom fraud pose a significant threat to personal and property safety.
Difficulties and Challenges in SMS Fraud Monitoring
SMS fraud stands out as one of the most common types of telecom fraud. Fraudsters constantly alter and adapt SMS contents to bypass the SMS monitoring system of telecom operators. Some common tactics employed by fraudsters include:
The traditional fraud management solution with a long upgrade period faces huge challenges. Overly lax policies may lead to low interception efficiency, while overly strict policies affect normal communication.
AI Model Enables Technological Revolution
On November 30, 2022, OpenAI launched ChatGPT, which obtained 100 million users within two months after its launch. Built on the transformer neural network architecture, ChatGPT, a large language model (LLM), has made major breakthroughs across multiple deep learning fields, including large-scale natural language processing, sequence data analysis, and target detection. Trained on extensive corpora, LLMs can acquire generalized knowledge and a deep understanding of languages and dialogues. Moreover, targeted training allows LLMs to solve problems in specific fields and rapidly adapt to new tasks and scenarios.
Accurately identifying fraudulent SMS messages requires a deep understanding of natural languages. Furthermore, it is necessary to classify sensitive information and identify the real intentions conveyed in the content. Lastly, given the evolving nature of fraudulent SMS messages, it is necessary to learn from samples and dynamically upgrade knowledge and models. These are the technologies where transformer-based LLMs excel. It is worthwhile to develop new SMS anti-fraud technologies and products utilizing AI models through prototype testing and exploration.
Rapid Technical Breakthrough Helps Tackle Difficulties
In the early stage of the project, we faced several challenges in selecting LLMs:
To achieve rapid technical breakthrough, we dared to try different approaches, make mistakes and adjust solutions promptly.
In terms of model selection, during the initial exploration phase, we tried models with less than 100 million parameters to 340 million, 7 billion and 13 billion parameters. This process encompassed four parameter scales and included six different models from both domestic and international sources, including self-developed ones. We evaluated a total of more than 20 combinations.
In terms of corpus and fine-tuning, we obtained a first-hand, high-quality corpus compliant with regulations, tried various fine-tuning solutions, and finally devised the most effective approach: "special prompt words+sample fine-tuning", greatly improving recognition accuracy and recall rate.
To address the challenges of high GPU quantity and high costs, we designed a multi-layer architecture with cache acceleration at the front and utilized a combination of small models and large models. Additionally, we implemented inference acceleration to achieve optimal performance.
After evaluating the effects and cost indicators of the models, we selected the most optimal solution and passed legal compliance review.
Perfect Combination of Communications and AI
Through continuous innovation, ZTE has successfully released the industry's first anti-fraud big model system called “Smart Safeguard” (Fig. 1). With its out-of-the-box functionality, the system automatically identifies illegal SMS messages without policy configuration. This greatly reduces the complexity and workload of on-site policy O&M, while enhancing the accuracy of illegal SMS message identification and recall rate. It enables integrated management of identifying, preventing, and controlling junk and fraudulent SMS messages.
Currently, the system has implemented the industry’s first LLM-based SMS anti-fraud management pilot in pilot offices of operators A and B in China and quickly transitioned into commercial use.
In addition, ZTE’s AI anti-fraud technologies have been chosen by China’s Ministry of Industry and Information Technology as an innovative technology application for preventing and controlling telecom fraud, and they have been promoted nationwide.
Future Evolution and Prospect
The introduction of anti-fraud model marks the beginning of AI model application in the communication sector. The Smart Safeguard series models will be developed, evolved, and applied across multiple domains, including service scope, media capabilities, and industrial applications.