DRILL: Dual-Reasoning Large Language Models for Phishing Email Detection with Limited Data

Greenewald, Calvin; Ashmore, Bradley; Poon, Chien-Sing; Chen, Lingwei

Citation Details

As phishing emails pose a growing threat to individuals and organizations alike, there is an urgent need to develop more accurate detection methods. Large Language Models (LLMs) have recently garnered major attention in this line of research; however, they often require large-scale data for fine-tuning, which is impractical in real-world application scenarios. This paper proposes DRILL, a new simple and efficient mechanism, for dual-reasoning LLMs to detect phishing emails with extremely small data. DRILL distills the reasoning ability from an LLM into a target small LM model, while integrating trainable perturbations to manipulate the inputs, which in turn adaptively enhances the inference ability of the target LM. Extensive experiments are conducted on multiple real-world email datasets, and the evaluation results demonstrate that DRILL can benefit from dual LMs, which significantly reduces training parameters and data required, while maintaining state-of-the-art performance in phishing email detection with limited data. more »

Award ID(s):: 2245968

PAR ID:: 10559232

Author(s) / Creator(s):: Greenewald, Calvin; Ashmore, Bradley; Poon, Chien-Sing; Chen, Lingwei

Publisher / Repository:: International Conference on Neural Information Processing

Date Published:: 2024-12-02

Subject(s) / Keyword(s):: Phishing Email Detection Large Language Models Reasoning Data-Limited Learning

Format(s):: Medium: X

Location:: Auckland, New Zealand

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this