URL Classification

URL classification aims to automatically categorize URLs as benign or malicious (e.g., phishing, spam, malware), a crucial task for cybersecurity and online safety. Current research focuses on improving the accuracy and robustness of classification models, employing techniques like convolutional neural networks (CNNs), graph neural networks (GNNs), and large language models (LLMs), often incorporating contrastive learning and adversarial training to enhance performance against sophisticated evasion techniques. The development of explainable models and efficient methods for handling large-scale datasets are also key areas of investigation, with the ultimate goal of providing more reliable and understandable URL security solutions.

Papers

October 1, 2024

LinkThief: Combining Generalized Structure Knowledge with Node Similarity for Link Stealing Attack against GNN
Yuxing Zhang, Siyuan Meng, Chunchun Chen, Mengyao Peng, Hongyan Gu, Xinli Huang
Graph Neural Network Gene Level GNN Tiltable Link Node Similarity Shadow Model URL Classification Generalized Knowledge Linkage Attack Target Graph

September 22, 2024

LLMs are One-Shot URL Classifiers and Explainers
Fariza Rashid, Nishavi Ranaweera, Ben Doyle, Suranga Seneviratne
Large Language Model Url Detection URL Classification

April 27, 2024

PhishGuard: A Convolutional Neural Network Based Model for Detecting Phishing URLs with Explainability Analysis
Md Robiul Islam, Md Mahamodul Islam, Mst. Suraiya Afrin, Anika Antara, Nujhat Tabassum, Al Amin
Deep Learning Cybersecurity Perspective Explanation Model Uniform Resource Locator 1D Convolutional URL Classification

February 18, 2024

URLBERT:A Contrastive and Adversarial Pre-trained Model for URL Classification
Yujie Li, Yanbin Wang, Haitao Xu, Zhenhao Guo, Zheng Cao, Lun Zhang
Url Detection URL Classification

September 10, 2023

Classification of Spam URLs Using Machine Learning Approaches
Omar Husni Odeh, Anas Arram, Murad Njoum
Machine Learning Model Classification Code Random Forest Spam Detection Uniform Resource Locator Image Spam URL Classification

May 8, 2023

Web Content Filtering through knowledge distillation of Large Language Models
Tamás Vörös, Sean Paul Bergeron, Konstantin Berlin
Knowledge Distillation Content Filtering URL Classification

September 3, 2022

Phishing URL Detection: A Network-based Approach Robust to Evasion
Taeri Kim, Noseong Park, Jiwon Hong, Sang-Wook Kim
Internet Service Domain Network Inference Robust Network Url Detection URL Classification Common Pattern

April 27, 2022

An Adversarial Attack Analysis on Malicious Advertisement URL Detection Framework
Ehsan Nowroozi, Abhishek, Mohammadreza Mohammadi, Mauro Conti
Adversarial Attack Uniform Resource Locator Url Detection URL Classification