Scopus Harvesting Series

A graph convolutional network-based sensitive information detection algorithm

Ying Liu, Anhui University of Science and Technology
Chao Yu Yang, Anhui University of Science and Technology
Jie Yang, University of Wollongong

Publication Name

Complexity

Abstract

In the field of natural language processing (NLP), the task of sensitive information detection refers to the procedure of identifying sensitive words for given documents. THe majority of existing detection methods are based on the sensitive-word tree, which is usually constructed via the common prefixes of different sensitive words from the given corpus. Yet, these traditional methods suffer from a couple of drawbacks, such as poor generalization and low efficiency. For improvement purposes, this paper proposes a novel self-attention-based detection algorithm using the implementation of graph convolutional network (GCN). THe main contribution is twofold. Firstly, we consider a weighted GCN to better encode word pairs from the given documents and corpus. Secondly, a simple, yet effective, attention mechanism is introduced to further integrate the interaction among candidate words and corpus. Experimental results from the benchmarking dataset of THUC news demonstrate a promising detection performance, compared to existing work.

Open Access Status

This publication may be available as open access

Volume

2021

Article Number

6631768

Link to Full Text

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1155/2021/6631768

Scopus Harvesting Series

A graph convolutional network-based sensitive information detection algorithm

Publication Name

Abstract

Open Access Status

Volume

Article Number

Link to publisher version (DOI)

Search

Browse

Links

Scopus Harvesting Series

A graph convolutional network-based sensitive information detection algorithm

Authors

Publication Name

Abstract

Open Access Status

Volume

Article Number

Share

Link to publisher version (DOI)

Search

Browse

Links