LSTM网络应用于DGA域名识别--文献翻译--更新中

原文献名称 Predicting Domain Generation Algorithms with Long Short-Term Memory Networks
原文作者 Jonathan Woodbridge, Hyrum S. Anderson, Anjum Ahuja, and Daniel Grant

The LSTM cell’s design with multiplicative gates allows a network to store and access state over long sequences, thereby
mitigating the vanishing gradients problem. For our use with domain names, the state space is intended to capture combi-
nations of letters that are important to discriminating DGA domains from non-DGA domains. This flexible architecture
generalizes manual feature extraction via bigrams, for example, but instead learns dependencies of one or multiple characters,
whether in succession or with arbitrary separation.

LSTM网络的多重门结构使得LSTM网络可以存储并且获取长序列的状态,因而减轻了特征失效这一个问题。在域名检测这一问题

上,我们主要利用LSTM的状态空间来存储字母之间的综合特征,这些状态是将DGA域名区分出来的关键。这样存储的综合特征覆盖

范围之广,包括绝大多数手工创建的特征,例如:依赖于一个字母或者多个字母的连续特征或者间断特征。

猜你喜欢

转载自www.cnblogs.com/blili/p/9472654.html