IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Speech and Image Processing, Recognition>
An Attention Mechanism Extension of Automatic HTML Generation from Web Page Design Images
Hiroki ChinenHidehiro OhkiKeiji GyohtenToshiya Takami
Author information
JOURNAL RESTRICTED ACCESS

2020 Volume 140 Issue 12 Pages 1393-1401

Details
Abstract

The purpose of this study is to improve the accuracy of automatic HTML generation from web page design images. pix2code is the state of art in this field. It is consist of design image learning part by CNN and HTML learning part by LSTM. We propose three improvements of adding a word embedding layer, applying VGG16 fine tuning to CNN, replacing LSTM to Bidirectional LSTM or GRU, and introducing attention mechanism. In the experiment, we employed a conventional data set which was used in pix2code and evaluated by a standard natural language generation metric called BLEU. As the results, the one of proposed models that contained the word embedding layer and the attention mechanism scored the accuracy of 99%. It overcomes the result of state of art scored 88%.

Content from these authors
© 2020 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top