Document and Text Processing
Data Extraction using Content-Based Handles

A. Pouramini; S. Khaje Hassani; Sh. Nasiri

Volume 6, Issue 2 , July 2018, , Pages 399-407

https://doi.org/10.22044/jadm.2017.990

Abstract
  In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired ...  Read More