Document and Text Processing
1. External Plagiarism Detection based on Human Behaviors in Producing Paraphrases of Sentences in English and Persian Languages

A. Shojaie; F. Safi-Esfahani

Volume 7, Issue 3 , Summer 2019, , Pages 451-466

http://dx.doi.org/10.22044/jadm.2018.4328.1517

Abstract
  With the advent of the internet and easy access to digital libraries, plagiarism has become a major issue. Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries. Generating suitable queries is the heart of this technique and existing ...  Read More

Document and Text Processing
2. A Joint Semantic Vector Representation Model for Text Clustering and Classification

S. Momtazi; A. Rahbar; D. Salami; I. Khanijazani

Volume 7, Issue 3 , Summer 2019, , Pages 443-450

http://dx.doi.org/10.22044/jadm.2019.7400.1876

Abstract
  Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, ...  Read More

Document and Text Processing
3. Automatic Construction of Persian ICT WordNet using Princeton WordNet

A. Ahmadi Tameh; M. Nassiri; M. Mansoorizadeh

Volume 7, Issue 1 , Winter 2019, , Pages 109-119

http://dx.doi.org/10.22044/jadm.2018.4966.1601

Abstract
  WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word ...  Read More

Document and Text Processing
4. A survey on Automatic Text Summarization

N. Nazari; M. A. Mahdavi

Volume 7, Issue 1 , Winter 2019, , Pages 121-135

http://dx.doi.org/10.22044/jadm.2018.6139.1726

Abstract
  Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major ...  Read More

Document and Text Processing
5. Data Extraction using Content-Based Handles

A. Pouramini; S. Khaje Hassani; Sh. Nasiri

Volume 6, Issue 2 , Summer 2018, , Pages 399-407

http://dx.doi.org/10.22044/jadm.2017.990

Abstract
  In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired ...  Read More

Document and Text Processing
6. English-Persian Plagiarism Detection based on a Semantic Approach

F. Safi-Esfahani; Sh. Rakian; M.H. Nadimi-Shahraki

Volume 5, Issue 2 , Summer 2017, , Pages 275-284

http://dx.doi.org/10.22044/jadm.2016.770

Abstract
  Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), ...  Read More