H.6.5.14. Text processing
Abolfazl Adressi; َAmirhossein Amiri
Abstract
Identifying and classifying anomalies in textual data from social networks is challenging due to the linguistic complexity and diverse user expressions. While deep learning and machine learning techniques offer promise in tackling this problem, their effectiveness is limited by insufficient data. The ...
Read More
Identifying and classifying anomalies in textual data from social networks is challenging due to the linguistic complexity and diverse user expressions. While deep learning and machine learning techniques offer promise in tackling this problem, their effectiveness is limited by insufficient data. The effect of Generative Adversarial Networks (GANs) on anomaly detection and Classification is assessed in this paper, along with their relevance for generating synthetic text data. Combining synthetic and real data enhances classification accuracy, especially in settings of limited data. In this paper, Lasso and Ridge regression techniques are used for anomaly detection and classification. Experimental results reveal the superior performance of the proposed model in identifying and classifying anomalies under new datasets generated by GAN. By combining statistical methods with generative techniques, the solution becomes not only more interpretable and scalable but also better suited for advanced text analysis in fast-changing environments like social media platforms.
H.6.5.14. Text processing
Amir Ali Kharazmi; Hamid Hassanpour
Abstract
Advancements in artificial intelligence have produced powerful language models that enhance scientific writing through automated evaluation and proofreading. Effective use of these models relies on prompt engineering—the precise formulation of requests—which directly influences output quality. ...
Read More
Advancements in artificial intelligence have produced powerful language models that enhance scientific writing through automated evaluation and proofreading. Effective use of these models relies on prompt engineering—the precise formulation of requests—which directly influences output quality. As the saying goes, "Asking correctly is half of knowledge," emphasizing the importance of well-crafted prompts. In this study, we introduce a novel approach utilizing the simple language model Gemma-7b-it to improve scientific writing. By detailing the specific characteristics and structures of each section of a scientific paper, we prompt the model to evaluate and proofread text for clarity, coherence, and adherence to academic standards. Our method comprises three stages: initial evaluation, feedback-based proofreading, and iterative refinement using textual gradient optimization. Tested on a dataset of 25 scientific articles, expert evaluations confirm that this method achieves significant enhancements in abstract quality. These findings demonstrate that meticulous prompt engineering can enable simpler language models to produce results comparable to advanced models like GPT-4, underscoring the critical role of prompt optimization in achieving high-quality scientific writing.