Reaching Quality and Efficiency with a Parameter-Efficient Controllable Sentence Simplification Approach

Antonio Menta1 and Ana Garcia-Serrano1

  1. E.T.S.I. Informática (UNED)
    C. de Juan del Rosal, 14, 28040 Madrid, Spain
    amenta1@alumno.uned.es, agarcia@lsi.uned.es

Abstract

The task of Automatic Text Simplification (ATS) aims to transform texts to improve their readability and comprehensibility. Current solutions are based on Large Language Models (LLM). These models have high performance but require powerful computing resources and large amounts of data to be fine-tuned when working in specific and technical domains. This prevents most researchers from adapting the models to their area of study. The main contributions of this research are as follows: (1) proposing an accurate solution when powerful resources are not available, using the transfer learning capabilities across different domains with a set of linguistic features using a reduced size pre-trained language model (T5-small) and making it accessible to a broader range of researchers and individuals; (2) the evaluation of our model on two well-known datasets, Turkcorpus and ASSET, and the analysis of the influence of control tokens on the SimpleText corpus, focusing on the domains of Computer Science and Medicine. Finally, a detailed discussion comparing our approach with state-of-the-art models for sentence simplification is included.

Key words

Text Simplification, Transfer Learning, Language Models

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS230912017M

Publication information

Volume 21, Issue 3 (June 2024)
Year of Publication: 2024
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Menta, A., Garcia-Serrano, A.: Reaching Quality and Efficiency with a Parameter-Efficient Controllable Sentence Simplification Approach. Computer Science and Information Systems, Vol. 21, No. 3, 719-741. (2024), https://doi.org/10.2298/CSIS230912017M