ASHuR: Evaluation of the Relation Summary-Content Without Human Reference Using ROUGE

Alan Ramírez-Noriega

Universidad Autónoma de Baja California Facultad de Ciencias Quimicas e Ingeniería Calzada Universitaria 14418, Parque Industrial Internacional Tijuana, Baja California, C.P. 22390 México
Reyes Juárez-Ramírez

Universidad Autónoma de Baja California Facultad de Ciencias Quimicas e Ingeniería Calzada Universitaria 14418, Parque Industrial Internacional Tijuana, Baja California, C.P. 22390 México
Samantha Jiménez

Universidad Autónoma de Baja California Facultad de Ciencias Quimicas e Ingeniería Calzada Universitaria 14418, Parque Industrial Internacional Tijuana, Baja California, C.P. 22390 México
Sergio Inzunza

Universidad Autónoma de Baja California Facultad de Ciencias Quimicas e Ingeniería Calzada Universitaria 14418, Parque Industrial Internacional Tijuana, Baja California, C.P. 22390 México
Yobani Martínez-Ramírez

Universidad Autónoma de Sinaloa Facultad de Ingeniería Mochis Fuente de Poseidon y Angel Flores s/n, Col. Jiquilpan Los Mochis, Sinaloa, C.P. 81223 México

ASHuR: Evaluation of the Relation Summary-Content Without Human Reference Using ROUGE

keywords: Text summarization, summary evaluation, ROUGE, sentences extraction

In written documents, the summary is a brief description of important aspects of a text. The degree of similarity between the summary and the content of a document provides reliability about the summary. Some efforts have been done in order to automate the evaluation of a summary. ROUGE metrics can automatically evaluate a summary, but it needs a model summary built by humans. The goal of this study is to find a quantitative relation between an article content and its summary using ROUGE tests without a model summary built by humans. This work proposes a method for automatic text summarization to evaluate a summary (ASHuR) based on extraction of sentences. ASHuR extracts the best sentences of an article based on the frequency of concepts, cue-words, title words, and sentence length. Extracted sentences constitute the essence of the article; these sentences construct the model summary. We performed two experiments to assess the reliability of ASHuR. The first experiment compared ASHuR against similar approaches based on sentences extraction; the experiment placed ASHuR in the first place in each applied test. The second experiment compared ASHuR against human-made summaries, which yielded a Pearson correlation value of 0.86. Assessments made to ASHuR show reliability to evaluate summaries written by users in collaborative sites (e.g. Wikipedia) or to review texts generated by students in online learning systems (e.g. Moodle).

mathematics subject classification 2000: 68-U15, 68-T50

reference: Vol. 37, 2018, No. 2, pp. 509–532

doi: 10.4149/cai_2018_2_509

Computing and Informatics

formerly Computers and Artificial Intelligence

ASHuR: Evaluation of the Relation Summary-Content Without Human Reference Using ROUGE