Abstract
This paper aims to investigate the feasibility of utilising Large Language Models (LLMs) and Latent Diffusion Models (LDMs) for automatically categorising word basicness and concreteness, i.e. two well-known aspects of language having significant relevance on tasks such as text simplification. To achieve this, we propose two distinct approaches: i) a generative Transformer-based LLM, and ii) a image+text multi-modal pipeline, referred to as stableKnowledge, which utilises a LDM to map terms to the image level. The evaluation results indicate that while the LLM approach is particularly well-suited for recognising word basicness, stableKnowledge outperforms the former when the task shifts to measuring concreteness.
Note: This work is based on the author’s Master’s thesis, which received a Special Mention at the AIxIA “Leonardo Lesmo” Award for best AI Master Thesis in 2023.
Citation
Federico Torrielli, Amon Rapp, and Luigi Di Caro, “How Shall a Machine Call a Thing?” in Natural Language Processing and Information Systems - 28th International Conference on Applications of Natural Language to Information Systems, NLDB 2023, Derby, UK, June 21-23, 2023, Proceedings, Elisabeth Métais, Farid Meziane, Vijayan Sugumaran, Warren Manning, and Stephan Reiff-Marganiec, Eds., ser. Lecture Notes in Computer Science, vol. 13913, Springer, 2023, pp. 546–557. DOI: 10.1007/978-3-031-35320-8_41
@inproceedings{DBLP:conf/nldb/TorrielliRC23,
title = {How Shall a Machine Call a Thing?},
author = {Federico Torrielli and Amon Rapp and Luigi Di Caro},
year = 2023,
booktitle = {Natural Language Processing and Information Systems - 28th International Conference on Applications of Natural Language to Information Systems, {NLDB} 2023, Derby, UK, June 21-23, 2023, Proceedings},
publisher = {Springer},
series = {Lecture Notes in Computer Science},
volume = 13913,
pages = {546--557},
doi = {10.1007/978-3-031-35320-8_41},
url = {https://doi.org/10.1007/978-3-031-35320-8_41},
editor = {Elisabeth M{\'{e}}tais and Farid Meziane and Vijayan Sugumaran and Warren Manning and Stephan Reiff{-}Marganiec},
timestamp = {Fri, 07 Jul 2023 23:30:36 +0200},
biburl = {https://dblp.org/rec/conf/nldb/TorrielliRC23.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}
Links
Awards
Special Mention - AIxIA “Leonardo Lesmo” Award for best AI Master Thesis (2023)
Master’s Thesis
This paper is based on the Master’s thesis:
“How shall a machine call a thing?” Exploring Basicness in Language through Attention-based Neural Networks and Human-in-the-Loop Methodology
Final Vote: 110L/110 with dignity of printing
University of Torino, 2023