Ensemble-based Short Text Similarity: An Easy Approach for Multilingual Datasets using Transformers and WordNet in Real-world Scenarios

by Isabella Gagliardi and Maria Teresa Artese

to be submitted to Big Data and Cognitive Computing Special Issue "Artificial Intelligence in Digital Humanities"



Some datasets used in the paper, in progress ...

Data extracted from DPLA using the 'Wedding' query in QueryLab, first ten json files
1 2 3 4 5 6 7 8 9
Data extracted from V&A using the 'Wedding' query in QueryLab, first ten json files
1 2 3 4 5 6 7 8 9
Data extracted from Europeana using the 'Mariage' query in QueryLab, first ten json files
1 2 3 4 5 6 7 8 9
Data extracted from RMN using the 'Mariage' query in QueryLab, first ten json files
1 2 3 4 5 6 7 8 9


IMATI - CNR
IMATI-MI Multimedia Information Systems Lab

Isabella Gagliardi personal page
Maria Teresa Artese personal page

2023/09/15