Paper ID: 2304.03098

Static Fuzzy Bag-of-Words: a lightweight sentence embedding algorithm

Matteo Muffo, Roberto Tedesco, Licia Sbattella, Vincenzo Scotti

The introduction of embedding techniques has pushed forward significantly the Natural Language Processing field. Many of the proposed solutions have been presented for word-level encoding; anyhow, in the last years, new mechanism to treat information at an higher level of aggregation, like at sentence- and document-level, have emerged. With this work we address specifically the sentence embeddings problem, presenting the Static Fuzzy Bag-of-Word model. Our model is a refinement of the Fuzzy Bag-of-Words approach, providing sentence embeddings with a predefined dimension. SFBoW provides competitive performances in Semantic Textual Similarity benchmarks, while requiring low computational resources.

Submitted: Apr 6, 2023