Solutions
Large Language Models
Empower your content with human-like text generation across languages.
Minerva AI
Agentic AI
Retrieval-Augmented Generation (RAG)
Model Fine Tuning
AI Evaluation
More about LLM
Text Analytics
Unlock actionable insights from unstructured text for strategic decision-making.
Named Entity Recognition
Keyword Extraction
Relation Extraction
Entity Linking
Sentiment Analysis
Word Sense Disambiguation
More about Text Analytics
Knowledge Graphs
Search, visualize and explore data connections for deep insights and complex queries.
Next-Generation KG
Rich Semantic Information
Custom Enterprise KG Development
More about Knowledge Graphs
Semantic Search
Refine searches with context-aware results that understand user intent multilingually.
Advanced Query Understanding
Contextual Results Ranking
Customizable Search Framework
Semantic Annotation
More about Semantic Search
Minerva AI
Agentic AI
Retrieval-Augmented Generation (RAG)
Model Fine Tuning
AI Evaluation
Named Entity Recognition
Keyword Extraction
Relation Extraction
Entity Linking
Sentiment Analysis
Word Sense Disambiguation
Next-Generation KG
Rich Semantic Information
Custom Enterprise KG Development
Advanced Query Understanding
Contextual Results Ranking
Customizable Search Framework
Semantic Annotation
More about LLM
More about Text Analytics
More about Knowledge Graphs
More about Semantic Search
Solutions
Products
Babelscape Vera
LLM-powered, grounded fact-checking
WordAtlas
the next-generation multilingual knowledge graph
Comprehendo
disambiguate and semantically tag text in hundreds of languages
Extraggo
extract knowledge from text and analyze key concepts, entities and domains
Emotionary
the next-generation language abuse and emotion detection AI
NLP Pipeline
large-scale, parallel, multilingual and modularized
Semantic Paths
A multilingual semantic search engine and information monitor
LexTag
Create your semantically-annotated datasets with ease
myKnowledgeGraph
organize your enterprise documents into a structured knowledge base
TraDeInterpret
Revolutionize the way you work with trademark denominations
Products
Research
About
News
API & Demos
Explore Babelscape's API Console

Register to get a free API key or purchase one to access our powerful multilingual AI solutions. Test live demos, experience entity linking, semantic search, and more - unlocking the full potential of AI-powered text understanding for your industry.

APIs Console
Discover Babelscape's techology in action

See firsthand how our products can transform your business by providing advanced multilingual understanding, entity linking, semantic search, and more. Explore the demos below and unlock the potential of AI-driven solutions tailored to your needs.

Live demos
API & Demos
Contact us Contact us

More about Text Analytics

More about Knowledge Graphs

More about Semantic Search

Babelscape Vera

LLM-powered, grounded fact-checking

the next-generation multilingual knowledge graph

disambiguate and semantically tag text in hundreds of languages

extract knowledge from text and analyze key concepts, entities and domains

the next-generation language abuse and emotion detection AI

large-scale, parallel, multilingual and modularized

A multilingual semantic search engine and information monitor

Create your semantically-annotated datasets with ease

myKnowledgeGraph

organize your enterprise documents into a structured knowledge base

Revolutionize the way you work with trademark denominations

Explore Babelscape's API Console

Register to get a free API key or purchase one to access our powerful multilingual AI solutions. Test live demos, experience entity linking, semantic search, and more - unlocking the full potential of AI-powered text understanding for your industry.

Discover Babelscape's techology in action

See firsthand how our products can transform your business by providing advanced multilingual understanding, entity linking, semantic search, and more. Explore the demos below and unlock the potential of AI-driven solutions tailored to your needs.

Header shape illustration 1

Header shape illustration 2

Back

How Much Do Pretrained Language Models Know About Word Senses?

Simone Teglia, Simone Tedeschi, Roberto Navigli

Abstract

Word Sense Disambiguation (WSD) is a key task in Natural Language Processing (NLP), involving selecting the correct meaning of a word based on its context. With Pretrained Language Models (PLMs) like BERT and DeBERTa now well established, significant progress has been made in understanding contextual semantics. Nevertheless, how well these models inherently disambiguate word senses remains uncertain. In this work, we evaluate several encoder-only PLMs across two popular inventories (i.e. WordNet and the Oxford Dictionary of English) by analyzing their ability to separate word senses without any task-specific fine-tuning. We compute centroids of word senses and measure similarity to assess performance across different layers. Our results show that DeBERTa-v3 delivers the best performance on the task, with the middle layers (specifically the 7th and 8th layers) achieving the highest accuracy, outperforming the output layer by approximately 15 percentage points. Our experiments also explore the inherent structure of WordNet and ODE sense inventories, highlighting their influence on the overall model behavior and performance. Finally, based on our findings, we develop a small, efficient model for the WSD task that attains robust performance while significantly reducing the carbon footprint.

https://aclanthology.org/2025.acl-long.113.pdf
Simone Teglia, Simone Tedeschi, Roberto Navigli. 2025. How Much Do Pretrained Language Models Know About Word Senses?. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vienna, Austria. Association for Computational Linguistics.