10 years

Solutions
Large Language Models
Empower your content with human-like text generation across languages.
Minerva AI
Agentic AI
Retrieval-Augmented Generation (RAG)
Model Fine Tuning
AI Evaluation
More about LLM
Text Analytics
Unlock actionable insights from unstructured text for strategic decision-making.
Named Entity Recognition
Keyword Extraction
Relation Extraction
Entity Linking
Sentiment Analysis
Word Sense Disambiguation
More about Text Analytics
Knowledge Graphs
Search, visualize and explore data connections for deep insights and complex queries.
Next-Generation KG
Rich Semantic Information
Custom Enterprise KG Development
More about Knowledge Graphs
Semantic Search
Refine searches with context-aware results that understand user intent multilingually.
Advanced Query Understanding
Contextual Results Ranking
Customizable Search Framework
Semantic Annotation
More about Semantic Search
Minerva AI
Agentic AI
Retrieval-Augmented Generation (RAG)
Model Fine Tuning
AI Evaluation
Named Entity Recognition
Keyword Extraction
Relation Extraction
Entity Linking
Sentiment Analysis
Word Sense Disambiguation
Next-Generation KG
Rich Semantic Information
Custom Enterprise KG Development
Advanced Query Understanding
Contextual Results Ranking
Customizable Search Framework
Semantic Annotation
More about LLM
More about Text Analytics
More about Knowledge Graphs
More about Semantic Search
Solutions
Products
Babelscape Vera
LLM-powered, grounded fact-checking
WordAtlas
the next-generation multilingual knowledge graph
Comprehendo
disambiguate and semantically tag text in hundreds of languages
Extraggo
extract knowledge from text and analyze key concepts, entities and domains
Emotionary
the next-generation language abuse and emotion detection AI
NLP Pipeline
large-scale, parallel, multilingual and modularized
Semantic Paths
A multilingual semantic search engine and information monitor
LexTag
Create your semantically-annotated datasets with ease
myKnowledgeGraph
organize your enterprise documents into a structured knowledge base
TraDeInterpret
Revolutionize the way you work with trademark denominations
Products
Research
About
News
API & Demos
Explore Babelscape's API Console

Register to get a free API key or purchase one to access our powerful multilingual AI solutions. Test live demos, experience entity linking, semantic search, and more - unlocking the full potential of AI-powered text understanding for your industry.

APIs Console
Discover Babelscape's techology in action

See firsthand how our products can transform your business by providing advanced multilingual understanding, entity linking, semantic search, and more. Explore the demos below and unlock the potential of AI-driven solutions tailored to your needs.

Live demos
API & Demos
Contact us Contact us

More about Text Analytics

More about Knowledge Graphs

More about Semantic Search

Babelscape Vera

LLM-powered, grounded fact-checking

the next-generation multilingual knowledge graph

disambiguate and semantically tag text in hundreds of languages

extract knowledge from text and analyze key concepts, entities and domains

the next-generation language abuse and emotion detection AI

large-scale, parallel, multilingual and modularized

A multilingual semantic search engine and information monitor

Create your semantically-annotated datasets with ease

myKnowledgeGraph

organize your enterprise documents into a structured knowledge base

Revolutionize the way you work with trademark denominations

Explore Babelscape's API Console

Register to get a free API key or purchase one to access our powerful multilingual AI solutions. Test live demos, experience entity linking, semantic search, and more - unlocking the full potential of AI-powered text understanding for your industry.

Discover Babelscape's techology in action

See firsthand how our products can transform your business by providing advanced multilingual understanding, entity linking, semantic search, and more. Explore the demos below and unlock the potential of AI-driven solutions tailored to your needs.

Header shape illustration 1

Header shape illustration 2

Back

FENICE: Factuality Evaluation of summarization based on NLI and Claim Extraction

Alessandro Scirè, Karim Ghonim, Roberto Navigli

Abstract

Recent advancements in text summarization, particularly with the advent of Large Language Models (LLMs), have shown remarkable performance. However, a notable challenge persists as a substantial number of automatically-generated summaries exhibit factual inconsistencies, such as hallucinations. In response to this issue, various approaches for the evaluation of consistency for summarization have emerged. Yet, these newly-introduced metrics face several limitations, including lack of interpretability, focus on short document summaries (e.g., news articles), and computational impracticality, especially for LLM-based metrics. To address these shortcomings, we propose Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction (FENICE), a more interpretable and efficient factuality-oriented metric. FENICE leverages an NLI-based alignment between information in the source document and a set of atomic facts, referred to as claims, extracted from the summary. Our metric sets a new state of the art on AGGREFACT, the de-facto benchmark for factuality evaluation. Moreover, we extend our evaluation to a more challenging setting by conducting a human annotation process of long-form summarization. In the hope of fostering research in summarization factuality evaluation, we release the code of our metric and our factuality annotations of long-form summarization at https://github.com/Babelscape/FENICE.

https://aclanthology.org/2024.findings-acl.841.pdf
Alessandro Scirè, Karim Ghonim, Roberto Navigli. 2024. FENICE: Factuality Evaluation of summarization based on NLI and Claim Extraction. Findings of the Association for Computational Linguistics ACL 2024, pages 14148–14161, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.