10 years

Solutions
Large Language Models
Empower your content with human-like text generation across languages.
Minerva AI
Agentic AI
Retrieval-Augmented Generation (RAG)
Model Fine Tuning
AI Evaluation
More about LLM
Text Analytics
Unlock actionable insights from unstructured text for strategic decision-making.
Named Entity Recognition
Keyword Extraction
Relation Extraction
Entity Linking
Sentiment Analysis
Word Sense Disambiguation
More about Text Analytics
Knowledge Graphs
Search, visualize and explore data connections for deep insights and complex queries.
Next-Generation KG
Rich Semantic Information
Custom Enterprise KG Development
More about Knowledge Graphs
Semantic Search
Refine searches with context-aware results that understand user intent multilingually.
Advanced Query Understanding
Contextual Results Ranking
Customizable Search Framework
Semantic Annotation
More about Semantic Search
Minerva AI
Agentic AI
Retrieval-Augmented Generation (RAG)
Model Fine Tuning
AI Evaluation
Named Entity Recognition
Keyword Extraction
Relation Extraction
Entity Linking
Sentiment Analysis
Word Sense Disambiguation
Next-Generation KG
Rich Semantic Information
Custom Enterprise KG Development
Advanced Query Understanding
Contextual Results Ranking
Customizable Search Framework
Semantic Annotation
More about LLM
More about Text Analytics
More about Knowledge Graphs
More about Semantic Search
Solutions
Products
Babelscape Vera
LLM-powered, grounded fact-checking
WordAtlas
the next-generation multilingual knowledge graph
Comprehendo
disambiguate and semantically tag text in hundreds of languages
Extraggo
extract knowledge from text and analyze key concepts, entities and domains
Emotionary
the next-generation language abuse and emotion detection AI
NLP Pipeline
large-scale, parallel, multilingual and modularized
Semantic Paths
A multilingual semantic search engine and information monitor
LexTag
Create your semantically-annotated datasets with ease
myKnowledgeGraph
organize your enterprise documents into a structured knowledge base
TraDeInterpret
Revolutionize the way you work with trademark denominations
Products
Research
About
News
API & Demos
Explore Babelscape's API Console

Register to get a free API key or purchase one to access our powerful multilingual AI solutions. Test live demos, experience entity linking, semantic search, and more - unlocking the full potential of AI-powered text understanding for your industry.

APIs Console
Discover Babelscape's techology in action

See firsthand how our products can transform your business by providing advanced multilingual understanding, entity linking, semantic search, and more. Explore the demos below and unlock the potential of AI-driven solutions tailored to your needs.

Live demos
API & Demos
Contact us Contact us

More about Text Analytics

More about Knowledge Graphs

More about Semantic Search

Babelscape Vera

LLM-powered, grounded fact-checking

the next-generation multilingual knowledge graph

disambiguate and semantically tag text in hundreds of languages

extract knowledge from text and analyze key concepts, entities and domains

the next-generation language abuse and emotion detection AI

large-scale, parallel, multilingual and modularized

A multilingual semantic search engine and information monitor

Create your semantically-annotated datasets with ease

myKnowledgeGraph

organize your enterprise documents into a structured knowledge base

Revolutionize the way you work with trademark denominations

Explore Babelscape's API Console

Register to get a free API key or purchase one to access our powerful multilingual AI solutions. Test live demos, experience entity linking, semantic search, and more - unlocking the full potential of AI-powered text understanding for your industry.

Discover Babelscape's techology in action

See firsthand how our products can transform your business by providing advanced multilingual understanding, entity linking, semantic search, and more. Explore the demos below and unlock the potential of AI-driven solutions tailored to your needs.

Header shape illustration 1

Header shape illustration 2

Back

Process Reward Models Meet Planning: Generating Precise and Scalable Datasets for Step-Level Rewards

Raffaele Pisano, Roberto Navigli

Abstract

Process Reward Models (PRMs) have emerged as a powerful tool for providing step-level feedback when evaluating the reasoning of Large Language Models (LLMs), which frequently produce chains of thought (CoTs) containing errors even when the final answer is correct. However, existing PRM datasets remain expensive to construct, prone to annotation errors, and predominantly limited to the mathematical domain. This work introduces a novel and scalable approach to PRM dataset generation based on planning logical problems expressed in the Planning Domain Definition Language (PDDL). Using this method, we generate a corpus of approximately one million reasoning steps across various PDDL domains and use it to train PRMs. Experimental results show that augmenting widely-used PRM training datasets with PDDL-derived data yields substantial improvements in both mathematical and non-mathematical reasoning, as demonstrated across multiple benchmarks. These findings indicate that planning problems constitute a scalable and effective resource for generating robust, precise, and fine-grained training data for PRMs, going beyond the classical mathematical sources that dominate this field.

https://arxiv.org/abs/2604.17957
Raffaele Pisano, Roberto Navigli. 2026. Process Reward Models Meet Planning: Generating Precise and Scalable Datasets for Step-Level Rewards. San Diego, California, United States. Association for Computational Linguistics.