Positive Characteristic Sets for Relational Pattern Languages

📅 2025-11-15

📈 Citations: 0

✨ Influential: 0

career value

190K/year

🤖 AI Summary

This paper addresses the fundamental challenge in formal language learning of identifying relational pattern languages (L) solely from positive examples—strings belonging to (L)—without negative evidence. We introduce, for the first time, the notion of a *positive characteristic set*: a minimal set of positive examples that uniquely identifies (L) within a given language class. Methodologically, we construct polynomial-size positive characteristic sets by leveraging formal representations of relational pattern languages and analyzing their identifiability properties, thereby establishing necessary and sufficient conditions for their existence. Our main contribution is twofold: (i) it overcomes the classical requirement for both positive and negative examples, enabling efficient and exact identification of (L) from positive data alone; and (ii) it systematically introduces and develops the theory of positive characteristic sets for relational pattern languages, providing both a theoretical foundation and a constructive framework for positive-example-driven language identification.

Technology Category

Application Category

📝 Abstract

In the context of learning formal languages, data about an unknown target language L is given in terms of a set of (word,label) pairs, where a binary label indicates whether or not the given word belongs to L. A (polynomial-size) characteristic set for L, with respect to a reference class L of languages, is a set of such pairs that satisfies certain conditions allowing a learning algorithm to (efficiently) identify L within L. In this paper, we introduce the notion of positive characteristic set, referring to characteristic sets of only positive examples. These are of importance in the context of learning from positive examples only. We study this notion for classes of relational pattern languages, which are of relevance to various applications in string processing.

Problem

Research questions and friction points this paper is trying to address.

Defining positive characteristic sets for learning from only positive examples

Studying characteristic sets for relational pattern language classes

Enabling efficient language identification using positive examples only

Innovation

Methods, ideas, or system contributions that make the work stand out.

Positive characteristic sets for relational patterns

Learning from positive examples only

Efficient identification with polynomial-size sets

🔎 Similar Papers

AMR-RE: Abstract Meaning Representations for Retrieval-Based In-Context Learning in Relation Extraction