Towards Universal Neural Inference

📅 2025-08-12

📈 Citations: 0

✨ Influential: 0

career value

159K/year

🤖 AI Summary

Real-world heterogeneous structured data lack unified schemas, exhibit semantic inconsistencies, and possess arbitrary feature orderings, severely hindering cross-dataset generalization. To address this, we propose ASPIRE—the first general-purpose neural reasoning model for open-domain heterogeneous tabular data. Its core innovation lies in a permutation-invariant set-based Transformer architecture, augmented by semantic anchoring and multi-source semantic grounding (natural language descriptions, metadata, and contextual examples) to enable cross-table semantic alignment and dynamic reasoning. ASPIRE is the first to support zero-shot task transfer without fine-tuning and jointly optimizes prediction accuracy with budget-constrained active feature acquisition at inference time. Experiments across multiple benchmarks demonstrate that ASPIRE significantly outperforms existing methods, robustly generalizes to unseen datasets, and effectively performs both semantic reasoning and target prediction.

Technology Category

Application Category

📝 Abstract

Real-world data often appears in diverse, disjoint forms -- with varying schemas, inconsistent semantics, and no fixed feature ordering -- making it challenging to build general-purpose models that can leverage information across datasets. We introduce ASPIRE, Arbitrary Set-based Permutation-Invariant Reasoning Engine, a Universal Neural Inference model for semantic reasoning and prediction over heterogeneous structured data. ASPIRE combines a permutation-invariant, set-based Transformer with a semantic grounding module that incorporates natural language descriptions, dataset metadata, and in-context examples to learn cross-dataset feature dependencies. This architecture allows ASPIRE to ingest arbitrary sets of feature--value pairs and support examples, align semantics across disjoint tables, and make predictions for any specified target. Once trained, ASPIRE generalizes to new inference tasks without additional tuning. In addition to delivering strong results across diverse benchmarks, ASPIRE naturally supports cost-aware active feature acquisition in an open-world setting, selecting informative features under test-time budget constraints for an arbitrary unseen dataset. These capabilities position ASPIRE as a step toward truly universal, semantics-aware inference over structured data.

Problem

Research questions and friction points this paper is trying to address.

Handling diverse disjoint data with varying schemas and semantics

Building general-purpose models for cross-dataset feature dependencies

Supporting cost-aware active feature acquisition under budget constraints

Innovation

Methods, ideas, or system contributions that make the work stand out.

Set-based Transformer for permutation-invariant reasoning

Semantic grounding module with natural language descriptions

Cost-aware active feature acquisition under budget constraints

🔎 Similar Papers

BrainWave: A Brain Signal Foundation Model for Clinical Applications