export.arxiv.org
CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions
View a PDF of the paper titled CATArena: Evaluation of LLM Agents through Iterative Tournament Compe ...View More
Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base
View a PDF of the paper titled Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Sc ...View More
The Denario project: Deep knowledge AI agents for scientific discovery
[Submitted on 30 Oct 2025] Authors:Francisco Villaescusa-Navarro, Boris Bolliet, Pablo Villanueva ...View More
Cognition Envelopes for Bounded AI Reasoning in Autonomous UAS Operations
[Submitted on 30 Oct 2025] View a PDF of the paper titled Cognition Envelopes for Bounded AI Rea ...View More
SUSTAINABLE Platform: Seamless Smart Farming Integration Towards Agronomy Automation
[Submitted on 30 Oct 2025] View a PDF of the paper titled SUSTAINABLE Platform: Seamless Smart F ...View More
Causal Masking on Spatial Data: An Information-Theoretic Case for Learning Spatial Datasets with Unimodal Language Models
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly ...View More
e1: Learning Adaptive Control of Reasoning Effort
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly ...View More
Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement
View a PDF of the paper titled Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Impro ...View More
CombiGraph-Vis: A Curated Multimodal Olympiad Benchmark for Discrete Mathematical Reasoning
[Submitted on 31 Oct 2025] View a PDF of the paper titled CombiGraph-Vis: A Curated Multimodal Oly ...View More
Glia: A Human-Inspired AI for Automated Systems Design and Optimization
View a PDF of the paper titled Glia: A Human-Inspired AI for Automated Systems Design and Optimizati ...View More
