March 2026

WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement

digitado ⋅ 25 de March de 2026

arXiv:2603.22352v1 Announce Type: new Abstract: Recent progress in reinforcement learning with verifiable rewards (RLVR) offers a practical path to self-improvement of language models, but existing methods face a key trade-off: endogenous self-play can drift over iterations, while corpus-grounded approaches rely on curated data environments. We present textbf{WIST}, a textbf{W}eb-grounded textbf{I}terative textbf{S}elf-play textbf{T}ree framework for domain-targeted reasoning improvement that learns directly from the open web without requiring any pre-arranged domain corpus. WIST incrementally expands a domain tree for exploration, […]

Ver mais

Like 0

Liked Liked

technocracy

Session Risk Memory (SRM): Temporal Authorization for Deterministic Pre-Execution Safety Gates

digitado ⋅ 25 de March de 2026

arXiv:2603.22350v1 Announce Type: new Abstract: Deterministic pre-execution safety gates evaluate whether individual agent actions are compatible with their assigned roles. While effective at per-action authorization, these systems are structurally blind to distributed attacks that decompose harmful intent across multiple individually-compliant steps. This paper introduces Session Risk Memory (SRM), a lightweight deterministic module that extends stateless execution gates with trajectory-level authorization. SRM maintains a compact semantic centroid representing the evolving behavioral profile of an agent session and accumulates a […]

Ver mais

Like 0

Liked Liked

technocracy

Personalized Federated Sequential Recommender

digitado ⋅ 25 de March de 2026

arXiv:2603.22349v1 Announce Type: new Abstract: In the domain of consumer electronics, personalized sequential recommendation has emerged as a central task. Current methodologies in this field are largely centered on modeling user behavior and have achieved notable performance. Nevertheless, the inherent quadratic computational complexity typical of most existing approaches often leads to inefficiencies that hinder real-time recommendation. Moreover, these methods face challenges in being effectively adapted to the personalized requirements of users across diverse scenarios. To tackle these issues, […]

Ver mais

Like 0

Liked Liked

technocracy

COMPASS-Hedge: Learning Safely Without Knowing the World

digitado ⋅ 25 de March de 2026

arXiv:2603.22348v1 Announce Type: new Abstract: Online learning algorithms often faces a fundamental trilemma: balancing regret guarantees between adversarial and stochastic settings and providing baseline safety against a fixed comparator. While existing methods excel in one or two of these regimes, they typically fail to unify all three without sacrificing optimal rates or requiring oracle access to problem-dependent parameters. In this work, we bridge this gap by introducing COMPASS-Hedge. Our algorithm is the first full-information method to simultaneously achieve: […]

Ver mais

Like 0

Liked Liked

technocracy

Intelligence Inertia: Physical Principles and Applications

digitado ⋅ 25 de March de 2026

arXiv:2603.22347v1 Announce Type: new Abstract: While Landauer’s principle establishes the fundamental thermodynamic floor for information erasure and Fisher Information provides a metric for local curvature in parameter space, these classical frameworks function effectively only as approximations within regimes of sparse rule-constraints. They fail to explain the super-linear, and often explosive, computational and energy costs incurred when maintaining symbolic interpretability during the reconfiguration of advanced intelligent systems. This paper introduces the property of intelligence inertia and its underlying physical […]

Ver mais

Like 0

Liked Liked

technocracy

First-Mover Bias in Gradient Boosting Explanations: Mechanism, Detection, and Resolution

digitado ⋅ 25 de March de 2026

arXiv:2603.22346v1 Announce Type: new Abstract: We isolate and empirically characterize first-mover bias — a path-dependent concentration of feature importance caused by sequential residual fitting in gradient boosting — as a specific mechanistic cause of the well-known instability of SHAP-based feature rankings under multicollinearity. When correlated features compete for early splits, gradient boosting creates a self-reinforcing advantage for whichever feature is selected first: subsequent trees inherit modified residuals that favor the incumbent, concentrating SHAP importance on an arbitrary feature […]

Ver mais

Like 0

Liked Liked

technocracy

Dynamic Fusion-Aware Graph Convolutional Neural Network for Multimodal Emotion Recognition in Conversations

digitado ⋅ 25 de March de 2026

arXiv:2603.22345v1 Announce Type: new Abstract: Multimodal emotion recognition in conversations (MERC) aims to identify and understand the emotions expressed by speakers during utterance interaction from multiple modalities (e.g., text, audio, images, etc.). Existing studies have shown that GCN can improve the performance of MERC by modeling dependencies between speakers. However, existing methods usually use fixed parameters to process multimodal features for different emotion types, ignoring the dynamics of fusion between different modalities, which forces the model to balance […]

Ver mais

Like 0

Liked Liked

technocracy

Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

digitado ⋅ 25 de March de 2026

arXiv:2603.22344v1 Announce Type: new Abstract: Large language models (LLMs) assisted literature retrieval may lead to erroneous references, but these errors have not been rigorously quantified. Therefore, we quantitatively assess errors in reference retrieval of widely used free-version LLM platforms and identify the factors associated with retrieval errors. We evaluated 2,000 references retrieved by 5 LLMs (Grok-2, ChatGPT GPT-4.1, Google Gemini Flash 2.5, Perplexity AI, and DeepSeek GPT-4) for 40 randomly-selected original articles (10 per journal) published Jan. 2024 […]

Ver mais

Like 0

Liked Liked

technocracy

Cloud-Edge Collaborative Large Models for Robust Photovoltaic Power Forecasting

digitado ⋅ 25 de March de 2026

arXiv:2603.22343v1 Announce Type: new Abstract: Photovoltaic (PV) power forecasting in edge-enabled grids requires balancing forecasting accuracy, robustness under weather-driven distribution shifts, and strict latency constraints. Local specialized models are efficient for routine conditions but often degrade under rare ramp events and unseen weather patterns, whereas always relying on cloud-side large models incurs substantial communication delay and cloud overhead. To address this challenge, we propose a risk-aware cloud-edge collaborative framework for latency-sensitive PV forecasting. The framework integrates a site-specific […]

Ver mais

Like 0

Liked Liked

technocracy

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

digitado ⋅ 25 de March de 2026

arXiv:2603.22341v1 Announce Type: new Abstract: While prior red-teaming efforts have focused on eliciting harmful text outputs from large language models (LLMs), such approaches fail to capture agent-specific vulnerabilities that emerge through multi-step tool execution, particularly in rapidly growing ecosystems such as the Model Context Protocol (MCP). To address this gap, we propose a trajectory-aware evolutionary search method, T-MAP, which leverages execution trajectories to guide the discovery of adversarial prompts. Our approach enables the automatic generation of attacks that […]

Ver mais

Like 0

Liked Liked