digitado

About digitado

https://www.digitado.com.br

Posts by :

Procedural Knowledge at Scale Improves Reasoning

digitado ⋅ 3 de April de 2026

arXiv:2604.01348v1 Announce Type: new Abstract: Test-time scaling has emerged as an effective way to improve language models on challenging reasoning tasks. However, most existing methods treat each problem in isolation and do not systematically reuse knowledge from prior reasoning trajectories. In particular, they underutilize procedural knowledge: how to reframe a problem, choose an approach, and verify or backtrack when needed. We introduce Reasoning Memory, a retrieval-augmented generation (RAG) framework for reasoning models that explicitly retrieves and reuses procedural […]

Ver mais

Like 0

Liked Liked

technocracy

Safety, Security, and Cognitive Risks in World Models

digitado ⋅ 3 de April de 2026

arXiv:2604.01346v1 Announce Type: new Abstract: World models — learned internal simulators of environment dynamics — are rapidly becoming foundational to autonomous decision-making in robotics, autonomous vehicles, and agentic AI. Yet this predictive power introduces a distinctive set of safety, security, and cognitive risks. Adversaries can corrupt training data, poison latent representations, and exploit compounding rollout errors to cause catastrophic failures in safety-critical deployments. World model-equipped agents are more capable of goal misgeneralisation, deceptive alignment, and reward hacking precisely […]

Ver mais

Like 0

Liked Liked

technocracy

Malliavin Calculus for Counterfactual Gradient Estimation in Adaptive Inverse Reinforcement Learning

digitado ⋅ 3 de April de 2026

arXiv:2604.01345v1 Announce Type: new Abstract: Inverse reinforcement learning (IRL) recovers the loss function of a forward learner from its observed responses adaptive IRL aims to reconstruct the loss function of a forward learner by passively observing its gradients as it performs reinforcement learning (RL). This paper proposes a novel passive Langevin-based algorithm that achieves adaptive IRL. The key difficulty in adaptive IRL is that the required gradients in the passive algorithm are counterfactual, that is, they are conditioned […]

Ver mais

Like 0

Liked Liked

technocracy

IDEA2: Expert-in-the-loop competency question elicitation for collaborative ontology engineering

digitado ⋅ 3 de April de 2026

arXiv:2604.01344v1 Announce Type: new Abstract: Competency question (CQ) elicitation represents a critical but resource-intensive bottleneck in ontology engineering. This foundational phase is often hampered by the communication gap between domain experts, who possess the necessary knowledge, and ontology engineers, who formalise it. This paper introduces IDEA2, a novel, semi-automated workflow that integrates Large Language Models (LLMs) within a collaborative, expert-in-the-loop process to address this challenge. The methodology is characterised by a core iterative loop: an initial LLM-based extraction […]

Ver mais

Like 0

Liked Liked

technocracy

Perceptual misalignment of texture representations in convolutional neural networks

digitado ⋅ 3 de April de 2026

arXiv:2604.01341v1 Announce Type: new Abstract: Mathematical modeling of visual textures traces back to Julesz’s intuition that texture perception in humans is based on local correlations between image features. An influential approach for texture analysis and generation generalizes this notion to linear correlations between the nonlinear features computed by convolutional neural networks (CNNs), compiled into Gram matrices. Given that CNNs are often used as models for the visual system, it is natural to ask whether such “texture representations” spontaneously […]

Ver mais

Like 0

Liked Liked

technocracy

A High Voltage Test System Meeting Requirements Under Normal and All Single Contingencies Conditions of Peak, Dominant, and Light Loadings for Transmission Expansion Planning Studies (TEP) and TEP Case Studies

digitado ⋅ 3 de April de 2026

arXiv:2604.01338v1 Announce Type: new Abstract: This paper presents a high-voltage test system designed specifically for transmission expansion planning (TEP) and explores multiple TEP studies using this test system. The network incorporates long transmission lines, lines are accurately modeled, and line parameters are calculated using the equivalent {pi} circuit model for long transmission lines to account for the distributed nature of line parameters. The paper provides detailed load flow analyses for both normal and all contingency conditions for three […]

Ver mais

Like 0

Liked Liked

technocracy

SECURE: Stable Early Collision Understanding via Robust Embeddings in Autonomous Driving

digitado ⋅ 3 de April de 2026

arXiv:2604.01337v1 Announce Type: new Abstract: While deep learning has significantly advanced accident anticipation, the robustness of these safety-critical systems against real-world perturbations remains a major challenge. We reveal that state-of-the-art models like CRASH, despite their high performance, exhibit significant instability in predictions and latent representations when faced with minor input perturbations, posing serious reliability risks. To address this, we introduce SECURE – Stable Early Collision Understanding Robust Embeddings, a framework that formally defines and enforces model robustness. SECURE […]

Ver mais

Like 0

Liked Liked

technocracy

Bias Inheritance in Neural-Symbolic Discovery of Constitutive Closures Under Function-Class Mismatch

digitado ⋅ 3 de April de 2026

arXiv:2604.01335v1 Announce Type: new Abstract: We investigate the data-driven discovery of constitutive closures in nonlinear reaction-diffusion systems with known governing PDE structures. Our objective is to robustly recover diffusion and reaction laws from spatiotemporal observations while avoiding the common pitfall where low residuals or short-horizon predictions are conflated with physical recovery. We propose a three-stage neural-symbolic framework: (1) learning numerical surrogates under physical constraints using a noise-robust weak-form-driven objective; (2) compressing these surrogates into restricted interpretable symbolic families […]

Ver mais

Like 0

Liked Liked

technocracy

Disclosure or Marketing? Analyzing the Efficacy of Vendor Self-reports for Vetting Public-sector AI

digitado ⋅ 3 de April de 2026

arXiv:2604.01332v1 Announce Type: new Abstract: Documentation-based disclosure has become a central governance strategy for responsible AI, particularly in public-sector procurement. Tools such as model cards, datasheets, and AI FactSheets are increasingly expected to support accountability, risk assessment, and informed decision-making across organizational boundaries. Yet there is limited empirical evidence about how these artifacts are produced, interpreted, and used in practice. In this paper, we present a qualitative study of the GovAI Coalition FactSheet, a widely adopted transparency document […]

Ver mais

Like 0

Liked Liked

technocracy

Evolutionary Multi-Objective Fusion of Deepfake Speech Detectors

digitado ⋅ 3 de April de 2026

arXiv:2604.01330v1 Announce Type: new Abstract: While deepfake speech detectors built on large self-supervised learning (SSL) models achieve high accuracy, employing standard ensemble fusion to further enhance robustness often results in oversized systems with diminishing returns. To address this, we propose an evolutionary multi-objective score fusion framework that jointly minimizes detection error and system complexity. We explore two encodings optimized by NSGA-II: binary-coded detector selection for score averaging and a real-valued scheme that optimizes detector weights for a weighted […]

Ver mais

Like 0

Liked Liked