digitado

About digitado

https://www.digitado.com.br

Posts by :

Open-Domain Safety Policy Construction

digitado ⋅ 3 de April de 2026

arXiv:2604.01354v1 Announce Type: new Abstract: Moderation layers are increasingly a core component of many products built on user- or model-generated content. However, drafting and maintaining domain-specific safety policies remains costly. We present Deep Policy Research (DPR), a minimal agentic system that drafts a full content moderation policy based on only human-written seed domain information. DPR uses a single web search tool and lightweight scaffolding to iteratively propose search queries, distill diverse web sources into policy rules, and organize […]

Ver mais

Like 0

Liked Liked

technocracy

Open-loop POMDP Simplification and Safe Skipping of Replanning with Formal Performance Guarantees

digitado ⋅ 3 de April de 2026

arXiv:2604.01352v1 Announce Type: new Abstract: Partially Observable Markov Decision Processes (POMDPs) provide a principled mathematical framework for decision-making under uncertainty. However, the exact solution to POMDPs is computationally intractable. In this paper, we address the computational intractability by introducing a novel framework for adaptive open-loop simplification with formal performance guarantees. Our method adaptively interleaves open-loop and closed-loop planning via a topology-based belief tree, enabling a significant reduction in planning complexity. The key contribution lies in the derivation of […]

Ver mais

Like 0

Liked Liked

technocracy

No Attacker Needed: Unintentional Cross-User Contamination in Shared-State LLM Agents

digitado ⋅ 3 de April de 2026

arXiv:2604.01350v1 Announce Type: new Abstract: LLM-based agents increasingly operate across repeated sessions, maintaining task states to ensure continuity. In many deployments, a single agent serves multiple users within a team or organization, reusing a shared knowledge layer across user identities. This shared persistence expands the failure surface: information that is locally valid for one user can silently degrade another user’s outcome when the agent reapplies it without regard for scope. We refer to this failure mode as unintentional […]

Ver mais

Like 0

Liked Liked

technocracy

PI-JEPA: Label-Free Surrogate Pretraining for Coupled Multiphysics Simulation via Operator-Split Latent Prediction

digitado ⋅ 3 de April de 2026

arXiv:2604.01349v1 Announce Type: new Abstract: Reservoir simulation workflows face a fundamental data asymmetry: input parameter fields (geostatistical permeability realizations, porosity distributions) are free to generate in arbitrary quantities, yet existing neural operator surrogates require large corpora of expensive labeled simulation trajectories and cannot exploit this unlabeled structure. We introduce textbf{PI-JEPA} (Physics-Informed Joint Embedding Predictive Architecture), a surrogate pretraining framework that trains emph{without any completed PDE solves}, using masked latent prediction on unlabeled parameter fields under per-sub-operator PDE residual […]

Ver mais

Like 0

Liked Liked

technocracy

Procedural Knowledge at Scale Improves Reasoning

digitado ⋅ 3 de April de 2026

arXiv:2604.01348v1 Announce Type: new Abstract: Test-time scaling has emerged as an effective way to improve language models on challenging reasoning tasks. However, most existing methods treat each problem in isolation and do not systematically reuse knowledge from prior reasoning trajectories. In particular, they underutilize procedural knowledge: how to reframe a problem, choose an approach, and verify or backtrack when needed. We introduce Reasoning Memory, a retrieval-augmented generation (RAG) framework for reasoning models that explicitly retrieves and reuses procedural […]

Ver mais

Like 0

Liked Liked

technocracy

Safety, Security, and Cognitive Risks in World Models

digitado ⋅ 3 de April de 2026

arXiv:2604.01346v1 Announce Type: new Abstract: World models — learned internal simulators of environment dynamics — are rapidly becoming foundational to autonomous decision-making in robotics, autonomous vehicles, and agentic AI. Yet this predictive power introduces a distinctive set of safety, security, and cognitive risks. Adversaries can corrupt training data, poison latent representations, and exploit compounding rollout errors to cause catastrophic failures in safety-critical deployments. World model-equipped agents are more capable of goal misgeneralisation, deceptive alignment, and reward hacking precisely […]

Ver mais

Like 0

Liked Liked

technocracy

Malliavin Calculus for Counterfactual Gradient Estimation in Adaptive Inverse Reinforcement Learning

digitado ⋅ 3 de April de 2026

arXiv:2604.01345v1 Announce Type: new Abstract: Inverse reinforcement learning (IRL) recovers the loss function of a forward learner from its observed responses adaptive IRL aims to reconstruct the loss function of a forward learner by passively observing its gradients as it performs reinforcement learning (RL). This paper proposes a novel passive Langevin-based algorithm that achieves adaptive IRL. The key difficulty in adaptive IRL is that the required gradients in the passive algorithm are counterfactual, that is, they are conditioned […]

Ver mais

Like 0

Liked Liked

technocracy

IDEA2: Expert-in-the-loop competency question elicitation for collaborative ontology engineering

digitado ⋅ 3 de April de 2026

arXiv:2604.01344v1 Announce Type: new Abstract: Competency question (CQ) elicitation represents a critical but resource-intensive bottleneck in ontology engineering. This foundational phase is often hampered by the communication gap between domain experts, who possess the necessary knowledge, and ontology engineers, who formalise it. This paper introduces IDEA2, a novel, semi-automated workflow that integrates Large Language Models (LLMs) within a collaborative, expert-in-the-loop process to address this challenge. The methodology is characterised by a core iterative loop: an initial LLM-based extraction […]

Ver mais

Like 0

Liked Liked

technocracy

Perceptual misalignment of texture representations in convolutional neural networks

digitado ⋅ 3 de April de 2026

arXiv:2604.01341v1 Announce Type: new Abstract: Mathematical modeling of visual textures traces back to Julesz’s intuition that texture perception in humans is based on local correlations between image features. An influential approach for texture analysis and generation generalizes this notion to linear correlations between the nonlinear features computed by convolutional neural networks (CNNs), compiled into Gram matrices. Given that CNNs are often used as models for the visual system, it is natural to ask whether such “texture representations” spontaneously […]

Ver mais

Like 0

Liked Liked

technocracy

A High Voltage Test System Meeting Requirements Under Normal and All Single Contingencies Conditions of Peak, Dominant, and Light Loadings for Transmission Expansion Planning Studies (TEP) and TEP Case Studies

digitado ⋅ 3 de April de 2026

arXiv:2604.01338v1 Announce Type: new Abstract: This paper presents a high-voltage test system designed specifically for transmission expansion planning (TEP) and explores multiple TEP studies using this test system. The network incorporates long transmission lines, lines are accurately modeled, and line parameters are calculated using the equivalent {pi} circuit model for long transmission lines to account for the distributed nature of line parameters. The paper provides detailed load flow analyses for both normal and all contingency conditions for three […]

Ver mais

Like 0

Liked Liked