digitado

About digitado

https://www.digitado.com.br

Posts by :

On the Exploitability of FTRL Dynamics

digitado ⋅ 8 de April de 2026

arXiv:2604.05129v1 Announce Type: new Abstract: In this paper we investigate the exploitability of a Follow-the-Regularized-Leader (FTRL) learner with constant step size $eta$ in $ntimes m$ two-player zero-sum games played over $T$ rounds against a clairvoyant optimizer. In contrast with prior analysis, we show that exploitability is an inherent feature of the FTRL family, rather than an artifact of specific instantiations. First, for fixed optimizer, we establish a sweeping law of order $Omega(N/eta)$, proving that exploitation scales to the […]

Ver mais

Like 0

Liked Liked

technocracy

Offline RL for Adaptive Policy Retrieval in Prior Authorization

digitado ⋅ 8 de April de 2026

arXiv:2604.05125v1 Announce Type: new Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing retrieval-augmented systems rely on static top-$K$ strategies with fixed numbers of retrieved sections. Such fixed retrieval can be inefficient and gather irrelevant or insufficient information. We model policy retrieval for PA as a sequential decision-making problem, formulating adaptive retrieval as a Markov Decision Process (MDP). In our system, an agent iteratively selects policy chunks from a top-$K$ candidate set or […]

Ver mais

Like 0

Liked Liked

technocracy

Designing Digital Humans with Ambient Intelligence

digitado ⋅ 8 de April de 2026

arXiv:2604.05120v1 Announce Type: new Abstract: Digital humans are lifelike virtual agents capable of natural conversation and are increasingly deployed in domains like retail and finance. However, most current digital humans operate in isolation from their surroundings and lack contextual awareness beyond the dialogue itself. We address this limitation by integrating ambient intelligence (AmI) – i.e., environmental sensors, IoT data, and contextual modeling – with digital human systems. This integration enables situational awareness of the user’s environment, anticipatory and […]

Ver mais

Like 0

Liked Liked

technocracy

Governance-Aware Agent Telemetry for Closed-Loop Enforcement in Multi-Agent AI Systems

digitado ⋅ 8 de April de 2026

arXiv:2604.05119v1 Announce Type: new Abstract: Enterprise multi-agent AI systems produce thousands of inter-agent interactions per hour, yet existing observability tools capture these dependencies without enforcing anything. OpenTelemetry and Langfuse collect telemetry but treat governance as a downstream analytics concern, not a real-time enforcement target. The result is an “observe-but-do-not-act” gap where policy violations are detected only after damage is done. We present Governance-Aware Agent Telemetry (GAAT), a reference architecture that closes the loop between telemetry collection and automated […]

Ver mais

Like 0

Liked Liked

technocracy

Watch Before You Answer: Learning from Visually Grounded Post-Training

digitado ⋅ 8 de April de 2026

arXiv:2604.05117v1 Announce Type: new Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual cues. However, despite rapid progress in multimodal modeling, video understanding performance still lags behind text-based reasoning. In this work, we find that progress is even worse than previously assumed: commonly reported long video understanding benchmarks contain 40-60% of questions that can be answered using text cues alone. Furthermore, we find that these issues are also pervasive in widely used […]

Ver mais

Like 0

Liked Liked

technocracy

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

digitado ⋅ 8 de April de 2026

arXiv:2604.05116v1 Announce Type: new Abstract: Clinical diagnosis requires sequential evidence acquisition under uncertainty. However, most Large Language Model (LLM) based diagnostic systems assume fully observed patient information and therefore do not explicitly model how clinical evidence should be sequentially acquired over time. Even when diagnosis is formulated as a sequential decision process, it is still challenging to learn effective diagnostic trajectories. This is because the space of possible evidence-acquisition paths is relatively large, while clinical datasets rarely provide […]

Ver mais

Like 0

Liked Liked

technocracy

Probabilistic Tree Inference Enabled by FDSOI Ferroelectric FETs

digitado ⋅ 8 de April de 2026

arXiv:2604.05115v1 Announce Type: new Abstract: Artificial intelligence applications in autonomous driving, medical diagnostics, and financial systems increasingly demand machine learning models that can provide robust uncertainty quantification, interpretability, and noise resilience. Bayesian decision trees (BDTs) are attractive for these tasks because they combine probabilistic reasoning, interpretable decision-making, and robustness to noise. However, existing hardware implementations of BDTs based on CPUs and GPUs are limited by memory bottlenecks and irregular processing patterns, while multi-platform solutions exploiting analog content-addressable memory […]

Ver mais

Like 0

Liked Liked

technocracy

$pi^2$: Structure-Originated Reasoning Data Improves Long-Context Reasoning Ability of Large Language Models

digitado ⋅ 8 de April de 2026

arXiv:2604.05114v1 Announce Type: new Abstract: We study a pipeline that curates reasoning data from initial structured data for improving long-context reasoning in large language models (LLMs). Our approach, $pi^2$, constructs high-quality reasoning data through rigorous QA curation: 1) extracting and expanding tables from Wikipedia, 2) from the collected tables and relevant context, generating realistic and multi-hop analytical reasoning questions whose answers are automatically determined and verified through dual-path code execution, and 3) back-translating step-by-step structured reasoning traces as […]

Ver mais

Like 0

Liked Liked

technocracy

CRAB: Codebook Rebalancing for Bias Mitigation in Generative Recommendation

digitado ⋅ 8 de April de 2026

arXiv:2604.05113v1 Announce Type: new Abstract: Generative recommendation (GeneRec) has introduced a new paradigm that represents items as discrete semantic tokens and predicts items in a generative manner. Despite its strong performance across multiple recommendation tasks, existing GeneRec approaches still suffer from severe popularity bias and may even exacerbate it. In this work, we conduct a comprehensive empirical analysis to uncover the root causes of this phenomenon, yielding two core insights: 1) imbalanced tokenization inherits and can further amplify […]

Ver mais

Like 0

Liked Liked

technocracy

Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner

digitado ⋅ 8 de April de 2026

arXiv:2604.05112v1 Announce Type: new Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training generalist agents that can acquire new tasks directly at inference. Algorithm Distillation (AD) pioneered this paradigm and was subsequently scaled to multi-domain settings, although its ability to generalize to unseen tasks remained limited. The Decision Pre-Trained Transformer (DPT) was introduced as an alternative, showing stronger in-context reinforcement learning abilities in simplified domains, but its scalability had not been established. In […]

Ver mais

Like 0

Liked Liked