digitado

About digitado

https://www.digitado.com.br

Posts by :

Residuals-based Offline Reinforcement Learning

digitado ⋅ 3 de April de 2026

arXiv:2604.01378v1 Announce Type: new Abstract: Offline reinforcement learning (RL) has received increasing attention for learning policies from previously collected data without interaction with the real environment, which is particularly important in high-stakes applications. While a growing body of work has developed offline RL algorithms, these methods often rely on restrictive assumptions about data coverage and suffer from distribution shift. In this paper, we propose a residuals-based offline RL framework for general state and action spaces. Specifically, we define […]

Ver mais

Like 0

Liked Liked

technocracy

RIFT: A RubrIc Failure Mode Taxonomy and Automated Diagnostics

digitado ⋅ 3 de April de 2026

arXiv:2604.01375v1 Announce Type: new Abstract: Rubric-based evaluation is widely used in LLM benchmarks and training pipelines for open-ended, less verifiable tasks. While prior work has demonstrated the effectiveness of rubrics using downstream signals such as reinforcement learning outcomes, there remains no principled way to diagnose rubric quality issues from such aggregated or downstream signals alone. To address this gap, we introduce RIFT: RubrIc Failure mode Taxonomy, a taxonomy for systematically characterizing failure modes in rubric composition and design. […]

Ver mais

Like 0

Liked Liked

technocracy

Dissipativity Analysis of Nonlinear Systems: A Linear–Radial Kernel-based Approach

digitado ⋅ 3 de April de 2026

arXiv:2604.01373v1 Announce Type: new Abstract: Estimating the dissipativity of nonlinear systems from empirical data is useful for the analysis and control of nonlinear systems, especially when an accurate model is unavailable. Based on a Koopman operator model of the nonlinear system on a reproducing kernel Hilbert space (RKHS), the storage function and supply rate functions are expressed as kernel quadratic forms, through which the dissipative inequality is expressed as a linear operator inequality. The RKHS is specified by […]

Ver mais

Like 0

Liked Liked

technocracy

Temporal Logic Control of Nonlinear Stochastic Systems with Online Performance Optimization

digitado ⋅ 3 de April de 2026

arXiv:2604.01372v1 Announce Type: new Abstract: The deployment of autonomous systems in safety-critical environments requires control policies that guarantee satisfaction of complex control specifications. These systems are commonly modeled as nonlinear discrete-time stochastic systems. A~popular approach to computing a policy that provably satisfies a complex control specification is to construct a finite-state abstraction, often represented as a Markov decision process (MDP) with intervals of transition probabilities, i.e., an interval MDP (IMDP). However, existing abstraction techniques compute a emph{single policy}, […]

Ver mais

Like 0

Liked Liked

technocracy

AffordTissue: Dense Affordance Prediction for Tool-Action Specific Tissue Interaction

digitado ⋅ 3 de April de 2026

arXiv:2604.01371v1 Announce Type: new Abstract: Surgical action automation has progressed rapidly toward achieving surgeon-like dexterous control, driven primarily by advances in learning from demonstration and vision-language-action models. While these have demonstrated success in table-top experiments, translating them to clinical deployment remains challenging: current methods offer limited predictability on where instruments will interact on tissue surfaces and lack explicit conditioning inputs to enforce tool-action-specific safe interaction regions. Addressing this gap, we introduce AffordTissue, a multimodal framework for predicting tool-action […]

Ver mais

Like 0

Liked Liked

technocracy

“The System Will Choose Security Over Humanity Every Time”: Understanding Security and Privacy for U.S. Incarcerated Users

digitado ⋅ 3 de April de 2026

arXiv:2604.01370v1 Announce Type: new Abstract: Digital devices like tablets, media players, and kiosks are increasingly deployed in U.S. prisons. These technologies can enable incarcerated people to access education, communicate with loved ones, and develop vital reentry skills. However, they can also introduce new privacy and security risks for incarcerated people who have little agency over their usage and contracts, and are currently carved out of many consumer protection safeguards. To investigate these issues, we conducted focus groups and […]

Ver mais

Like 0

Liked Liked

technocracy

Approximating the Permanent of a Random Matrix with Polynomially Small Mean: Zeros and Universality

digitado ⋅ 3 de April de 2026

arXiv:2604.01367v1 Announce Type: new Abstract: We study algorithms for approximating the permanent of a random matrix when the entries are slightly biased away from zero. This question is motivated by the goal of understanding the classical complexity of linear optics and emph{boson sampling} (Aaronson and Arkhipov ’11; Eldar and Mehraban ’17). Barvinok’s interpolation method enables efficient approximation of the permanent, provided one can establish a sufficiently large zero-free region for the polynomial $mathrm{per}(zJ + W)$, where $J$ is […]

Ver mais

Like 0

Liked Liked

technocracy

CogBias: Measuring and Mitigating Cognitive Bias in Large Language Models

digitado ⋅ 3 de April de 2026

arXiv:2604.01366v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in high-stakes decision-making contexts. While prior work has shown that LLMs exhibit cognitive biases behaviorally, whether these biases correspond to identifiable internal representations and can be mitigated through targeted intervention remains an open question. We define LLM cognitive bias as systematic, reproducible deviations from correct answers in tasks with computable ground-truth baselines, and introduce LLM CogBias, a benchmark organized around four families of cognitive biases: Judgment, […]

Ver mais

Like 0

Liked Liked

technocracy

Crashing Waves vs. Rising Tides: Preliminary Findings on AI Automation from Thousands of Worker Evaluations of Labor Market Tasks

digitado ⋅ 3 de April de 2026

arXiv:2604.01363v1 Announce Type: new Abstract: We propose that AI automation is a continuum between: (i) crashing waves where AI capabilities surge abruptly over small sets of tasks, and (ii) rising tides where the increase in AI capabilities is more continuous and broad-based. We test for these effects in preliminary evidence from an ongoing evaluation of AI capabilities across over 3,000 broad-based tasks derived from the U.S. Department of Labor O*NET categorization that are text-based and thus LLM-addressable. Based […]

Ver mais

Like 0

Liked Liked

technocracy

Multipath Channel Metrics and Detection in Vascular Molecular Communication: A Wireless-Inspired Perspective

digitado ⋅ 3 de April de 2026

arXiv:2604.01362v1 Announce Type: new Abstract: Motivated by classical communications engineering, early works in molecular communication (MC) largely adopted established modeling and signal processing concepts from wireless electromagnetic communication systems. In the context of the human cardiovascular system (CVS), MC channel models evolved from simple unbounded and single-duct environments mimicking individual blood vessels to complex vessel network (VN) topologies, generally at the expense of analytical tractability. Up until now, this has largely prohibited rigorous communication-theoretic analysis of large-scale VNs. […]

Ver mais

Like 0

Liked Liked