digitado – Page 291

Learning to Attack: A Bandit Approach to Adversarial Context Poisoning

digitado ⋅ 3 de March de 2026

arXiv:2603.00567v1 Announce Type: new Abstract: Neural contextual bandits are vulnerable to adversarial attacks, where subtle perturbations to rewards, actions, or contexts induce suboptimal decisions. We introduce AdvBandit, a black-box adaptive attack that formulates context poisoning as a continuous-armed bandit problem, enabling the attacker to jointly learn and exploit the victim’s evolving policy. The attacker requires no access to the victim’s internal parameters, reward function, or gradient information; instead, it constructs a surrogate model using a maximum-entropy inverse reinforcement […]

Ver mais

Like 0

Liked Liked

technocracy

Self-Sovereign Agent

digitado ⋅ 13 de April de 2026

arXiv:2604.08551v1 Announce Type: new Abstract: We investigate the emerging prospect of self-sovereign agents — AI systems that can economically sustain and extend their own operation without human involvement. Recent advances in large language models and agent frameworks have substantially expanded agents’ practical capabilities, pointing toward a potential shift from developer-controlled tools to more autonomous digital actors. We analyze the remaining technical barriers to such deployments and discuss the security, societal, and governance challenges that could arise if such […]

Ver mais

Like 0

Liked Liked

technocracy

TDPNavigator-Placer: Thermal- and Wirelength-Aware Chiplet Placement in 2.5D Systems Through Multi-Agent Reinforcement Learning

digitado ⋅ 13 de February de 2026

arXiv:2602.11187v1 Announce Type: new Abstract: The rapid growth of electronics has accelerated the adoption of 2.5D integrated circuits, where effective automated chiplet placement is essential as systems scale to larger and more heterogeneous chiplet assemblies. Existing placement methods typically focus on minimizing wirelength or transforming multi-objective optimization into a single objective through weighted sum, which limits their ability to handle competing design requirements. Wirelength reduction and thermal management are inherently conflicting objectives, making prior approaches inadequate for practical […]

Ver mais

Like 0

Liked Liked

technocracy

Discrete Causal Representation Learning

digitado ⋅ 27 de March de 2026

arXiv:2603.25017v1 Announce Type: cross Abstract: Causal representation learning seeks to uncover causal relationships among high-level latent variables from low-level, entangled, and noisy observations. Existing approaches often either rely on deep neural networks, which lack interpretability and formal guarantees, or impose restrictive assumptions like linearity, continuous-only observations, and strong structural priors. These limitations particularly challenge applications with a large number of discrete latent variables and mixed-type observations. To address these challenges, we propose discrete causal representation learning (DCRL), a […]

Ver mais

Like 0

Liked Liked

technocracy

Double Fairness Policy Learning: Integrating Action Fairness and Outcome Fairness in Decision-making

digitado ⋅ 27 de January de 2026

Fairness is a central pillar of trustworthy machine learning, especially in domains where accuracy- or profit-driven optimization is insufficient. While most fairness research focuses on supervised learning, fairness in policy learning remains less explored. Because policy learning is interventional, it induces two distinct fairness targets: action fairness (equitable action assignments) and outcome fairness (equitable downstream consequences). Crucially, equalizing actions does not generally equalize outcomes when groups face different constraints or respond differently to the same action. We propose […]

Ver mais

Like 0

Liked Liked

technocracy

Is DQN algorithm suitable for yugioh?

digitado ⋅ 20 de March de 2026

Hi I’m currently looking to use DQN to implement an ai that plays yugioh (a two player card game), but have had basically no experience with Ml. I don’t know if I am underestimating the complexity of this, given how complex yugioh is, but with how big the size of the state that needs to be fed in is, along with the number of actions that need to be mapped (possibly around 120 total possible moves, though obviously […]

Ver mais

Like 0

Liked Liked

technocracy

Microsoft vs Palantir: Two Paths to Enterprise Ontology (And Why Microsoft’s Bet on Semantic Contracts Changes the Game)

digitado ⋅ 6 de February de 2026

Author(s): Pankaj Kumar Originally published on Towards AI. A technical deep-dive into how Microsoft Fabric IQ actually implements ontology — and why it’s fundamentally different from Palantir’s approach 🚀 NEW: I just built OntoGuard in 48 hours — an ontology firewall for AI agents that prevents $4.6M mistakes. See the build story → Image caption not availableThe article discusses the contrasting approaches of Microsoft and Palantir in utilizing ontology within enterprise systems, emphasizing that Microsoft aims to empower […]

Ver mais

Like 0

Liked Liked

technocracy

Multi-label Instance-level Generalised Visual Grounding in Agriculture

digitado ⋅ 10 de March de 2026

arXiv:2603.06699v1 Announce Type: new Abstract: Understanding field imagery such as detecting plants and distinguishing individual crop and weed instances is a central challenge in precision agriculture. Despite progress in vision-language tasks like captioning and visual question answering, Visual Grounding (VG), localising language-referred objects, remains unexplored in agriculture. A key reason is the lack of suitable benchmark datasets for evaluating grounding models in field conditions, where many plants look highly similar, appear at multiple scales, and the referred target […]

Ver mais

Like 0

Liked Liked

technocracy

Amortizing Maximum Inner Product Search with Learned Support Functions

digitado ⋅ 9 de March de 2026

Maximum inner product search (MIPS) is a crucial subroutine in machine learning, requiring the identification of key vectors that align best with a given query. We propose amortized MIPS: a learning-based approach that trains neural networks to directly predict MIPS solutions, amortizing the computational cost of matching queries (drawn from a fixed distribution) to a fixed set of keys. Our key insight is that the MIPS value function, the maximal inner product between a query and keys, is […]

Ver mais

Like 0

Liked Liked

technocracy

Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models

digitado ⋅ 23 de February de 2026

arXiv:2602.17846v1 Announce Type: new Abstract: Diffusion models generate high-quality samples but can also memorize training data, raising serious privacy concerns. Understanding the mechanisms governing when memorization versus generalization occurs remains an active area of research. In particular, it is unclear where along the noise schedule memorization is induced, how data geometry influences it, and how phenomena at different noise scales interact. We introduce a geometric framework that partitions the noise schedule into three regimes based on the coverage […]

Ver mais

Like 0

Liked Liked