digitado

GUI-Eyes: Tool-Augmented Perception for Visual Grounding in GUI Agents

digitado ⋅ 16 de January de 2026

arXiv:2601.09770v1 Announce Type: new Abstract: Recent advances in vision-language models (VLMs) and reinforcement learning (RL) have driven progress in GUI automation. However, most existing methods rely on static, one-shot visual inputs and passive perception, lacking the ability to adaptively determine when, whether, and how to observe the interface. We present GUI-Eyes, a reinforcement learning framework for active visual perception in GUI tasks. To acquire more informative observations, the agent learns to make strategic decisions on both whether and […]

Ver mais

Like 0

Liked Liked

technocracy

Two Ways to Build a Skill Server for Your AI Agent

digitado ⋅ 13 de February de 2026

How I stopped fighting context window limits and started giving my agents deeper, richer tool knowledge — on demand. The Problem with Loading Everything at Startup If you’ve built AI agents with more than a handful of tools, you’ve probably noticed something: the more tools you register, the worse the agent gets at choosing the right one. This isn’t a model limitation. It’s an architecture problem. When you register a tool in any agent framework — LlamaIndex, LangChain, Microsoft Agent Framework, or through MCP — its name, […]

Ver mais

Like 0

Liked Liked

technocracy

Task-Uniform Convergence and Backward Transfer in Federated Domain-Incremental Learning with Partial Participation

digitado ⋅ 29 de January de 2026

Real-world federated systems seldom operate on static data: input distributions drift while privacy rules forbid raw-data sharing. We study this setting as Federated Domain-Incremental Learning (FDIL), where (i) clients are heterogeneous, (ii) tasks arrive sequentially with shifting domains, yet (iii) the label space remains fixed. Two theoretical pillars remain missing for FDIL under realistic deployment: a guarantee of backward knowledge transfer (BKT) and a convergence rate that holds across the sequence of all tasks with partial participation. We […]

Ver mais

Like 0

Liked Liked

technocracy

Phantom Transfer: Data-level Defences are Insufficient Against Data Poisoning

digitado ⋅ 6 de February de 2026

arXiv:2602.04899v1 Announce Type: new Abstract: We present a data poisoning attack — Phantom Transfer — with the property that, even if you know precisely how the poison was placed into an otherwise benign dataset, you cannot filter it out. We achieve this by modifying subliminal learning to work in real-world contexts and demonstrate that the attack works across models, including GPT-4.1. Indeed, even fully paraphrasing every sample in the dataset using a different model does not stop the […]

Ver mais

Like 0

Liked Liked

technocracy

Grables: Tabular Learning Beyond Independent Rows

digitado ⋅ 5 de February de 2026

arXiv:2602.03945v1 Announce Type: new Abstract: Tabular learning is still dominated by row-wise predictors that score each row independently, which fits i.i.d. benchmarks but fails on transactional, temporal, and relational tables where labels depend on other rows. We show that row-wise prediction rules out natural targets driven by global counts, overlaps, and relational patterns. To make “using structure” precise across architectures, we introduce grables: a modular interface that separates how a table is lifted to a graph (constructor) from […]

Ver mais

Like 0

Liked Liked

technocracy

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

digitado ⋅ 28 de March de 2023

Previously, I shared an article using multi-GPU training strategies to speed up the finetuning of large language models. Several of these strategies include mechanisms such as model or tensor sharding that distributes the model weights and computations across different devices to work around GPU memory limitations. However, many of us don’t have access to multi-GPU resources. So, this article illustrates a simple technique that works as a great workaround to train models with larger batch sizes when GPU […]

Ver mais

Like 0

Liked Liked

technocracy

Tighter bounds in the prime number theorem

digitado ⋅ 16 de January de 2026

The most elementary form of the prime number theorem says that π(x), the number of prime numbers less than x, is asymptotically equal to x / log(x). That’s true, but a more accurate result says π(x) is asymptotically equal to li(x) where Five years ago I wrote about a result that was new at the time, giving a bound on |π(x) − li(x)| for x > exp(2000). This morning I saw a result in a blog post by Terence […]

Ver mais

Like 0

Liked Liked

technocracy

Simple Network Graph Comparative Learning

digitado ⋅ 15 de January de 2026

The effectiveness of contrastive learning methods has been widely recognized in the field of graph learning, especially in contexts where graph data often lack labels or are difficult to label. However, the application of these methods to node classification tasks still faces a number of challenges. First, existing data enhancement techniques may lead to significant differences from the original view when generating new views, which may weaken the relevance of the view and affect the efficiency of model […]

Ver mais

Like 0

Liked Liked

technocracy

Policy4OOD: A Knowledge-Guided World Model for Policy Intervention Simulation against the Opioid Overdose Crisis

digitado ⋅ 16 de February de 2026

arXiv:2602.12373v1 Announce Type: new Abstract: The opioid epidemic remains one of the most severe public health crises in the United States, yet evaluating policy interventions before implementation is difficult: multiple policies interact within a dynamic system where targeting one risk pathway may inadvertently amplify another. We argue that effective opioid policy evaluation requires three capabilities — forecasting future outcomes under current policies, counterfactual reasoning about alternative past decisions, and optimization over candidate interventions — and propose to unify […]

Ver mais

Like 0

Liked Liked

technocracy

Radon–Wasserstein Gradient Flows for Interacting-Particle Sampling in High Dimensions

digitado ⋅ 9 de February de 2026

arXiv:2602.05227v2 Announce Type: replace Abstract: Gradient flows of the Kullback–Leibler (KL) divergence, such as the Fokker–Planck equation and Stein Variational Gradient Descent, evolve a distribution toward a target density known only up to a normalizing constant. We introduce new gradient flows of the KL divergence with a remarkable combination of properties: they admit accurate interacting-particle approximations in high dimensions, and the per-step cost scales linearly in both the number of particles and the dimension. These gradient flows are […]

Ver mais

Like 0

Liked Liked