digitado – Page 454

SafeClaw-R: Towards Safe and Secure Multi-Agent Personal Assistants

digitado ⋅ 1 de April de 2026

arXiv:2603.28807v1 Announce Type: new Abstract: LLM-based multi-agent systems (MASs) are transforming personal productivity by autonomously executing complex, cross-platform tasks. Frameworks such as OpenClaw demonstrate the potential of locally deployed agents integrated with personal data and services, but this autonomy introduces significant safety and security risks. Unintended actions from LLM reasoning failures can cause irreversible harm, while prompt injection attacks may exfiltrate credentials or compromise the system. Our analysis shows that 36.4% of OpenClaw’s built-in skills pose high or […]

Ver mais

Like 0

Liked Liked

technocracy

Privately Learning Decision Lists and a Differentially Private Winnow

digitado ⋅ 10 de February de 2026

arXiv:2602.07370v1 Announce Type: cross Abstract: We give new differentially private algorithms for the classic problems of learning decision lists and large-margin halfspaces in the PAC and online models. In the PAC model, we give a computationally efficient algorithm for learning decision lists with minimal sample overhead over the best non-private algorithms. In the online model, we give a private analog of the influential Winnow algorithm for learning halfspaces with mistake bound polylogarithmic in the dimension and inverse polynomial […]

Ver mais

Like 0

Liked Liked

technocracy

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

digitado ⋅ 10 de February de 2026

Recent advances in large language model (LLM) have empowered autonomous agents to perform complex tasks that require multi-turn interactions with tools and environments. However, scaling such agent training is limited by the lack of diverse and reliable environments. In this paper, we propose Agent World Model (AWM), a fully synthetic environment generation pipeline. Using this pipeline, we scale to 1,000 environments covering everyday scenarios, in which agents can interact with rich toolsets (35 tools per environment on average) […]

Ver mais

Like 0

Liked Liked

technocracy

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

digitado ⋅ 25 de March de 2026

arXiv:2603.22386v1 Announce Type: new Abstract: Large language model (LLM)-based systems are becoming increasingly popular for solving tasks by constructing executable workflows that interleave LLM calls, information retrieval, tool use, code execution, memory updates, and verification. This survey reviews recent methods for designing and optimizing such workflows, which we treat as agentic computation graphs (ACGs). We organize the literature based on when workflow structure is determined, where structure refers to which components or agents are present, how they depend […]

Ver mais

Like 0

Liked Liked

technocracy

Game-to-Real Gap: Quantifying the Effect of Model Misspecification in Network Games

digitado ⋅ 26 de January de 2026

arXiv:2601.16367v1 Announce Type: new Abstract: Game-theoretic models and solution concepts provide rigorous tools for predicting collective behavior in multi-agent systems. In practice, however, different agents may rely on different game-theoretic models to design their strategies. As a result, when these heterogeneous models interact, the realized outcome can deviate substantially from the outcome each agent expects based on its own local model. In this work, we introduce the game-to-real gap, a new metric that quantifies the impact of such […]

Ver mais

Like 0

Liked Liked

technocracy

Learning functional components of PDEs from data using neural networks

digitado ⋅ 13 de February de 2026

Partial differential equations often contain unknown functions that are difficult or impossible to measure directly, hampering our ability to derive predictions from the model. Workflows for recovering scalar PDE parameters from data are well studied: here we show how similar workflows can be used to recover functions from data. Specifically, we embed neural networks into the PDE and show how, as they are trained on data, they can approximate unknown functions with arbitrary accuracy. Using nonlocal aggregation-diffusion equations […]

Ver mais

Like 0

Liked Liked

technocracy

ESA considers righting the wrongs of Ariane 6 by turning it into a Franken-rocket

digitado ⋅ 10 de January de 2026

It took a while, but a consensus has emerged in Europe that the continent’s space industry needs to develop reusable rockets. How to do it and how much to spend on it remain unresolved questions. Much of the discourse around reusable rockets in Europe has focused on developing a brand new rocket that might eventually replace the Ariane 6, which debuted less than two years ago but still uses the use it and lose it model embraced by […]

Ver mais

Like 0

Liked Liked

technocracy

Reinforcement Learning: Supervised, Unsupervised, or Something Else? (When to Use Each)

digitado ⋅ 6 de January de 2026

By the end of this tutorial, you will clearly understand: Why RL looks similar to supervised learning—but behaves completely differently, Why unsupervised learning is closer philosophically, yet still not the right definition, When RL is the right tool, and when supervised is faster, cheaper, safer, and better, How cost, risk, and feedback shape the correct choice, How hybrid pipelines (Behavioral Cloning (BC) –> RL) work in the real world, How to test your problem using a simple decision […]

Ver mais

Like 0

Liked Liked

technocracy

Retro Rewind re-creates the glorious drudgery of working a ’90s video store

digitado ⋅ 13 de April de 2026

If you were working a retail job at a movie rental store in the early ’90s, there’s a decent chance you couldn’t wait to clock out for the day and escape from the daily grind with a mindless video game. Here in the 2020s, on the other hand, at least one mindless video game is striving to re-create the daily grind of working at a video rental store. Retro Rewind: Video Store Simulator is the latest in a […]

Ver mais

Like 0

Liked Liked

technocracy

Resilient Class-Incremental Learning: on the Interplay of Drifting, Unlabelled and Imbalanced Data Streams

digitado ⋅ 10 de February de 2026

In today’s connected world, the generation of massive streaming data across diverse domains has become commonplace. In the presence of concept drift, class imbalance, label scarcity, and new class emergence, they jointly degrade representation stability, bias learning toward outdated distributions, and reduce the resilience and reliability of detection in dynamic environments. This paper proposes SCIL (Streaming Class-Incremental Learning) to address these challenges. The SCIL framework integrates an autoencoder (AE) with a multi-layer perceptron for multi-class prediction, uses a […]

Ver mais

Like 0

Liked Liked