digitado

About digitado

https://www.digitado.com.br

Posts by :

Simulate realistic users to evaluate multi-turn AI agents in Strands Evals

digitado ⋅ 2 de April de 2026

Evaluating single-turn agent interactions follows a pattern that most teams understand well. You provide an input, collect the output, and judge the result. Frameworks like Strands Evaluation SDK make this process systematic through evaluators that assess helpfulness, faithfulness, and tool usage. In a previous blog post, we covered how to build comprehensive evaluation suites for AI agents using these capabilities. However, production conversations rarely stop at one turn. Real users engage in exchanges that unfold over multiple turns. […]

Ver mais

Like 0

Liked Liked

technocracy

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

digitado ⋅ 2 de April de 2026

Agent skills, structured packages of procedural knowledge and executable resources that agents dynamically load at inference time, have become a reliable mechanism for augmenting LLM agents. Yet inference-time skill augmentation is fundamentally limited: retrieval noise introduces irrelevant guidance, injected skill content imposes substantial token overhead, and the model never truly acquires the knowledge it merely follows. We ask whether skills can instead be internalized into model parameters, enabling zero-shot autonomous behavior without any runtime skill retrieval. We introduce […]

Ver mais

Like 0

Liked Liked

technocracy

New Rowhammer attacks give complete control of machines running Nvidia GPUs

digitado ⋅ 2 de April de 2026

The cost of high-performance GPUs, typically $8,000 or more, means they are frequently shared among dozens of users in cloud environments. Two new attacks demonstrate how a malicious user can gain full root control of a host machine by performing novel Rowhammer attacks on high-performance GPU cards made by Nvidia. The attacks exploit memory hardware’s increasing susceptibility to bit flips, in which 0s stored in memory switch to 1s and vice versa. In 2014, researchers first demonstrated that […]

Ver mais

Like 0

Liked Liked

technocracy

Renewables dominate 2025’s newly installed generating capacity

digitado ⋅ 2 de April de 2026

On Wednesday, the International Renewable Energy Agency (IRENA) released its numbers on what was built in 2025. And much as we saw in the US, solar power is the primary driver of change. The numbers show that the world installed an average of 1.4 gigawatts of solar capacity every day last year, for a total of 511 GW. That brings the total solar capacity up to 2.4 Terawatts, making it the largest single source of renewable capacity by […]

Ver mais

Like 0

Liked Liked

technocracy

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

digitado ⋅ 2 de April de 2026

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under time-varying dynamics. We consider a continual model-based reinforcement learning setting in which an agent repeatedly learns and controls a dynamical system whose transition dynamics evolve across episodes. We analyze the problem using Gaussian process dynamics models under frequentist variation-budget assumptions. Our analysis shows that persistent non-stationarity requires explicitly […]

Ver mais

Like 0

Liked Liked

technocracy

Smoothing the Landscape: Causal Structure Learning via Diffusion Denoising Objectives

digitado ⋅ 2 de April de 2026

Understanding causal dependencies in observational data is critical for informing decision-making. These relationships are often modeled as Bayesian Networks (BNs) and Directed Acyclic Graphs (DAGs). Existing methods, such as NOTEARS and DAG-GNN, often face issues with scalability and stability in high-dimensional data, especially when there is a feature-sample imbalance. Here, we show that the denoising score matching objective of diffusion models could smooth the gradients for faster, more stable convergence. We also propose an adaptive k-hop acyclicity constraint […]

Ver mais

Like 0

Liked Liked

technocracy

BVFLMSP : Bayesian Vertical Federated Learning for Multimodal Survival with Privacy

digitado ⋅ 2 de April de 2026

Multimodal time-to-event prediction often requires integrating sensitive data distributed across multiple parties, making centralized model training impractical due to privacy constraints. At the same time, most existing multimodal survival models produce single deterministic predictions without indicating how confident the model is in its estimates, which can limit their reliability in real-world decision making. To address these challenges, we propose BVFLMSP, a Bayesian Vertical Federated Learning (VFL) framework for multimodal time-to-event analysis based on a Split Neural Network architecture. […]

Ver mais

Like 0

Liked Liked

technocracy

(PAC-)Learning state machines from data streams: A generic strategy and an improved heuristic (Extended version)

digitado ⋅ 2 de April de 2026

This is an extended version of our publication Learning state machines from data streams: A generic strategy and an improved heuristic, International Conference on Grammatical Inference (ICGI) 2023, Rabat, Morocco. It has been extended with a formal proof on PAC-bounds, and the discussion and analysis of a similar approach has been moved from the appendix and is now a full Section. State machine models are models that simulate the behavior of discrete event systems, capable of representing systems […]

Ver mais

Like 0

Liked Liked

technocracy

(PAC-)Learning state machines from data streams: A generic strategy and an improved heuristic (Extended version)

digitado ⋅ 2 de April de 2026

This is an extended version of our publication Learning state machines from data streams: A generic strategy and an improved heuristic, International Conference on Grammatical Inference (ICGI) 2023, Rabat, Morocco. It has been extended with a formal proof on PAC-bounds, and the discussion and analysis of a similar approach has been moved from the appendix and is now a full Section. State machines models are models that simulate the behavior of discrete event systems, capable of representing systems […]

Ver mais

Like 0

Liked Liked

technocracy

KiloClaw targets shadow AI with autonomous agent governance

digitado ⋅ 2 de April de 2026

With the launch of KiloClaw, enterprises now have a tool to enforce governance over autonomous agents and manage shadow AI. While businesses spent the last year securing large language models and formalising vendor agreements, developers and knowledge workers started moving on their own. Employees are bypassing official procurement, deploying autonomous agents on personal infrastructure to automate their daily workflows. This practice, known as ‘Bring Your Own AI’ or BYOAI, exposes proprietary enterprise data to unregulated external environments. To […]

Ver mais

Like 0

Liked Liked