digitado – Page 10

Federated Learning and Class Imbalances

digitado ⋅ 10 de January de 2026

Federated Learning (FL) enables collaborative model training across decentralized devices while preserving data privacy. However, real-world FL deployments face critical challenges such as data imbalances, including label noise and non-IID distributions. RHFL+, a state-of-the-art method, was proposed to address these challenges in settings with heterogeneous client models. This work investigates the robustness of RHFL+ under class imbalances through three key contributions: (1) reproduction of RHFL+ along with all benchmark algorithms under a unified evaluation framework; (2) extension of […]

Ver mais

Like 0

Liked Liked

technocracy

Beware the Real-Time Trap: Your Fresh Data Could Be Slowing Down Your Dashboards

digitado ⋅ 31 de January de 2026

“Speed” in data engineering is a trade-off, not a single metric. To build effective systems, you must distinguish between two competing concepts: – Data Latency (Freshness): How long it takes for an event to reach your report. – Query Latency (Responsiveness): How long a user waits for a dashboard to load. The Conflict: Optimizing for real-time freshness often slows down query performance because the system can’t pre-calculate data. Conversely, pre-calculating data for “snappy” dashboards usually requires batching, which […]

Ver mais

Like 0

Liked Liked

technocracy

Robotic Assembly Using Deep Reinforcement Learning

digitado ⋅ 21 de October de 2020

Introduction Disclaimer: This article is a cross post from Pytorch Medium Blog Post. One of the most exciting advancements, that has pushed the frontier of the Artificial Intelligence (AI) in recent years, is Deep Reinforcement Learning (DRL). DRL belongs to the family of machine learning algorithms. It assumes that intelligent machines can learn from their actions similar to the way humans learn from experience. Over the recent years we could witness some impressive real-world applications of DRL. The […]

Ver mais

Like 0

Liked Liked

technocracy

Robust Federated Learning via Byzantine Filtering over Encrypted Updates

digitado ⋅ 5 de February de 2026

Federated Learning (FL) aims to train a collaborative model while preserving data privacy. However, the distributed nature of this approach still raises privacy and security issues, such as the exposure of sensitive data due to inference attacks and the influence of Byzantine behaviors on the trained model. In particular, achieving both secure aggregation and Byzantine resilience remains challenging, as existing solutions often address these aspects independently. In this work, we propose to address these challenges through a novel […]

Ver mais

Like 0

Liked Liked

technocracy

Mind the Performance Gap: Capability-Behavior Trade-offs in Feature Steering

digitado ⋅ 6 de February de 2026

arXiv:2602.04903v1 Announce Type: new Abstract: Feature steering has emerged as a promising approach for controlling LLM behavior through direct manipulation of internal representations, offering advantages over prompt engineering. However, its practical effectiveness in real-world applications remains poorly understood, particularly regarding potential trade-offs with output quality. We show that feature steering methods substantially degrade model performance even when successfully controlling target behaviors, a critical trade-off. Specifically, we evaluate Goodfire’s Auto Steer against prompt engineering baselines across 14 steering queries […]

Ver mais

Like 0

Liked Liked

technocracy

RewriteNets: End-to-End Trainable String-Rewriting for Generative Sequence Modeling

digitado ⋅ 14 de January de 2026

arXiv:2601.07868v1 Announce Type: new Abstract: Dominant sequence models like the Transformer represent structure implicitly through dense attention weights, incurring quadratic complexity. We propose RewriteNets, a novel neural architecture built on an alternative paradigm: explicit, parallel string rewriting. Each layer in a RewriteNet contains a set of learnable rules. For each position in an input sequence, the layer performs four operations: (1) fuzzy matching of rule patterns, (2) conflict resolution via a differentiable assignment operator to select non-overlapping rewrites, […]

Ver mais

Like 0

Liked Liked

technocracy

Here’s Why WebMCP is Exciting

digitado ⋅ 24 de June de 2026

WebMCP is an open web standard that lets websites expose structured, callable tools directly to browser-based agents. Find out what makes it exciting.

Ver mais

Like 0

Liked Liked

technocracy

Optimize video semantic search intent with Amazon Nova Model Distillation on Amazon Bedrock

digitado ⋅ 17 de April de 2026

Optimizing models for video semantic search requires balancing accuracy, cost, and latency. Faster, smaller models lack routing intelligence, while larger, accurate models add significant latency overhead. In Part 1 of this series, we showed how to build a multimodal video semantic search system on AWS with intelligent intent routing using the Anthropic Claude Haiku model in Amazon Bedrock. While the Haiku model delivers strong accuracy for user search intent, it increases end-to-end search time to 2-4 seconds. This […]

Ver mais

Like 0

Liked Liked

technocracy

Rede Mater Dei de Saúde: Monitoring AI agents in the revenue cycle with Amazon Bedrock AgentCore

digitado ⋅ 15 de April de 2026

This post is cowritten by Renata Salvador Grande, Gabriel Bueno and Paulo Laurentys at Rede Mater Dei de Saúde. The growing adoption of multi-agent AI systems is redefining critical operations in healthcare. In large hospital networks, where thousands of decisions directly impact cash flow, service delivery times, and the risk of claim denials, the ability to monitor, track, and govern AI agents has become essential for operational sustainability. This is the journey of Rede Mater Dei de Saúde, […]

Ver mais

Like 0

Liked Liked

technocracy

What Do Agents Learn from Trajectory-SFT: Semantics or Interfaces?

digitado ⋅ 2 de February de 2026

Large language models are increasingly evaluated as interactive agents, yet standard agent benchmarks conflate two qualitatively distinct sources of success: semantic tool-use and interface-specific interaction pattern memorization. Because both mechanisms can yield identical task success on the original interface, benchmark scores alone are not identifiable evidence of environment-invariant capability. We propose PIPE, a protocol-level evaluation augmentation for diagnosing interface reliance by minimally rewriting environment interfaces while preserving task semantics and execution behavior. Across 16 environments from AgentBench and […]

Ver mais

Like 0

Liked Liked