digitado

About digitado

https://www.digitado.com.br

Posts by :

Semantic-aware Token Selection and Resource Optimization for Communication-efficient Split Federated Fine-tuning in Edge Intelligence

digitado ⋅ 27 de May de 2026

arXiv:2605.26120v1 Announce Type: new Abstract: Deploying large Transformer-based vision models on resource-limited mobile devices at network edge is severely constrained by hardware limitations and dynamic wireless environments. While federated learning (FL) enables collaborative training without sharing raw data, strictly local fine-tuning of such massive models remains computationally prohibitive for edge devices. Split federated learning (SFL) alleviates this burden by offloading deep layers to the edge server, yet it suffers from heavy communication overhead when transmitting high-dimensional activation tokens. […]

Ver mais

Like 0

Liked Liked

technocracy

Edge AI Deployment Beyond Models: A BSP-Aware Systems Framework for Industrial Embedded Platforms

digitado ⋅ 27 de May de 2026

arXiv:2605.26119v1 Announce Type: new Abstract: Industrial Edge AI programs often begin with the model and only later confront the platform. That sequencing is attractive because it allows early demonstrations, but it breaks down when the deployment target is an embedded system with long product lifecycles, vendor-specific kernels, heterogeneous accelerators, safety constraints, and nontrivial I/O paths. In that environment, a model is only one component of a larger execution chain that begins at the sensor, traverses the board support […]

Ver mais

Like 0

Liked Liked

technocracy

Xe-Forge: Multi-Stage LLM-Powered Kernel Optimization for Intel GPU

digitado ⋅ 27 de May de 2026

arXiv:2605.26118v1 Announce Type: new Abstract: Porting deep learning algorithms to new hardware accelerators requires developers to repeatedly apply the same low-level optimizations — quantization, memory access coalescing, tile size tuning, and architecture-specific workarounds — to every Triton kernel in their code-base. This manual, repetitive effort is a major bottleneck: each kernel demands the same cycle of trial-and-error profiling against hardware constraints that vary across devices, yet the underlying optimization patterns remain largely consistent. We present Xe-Forge, a multi-stage […]

Ver mais

Like 0

Liked Liked

technocracy

From user-understandable to technical process model: a model-driven approach using cuta4bpm

digitado ⋅ 27 de May de 2026

arXiv:2605.26117v1 Announce Type: new Abstract: For business process modeling, we can choose between graph-oriented and block-oriented languages. Block-oriented languages are more structured and therefore better understandable for domain experts, while graph-oriented languages allow more modeling freedom and technical versatility for process designers. To bridge this gap between understandability and technical versatility, we propose a participative forward engineering approach. It uses our block-oriented CUTA4BPM language to support high level process modeling together with domain experts, while graph-oriented BPMN is […]

Ver mais

Like 0

Liked Liked

technocracy

5 Things Broke When I Shipped a RAG + MCP Agent to Production.

digitado ⋅ 27 de May de 2026

Author(s): Sudip P. Originally published on Towards AI. 5 Things Broke When I Shipped a RAG + MCP Agent to Production. Diagram-1: RAG vs MCP agent architecture: a small LLM router classifies each user query as either a Knowledge request (hybrid search → cross-encoder rerank) or an Action request (validate input → tool call). Both paths converge at a single frontier model for synthesis, then pass through eval and logging before returning a response. Read this article for […]

Ver mais

Like 0

Liked Liked

technocracy

Election information and safeguards in 2026

digitado ⋅ 27 de May de 2026

Ahead of global elections, we’re helping people access information, supporting cyber defenders, and increasing AI transparency

Ver mais

Like 0

Liked Liked

technocracy

Warp’s big bet on building open source with GPT-5.5

digitado ⋅ 27 de May de 2026

Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows.

Ver mais

Like 0

Liked Liked

technocracy

The pressure

digitado ⋅ 27 de May de 2026

The pressure Daniel Stenberg on the unprecedented level of pressure the curl team are facing right now thanks to the deluge of (credible) AI-assisted security issues being reported. The rate of incoming security reports is 4-5 times higher than it was in 2024 and double the speed of 2025 — meaning that on average we now get more than one report per day. The quality is way higher than ever before. The reports are typically very detailed and […]

Ver mais

Like 0

Liked Liked

technocracy

Design a High-Precision Retrieve-and-Rerank Pipeline with ZeroEntropy Zerank-2 Reranker

digitado ⋅ 27 de May de 2026

In this tutorial, we use zeroentropy/zerank-2-reranker, a 4B Qwen3-based cross-encoder reranker, to improve retrieval quality. We start by setting up the runtime, loading the reranker, and understanding how it scores query-document pairs. Then, we move from simple pairwise scoring to a practical two-stage retrieve-and-rerank pipeline, where a fast bi-encoder first retrieves candidates and zerank-2 reranks them for better precision. We also evaluate the impact using NDCG@10 and test the reranker across finance, legal, and code examples to assess […]

Ver mais

Like 0

Liked Liked

technocracy

Stability AI Releases Stable Audio 3: A Family of Fast Latent Diffusion Models for Audio Generation and Editing

digitado ⋅ 27 de May de 2026

Stability AI has released open weights for Stable Audio 3 along with a technical research paper. Stable Audio 3 is a family of latent diffusion models that generate stereo audio at 44.1 kHz. The models support variable-length outputs, inpainting-based editing, and fast inference. What Is Stable Audio 3? Stable Audio 3 is a family of three model scales: small, medium, and large. A latent diffusion model generates audio by learning to progressively remove noise from a compressed representation […]

Ver mais

Like 0

Liked Liked