digitado

About digitado

https://www.digitado.com.br

Posts by :

Ragged Paged Attention: A High-Performance and Flexible LLM Inference Kernel for TPU

digitado ⋅ 20 de April de 2026

arXiv:2604.15464v1 Announce Type: new Abstract: Large Language Model (LLM) deployment is increasingly shifting to cost-efficient accelerators like Google’s Tensor Processing Units (TPUs), prioritizing both performance and total cost of ownership (TCO). However, existing LLM inference kernels and serving systems remain largely GPU-centric, and there is no well-established approach for efficiently mapping LLM workloads onto TPU architectures–particularly under the dynamic and ragged execution patterns common in modern serving. In this paper, we present Ragged Paged Attention (RPA), a high-performance […]

Ver mais

Like 0

Liked Liked

technocracy

Evaluating LLM Simulators as Differentially Private Data Generators

digitado ⋅ 20 de April de 2026

arXiv:2604.15461v1 Announce Type: new Abstract: LLM-based simulators offer a promising path for generating complex synthetic data where traditional differentially private (DP) methods struggle with high-dimensional user profiles. But can LLMs faithfully reproduce statistical distributions from DP-protected inputs? We evaluate this using PersonaLedger, an agentic financial simulator, seeded with DP synthetic personas derived from real user statistics. We find that PersonaLedger achieves promising fraud detection utility (AUC 0.70 at epsilon=1) but exhibits significant distribution drift due to systematic LLM […]

Ver mais

Like 0

Liked Liked

technocracy

The Crutch or the Ceiling? How Different Generations of LLMs Shape EFL Student Writings

digitado ⋅ 20 de April de 2026

arXiv:2604.15460v1 Announce Type: new Abstract: The rapid evolution of Large Language Models (LLMs) has made them powerful tools for enhancing student writing. This study explores the extent and limitations of LLMs in assisting secondary-level English as a Foreign Language (EFL) students with their writing tasks. While existing studies focus on output quality, our research examines the developmental shift in LLMs and their impact on EFL students, assessing whether smarter models act as true scaffolds or mere compensatory crutches. […]

Ver mais

Like 0

Liked Liked

technocracy

DeepER-Med: Advancing Deep Evidence-Based Research in Medicine Through Agentic AI

digitado ⋅ 20 de April de 2026

arXiv:2604.15456v1 Announce Type: new Abstract: Trustworthiness and transparency are essential for the clinical adoption of artificial intelligence (AI) in healthcare and biomedical research. Recent deep research systems aim to accelerate evidence-grounded scientific discovery by integrating AI agents with multi-hop information retrieval, reasoning, and synthesis. However, most existing systems lack explicit and inspectable criteria for evidence appraisal, creating a risk of compounding errors and making it difficult for researchers and clinicians to assess the reliability of their outputs. In […]

Ver mais

Like 0

Liked Liked

technocracy

One-Shot Cross-Geometry Skill Transfer through Part Decomposition

digitado ⋅ 20 de April de 2026

arXiv:2604.15455v1 Announce Type: new Abstract: Given a demonstration, a robot should be able to generalize a skill to any object it encounters-but existing approaches to skill transfer often fail to adapt to objects with unfamiliar shapes. Motivated by examples of improved transfer from compositional modeling, we propose a method for improving transfer by decomposing objects into their constituent semantic parts. We leverage data-efficient generative shape models to accurately transfer interaction points from the parts of a demonstration object […]

Ver mais

Like 0

Liked Liked

technocracy

(1D) Ordered Tokens Enable Efficient Test-Time Search

digitado ⋅ 20 de April de 2026

arXiv:2604.15453v1 Announce Type: new Abstract: Tokenization is a key component of autoregressive (AR) generative models, converting raw data into more manageable units for modeling. Commonly, tokens describe local information, such as regions of pixels in images or word pieces in text, and AR generation predicts these tokens in a fixed order. A worthwhile question is whether token structures affect the ability to steer the generation through test-time search, where multiple candidate generations are explored and evaluated by a […]

Ver mais

Like 0

Liked Liked

technocracy

Weak-to-Strong Knowledge Distillation Accelerates Visual Learning

digitado ⋅ 20 de April de 2026

arXiv:2604.15451v1 Announce Type: new Abstract: Large-scale visual learning is increasingly limited by training cost. Existing knowledge distillation methods transfer from a stronger teacher to a weaker student for compression or final-accuracy improvement. We instead investigate distillation to accelerate the training of strong students. We propose a generalizable plug-and-play recipe that freezes a weaker teacher, applies distillation only in early training, and turns it off once the student reaches and surpasses teacher-level performance. For ImageNet and CIFAR classification, this […]

Ver mais

Like 0

Liked Liked

technocracy

A shifted interface approach for internal discontinuities in poroelastic media

digitado ⋅ 20 de April de 2026

arXiv:2604.15450v1 Announce Type: new Abstract: Porous media containing cracks, fractures, or internal discontinuities arise throughout subsurface geomechanics, biomechanics, and materials science. Numerical simulation of the coupled hydromechanical response is inherently challenging because the pressure and displacement fields are tightly coupled through the Biot equations, requiring stable mixed formulations. These difficulties are compounded when cracks are present, because standard mesh-conforming approaches require costly, labor-intensive, body-fitted meshing, while unfitted methods often require cut-cell integration, enrichment functions, or additional stabilization. In […]

Ver mais

Like 0

Liked Liked

technocracy

Iterated Invariant EKF for Quadruped Robot Odometry

digitado ⋅ 20 de April de 2026

arXiv:2604.15449v1 Announce Type: new Abstract: Kalman filter-based algorithms are fundamental for mobile robots, as they provide a computationally efficient solution to the challenging problem of state estimation. However, they rely on two main assumptions that are difficult to satisfy in practice: (a) the system dynamics must be linear with Gaussian process noise, and (b) the measurement model must also be linear with Gaussian measurement noise. Previous works have extended assumption (a) to nonlinear spaces through the Invariant Extended […]

Ver mais

Like 0

Liked Liked

technocracy

Transfer Learning from Foundational Optimization Embeddings to Unsupervised SAT Representations

digitado ⋅ 20 de April de 2026

arXiv:2604.15448v1 Announce Type: new Abstract: Foundational optimization embeddings have recently emerged as powerful pre-trained representations for mixed-integer programming (MIP) problems. These embeddings were shown to enable cross-domain transfer and reduce reliance on solver-generated labels. In this work, we investigate whether such representations generalize beyond optimization to decision problems, focusing on Boolean satisfiability (SAT). We adapt the foundational optimization architecture to SAT by mapping CNF formulas into the same bipartite constraint-variable graph representation used for MIPs. This allows direct […]

Ver mais

Like 0

Liked Liked