digitado

About digitado

https://www.digitado.com.br

Posts by :

Fast Heterogeneous Serving: Scalable Mixed-Scale LLM Allocation for SLO-Constrained Inference

digitado ⋅ 10 de April de 2026

arXiv:2604.07472v1 Announce Type: new Abstract: Deploying large language model (LLM) inference at scale requires jointly selecting base models, provisioning heterogeneous GPUs, configuring parallelism, and distributing workloads under tight latency, accuracy, and budget constraints. Exact mixed-integer linear programming (MILP) approaches guarantee optimality but scale poorly. We propose two constraint-aware heuristics: a Greedy Heuristic (GH) for single-pass allocation, and an Adaptive Greedy Heuristic (AGH) that enhances GH via multi-start construction, relocate-based local search, and GPU consolidation. Three constraint-aware mechanisms — […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond Single Reports: Evaluating Automated ATT&CK Technique Extraction in Multi-Report Campaign Settings

digitado ⋅ 10 de April de 2026

arXiv:2604.07470v1 Announce Type: new Abstract: Large-scale cyberattacks, referred to as campaigns, are documented across multiple CTI reports from diverse sources, with some providing a high-level overview of attack techniques and others providing technical details. Extracting attack techniques from reports is essential for organizations to identify the controls required to protect against attacks. Manually extracting techniques at scale is impractical. Existing automated methods focus on single reports, leaving many attack techniques and their controls undetected, resulting in a fragmented […]

Ver mais

Like 0

Liked Liked

technocracy

To Layer or Not to Layer? Evaluating the Effects and Mechanisms of LLM-Generated Feedback on learning performance

digitado ⋅ 10 de April de 2026

arXiv:2604.07469v1 Announce Type: new Abstract: Feedback is vital for learning, yet its effectiveness depends not only on its content but also on how it engages students in the learning process. Large Language Models (LLMs) offer novel opportunities to efficiently generate rich, formative feedback, ranging from direct explanations to incrementally layered scaffolding designed to foster learner autonomy. Despite these affordances, it remains unclear whether layered feedback (which sequences encouragement and prompts prior to revealing the correct answer) actually improves […]

Ver mais

Like 0

Liked Liked

technocracy

M-ArtAgent: Evidence-Based Multimodal Agent for Implicit Art Influence Discovery

digitado ⋅ 10 de April de 2026

arXiv:2604.07468v1 Announce Type: new Abstract: Implicit artistic influence, although visually plausible, is often undocumented and thus poses a historically constrained attribution problem: resemblance is necessary but not sufficient evidence. Most prior systems reduce influence discovery to embedding similarity or label-driven graph completion, while recent multimodal large language models (LLMs) remain vulnerable to temporal inconsistency and unverified attributions. This paper introduces M-ArtAgent, an evidence-based multimodal agent that reframes implicit influence discovery as probabilistic adjudication. It follows a four-phase protocol […]

Ver mais

Like 0

Liked Liked

technocracy

Lexical Tone is Hard to Quantize: Probing Discrete Speech Units in Mandarin and Yor`ub’a

digitado ⋅ 10 de April de 2026

arXiv:2604.07467v1 Announce Type: new Abstract: Discrete speech units (DSUs) are derived by quantising representations from models trained using self-supervised learning (SSL). They are a popular representation for a wide variety of spoken language tasks, including those where prosody matters. DSUs are especially convenient for tasks where text and speech are jointly modelled, such as text-to-speech and multimodal dialogue systems. But we have found that DSUs encode suprasegmental information less reliably than segmental structure, which we demonstrate in this […]

Ver mais

Like 0

Liked Liked

technocracy

Cross-Tokenizer LLM Distillation through a Byte-Level Interface

digitado ⋅ 10 de April de 2026

arXiv:2604.07466v1 Announce Type: new Abstract: Cross-tokenizer distillation (CTD), the transfer of knowledge from a teacher to a student language model when the two use different tokenizers, remains a largely unsolved problem. Existing approaches rely on heuristic strategies to align mismatched vocabularies, introducing considerable complexity. In this paper, we propose a simple but effective baseline called Byte-Level Distillation (BLD) which enables CTD by operating at a common interface across tokenizers: the byte level. In more detail, we convert the […]

Ver mais

Like 0

Liked Liked

technocracy

CMP: Robust Whole-Body Tracking for Loco-Manipulation via Competence Manifold Projection

digitado ⋅ 10 de April de 2026

arXiv:2604.07457v1 Announce Type: new Abstract: While decoupled control schemes for legged mobile manipulators have shown robustness, learning holistic whole-body control policies for tracking global end-effector poses remains fragile against Out-of-Distribution (OOD) inputs induced by sensor noise or infeasible user commands. To improve robustness against these perturbations without sacrificing task performance and continuity, we propose Competence Manifold Projection (CMP). Specifically, we utilize a Frame-Wise Safety Scheme that transforms the infinite-horizon safety constraint into a computationally efficient single-step manifold inclusion. […]

Ver mais

Like 0

Liked Liked

technocracy

Munkres’ General Topology Autoformalized in Isabelle/HOL

digitado ⋅ 10 de April de 2026

arXiv:2604.07455v1 Announce Type: new Abstract: We describe an experiment in LLM-assisted autoformalization that produced over 85,000 lines of Isabelle/HOL code covering all 39 sections of Munkres’ Topology (general topology, Chapters 2–8), from topological spaces through dimension theory. The LLM-based coding agents (initially ChatGPT 5.2 and then Claude Opus 4.6) used 24 active days for that. The formalization is complete: all 806 formal results are fully proved with zero sorry’s. Proved results include the Tychonoff theorem, the Baire category […]

Ver mais

Like 0

Liked Liked

technocracy

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

digitado ⋅ 10 de April de 2026

arXiv:2604.07430v1 Announce Type: new Abstract: We introduce HY-Embodied-0.5, a family of foundation models specifically designed for real-world embodied agents. To bridge the gap between general Vision-Language Models (VLMs) and the demands of embodied agents, our models are developed to enhance the core capabilities required by embodied intelligence: spatial and temporal visual perception, alongside advanced embodied reasoning for prediction, interaction, and planning. The HY-Embodied-0.5 suite comprises two primary variants: an efficient model with 2B activated parameters designed for edge […]

Ver mais

Like 0

Liked Liked

technocracy

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

digitado ⋅ 10 de April de 2026

arXiv:2604.07429v1 Announce Type: new Abstract: Towards an embodied generalist for real-world interaction, Multimodal Large Language Model (MLLM) agents still suffer from challenging latency, sparse feedback, and irreversible mistakes. Video games offer an ideal testbed with rich visual observations and closed-loop interaction, demanding fine-grained perception, long-horizon planning, and precise control. However, systematically evaluating these capabilities is currently hindered by heterogeneous action interfaces and heuristic verification. To this end, we introduce GameWorld, a benchmark designed for standardized and verifiable evaluation […]

Ver mais

Like 0

Liked Liked