digitado

About digitado

https://www.digitado.com.br

Posts by :

DOSE: Data Selection for Multi-Modal LLMs via Off-the-Shelf Models

digitado ⋅ 21 de April de 2026

arXiv:2604.16979v1 Announce Type: new Abstract: High-quality and diverse multimodal data are essential for improving vision-language models (VLMs), yet existing datasets often contain noisy, redundant, and poorly aligned samples. To address these problems, data filtering is commonly used to enhance the efficiency and performance of multimodal learning, but it introduces extra computational cost because filtering models are usually trained on the same data they are meant to screen. To reduce this cost, we study DOSE, which explores whether off-the-shelf […]

Ver mais

Like 0

Liked Liked

technocracy

UGD: An Unsupervised Geometric Distance for Evaluating Real-world Noisy Point Cloud Denoising

digitado ⋅ 21 de April de 2026

arXiv:2604.16976v1 Announce Type: new Abstract: Point cloud denoising is a fundamental and crucial challenge in real-world point cloud applications. Existing quantitative evaluation metrics for point cloud denoising methods are implemented in a supervised manner, which requires both the denoised point cloud and the corresponding ground-truth clean point cloud to compute a representative geometric distance. This requirement is highly problematic in real-world scenarios, where ground-truth clean point clouds are often unavailable. In this paper, we propose a simple yet […]

Ver mais

Like 0

Liked Liked

technocracy

MCPO: Mastery-Consolidated Policy Optimization for Large Reasoning Models

digitado ⋅ 21 de April de 2026

arXiv:2604.16972v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a promising approach to improve the reasoning abilities of Large Language Models (LLMs). Among RLVR algorithms, Group Relative Policy Optimization (GRPO) and its variants have demonstrated strong performance and high training efficiency. However, GRPO-style objectives exhibit two issues on high accuracy prompts including mastered prompts (rollout accuracy =1) and majority-correct prompts (rollout accuracy in (0.5,1)). For mastered prompts, group-relative advantages vanish, yielding no training […]

Ver mais

Like 0

Liked Liked

technocracy

Hyperspectral Unmixing Hierarchies

digitado ⋅ 21 de April de 2026

arXiv:2604.16969v1 Announce Type: new Abstract: Unmixing reveals the spatial distribution and spectral details of different constituents, called endmembers, in a hyperspectral image. Because unmixing has limited ground truth requirements, can accommodate mixed pixels, and is closely tied to light propagation, it is a uniquely powerful tool for analyzing hyperspectral images. However, spectral variability inhibits unmixing performance, the proper way to determine the number of endmembers is ambiguous, and the clarity of the endmembers degrades as more are included. […]

Ver mais

Like 0

Liked Liked

technocracy

On Safety Risks in Experience-Driven Self-Evolving Agents

digitado ⋅ 21 de April de 2026

arXiv:2604.16968v1 Announce Type: new Abstract: Experience-driven self-evolution has emerged as a promising paradigm for improving the autonomy of large language model agents, yet its reliance on self-curated experience introduces underexplored safety risks. In this study, we investigate how experience accumulation and utilization in self-evolving agents affect safety performance across web-based and embodied environments. Notably, experience gathered solely from benign tasks can still compromise safety in high-risk scenarios. Further analysis attributes this degradation to the execution-oriented nature of accumulated […]

Ver mais

Like 0

Liked Liked

technocracy

NaviFormer: A Deep Reinforcement Learning Transformer-like Model to Holistically Solve the Navigation Problem

digitado ⋅ 21 de April de 2026

arXiv:2604.16967v1 Announce Type: new Abstract: Path planning is usually solved by addressing either the (high-level) route planning problem (waypoint sequencing to achieve the final goal) or the (low-level) path planning problem (trajectory prediction between two waypoints avoiding collisions). However, real-world problems usually require simultaneous solutions to the route and path planning subproblems with a holistic and efficient approach. In this paper, we introduce NaviFormer, a deep reinforcement learning model based on a Transformer architecture that solves the global […]

Ver mais

Like 0

Liked Liked

technocracy

Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning

digitado ⋅ 21 de April de 2026

arXiv:2604.16966v1 Announce Type: new Abstract: The evolution from static ranking models to Agentic Recommender Systems (Agentic RecSys) empowers AI agents to maintain long-term user profiles and autonomously plan service tasks. While this paradigm shift enhances personalization, it introduces a vulnerability: reliance on Long-term Memory (LTM). In this paper, we uncover a threat termed “Visual Inception.” Unlike traditional adversarial attacks that seek immediate misclassification, Visual Inception injects triggers into user-uploaded images (e.g., lifestyle photos) that act as “sleeper agents” […]

Ver mais

Like 0

Liked Liked

technocracy

Different Perspectives of Memory System Simulation

digitado ⋅ 21 de April de 2026

arXiv:2604.16965v1 Announce Type: new Abstract: Memory simulators are used to estimate application performance on advanced memory systems, yet they may exhibit significant discrepancies compared to real hardware. This paper investigates two key questions: (1) what causes these inaccuracies, and (2) how can simulators be properly validated to ensure reliable performance predictions. We propose a methodology that evaluates memory performance from three complementary perspectives: the memory simulator, the CPU-memory interface, and the application. Our analysis reveals that these perspectives […]

Ver mais

Like 0

Liked Liked

technocracy

E2AFS: Energy-Efficient Approximate Floating Point Square Rooter for Error Tolerant Computing

digitado ⋅ 21 de April de 2026

arXiv:2604.16964v1 Announce Type: new Abstract: Floating-point square-root computation is a power- and delay-critical operation in edge-AI, signal-processing, and embedded systems. Conventional implementations typically rely on multipliers or iterative pipelines, resulting in increased hardware complexity, switching activity, and energy consumption. This work presents E2AFS, a lightweight and fully multiplier-free floating-point square-root architecture optimized for energy-efficient computation. By reducing logic depth and minimizing switching activity, the proposed design achieves substantial improvements in hardware efficiency and performance. FPGA implementation on an […]

Ver mais

Like 0

Liked Liked

technocracy

Correcting Low-Signal Sensitivity in the Deliberative Reason Index

digitado ⋅ 21 de April de 2026

arXiv:2604.16963v1 Announce Type: new Abstract: The Deliberative Reason Index (DRI) is increasingly used to assess the coherence between considerations and preferences in deliberative settings, including applications to LLM-generated data. Under low-signal conditions, however, the standard DRI can produce inflated scores by treating near-zero correlations as evidence of consistency. Monte Carlo simulations across common study designs show that this bias increases with group size and yields positive values even under random response. A modified DRI is introduced that applies […]

Ver mais

Like 0

Liked Liked