digitado

About digitado

https://www.digitado.com.br

Posts by :

D-QRELO: Training- and Data-Free Delta Compression for Large Language Models via Quantization and Residual Low-Rank Approximation

digitado ⋅ 21 de April de 2026

arXiv:2604.16940v1 Announce Type: new Abstract: Supervised Fine-Tuning (SFT) accelerates taskspecific large language models (LLMs) development, but the resulting proliferation of finetuned models incurs substantial memory overhead. Delta compression addresses this by retaining a single pre-trained LLM with multiple compressed delta weights. However, existing methods fail on models fine-tuned with largescale datasets. We find that larger SFT data scale amplifies delta parameter magnitude, singular values, and entropy, exacerbating compression errors. To tackle this, we propose DQRELO (Delta Compression via […]

Ver mais

Like 0

Liked Liked

technocracy

No One Fits All: From Fixed Prompting to Learned Routing in Multilingual LLMs

digitado ⋅ 21 de April de 2026

arXiv:2604.16937v1 Announce Type: new Abstract: Translation-based prompting is widely used in multilingual LLMs, yet its effectiveness varies across languages and tasks. We evaluate prompting strategies across ten languages of different resource levels and four benchmarks. Our analysis shows that no single strategy is universally optimal. Translation strongly benefits low-resource languages even when translation quality is imperfect, high-resource languages gain little, and prompt-based self-routing underperforms explicit translation. Motivated by these findings, we formulate prompting strategy selection as a learned […]

Ver mais

Like 0

Liked Liked

technocracy

Adaptive receptive field-based spatial-frequency feature reconstruction network for few-shot fine-grained image classification

digitado ⋅ 21 de April de 2026

arXiv:2604.16936v1 Announce Type: new Abstract: Feature reconstruction techniques are widely applied for few-shot fine-grained image classification (FSFGIC). Our research indicates that one of the main challenges facing existing feature-based FSFGIC methods is how to choose the size of the receptive field to extract feature descriptors (including spatial and frequency feature descriptors) from different category input images, thereby better performing the FSFGIC tasks. To address this, an adaptive receptive field-based spatial-frequency feature reconstruction network (ARF-SFR-Net) is proposed. The designed […]

Ver mais

Like 0

Liked Liked

technocracy

LLMs can persuade only psychologically susceptible humans on societal issues, via trust in AI and emotional appeals, amid logical fallacies

digitado ⋅ 21 de April de 2026

arXiv:2604.16935v1 Announce Type: new Abstract: Scarce longitudinal evidence examines LLMs’ persuasiveness and humanness along time-evolving psychological frameworks. We introduce Talk2AI, a longitudinal framework quantifying psycho-social, reasoning and affective dimensions of LLMs’ persuasiveness about polarizing societal topics. In a four-way longitudinal setup, Talk2AI’s 770 participants engaged in structured conversations with one of four leading LLMs on topics like climate change, social media misinformation, and math anxiety. This produced 3,080 conversations over 60,000 turns. After each wave, participants reported conviction […]

Ver mais

Like 0

Liked Liked

technocracy

Treating Run-time Execution History as a First-Class Citizen: Co-Versioning Run-time Behavior alongside Code

digitado ⋅ 21 de April de 2026

arXiv:2604.16933v1 Announce Type: new Abstract: Behavioral Co-Versioning remains absent from mainstream practice: while developers routinely version source code with Git, they rarely persist and query how run-time behavior evolves across revisions. This paper argues that this mismatch contributes to a blind spot in software evolution analysis and CI, where rich execution information is discarded and typically reduced to pass/fail outcomes — despite partial test oracles, flakiness, and silent output or performance drift. We propose textit{Behavioral Co-Versioning}, a paradigm […]

Ver mais

Like 0

Liked Liked

technocracy

Playing Psychic: Using Thought Trees to Predict Reasoning Models Accuracy on Coding Tasks

digitado ⋅ 21 de April de 2026

arXiv:2604.16931v1 Announce Type: new Abstract: Recent advances in large language models (LLMs) have shown that test-time scaling can substantially improve model performance on complex tasks, particularly in the coding domain. Under this paradigm, models use a larger token budget during inference to generate intermediate reasoning traces before producing a final answer. However, current evaluations primarily rely on competitive programming benchmarks, which may not capture the full range of reasoning abilities. In this work, we perform a systematic study […]

Ver mais

Like 0

Liked Liked

technocracy

CoGR-MoE: Concept-Guided Expert Routing with Consistent Selection and Flexible Reasoning for Visual Question Answering

digitado ⋅ 21 de April de 2026

arXiv:2604.16930v1 Announce Type: new Abstract: Visual Question Answering (VQA) requires models to identify the correct answer options based on both visual and textual evidence. Recent Mixture-of-Experts (MoE) methods improve option reasoning by grouping similar concepts or routing based on examples. However, unstable routing can lead to inconsistent expert selection in the same question type, while overly stable routing may reduce flexibility. To address this, we propose Concept-Guided Routing framework (CoGR-MoE), which incorporates semantics of the answer options to […]

Ver mais

Like 0

Liked Liked

technocracy

MeasHalu: Mitigation of Scientific Measurement Hallucinations for Large Language Models with Enhanced Reasoning

digitado ⋅ 21 de April de 2026

arXiv:2604.16929v1 Announce Type: new Abstract: The accurate extraction of scientific measurements from literature is a critical yet challenging task in AI4Science, enabling large-scale analysis and integration of quantitative research findings. However, Large Language Models (LLMs) frequently exhibit severe hallucinations, which significantly undermine the reliability of automated scientific document understanding systems. To address this problem, we propose MeasHalu, a novel framework for mitigating scientific measurement hallucinations through enhanced reasoning and targeted optimization. We first present a fine-grained taxonomy of […]

Ver mais

Like 0

Liked Liked

technocracy

Test-Time Adaptation for EEG Foundation Models: A Systematic Study under Real-World Distribution Shifts

digitado ⋅ 21 de April de 2026

arXiv:2604.16926v1 Announce Type: new Abstract: Electroencephalography (EEG) foundation models have shown strong potential for learning generalizable representations from large-scale neural data, yet their clinical deployment is hindered by distribution shifts across clinical settings, devices, and populations. Test-time adaptation (TTA) offers a promising solution by enabling models to adapt to unlabeled target data during inference without access to source data, a valuable property in healthcare settings constrained by privacy regulations and limited labeled data. However, its effectiveness for EEG […]

Ver mais

Like 0

Liked Liked

technocracy

Rethinking Cross-Dose PET Denoising: Mitigating Averaging Effects via Residual Noise Learning

digitado ⋅ 21 de April de 2026

arXiv:2604.16925v1 Announce Type: new Abstract: Cross-dose denoising for low-dose positron emission tomography (LDPET) has been proposed to address the limited generalization of models trained at a single noise level. In practice, neural networks trained on a specific dose level often fail to generalize to other dose conditions due to variations in noise magnitude and statistical properties. Conventional “one-size-for-all” models attempt to handle this variability but tend to learn averaged representations across noise levels, resulting in degraded performance. In […]

Ver mais

Like 0

Liked Liked