February 2026

QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals

digitado ⋅ 4 de February de 2026

arXiv:2602.02581v1 Announce Type: new Abstract: Weight-only quantization is important for compressing Large Language Models (LLMs). Inspired by the spirit of classical magnitude pruning, we study whether the magnitude of weight updates during reasoning-incentivized fine-tuning can provide valuable signals for quantizing Large Reasoning Models (LRMs). We hypothesize that the smallest and largest weight updates during fine-tuning are more important than those of intermediate magnitude, a phenomenon we term “protecting both ends”. Upon hypothesis validation, we introduce QuantLRM, which stands […]

Ver mais

Like 0

Liked Liked

technocracy

ProphetKV: User-Query-Driven Selective Recomputation for Efficient KV Cache Reuse in Retrieval-Augmented Generation

digitado ⋅ 4 de February de 2026

arXiv:2602.02579v1 Announce Type: new Abstract: The prefill stage of long-context Retrieval-Augmented Generation (RAG) is severely bottlenecked by computational overhead. To mitigate this, recent methods assemble pre-calculated KV caches of retrieved RAG documents (by a user query) and reprocess selected tokens to recover cross-attention between these pre-calculated KV caches. However, we identify a fundamental “crowding-out effect” in current token selection criteria: globally salient but user-query-irrelevant tokens saturate the limited recomputation budget, displacing the tokens truly essential for answering the […]

Ver mais

Like 0

Liked Liked

technocracy

WritePolicyBench: Benchmarking Memory Write Policies under Byte Budgets

digitado ⋅ 4 de February de 2026

arXiv:2602.02574v1 Announce Type: new Abstract: We introduce WritePolicyBench, a benchmark for evaluating memory write policies: decision rules that choose what to store, merge, and evict under a strict byte budget while processing a stream with document/API drift. The benchmark provides (i) task generators with controlled non-stationarity, (ii) an explicit action interface for external memory, (iii) a byte-accurate cost model, and (iv) standardized metrics that measure both task success and budget efficiency.

Ver mais

Like 0

Liked Liked

technocracy

Product Interaction: An Algebraic Formalism for Deep Learning Architectures

digitado ⋅ 4 de February de 2026

arXiv:2602.02573v1 Announce Type: new Abstract: In this paper, we introduce product interactions, an algebraic formalism in which neural network layers are constructed from compositions of a multiplication operator defined over suitable algebras. Product interactions provide a principled way to generate and organize algebraic expressions by increasing interaction order. Our central observation is that algebraic expressions in modern neural networks admit a unified construction in terms of linear, quadratic, and higher-order product interactions. Convolutional and equivariant networks arise as […]

Ver mais

Like 0

Liked Liked

technocracy

Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective

digitado ⋅ 4 de February de 2026

arXiv:2602.02572v1 Announce Type: new Abstract: Existing alignment methods directly use the reward model learned from user preference data to optimize an LLM policy, subject to KL regularization with respect to the base policy. This practice is suboptimal for maximizing user’s utility because the KL regularization may cause the LLM to inherit the bias in the base policy that conflicts with user preferences. While amplifying rewards for preferred outputs can mitigate this bias, it also increases the risk of […]

Ver mais

Like 0

Liked Liked

technocracy

Trajectory Consistency for One-Step Generation on Euler Mean Flows

digitado ⋅ 4 de February de 2026

arXiv:2602.02571v1 Announce Type: new Abstract: We propose emph{Euler Mean Flows (EMF)}, a flow-based generative framework for one-step and few-step generation that enforces long-range trajectory consistency with minimal sampling cost. The key idea of EMF is to replace the trajectory consistency constraint, which is difficult to supervise and optimize over long time scales, with a principled linear surrogate that enables direct data supervision for long-horizon flow-map compositions. We derive this approximation from the semigroup formulation of flow-based models and […]

Ver mais

Like 0

Liked Liked

technocracy

An Improved Quasi-Physical Dynamic Algorithm for Efficient Circular Coverage in Arbitrary Convex

digitado ⋅ 4 de February de 2026

arXiv:2602.02570v1 Announce Type: new Abstract: The optimal circle coverage problem aims to find a configuration of circles that maximizes the covered area within a given region. Although theoretical optimal solutions exist for simple cases, the problem’s NP-hard characteristic makes the problem computationally intractable for complex polygons with numerous circles. Prevailing methods are largely confined to regular domains, while the few algorithms designed for irregular polygons suffer from poor initialization, unmanaged boundary effects, and excessive overlap among circles, resulting […]

Ver mais

Like 0

Liked Liked

technocracy

DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems

digitado ⋅ 4 de February de 2026

arXiv:2602.02569v1 Announce Type: new Abstract: Fact-checking systems with search-enabled large language models (LLMs) have shown strong potential for verifying claims by dynamically retrieving external evidence. However, the robustness of such systems against adversarial attack remains insufficiently understood. In this work, we study adversarial claim attacks against search-enabled LLM-based fact-checking systems under a realistic input-only threat model. We propose DECEIVE-AFC, an agent-based adversarial attack framework that integrates novel claim-level attack strategies and adversarial claim validity evaluation principles. DECEIVE-AFC systematically […]

Ver mais

Like 0

Liked Liked

technocracy

Mitigating Task-Order Sensitivity and Forgetting via Hierarchical Second-Order Consolidation

digitado ⋅ 4 de February de 2026

arXiv:2602.02568v1 Announce Type: new Abstract: We introduce $textbf{Hierarchical Taylor Series-based Continual Learning (HTCL)}$, a framework that couples fast local adaptation with conservative, second-order global consolidation to address the high variance introduced by random task ordering. To address task-order effects, HTCL identifies the best intra-group task sequence and integrates the resulting local updates through a Hessian-regularized Taylor expansion, yielding a consolidation step with theoretical guarantees. The approach naturally extends to an $L$-level hierarchy, enabling multiscale knowledge integration in a […]

Ver mais

Like 0

Liked Liked

technocracy

IceBench-S2S: A Benchmark of Deep Learning for Challenging Subseasonal-to-Seasonal Daily Arctic Sea Ice Forecasting in Deep Latent Space

digitado ⋅ 4 de February de 2026

arXiv:2602.02567v1 Announce Type: new Abstract: Arctic sea ice plays a critical role in regulating Earth’s climate system, significantly influencing polar ecological stability and human activities in coastal regions. Recent advances in artificial intelligence have facilitated the development of skillful pan-Arctic sea ice forecasting systems, where data-driven approaches showcase tremendous potential to outperform conventional physics-based numerical models in terms of accuracy, computational efficiency and forecasting lead times. Despite the latest progress made by deep learning (DL) forecasting models, most […]

Ver mais

Like 0

Liked Liked