February 2026

Beyond Output Critique: Self-Correction via Task Distillation

digitado ⋅ 3 de February de 2026

arXiv:2602.00871v1 Announce Type: new Abstract: Large language models (LLMs) have shown promising self-correction abilities, where iterative refinement improves the quality of generated responses. However, most existing approaches operate at the level of output critique, patching surface errors while often failing to correct deeper reasoning flaws. We propose SELF-THOUGHT, a framework that introduces an intermediate step of task abstraction before solution refinement. Given an input and an initial response, the model first distills the task into a structured template […]

Ver mais

Like 0

Liked Liked

technocracy

Finite Element Eigenfunction Network (FEENet): A Hybrid Framework for Solving PDEs on Complex Geometries

digitado ⋅ 3 de February de 2026

arXiv:2602.00870v1 Announce Type: new Abstract: Neural operators aim to learn mappings between infinite-dimensional function spaces, but their performance often degrades on complex or irregular geometries due to the lack of geometry-aware representations. We propose the Finite Element Eigenfunction Network (FEENet), a hybrid spectral learning framework grounded in the eigenfunction theory of differential operators. For a given domain, FEENet leverages the Finite Element Method (FEM)toperformaone-timecomputationofaneigenfunctionbasisintrinsictothegeometry. PDE solutions are subsequently represented in this geometry-adapted basis, and learning is reduced to […]

Ver mais

Like 0

Liked Liked

technocracy

Improving Flow Matching by Aligning Flow Divergence

digitado ⋅ 3 de February de 2026

arXiv:2602.00869v1 Announce Type: new Abstract: Conditional flow matching (CFM) stands out as an efficient, simulation-free approach for training flow-based generative models, achieving remarkable performance for data generation. However, CFM is insufficient to ensure accuracy in learning probability paths. In this paper, we introduce a new partial differential equation characterization for the error between the learned and exact probability paths, along with its solution. We show that the total variation gap between the two probability paths is bounded above […]

Ver mais

Like 0

Liked Liked

technocracy

Safe Stochastic Explorer: Enabling Safe Goal Driven Exploration in Stochastic Environments and Safe Interaction with Unknown Objects

digitado ⋅ 3 de February de 2026

arXiv:2602.00868v1 Announce Type: new Abstract: Autonomous robots operating in unstructured, safety-critical environments, from planetary exploration to warehouses and homes, must learn to safely navigate and interact with their surroundings despite limited prior knowledge. Current methods for safe control, such as Hamilton-Jacobi Reachability and Control Barrier Functions, assume known system dynamics. Meanwhile existing safe exploration techniques often fail to account for the unavoidable stochasticity inherent when operating in unknown real world environments, such as an exploratory rover skidding over […]

Ver mais

Like 0

Liked Liked

technocracy

Foundation CAN LM: A Pretrained Language Model For Automotive CAN Data

digitado ⋅ 3 de February de 2026

arXiv:2602.00866v1 Announce Type: new Abstract: The Controller Area Network (CAN) bus provides a rich source of vehicular signals increasingly leveraged for applications in automotive and auto insurance domains, including collision detection, predictive maintenance, and driver risk modeling. Despite this potential, existing pipelines largely train isolated task-specific models on raw CAN data, with only limited efforts exploring decoded signals. Such fragmentation prevents shared representation learning and limits cross-task generalization. By contrast, natural language processing (NLP) and computer vision (CV) […]

Ver mais

Like 0

Liked Liked

technocracy

Distill3R: A Pipeline for Democratizing 3D Foundation Models on Commodity Hardware

digitado ⋅ 3 de February de 2026

arXiv:2602.00865v1 Announce Type: new Abstract: While multi-view 3D reconstruction has shifted toward large-scale foundation models capable of inferring globally consistent geometry, their reliance on massive computational clusters for training has created a significant barrier to entry for most academic laboratories. To bridge this compute divide, we introduce Distill3R, a framework designed to distill the geometric reasoning of 3D foundation models into compact students fully trainable on a single workstation. Our methodology centers on two primary innovations: (1) an […]

Ver mais

Like 0

Liked Liked

technocracy

Towards Multiscale Graph-based Protein Learning with Geometric Secondary Structural Motifs

digitado ⋅ 3 de February de 2026

arXiv:2602.00862v1 Announce Type: new Abstract: Graph neural networks (GNNs) have emerged as powerful tools for learning protein structures by capturing spatial relationships at the residue level. However, existing GNN-based methods often face challenges in learning multiscale representations and modeling long-range dependencies efficiently. In this work, we propose an efficient multiscale graph-based learning framework tailored to proteins. Our proposed framework contains two crucial components: (1) It constructs a hierarchical graph representation comprising a collection of fine-grained subgraphs, each corresponding […]

Ver mais

Like 0

Liked Liked

technocracy

Multi-Head Attention Is a Multi-Player Game

digitado ⋅ 3 de February de 2026

arXiv:2602.00861v1 Announce Type: new Abstract: Modern transformer attention is internally multi-agent — heads compete and coordinate — yet we train it as if it were a monolithic optimizer. We formalize this gap: cross-entropy training induces an implicit potential game among heads, and gradient descent converges to Nash equilibria with potentially unbounded inefficiency due to unpriced externalities (redundancy, correlated errors). Our main result bounds the Price of Anarchy by $Gamma(G)$, the off-diagonal mass of a head interaction matrix capturing […]

Ver mais

Like 0

Liked Liked

technocracy

ReACT-TTC: Capacity-Aware Top Trading Cycles for Post-Choice Reassignment in Shared CPS

digitado ⋅ 3 de February de 2026

arXiv:2602.00859v1 Announce Type: new Abstract: Cyber-physical systems (CPS) increasingly manage shared physical resources in the presence of human decision-making, where system-assigned actions must be executed by users or agents in the physical world. A fundamental challenge in such settings is user non-compliance: individuals may deviate from assigned resources due to personal preferences or local information, degrading system efficiency and requiring light-weight reassignment schemes. This paper proposes a post-deviation reassignment framework for shared-resource CPS that operates on top of […]

Ver mais

Like 0

Liked Liked

technocracy

Unifying Adversarial Robustness and Training Across Text Scoring Models

digitado ⋅ 3 de February de 2026

arXiv:2602.00857v1 Announce Type: new Abstract: Research on adversarial robustness in language models is currently fragmented across applications and attacks, obscuring shared vulnerabilities. In this work, we propose unifying the study of adversarial robustness in text scoring models spanning dense retrievers, rerankers, and reward models. This motivates adapting both attacks and adversarial training methods across model roles. Unlike open-ended generation, text scoring failures are directly testable: an attack succeeds when an irrelevant or rejected text outscores a relevant or […]

Ver mais

Like 0

Liked Liked