February 2026

Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

digitado ⋅ 23 de February de 2026

arXiv:2602.17835v1 Announce Type: new Abstract: Supervised fine-tuning (SFT) relies critically on selecting training data that most benefits a model’s downstream performance. Gradient-based data selection methods such as TracIn and Influence Functions leverage influence to identify useful samples, but their computational cost scales poorly, making them impractical for multi-billion-parameter large language models (LLMs). A common alternative is to use off-the-shelf smaller models as proxies, but they remain suboptimal since their learning dynamics are unclear, their sizes cannot be flexibly […]

Ver mais

Like 0

Liked Liked

technocracy

Distributed Triangle Enumeration in Hypergraphs

digitado ⋅ 23 de February de 2026

arXiv:2602.17834v1 Announce Type: new Abstract: In the last decade, subgraph detection and enumeration have emerged as a central problem in distributed graph algorithms. This is largely due to the theoretical challenges and practical applications of these problems. In this paper, we initiate the systematic study of distributed sub-hypergraph enumeration in hypergraphs. To this end, we (1)~introduce several computational models for hypergraphs that generalize the CONGEST model for graphs and evaluate their relative computational power, (2)~devise algorithms for distributed […]

Ver mais

Like 0

Liked Liked

technocracy

MePoly: Max Entropy Polynomial Policy Optimization

digitado ⋅ 23 de February de 2026

arXiv:2602.17832v1 Announce Type: new Abstract: Stochastic Optimal Control provides a unified mathematical framework for solving complex decision-making problems, encompassing paradigms such as maximum entropy reinforcement learning(RL) and imitation learning(IL). However, conventional parametric policies often struggle to represent the multi-modality of the solutions. Though diffusion-based policies are aimed at recovering the multi-modality, they lack an explicit probability density, which complicates policy-gradient optimization. To bridge this gap, we propose MePoly, a novel policy parameterization based on polynomial energy-based models. MePoly […]

Ver mais

Like 0

Liked Liked

technocracy

The Token Games: Evaluating Language Model Reasoning with Puzzle Duels

digitado ⋅ 23 de February de 2026

arXiv:2602.17831v1 Announce Type: new Abstract: Evaluating the reasoning capabilities of Large Language Models is increasingly challenging as models improve. Human curation of hard questions is highly expensive, especially in recent benchmarks using PhD-level domain knowledge to challenge the most capable models. Even then, there is always a concern about whether these questions test genuine reasoning or if similar problems have been seen during training. Here, we take inspiration from 16th-century mathematical duels to design The Token Games (TTG): […]

Ver mais

Like 0

Liked Liked

technocracy

Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

digitado ⋅ 23 de February de 2026

arXiv:2602.17829v1 Announce Type: new Abstract: Inferring causal relations in timeseries data with delayed effects is a fundamental challenge, especially when the underlying system exhibits complex dynamics that cannot be captured by simple functional mappings. Traditional approaches often fail to produce generalized and interpretable explanations, as multiple distinct input trajectories may yield nearly indistinguishable outputs. In this work, we present ruleXplain, a framework that leverages Large Language Models (LLMs) to extract formal explanations for input-output relations in simulation-driven dynamical […]

Ver mais

Like 0

Liked Liked

technocracy

Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge

digitado ⋅ 23 de February de 2026

arXiv:2602.17826v1 Announce Type: new Abstract: Language models exhibit fundamental limitations — hallucination, brittleness, and lack of formal grounding — that are particularly problematic in high-stakes specialist fields requiring verifiable reasoning. I investigate whether formal domain ontologies can enhance language model reliability through retrieval-augmented generation. Using mathematics as proof of concept, I implement a neuro-symbolic pipeline leveraging the OpenMath ontology with hybrid retrieval and cross-encoder reranking to inject relevant definitions into model prompts. Evaluation on the MATH benchmark with […]

Ver mais

Like 0

Liked Liked

technocracy

Evolution of Safety Requirements in Industrial Robotics: Comparative Analysis of ISO 10218-1/2 (2011 vs. 2025) and Integration of ISO/TS 15066

digitado ⋅ 23 de February de 2026

arXiv:2602.17822v1 Announce Type: new Abstract: Industrial robotics has established itself as an integral component of large-scale manufacturing enterprises. Simultaneously, collaborative robotics is gaining prominence, introducing novel paradigms of human-machine interaction. These advancements have necessitated a comprehensive revision of safety standards, specifically incorporating requirements for cybersecurity and protection against unauthorized access in networked robotic systems. This article presents a comparative analysis of the ISO 10218:2011 and ISO 10218:2025 standards, examining the evolution of their structure, terminology, technical requirements, and […]

Ver mais

Like 0

Liked Liked

technocracy

Variational optimization approach for reconstruction of dielectric permittivity and conductivity functions using partial boundary measurements

digitado ⋅ 23 de February de 2026

arXiv:2602.17819v1 Announce Type: new Abstract: We present a variational optimization approach for the solution of a coefficient inverse problem of simultaneous reconstruction of the dielectric permittivity and conductivity functions in time-dependent Maxwell’s system using limited boundary observations of the electric field. The variational optimization approach is based on constructing a weak form of a Lagrangian which allows to use finite element based reconstruction algorithms. The optimality conditions for the Lagrangian and stability estimate for the adjoint problem are […]

Ver mais

Like 0

Liked Liked

technocracy

Lend me an Ear: Speech Enhancement Using a Robotic Arm with a Microphone Array

digitado ⋅ 23 de February de 2026

arXiv:2602.17818v1 Announce Type: new Abstract: Speech enhancement performance degrades significantly in noisy environments, limiting the deployment of speech-controlled technologies in industrial settings, such as manufacturing plants. Existing speech enhancement solutions primarly rely on advanced digital signal processing techniques, deep learning methods, or complex software optimization techniques. This paper introduces a novel enhancement strategy that incorporates a physical optimization stage by dynamically modifying the geometry of a microphone array to adapt to changing acoustic conditions. A sixteen-microphone array is […]

Ver mais

Like 0

Liked Liked

technocracy

GPU Memory and Utilization Estimation for Training-Aware Resource Management: Opportunities and Limitations

digitado ⋅ 23 de February de 2026

arXiv:2602.17817v1 Announce Type: new Abstract: Collocating deep learning training tasks improves GPU utilization but causes drastic slowdowns due to resource contention and risks Out-of-Memory (OOM) failures. Accurate memory estimation is essential for robust collocation, while GPU utilization — a key proxy for resource contention — enables interference-aware scheduling to reduce slowdowns and improve throughput. Existing GPU memory estimators span three paradigms — analytical models, CPU-side libraries, and ML-based estimators — each with distinct limitations: dependence on detailed model […]

Ver mais

Like 0

Liked Liked