February 2026

A$^2$-LLM: An End-to-end Conversational Audio Avatar Large Language Model

digitado ⋅ 6 de February de 2026

arXiv:2602.04913v1 Announce Type: new Abstract: Developing expressive and responsive conversational digital humans is a cornerstone of next-generation human-computer interaction. While large language models (LLMs) have significantly enhanced dialogue capabilities, most current systems still rely on cascaded architectures that connect independent modules. These pipelines are often plagued by accumulated errors, high latency, and poor real-time performance. Lacking access to the underlying conversational context, these pipelines inherently prioritize rigid lip-sync over emotional depth. To address these challenges, we propose A$^2$-LLM, […]

Ver mais

Like 0

Liked Liked

technocracy

Atomic Information Flow: A Network Flow Model for Tool Attributions in RAG Systems

digitado ⋅ 6 de February de 2026

arXiv:2602.04912v1 Announce Type: new Abstract: Many tool-based Retrieval Augmented Generation (RAG) systems lack precise mechanisms for tracing final responses back to specific tool components — a critical gap as systems scale to complex multi-agent architectures. We present textbf{Atomic Information Flow (AIF)}, a graph-based network flow model that decomposes tool outputs and LLM calls into atoms: indivisible, self-contained units of information. By modeling LLM orchestration as a directed flow of atoms from tool and LLM nodes to a response […]

Ver mais

Like 0

Liked Liked

technocracy

A logical re-conception of neural networks: Hamiltonian bitwise part-whole architecture

digitado ⋅ 6 de February de 2026

arXiv:2602.04911v1 Announce Type: new Abstract: We introduce a simple initial working system in which relations (such as part-whole) are directly represented via an architecture with operating and learning rules fundamentally distinct from standard artificial neural network methods. Arbitrary data are straightforwardly encoded as graphs whose edges correspond to codes from a small fixed primitive set of elemental pairwise relations, such that simple relational encoding is not an add-on, but occurs intrinsically within the most basic components of the […]

Ver mais

Like 0

Liked Liked

technocracy

Reducing the Costs of Proof Synthesis on Rust Systems by Scaling Up a Seed Training Set

digitado ⋅ 6 de February de 2026

arXiv:2602.04910v1 Announce Type: new Abstract: Large Language Models (LLMs) are widely used for code generation. However, the correctness of code generated by LLMs remains a concern. A potential remedy to this concern is to have LLMs generate formal correctness proofs along with such code. However, compared with code generation, code-proof generation requires much higher reasoning capability and has much less existing data to learn from. In this paper, we present VeruSyn, a data synthesis pipeline for Verus, a […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Where It Matters: Geometric Anchoring for Robust Preference Alignment

digitado ⋅ 6 de February de 2026

arXiv:2602.04909v1 Announce Type: new Abstract: Direct Preference Optimization (DPO) and related methods align large language models from pairwise preferences by regularizing updates against a fixed reference policy. As the policy drifts, a static reference, however, can become increasingly miscalibrated, leading to distributional mismatch and amplifying spurious preference signals under noisy supervision. Conversely, reference-free variants avoid mismatch but often suffer from unconstrained reward drift. We propose Geometric Anchor Preference Optimization (GAPO), which replaces the fixed reference with a dynamic, […]

Ver mais

Like 0

Liked Liked

technocracy

Temporal Pair Consistency for Variance-Reduced Flow Matching

digitado ⋅ 6 de February de 2026

arXiv:2602.04908v1 Announce Type: new Abstract: Continuous-time generative models, such as diffusion models, flow matching, and rectified flow, learn time-dependent vector fields but are typically trained with objectives that treat timesteps independently, leading to high estimator variance and inefficient sampling. Prior approaches mitigate this via explicit smoothness penalties, trajectory regularization, or modified probability paths and solvers. We introduce Temporal Pair Consistency (TPC), a lightweight variance-reduction principle that couples velocity predictions at paired timesteps along the same probability path, operating […]

Ver mais

Like 0

Liked Liked

technocracy

Physics as the Inductive Bias for Causal Discovery

digitado ⋅ 6 de February de 2026

arXiv:2602.04907v1 Announce Type: new Abstract: Causal discovery is often a data-driven paradigm to analyze complex real-world systems. In parallel, physics-based models such as ordinary differential equations (ODEs) provide mechanistic structure for many dynamical processes. Integrating these paradigms potentially allows physical knowledge to act as an inductive bias, improving identifiability, stability, and robustness of causal discovery in dynamical systems. However, such integration remains challenging: real dynamical systems often exhibit feedback, cyclic interactions, and non-stationary data trend, while many widely […]

Ver mais

Like 0

Liked Liked

technocracy

LISA: Laplacian In-context Spectral Analysis

digitado ⋅ 6 de February de 2026

arXiv:2602.04906v1 Announce Type: new Abstract: We propose Laplacian In-context Spectral Analysis (LISA), a method for inference-time adaptation of Laplacian-based time-series models using only an observed prefix. LISA combines delay-coordinate embeddings and Laplacian spectral learning to produce diffusion-coordinate state representations, together with a frozen nonlinear decoder for one-step prediction. We introduce lightweight latent-space residual adapters based on either Gaussian-process regression or an attention-like Markov operator over context windows. Across forecasting and autoregressive rollout experiments, LISA improves over the frozen […]

Ver mais

Like 0

Liked Liked

technocracy

DCER: Dual-Stage Compression and Energy-Based Reconstruction

digitado ⋅ 6 de February de 2026

arXiv:2602.04904v1 Announce Type: new Abstract: Multimodal fusion faces two robustness challenges: noisy inputs degrade representation quality, and missing modalities cause prediction failures. We propose DCER, a unified framework addressing both challenges through dual-stage compression and energy-based reconstruction. The compression stage operates at two levels: within-modality frequency transforms (wavelet for audio, DCT for video) remove noise while preserving task-relevant patterns, and cross-modality bottleneck tokens force genuine integration rather than modality-specific shortcuts. For missing modalities, energy-based reconstruction recovers representations via […]

Ver mais

Like 0

Liked Liked

technocracy

Mind the Performance Gap: Capability-Behavior Trade-offs in Feature Steering

digitado ⋅ 6 de February de 2026

arXiv:2602.04903v1 Announce Type: new Abstract: Feature steering has emerged as a promising approach for controlling LLM behavior through direct manipulation of internal representations, offering advantages over prompt engineering. However, its practical effectiveness in real-world applications remains poorly understood, particularly regarding potential trade-offs with output quality. We show that feature steering methods substantially degrade model performance even when successfully controlling target behaviors, a critical trade-off. Specifically, we evaluate Goodfire’s Auto Steer against prompt engineering baselines across 14 steering queries […]

Ver mais

Like 0

Liked Liked