February 2026

Neural network optimization strategies and the topography of the loss landscape

digitado ⋅ 26 de February de 2026

arXiv:2602.21276v1 Announce Type: cross Abstract: Neural networks are trained by optimizing multi-dimensional sets of fitting parameters on non-convex loss landscapes. Low-loss regions of the landscapes correspond to the parameter sets that perform well on the training data. A key issue in machine learning is the performance of trained neural networks on previously unseen test data. Here, we investigate neural network training by stochastic gradient descent (SGD) – a non-convex global optimization algorithm which relies only on the gradient […]

Ver mais

Like 0

Liked Liked

technocracy

Group Orthogonalized Policy Optimization:Group Policy Optimization as Orthogonal Projection in Hilbert Space

digitado ⋅ 26 de February de 2026

arXiv:2602.21269v1 Announce Type: cross Abstract: We present Group Orthogonalized Policy Optimization (GOPO), a new alignment algorithm for large language models derived from the geometry of Hilbert function spaces. Instead of optimizing on the probability simplex and inheriting the exponential curvature of Kullback-Leibler divergence, GOPO lifts alignment into the Hilbert space L2(pi_k) of square-integrable functions with respect to the reference policy. Within this space, the simplex constraint reduces to a linear orthogonality condition = 0, defining a codimension-one subspace […]

Ver mais

Like 0

Liked Liked

technocracy

Probing the Geometry of Diffusion Models with the String Method

digitado ⋅ 26 de February de 2026

arXiv:2602.22122v1 Announce Type: new Abstract: Understanding the geometry of learned distributions is fundamental to improving and interpreting diffusion models, yet systematic tools for exploring their landscape remain limited. Standard latent-space interpolations fail to respect the structure of the learned distribution, often traversing low-density regions. We introduce a framework based on the string method that computes continuous paths between samples by evolving curves under the learned score function. Operating on pretrained models without retraining, our approach interpolates between three […]

Ver mais

Like 0

Liked Liked

technocracy

Scalable Kernel-Based Distances for Statistical Inference and Integration

digitado ⋅ 26 de February de 2026

arXiv:2602.21846v1 Announce Type: new Abstract: Representing, comparing, and measuring the distance between probability distributions is a key task in computational statistics and machine learning. The choice of representation and the associated distance determine properties of the methods in which they are used: for example, certain distances can allow one to encode robustness or smoothness of the problem. Kernel methods offer flexible and rich Hilbert space representations of distributions that allow the modeller to enforce properties through the choice […]

Ver mais

Like 0

Liked Liked

technocracy

Goodness-of-Fit Tests for Latent Class Models with Ordinal Categorical Data

digitado ⋅ 26 de February de 2026

arXiv:2602.21572v1 Announce Type: new Abstract: Ordinal categorical data are widely collected in psychology, education, and other social sciences, appearing commonly in questionnaires, assessments, and surveys. Latent class models provide a flexible framework for uncovering unobserved heterogeneity by grouping individuals into homogeneous classes based on their response patterns. A fundamental challenge in applying these models is determining the number of latent classes, which is unknown and must be inferred from data. In this paper, we propose one test statistic […]

Ver mais

Like 0

Liked Liked

technocracy

Fair Model-based Clustering

digitado ⋅ 26 de February de 2026

arXiv:2602.21509v1 Announce Type: new Abstract: The goal of fair clustering is to find clusters such that the proportion of sensitive attributes (e.g., gender, race, etc.) in each cluster is similar to that of the entire dataset. Various fair clustering algorithms have been proposed that modify standard K-means clustering to satisfy a given fairness constraint. A critical limitation of several existing fair clustering algorithms is that the number of parameters to be learned is proportional to the sample size […]

Ver mais

Like 0

Liked Liked

technocracy

Global Sequential Testing for Multi-Stream Auditing

digitado ⋅ 26 de February de 2026

arXiv:2602.21479v1 Announce Type: new Abstract: Across many risk-sensitive areas, it is critical to continuously audit the performance of machine learning systems and detect any unusual behavior quickly. This can be modeled as a sequential hypothesis testing problem with $k$ incoming streams of data and a global null hypothesis that asserts that the system is working as expected across all $k$ streams. The standard global test employs a Bonferroni correction and has an expected stopping time bound of $Oleft(lnfrac{k}{alpha}right)$ […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Inference after Directionally Stable Adaptive Experiments

digitado ⋅ 26 de February de 2026

arXiv:2602.21478v1 Announce Type: new Abstract: We study inference on scalar-valued pathwise differentiable targets after adaptive data collection, such as a bandit algorithm. We introduce a novel target-specific condition, directional stability, which is strictly weaker than previously imposed target-agnostic stability conditions. Under directional stability, we show that estimators that would have been efficient under i.i.d. data remain asymptotically normal and semiparametrically efficient when computed from adaptively collected trajectories. The canonical gradient has a martingale form, and directional stability guarantees […]

Ver mais

Like 0

Liked Liked

technocracy

ConformalHDC: Uncertainty-Aware Hyperdimensional Computing with Application to Neural Decoding

digitado ⋅ 26 de February de 2026

arXiv:2602.21446v1 Announce Type: new Abstract: Hyperdimensional Computing (HDC) offers a computationally efficient paradigm for neuromorphic learning. Yet, it lacks rigorous uncertainty quantification, leading to open decision boundaries and, consequently, vulnerability to outliers, adversarial perturbations, and out-of-distribution inputs. To address these limitations, we introduce ConformalHDC, a unified framework that combines the statistical guarantees of conformal prediction with the computational efficiency of HDC. For this framework, we propose two complementary variations. First, the set-valued formulation provides finite-sample, distribution-free coverage guarantees. […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Uncoupled Learning Dynamics with $tilde{O}!left(T^{-1/4}right)$ Last-Iterate Convergence in Bilinear Saddle-Point Problems over Convex Sets under Bandit Feedback

digitado ⋅ 26 de February de 2026

arXiv:2602.21436v1 Announce Type: new Abstract: In this paper, we study last-iterate convergence of learning algorithms in bilinear saddle-point problems, a preferable notion of convergence that captures the day-to-day behavior of learning dynamics. We focus on the challenging setting where players select actions from compact convex sets and receive only bandit feedback. Our main contribution is the design of an uncoupled learning algorithm that guarantees last-iterate convergence to the Nash equilibrium with high probability. We establish a convergence rate […]

Ver mais

Like 0

Liked Liked