January 2026

Success Conditioning as Policy Improvement: The Optimization Problem Solved by Imitating Success

digitado ⋅ 27 de January de 2026

arXiv:2601.18175v1 Announce Type: cross Abstract: A widely used technique for improving policies is success conditioning, in which one collects trajectories, identifies those that achieve a desired outcome, and updates the policy to imitate the actions taken along successful trajectories. This principle appears under many names — rejection sampling with SFT, goal-conditioned RL, Decision Transformers — yet what optimization problem it solves, if any, has remained unclear. We prove that success conditioning exactly solves a trust-region optimization problem, maximizing […]

Ver mais

Like 0

Liked Liked

technocracy

Feature-Space Generative Models for One-Shot Class-Incremental Learning

digitado ⋅ 27 de January de 2026

arXiv:2601.17905v1 Announce Type: cross Abstract: Few-shot class-incremental learning (FSCIL) is a paradigm where a model, initially trained on a dataset of base classes, must adapt to an expanding problem space by recognizing novel classes with limited data. We focus on the challenging FSCIL setup where a model receives only a single sample (1-shot) for each novel class and no further training or model alterations are allowed after the base training phase. This makes generalization to novel classes particularly […]

Ver mais

Like 0

Liked Liked

technocracy

A Universal Load Balancing Principle and Its Application to Large Language Model Serving

digitado ⋅ 27 de January de 2026

arXiv:2601.17855v1 Announce Type: cross Abstract: Load balancing-the allocation of work across parallel resources to reduce delay, energy and cost-is a pervasive challenge in science and engineering, from large-scale simulation and data processing to cloud and manufacturing operations. Motivated by the emerging bottleneck in large language model (LLM) serving, we study a particularly stringent regime of load balancing that arises in barrier-synchronized, stateful systems: work cannot be freely migrated and progress is gated by the slowest participant at each […]

Ver mais

Like 0

Liked Liked

technocracy

Semantic-Aware Task Clustering for Federated Cooperative Multi-Task Semantic Communication

digitado ⋅ 27 de January de 2026

arXiv:2601.17419v1 Announce Type: cross Abstract: Task-oriented semantic communication (SemCom) prioritizes task execution over accurate symbol reconstruction and is well-suited to emerging intelligent applications. Cooperative multi-task SemCom (CMT-SemCom) further improves task execution performance. However, [1] demonstrates that cooperative multi-tasking can be either constructive or destructive. Moreover, the existing CMT-SemCom framework is not directly applicable to distributed multi-user scenarios, such as non-terrestrial satellite networks, where each satellite employs an individual semantic encoder. In this paper, we extend our earlier CMT-SemCom […]

Ver mais

Like 0

Liked Liked

technocracy

Covariate-assisted Grade of Membership Models via Shared Latent Geometry

digitado ⋅ 27 de January de 2026

arXiv:2601.17265v1 Announce Type: cross Abstract: The grade of membership model is a flexible latent variable model for analyzing multivariate categorical data through individual-level mixed membership scores. In many modern applications, auxiliary covariates are collected alongside responses and encode information about the same latent structure. Traditional approaches to incorporating such covariates typically rely on fully specified joint likelihoods, which are computationally intensive and sensitive to misspecification. We introduce a covariate-assisted grade of membership model that integrates response and covariate […]

Ver mais

Like 0

Liked Liked

technocracy

A Unified Kantorovich Duality for Multimarginal Optimal Transport

digitado ⋅ 27 de January de 2026

arXiv:2601.17171v1 Announce Type: cross Abstract: Multimarginal optimal transport (MOT) has gained increasing attention in recent years, notably due to its relevance in machine learning and statistics, where one seeks to jointly compare and align multiple probability distributions. This paper presents a unified and complete Kantorovich duality theory for MOT problem on general Polish product spaces with bounded continuous cost function. For marginal compact spaces, the duality identity is derived through a convex-analytic reformulation, that identifies the dual problem […]

Ver mais

Like 0

Liked Liked

technocracy

Falsifying Predictive Algorithm

digitado ⋅ 27 de January de 2026

arXiv:2601.17146v1 Announce Type: cross Abstract: Empirical investigations into unintended model behavior often show that the algorithm is predicting another outcome than what was intended. These exposes highlight the need to identify when algorithms predict unintended quantities – ideally before deploying them into consequential settings. We propose a falsification framework that provides a principled statistical test for discriminant validity: the requirement that an algorithm predict intended outcomes better than impermissible ones. Drawing on falsification practices from causal inference, econometrics, […]

Ver mais

Like 0

Liked Liked

technocracy

Attention-Based Variational Framework for Joint and Individual Components Learning with Applications in Brain Network Analysis

digitado ⋅ 27 de January de 2026

arXiv:2601.17073v1 Announce Type: cross Abstract: Brain organization is increasingly characterized through multiple imaging modalities, most notably structural connectivity (SC) and functional connectivity (FC). Integrating these inherently distinct yet complementary data sources is essential for uncovering the cross-modal patterns that drive behavioral phenotypes. However, effective integration is hindered by the high dimensionality and non-linearity of connectome data, complex non-linear SC-FC coupling, and the challenge of disentangling shared information from modality-specific variations. To address these issues, we propose the Cross-Modal […]

Ver mais

Like 0

Liked Liked

technocracy

Uncertainty Quantification for Named Entity Recognition via Full-Sequence and Subsequence Conformal Prediction

digitado ⋅ 27 de January de 2026

arXiv:2601.16999v1 Announce Type: cross Abstract: Named Entity Recognition (NER) serves as a foundational component in many natural language processing (NLP) pipelines. However, current NER models typically output a single predicted label sequence without any accompanying measure of uncertainty, leaving downstream applications vulnerable to cascading errors. In this paper, we introduce a general framework for adapting sequence-labeling-based NER models to produce uncertainty-aware prediction sets. These prediction sets are collections of full-sentence labelings that are guaranteed to contain the correct […]

Ver mais

Like 0

Liked Liked

technocracy

Out-of-Distribution Radar Detection with Complex VAEs: Theory, Whitening, and ANMF Fusion

digitado ⋅ 27 de January de 2026

arXiv:2601.18677v1 Announce Type: new Abstract: We investigate the detection of weak complex-valued signals immersed in non-Gaussian, range-varying interference, with emphasis on maritime radar scenarios. The proposed methodology exploits a Complex-valued Variational AutoEncoder (CVAE) trained exclusively on clutter-plus-noise to perform Out-Of-Distribution detection. By operating directly on in-phase / quadrature samples, the CVAE preserves phase and Doppler structure and is assessed in two configurations: (i) using unprocessed range profiles and (ii) after local whitening, where per-range covariance estimates are obtained […]

Ver mais

Like 0

Liked Liked