January 2026

A Set of Rules for Model Validation

digitado ⋅ 30 de January de 2026

arXiv:2511.20711v2 Announce Type: replace-cross Abstract: The validation of a data-driven model is the process of assessing the model’s ability to generalize to new, unseen data in the population of interest. This paper proposes a set of general rules for model validation. These rules are designed to help practitioners create reliable validation plans and report their results transparently. While no validation scheme is flawless, these rules can help practitioners ensure their strategy is sufficient for practical use, openly discuss […]

Ver mais

Like 0

Liked Liked

technocracy

Geometry-Aware Deep Congruence Networks for Manifold Learning in Cross-Subject Motor Imagery

digitado ⋅ 30 de January de 2026

arXiv:2511.18940v2 Announce Type: replace-cross Abstract: Cross-subject motor-imagery decoding remains a major challenge in EEG-based brain-computer interfaces. To mitigate strong inter-subject variability, recent work has emphasized manifold-based approaches operating on covariance representations. Yet dispersion scaling and orientation alignment remain largely unaddressed in existing methods. In this paper, we address both issues through congruence transforms and introduce three complementary geometry-aware models: (i) Discriminative Congruence Transform (DCT), (ii) Deep Linear DCT (DLDCT), and (iii) Deep DCT-UNet (DDCT-UNet). These models are evaluated […]

Ver mais

Like 0

Liked Liked

technocracy

Policy Learning with Abstention

digitado ⋅ 30 de January de 2026

arXiv:2510.19672v3 Announce Type: replace-cross Abstract: Policy learning algorithms are widely used in areas such as personalized medicine and advertising to develop individualized treatment regimes. However, most methods force a decision even when predictions are uncertain, which is risky in high-stakes settings. We study policy learning with abstention, where a policy may defer to a safe default or an expert. When a policy abstains, it receives a small additive reward on top of the value of a random guess. […]

Ver mais

Like 0

Liked Liked

technocracy

DPMM-CFL: Clustered Federated Learning via Dirichlet Process Mixture Model Nonparametric Clustering

digitado ⋅ 30 de January de 2026

arXiv:2510.07132v2 Announce Type: replace-cross Abstract: Clustered Federated Learning (CFL) improves performance under non-IID client heterogeneity by clustering clients and training one model per cluster, thereby balancing between a global model and fully personalized models. However, most CFL methods require the number of clusters K to be fixed a priori, which is impractical when the latent structure is unknown. We propose DPMM-CFL, a CFL algorithm that places a Dirichlet Process (DP) prior over the distribution of cluster parameters. This […]

Ver mais

Like 0

Liked Liked

technocracy

Machine Learning. The Science of Selection under Uncertainty

digitado ⋅ 30 de January de 2026

arXiv:2509.21547v2 Announce Type: replace-cross Abstract: Learning, whether natural or artificial, is a process of selection. It starts with a set of candidate options and selects the more successful ones. In the case of machine learning the selection is done based on empirical estimates of prediction accuracy of candidate prediction rules on some data. Due to randomness of data sampling the empirical estimates are inherently noisy, leading to selection under uncertainty. The book provides statistical tools to obtain theoretical […]

Ver mais

Like 0

Liked Liked

technocracy

High Effort, Low Gain: Fundamental Limits of Active Learning for Linear Dynamical Systems

digitado ⋅ 30 de January de 2026

arXiv:2509.11907v2 Announce Type: replace-cross Abstract: In this work, we consider the problem of identifying an unknown linear dynamical system given a finite hypothesis class. In particular, we analyze the effect of the excitation input on the sample complexity of identifying the true system with high probability. To this end, we present sample complexity lower bounds that capture the choice of the selected excitation input. The sample complexity lower bound gives rise to a system theoretic condition to determine […]

Ver mais

Like 0

Liked Liked

technocracy

Dealing with Uncertainty in Contextual Anomaly Detection

digitado ⋅ 30 de January de 2026

arXiv:2507.04490v2 Announce Type: replace-cross Abstract: Contextual anomaly detection (CAD) aims to identify anomalies in a target (behavioral) variable conditioned on a set of contextual variables that influence the normalcy of the target variable but are not themselves indicators of anomaly. In many anomaly detection tasks, there exist contextual variables that influence the normalcy of the target variable but are not themselves indicators of anomaly. In this work, we propose a novel framework for CAD, normalcy score (NS), that […]

Ver mais

Like 0

Liked Liked

technocracy

Representative Action Selection for Large Action Space Bandit Families

digitado ⋅ 30 de January de 2026

arXiv:2505.18269v4 Announce Type: replace-cross Abstract: We study the problem of selecting a subset from a large action space shared by a family of bandits, with the goal of achieving performance nearly matching that of using the full action space. Indeed, in many natural situations, while the nominal set of actions may be large, there also exist significant correlations between the rewards of different actions. In this paper we propose an algorithm that can significantly reduce the action space […]

Ver mais

Like 0

Liked Liked

technocracy

Metric Graph Kernels via the Tropical Torelli Map

digitado ⋅ 30 de January de 2026

arXiv:2505.12129v2 Announce Type: replace-cross Abstract: We introduce the first graph kernels for metric graphs via tropical algebraic geometry. In contrast to conventional graph kernels based on graph combinatorics such as nodes, edges, and subgraphs, our metric graph kernels are purely based on the geometry and topology of the underlying metric space. A key characterizing property of our construction is its invariance under edge subdivision, making the kernels intrinsically well-suited for comparing graphs representing different underlying metric spaces. We […]

Ver mais

Like 0

Liked Liked

technocracy

Utilising Gradient-Based Proposals Within Sequential Monte Carlo Samplers for Training of Partial Bayesian Neural Networks

digitado ⋅ 30 de January de 2026

arXiv:2505.03797v2 Announce Type: replace-cross Abstract: Partial Bayesian neural networks (pBNNs) have been shown to perform competitively with fully Bayesian neural networks while only having a subset of the parameters be stochastic. Using sequential Monte Carlo (SMC) samplers as the inference method for pBNNs gives a non-parametric probabilistic estimation of the stochastic parameters, and has shown improved performance over parametric methods. In this paper we introduce a new SMC-based training method for pBNNs by utilising a guided proposal and […]

Ver mais

Like 0

Liked Liked