digitado – Page 234

Another Look at Log-PCA for Probability Measures: A Dynamical Formulation and Statistical Convergence

digitado ⋅ 17 de June de 2026

arXiv:2606.17196v1 Announce Type: new Abstract: This paper is concerned with learning principal variations of random probability measures on $mathbb{R}^m$ under the Wasserstein geometry. We introduce a new dynamical formulation to interpret the log-PCA, a linearized principal geodesic analysis, as a variational approach. Our differentiable version, termed as the Wasserstein Tangential PCA (WT-PCA), captures the local principal modes of geodesic variations of a (weighted) probability measure on the Wasserstein space via its covariance operator at barycenter. Based on the […]

Ver mais

Like 0

Liked Liked

technocracy

Communication-Efficient and Robust Multi-Modal Federated Learning via Latent-Space Consensus

digitado ⋅ 19 de March de 2026

Federated learning (FL) enables collaborative model training across distributed devices without sharing raw data, but applying FL to multi-modal settings introduces significant challenges. Clients typically possess heterogeneous modalities and model architectures, making it difficult to align feature spaces efficiently while preserving privacy and minimizing communication costs. To address this, we introduce CoMFed, a Communication-Efficient Multi-Modal Federated Learning framework that uses learnable projection matrices to generate compressed latent representations. A latent-space regularizer aligns these representations across clients, improving cross-modal […]

Ver mais

Like 0

Liked Liked

technocracy

MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization

digitado ⋅ 12 de January de 2026

Group-Relative Policy Optimization (GRPO) has emerged as an efficient paradigm for aligning Large Language Models (LLMs), yet its efficacy is primarily confined to domains with verifiable ground truths. Extending GRPO to open-domain settings remains a critical challenge, as unconstrained generation entails multi-faceted and often conflicting objectives – such as creativity versus factuality – where rigid, static reward scalarization is inherently suboptimal. To address this, we propose MAESTRO (Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization), which introduces […]

Ver mais

Like 0

Liked Liked

technocracy

A Function-Space Stability Boundary for Generalization in Interpolating Learning Systems

digitado ⋅ 3 de February de 2026

Modern learning systems often interpolate training data while still generalizing well, yet it remains unclear when algorithmic stability explains this behavior. We model training as a function-space trajectory and measure sensitivity to single-sample perturbations along this trajectory. We propose a contractive propagation condition and a stability certificate obtained by unrolling the resulting recursion. A small certificate implies stability-based generalization, while we also prove that there exist interpolating regimes with small risk where such contractive sensitivity cannot hold, showing […]

Ver mais

Like 0

Liked Liked

technocracy

HOSL: Hybrid-Order Split Learning for Memory-Constrained Edge Training

digitado ⋅ 16 de January de 2026

Split learning (SL) enables collaborative training of large language models (LLMs) between resource-constrained edge devices and compute-rich servers by partitioning model computation across the network boundary. However, existing SL systems predominantly rely on first-order (FO) optimization, which requires clients to store intermediate quantities such as activations for backpropagation. This results in substantial memory overhead, largely negating benefits of model partitioning. In contrast, zeroth-order (ZO) optimization eliminates backpropagation and significantly reduces memory usage, but often suffers from slow convergence […]

Ver mais

Like 0

Liked Liked

technocracy

Imitation learning for clinical decision support in pediatric ECMO

digitado ⋅ 15 de May de 2026

Pediatric critical care is a dynamic, high-stakes process involving constant monitoring and adjustments in life-saving treatments. Modeling these interventions is crucial for effective decision support. To address the challenges of high complexity and data scarcity in pediatric Extracorporeal Membrane Oxygenation (ECMO), we frame clinical decision-making as learning to act from trajectories, i.e., imitation learning that learns action models from observational data, with a key feature that actions are not directly observed. We consider TabPFN, a recent transformer-based approach […]

Ver mais

Like 0

Liked Liked

technocracy

Approximate Subgraph Matching with Neural Graph Representations and Reinforcement Learning

digitado ⋅ 18 de March de 2026

Approximate subgraph matching (ASM) is a task that determines the approximate presence of a given query graph in a large target graph. Being an NP-hard problem, ASM is critical in graph analysis with a myriad of applications ranging from database systems and network science to biochemistry and privacy. Existing techniques often employ heuristic search strategies, which cannot fully utilize the graph information, leading to sub-optimal solutions. This paper proposes a Reinforcement Learning based Approximate Subgraph Matching (RL-ASM) algorithm […]

Ver mais

Like 0

Liked Liked

technocracy

Four astronauts are now inexorably bound for the Moon

digitado ⋅ 3 de April de 2026

The Orion spacecraft successfully fired its main engine for 5 minutes and 50 seconds on Thursday, sending four astronauts on a free-return trajectory around the Moon. For NASA and the Artemis II crew members, this marked a point of no return for more than a week. About three-quarters of the American population has not witnessed humans leaving low-Earth orbit in their lifetimes. The last time this occurred was 1972, with the final Apollo Moon mission. The “translunar injection” […]

Ver mais

Like 0

Liked Liked

technocracy

Certainty robustness: Evaluating LLM stability under self-challenging prompts

digitado ⋅ 5 de March de 2026

arXiv:2603.03330v1 Announce Type: new Abstract: Large language models (LLMs) often present answers with high apparent confidence despite lacking an explicit mechanism for reasoning about certainty or truth. While existing benchmarks primarily evaluate single-turn accuracy, truthfulness or confidence calibration, they do not capture how models behave when their responses are challenged in interactive settings. We introduce the Certainty Robustness Benchmark, a two-turn evaluation framework that measures how LLMs balance stability and adaptability under self-challenging prompts such as uncertainty (“Are […]

Ver mais

Like 0

Liked Liked

technocracy

The Protocol Built to Keep AI Honest on Live Infrastructure

digitado ⋅ 18 de March de 2026

The Question That Started This When you put an AI agent on live infrastructure, a question follows immediately: How do you know what it tells you actually happened? Not whether the AI is capable. Whether the observation it hands you — “BGP session down, here’s why” — reflects what the device actually said. Whether the change it claims to have made actually exists on the device. Whether the signed output you’re trusting as ground truth was collected from […]

Ver mais

Like 0

Liked Liked