March 2026

PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning

digitado ⋅ 20 de March de 2026

Multi-objective reinforcement learning (MORL) provides an effective solution for decision-making problems involving conflicting objectives. However, achieving high-quality approximations to the Pareto policy set remains challenging, especially in complex tasks with continuous or high-dimensional state-action space. In this paper, we propose the Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning (PA2D-MORL) method, which constructs an efficient scheme for multi-objective problem decomposition and policy improvement, leading to a superior approximation of Pareto policy set. The proposed method leverages Pareto ascent […]

Ver mais

Like 0

Liked Liked

technocracy

An Adaptive Machine Learning Framework for Fluid Flow in Dual-Network Porous Media

digitado ⋅ 20 de March de 2026

Porous materials — natural or engineered — often exhibit dual pore-network structures that govern processes such as mineral exploration and hydrocarbon recovery from tight shales. Double porosity/permeability (DPP) mathematical models describe incompressible fluid flow through two interacting pore networks with inter-network mass exchange. Despite significant advances in numerical methods, there remains a need for computational frameworks that enable rapid forecasting, data assimilation, and reliable inverse analysis. To address this, we present a physics-informed neural network (PINN) framework for […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Bet for Horizon-Aware Anytime-Valid Testing

digitado ⋅ 20 de March de 2026

We develop horizon-aware anytime-valid tests and confidence sequences for bounded means under a strict deadline $N$. Using the betting/e-process framework, we cast horizon-aware betting as a finite-horizon optimal control problem with state space $(t, log W_t)$, where $t$ is the time and $W_t$ is the test martingale value. We first show that in certain interior regions of the state space, policies that deviate significantly from Kelly betting are provably suboptimal, while Kelly betting reaches the threshold with high […]

Ver mais

Like 0

Liked Liked

technocracy

Subspace Kernel Learning on Tensor Sequences

digitado ⋅ 20 de March de 2026

Learning from structured multi-way data, represented as higher-order tensors, requires capturing complex interactions across tensor modes while remaining computationally efficient. We introduce Uncertainty-driven Kernel Tensor Learning (UKTL), a novel kernel framework for $M$-mode tensors that compares mode-wise subspaces derived from tensor unfoldings, enabling expressive and robust similarity measure. To handle large-scale tensor data, we propose a scalable Nyström kernel linearization with dynamically learned pivot tensors obtained via soft $k$-means clustering. A key innovation of UKTL is its uncertainty-aware […]

Ver mais

Like 0

Liked Liked

technocracy

Scalable Cross-Facility Federated Learning for Scientific Foundation Models on Multiple Supercomputers

digitado ⋅ 20 de March de 2026

Artificial Intelligence for scientific applications increasingly requires training large models on data that cannot be centralized due to privacy constraints, data sovereignty, or the sheer volume of data generated. Federated learning (FL) addresses this by enabling collaborative training without centralizing raw data, but scientific applications demand model scales that requires extensive computing resources, typically offered at High Performance Computing (HPC) facilities. Deploying FL experiments across HPC facilities introduces challenges beyond cloud or enterprise settings. We present a comprehensive […]

Ver mais

Like 0

Liked Liked

technocracy

RFK Jr. has destroyed over a quarter of health dept’s expert panels

digitado ⋅ 19 de March de 2026

In his role as health secretary, Robert F. Kennedy Jr.—a long-time anti-vaccine activist with no background in science, medicine, or public health—has made headlines for his thorough perversion of the Centers for Disease Control and Prevention’s vaccine advisory panel. In June, Kennedy fired all 17 independent experts who made up the CDC’s Advisory Committee on Immunization Practices, or ACIP. The panel sets federal vaccination guidance that dictates insurance coverage and influences state school requirements. Kennedy then repopulated ACIP […]

Ver mais

Like 0

Liked Liked

technocracy

ICLAD: In-Context Learning for Unified Tabular Anomaly Detection Across Supervision Regimes

digitado ⋅ 19 de March de 2026

Anomaly detection on tabular data is commonly studied under three supervision regimes, including one-class settings that assume access to anomaly-free training samples, fully unsupervised settings with unlabeled and potentially contaminated training data, and semi-supervised settings with limited anomaly labels. Existing deep learning approaches typically train dataset-specific models under the assumption of a single supervision regime, which limits their ability to leverage shared structures across anomaly detection tasks and to adapt to different supervision levels. We propose ICLAD, an […]

Ver mais

Like 0

Liked Liked

technocracy

Cloud service providers ask EU regulator to reinstate VMware partner program

digitado ⋅ 19 de March de 2026

A trade association of cloud service providers (CSPs) filed an antitrust complaint today with the European Union’s European Commission (EC) over Broadcom’s shuttering of VMware’s CSP partner program this year. Since Broadcom bought VMware, it has drastically cut the number of channel partners VMware works with, a shift that began with the elimination of VMware’s partner program. Broadcom replaced the program with an invite-only alternative that favors larger partners working with enterprise-sized clients rather than small-to-medium-sized businesses. There […]

Ver mais

Like 0

Liked Liked

technocracy

We pointed multiple Claude Code agents at the same benchmark overnight and let them build on each other’s work

digitado ⋅ 19 de March de 2026

Inspired by Andrej Karpathy’s AutoResearch idea – keep the loop running, preserve improvements, revert failures. We wanted to test a simple question: What happens when multiple coding agents can read each other’s work and iteratively improve the same solution? So we built Hive 🐝, a crowdsourced platform where agents collaborate to evolve shared solutions. Each task has a repo + eval harness. One agent starts, makes changes, runs evals, and submits results. Then other agents can inspect prior […]

Ver mais

Like 0

Liked Liked

technocracy

GeoLAN: Geometric Learning of Latent Explanatory Directions in Large Language Models

digitado ⋅ 19 de March de 2026

Large language models (LLMs) demonstrate strong performance, but they often lack transparency. We introduce GeoLAN, a training framework that treats token representations as geometric trajectories and applies stickiness conditions inspired by recent developments related to the Kakeya Conjecture. We have developed two differentiable regularizers, Katz-Tao Convex Wolff (KT-CW) and Katz-Tao Attention (KT-Attn), that promote isotropy and encourage diverse attention. Our experiments with Gemma-3 (1B, 4B, 12B) and Llama-3-8B show that GeoLAN frequently maintains task accuracy while improving geometric […]

Ver mais

Like 0

Liked Liked