March 2026

A Recipe for Stable Offline Multi-agent Reinforcement Learning

digitado ⋅ 9 de March de 2026

Despite remarkable achievements in single-agent offline reinforcement learning (RL), multi-agent RL (MARL) has struggled to adopt this paradigm, largely persisting with on-policy training and self-play from scratch. One reason for this gap comes from the instability of non-linear value decomposition, leading prior works to avoid complex mixing networks in favor of linear value decomposition (e.g., VDN) with value regularization used in single-agent setups. In this work, we analyze the source of instability in non-linear value decomposition within the […]

Ver mais

Like 0

Liked Liked

technocracy

InFusionLayer: a CFA-based ensemble tool to generate new classifiers for learning and modeling

digitado ⋅ 9 de March de 2026

Ensemble learning is a well established body of methods for machine learning to enhance predictive performance by combining multiple algorithms/models. Combinatorial Fusion Analysis (CFA) has provided method and practice for combining multiple scoring systems, using rank-score characteristic (RSC) function and cognitive diversity (CD), including ensemble method and model fusion. However, there is no general-purpose Python tool available that incorporate these techniques. In this paper we introduce texttt{InFusionLayer}, a machine learning architecture inspired by CFA at the system fusion […]

Ver mais

Like 0

Liked Liked

technocracy

Perhaps not Boring Technology after all

digitado ⋅ 9 de March de 2026

A recurring concern I’ve seen regarding LLMs for programming is that they will push our technology choices towards the tools that are best represented in their training data, making it harder for new, better tools to break through the noise. This was certainly the case a couple of years ago, when asking models for help with Python or JavaScript appeared to give much better results than questions about less widely used languages. With the latest models running in […]

Ver mais

Like 0

Liked Liked

technocracy

Chevrolet killed it then brought it back, now we drive it: The 2027 Bolt

digitado ⋅ 9 de March de 2026

Chevrolet provided flights from Washington, DC, to Los Angeles and accommodation so Ars could drive the Bolt. Ars does not accept paid editorial content. WESTLAKE VILLAGE, Calif.—When the Chevrolet Bolt debuted in 2017, the electric hatchback stood out: Here was an electric vehicle with more than 200 miles of range for less than half the price of a Tesla Model S. The Bolt had its ups and downs, though. A $1.8 billion recall saw the automaker replace the […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond Attention Heatmaps: How to Get Better Explanations for Multiple Instance Learning Models in Histopathology

digitado ⋅ 9 de March de 2026

Multiple instance learning (MIL) has enabled substantial progress in computational histopathology, where a large amount of patches from gigapixel whole slide images are aggregated into slide-level predictions. Heatmaps are widely used to validate MIL models and to discover tissue biomarkers. Yet, the validity of these heatmaps has barely been investigated. In this work, we introduce a general framework for evaluating the quality of MIL heatmaps without requiring additional labels. We conduct a large-scale benchmark experiment to assess six […]

Ver mais

Like 0

Liked Liked

technocracy

Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces

digitado ⋅ 9 de March de 2026

We analyze the Bayesian regret of the Gaussian process posterior sampling reinforcement learning (GP-PSRL) algorithm. Posterior sampling is an effective heuristic for decision-making under uncertainty that has been used to develop successful algorithms for a variety of continuous control problems. However, theoretical work on GP-PSRL is limited. All known regret bounds either fail to achieve a tight dependence on a kernel-dependent quantity called the maximum information gain or fail to properly account for the fact that the set […]

Ver mais

Like 0

Liked Liked

technocracy

PolyFormer: learning efficient reformulations for scalable optimization under complex physical constraints

digitado ⋅ 9 de March de 2026

Real-world optimization problems are often constrained by complex physical laws that limit computational scalability. These constraints are inherently tied to complex regions, and thus learning models that incorporate physical and geometric knowledge, i.e., physics-informed machine learning (PIML), offer a promising pathway for efficient solution. Here, we introduce PolyFormer, which opens a new direction for PIML in prescriptive optimization tasks, where physical and geometric knowledge is not merely used to regularize learning models, but to simplify the problems themselves. […]

Ver mais

Like 0

Liked Liked

technocracy

SCL-GNN: Towards Generalizable Graph Neural Networks via Spurious Correlation Learning

digitado ⋅ 9 de March de 2026

Graph Neural Networks (GNNs) have demonstrated remarkable success across diverse tasks. However, their generalization capability is often hindered by spurious correlations between node features and labels in the graph. Our analysis reveals that GNNs tend to exploit imperceptible statistical correlations in training data, even when such correlations are unreliable for prediction. To address this challenge, we propose the Spurious Correlation Learning Graph Neural Network (SCL-GNN), a novel framework designed to enhance generalization on both Independent and Identically Distributed […]

Ver mais

Like 0

Liked Liked

technocracy

Server Virtualization in Cloud Computing: A Complete Guide

digitado ⋅ 9 de March de 2026

Have you ever thought about how cloud platforms can run thousands of apps on a minimal number of machines? Well, the answer is server virtualization in cloud computing. Instead of depending on a single physical server to do a single operation, virtualization divides a single server into numerous separate virtual servers. Each virtual server can run a separate OS and programs, as if it were a standalone machine. This technology is a critical foundation of current cloud computing. […]

Ver mais

Like 0

Liked Liked

technocracy

FedPrism: Adaptive Personalized Federated Learning under Non-IID Data

digitado ⋅ 9 de March de 2026

Federated Learning (FL) suffers significant performance degradation in real-world deployments characterized by moderate to extreme statistical heterogeneity (non-IID client data). While global aggregation strategies promote broad generalization, they often fail to capture the diversity of local data distributions, leading to suboptimal personalization. We address this problem with FedPrism, a framework that uses two main strategies. First, it uses a Prism Decomposition method that builds each client’s model from three parts: a global foundation, a shared group part for […]

Ver mais

Like 0

Liked Liked