March 2026

Graph-GRPO: Training Graph Flow Models with Reinforcement Learning

digitado ⋅ 11 de March de 2026

Graph generation is a fundamental task with broad applications, such as drug discovery. Recently, discrete flow matching-based graph generation, aka, graph flow model (GFM), has emerged due to its superior performance and flexible sampling. However, effectively aligning GFMs with complex human preferences or task-specific objectives remains a significant challenge. In this paper, we propose Graph-GRPO, an online reinforcement learning (RL) framework for training GFMs under verifiable rewards. Our method makes two key contributions: (1) We derive an analytical […]

Ver mais

Like 0

Liked Liked

technocracy

Khatri-Rao Clustering for Data Summarization

digitado ⋅ 11 de March de 2026

arXiv:2603.06602v2 Announce Type: replace-cross Abstract: As datasets continue to grow in size and complexity, finding succinct yet accurate data summaries poses a key challenge. Centroid-based clustering, a widely adopted approach to address this challenge, finds informative summaries of datasets in terms of few prototypes, each representing a cluster in the data. Despite their wide adoption, the resulting data summaries often contain redundancies, limiting their effectiveness particularly in datasets characterized by a large number of underlying clusters. To overcome […]

Ver mais

Like 0

Liked Liked

technocracy

Non-Rectangular Average-Reward Robust MDPs: Optimal Policies and Their Transient Values

digitado ⋅ 11 de March de 2026

arXiv:2603.00945v3 Announce Type: replace-cross Abstract: We study non-rectangular robust Markov decision processes under the average-reward criterion, where the ambiguity set couples transition probabilities across states and the adversary commits to a stationary kernel for the entire horizon. We show that any history-dependent policy achieving sublinear expected regret uniformly over the ambiguity set is robust-optimal, and that the robust value admits a minimax representation as the infimum over the ambiguity set of the classical optimal gains, without requiring any […]

Ver mais

Like 0

Liked Liked

technocracy

Uncovering Social Network Activity Using Joint User and Topic Interaction

digitado ⋅ 11 de March de 2026

arXiv:2506.12842v2 Announce Type: replace-cross Abstract: The emergence of online social platforms, such as social networks and social media, has drastically affected the way people apprehend the information flows to which they are exposed. In such platforms, various information cascades spreading among users is the main force creating complex dynamics of opinion formation, each user being characterized by their own behavior adoption mechanism. Moreover, the spread of multiple pieces of information or beliefs in a networked population is rarely […]

Ver mais

Like 0

Liked Liked

technocracy

A Consequentialist Critique of Binary Classification Evaluation: Theory, Practice, and Tools

digitado ⋅ 11 de March de 2026

arXiv:2504.04528v3 Announce Type: replace-cross Abstract: Machine learning-supported decisions, such as ordering diagnostic tests or determining preventive custody, often require converting probabilistic forecasts into binary classifications. We adopt a consequentialist perspective from decision theory to argue that evaluation methods should prioritize forecast quality across thresholds and base rates. This motivates the use of proper scoring rules such as the Brier score and log loss. However, our empirical review of practices at major ML venues (ICML, FAccT, CHIL) reveals a […]

Ver mais

Like 0

Liked Liked

technocracy

Improving clustering quality evaluation in noisy Gaussian mixtures

digitado ⋅ 11 de March de 2026

arXiv:2503.00379v3 Announce Type: replace-cross Abstract: Clustering is a well-established technique in machine learning and data analysis, widely used across various domains. Cluster validity indices, such as the Average Silhouette Width, Calinski-Harabasz, and Davies-Bouldin indices, play a crucial role in assessing clustering quality when external ground truth labels are unavailable. However, these measures can be affected by different degrees of feature relevance, potentially leading to unreliable evaluations in high-dimensional or noisy data sets. We introduce a theoretically grounded Feature […]

Ver mais

Like 0

Liked Liked

technocracy

Robust Assortment Optimization from Observational Data

digitado ⋅ 11 de March de 2026

arXiv:2602.10696v2 Announce Type: replace Abstract: Assortment optimization is a fundamental challenge in modern retail and recommendation systems, where the goal is to select a subset of products that maximizes expected revenue under complex customer choice behaviors. While recent advances in data-driven methods have leveraged historical data to learn and optimize assortments, these approaches typically rely on strong assumptions — namely, the stability of customer preferences and the correctness of the underlying choice models. However, such assumptions frequently break […]

Ver mais

Like 0

Liked Liked

technocracy

An AI-powered Bayesian Generative Modeling Approach for Arbitrary Conditional Inference

digitado ⋅ 11 de March de 2026

arXiv:2601.05355v2 Announce Type: replace Abstract: Modern data analysis increasingly requires flexible conditional inference P(X_B | X_A) where (X_A, X_B) is an arbitrary partition of observed variable X. Existing approaches are either restricted to a fixed conditioning structure or depend strongly on the distribution of conditioning masks during training. To address these limitations, we introduce Bayesian generative modeling (BGM), a unified framework for arbitrary conditional inference. BGM learns a generative model of X via a stochastic iterative Bayesian updating […]

Ver mais

Like 0

Liked Liked

technocracy

Personalized Collaborative Learning with Affinity-Based Variance Reduction

digitado ⋅ 11 de March de 2026

arXiv:2510.16232v3 Announce Type: replace Abstract: Multi-agent learning faces a fundamental tension: leveraging distributed collaboration without sacrificing the personalization needed for diverse agents. This tension intensifies when aiming for full personalization while adapting to unknown heterogeneity levels — gaining collaborative speedup when agents are similar, without performance degradation when they are different. Embracing the challenge, we propose personalized collaborative learning (PCL), a novel framework for heterogeneous agents to collaboratively learn personalized solutions with seamless adaptivity. Through carefully designed bias […]

Ver mais

Like 0

Liked Liked

technocracy

Repulsive Monte Carlo on the sphere for the sliced Wasserstein distance

digitado ⋅ 11 de March de 2026

arXiv:2509.10166v2 Announce Type: replace Abstract: In this paper, we consider the problem of computing the integral of a function on the unit sphere, in any dimension, using Monte Carlo methods. Although the methods we present are general, our guiding thread is the sliced Wasserstein distance between two measures on $mathbb{R}^d$, which is precisely an integral on the $d$-dimensional sphere. The sliced Wasserstein distance (SW) has gained momentum in machine learning either as a proxy to the less computationally […]

Ver mais

Like 0

Liked Liked