March 2026

Reject, Resample, Repeat: Understanding Parallel Reasoning in Language Model Inference

digitado ⋅ 10 de March de 2026

arXiv:2603.07887v1 Announce Type: cross Abstract: Inference-time methods that aggregate and prune multiple samples have emerged as a powerful paradigm for steering large language models, yet we lack any principled understanding of their accuracy-cost tradeoffs. In this paper, we introduce a route to rigorously study such approaches using the lens of *particle filtering* algorithms such as Sequential Monte Carlo (SMC). Given a base language model and a *process reward model* estimating expected terminal rewards, we ask: *how accurately can […]

Ver mais

Like 0

Liked Liked

technocracy

Cost-Driven Representation Learning for Linear Quadratic Gaussian Control: Part II

digitado ⋅ 10 de March de 2026

arXiv:2603.07437v1 Announce Type: cross Abstract: We study the problem of state representation learning for control from partial and potentially high-dimensional observations. We approach this problem via cost-driven state representation learning, in which we learn a dynamical model in a latent state space by predicting cumulative costs. In particular, we establish finite-sample guarantees on finding a near-optimal representation function and a near-optimal controller using the learned latent model for infinite-horizon time-invariant Linear Quadratic Gaussian (LQG) control. We study two […]

Ver mais

Like 0

Liked Liked

technocracy

Tree-Based Predictive Models for Noisy Input Data

digitado ⋅ 10 de March de 2026

arXiv:2603.07409v1 Announce Type: cross Abstract: Measurement error is prevalent across all domains of scientific research where only imprecise observations, rather than the true underlying values, can be obtained. For example, estimates of human microbiome diversity are based on small samples from a much larger, generally unobserved system and reflect both sampling error and technical variation. In high-noise settings like these, it becomes difficult to make accurate predictions and to summarize uncertainty. Methods have previously been proposed to accommodate […]

Ver mais

Like 0

Liked Liked

technocracy

A Distributed Gaussian Process Model for Multi-Robot Mapping

digitado ⋅ 10 de March de 2026

arXiv:2603.07351v1 Announce Type: cross Abstract: We propose DistGP: a multi-robot learning method for collaborative learning of a global function using only local experience and computation. We utilise a sparse Gaussian process (GP) model with a factorisation that mirrors the multi-robot structure of the task, and admits distributed training via Gaussian belief propagation (GBP). Our loopy model outperforms Tree-Structured GPs cite{bui2014tree} and can be trained online and in settings with dynamic connectivity. We show that such distributed, asynchronous training […]

Ver mais

Like 0

Liked Liked

technocracy

Variational Flow Maps: Make Some Noise for One-Step Conditional Generation

digitado ⋅ 10 de March de 2026

arXiv:2603.07276v1 Announce Type: cross Abstract: Flow maps enable high-quality image generation in a single forward pass. However, unlike iterative diffusion models, their lack of an explicit sampling trajectory impedes incorporating external constraints for conditional generation and solving inverse problems. We put forth Variational Flow Maps, a framework for conditional sampling that shifts the perspective of conditioning from “guiding a sampling path”, to that of “learning the proper initial noise”. Specifically, given an observation, we seek to learn a […]

Ver mais

Like 0

Liked Liked

technocracy

Conditional Rank-Rank Regression via Deep Conditional Transformation Models

digitado ⋅ 10 de March de 2026

arXiv:2603.07230v1 Announce Type: cross Abstract: Intergenerational mobility quantifies the transmission of socio-economic outcomes from parents to children. While rank-rank regression (RRR) is standard, adding covariates directly (RRRX) often yields parameters with unclear interpretation. Conditional rank-rank regression (CRRR) resolves this by using covariate-adjusted (conditional) ranks to measure within-group mobility. We improve and extend CRRR by estimating conditional ranks with a deep conditional transformation model (DCTM) and cross-fitting, enabling end-to-end conditional distribution learning with structural constraints and strong performance under […]

Ver mais

Like 0

Liked Liked

technocracy

Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts

digitado ⋅ 10 de March de 2026

arXiv:2603.07169v1 Announce Type: cross Abstract: Optimizing GPU kernels manually is a challenging and time-consuming task. With the rapid development of LLMs, automated GPU kernel optimization is gradually becoming a tangible reality. However, current LLM-driven automated optimization methods narrowly focus on machine learning applications, such as PyTorch operator optimization, while overlooking broader domains like sparse matrix operations in scientific computing. Extending to these broader applications brings new challenges for the benchmark and algorithm. Therefore, developing a general-purpose automated kernel […]

Ver mais

Like 0

Liked Liked

technocracy

Combining Adam and its Inverse Counterpart to Enhance Generalization of Deep Learning Optimizers

digitado ⋅ 10 de March de 2026

arXiv:2603.07122v1 Announce Type: cross Abstract: In the training of neural networks, adaptive moment estimation (Adam) typically converges fast but exhibits suboptimal generalization performance. A widely accepted explanation for its defect in generalization is that it often tends to converge to sharp minima. To enhance its ability to find flat minima, we propose its new variant named inverse Adam (InvAdam). The key improvement of InvAdam lies in its parameter update mechanism, which is opposite to that of Adam. Specifically, […]

Ver mais

Like 0

Liked Liked

technocracy

Fr’echet regression of multivariate distributions with nonparanormal transport

digitado ⋅ 10 de March de 2026

arXiv:2603.07014v1 Announce Type: cross Abstract: Regression with distribution-valued responses and Euclidean predictors has gained increasing scientific relevance. While methodology for univariate distributional data has advanced rapidly in recent years, multivariate distributions, which additionally encode dependence across univariate marginals, have received less attention and pose computational and statistical challenges. In this work, we address these challenges with a new regression approach for multivariate distributional responses, in which distributions are modeled within the semiparametric nonparanormal family. By incorporating the nonparanormal […]

Ver mais

Like 0

Liked Liked

technocracy

Combinatorial Allocation Bandits with Nonlinear Arm Utility

digitado ⋅ 10 de March de 2026

arXiv:2603.07005v1 Announce Type: cross Abstract: A matching platform is a system that matches different types of participants, such as companies and job-seekers. In such a platform, merely maximizing the number of matches can result in matches being concentrated on highly popular participants, which may increase dissatisfaction among other participants, such as companies, and ultimately lead to their churn, reducing the platform’s profit opportunities. To address this issue, we propose a novel online learning problem, Combinatorial Allocation Bandits (CAB), […]

Ver mais

Like 0

Liked Liked