March 2026

Frozen Policy Iteration: Computationally Efficient RL under Linear $Q^{pi}$ Realizability for Deterministic Dynamics

digitado ⋅ 3 de March de 2026

arXiv:2603.00716v1 Announce Type: cross Abstract: We study computationally and statistically efficient reinforcement learning under the linear $Q^{pi}$ realizability assumption, where any policy’s $Q$-function is linear in a given state-action feature representation. Prior methods in this setting are either computationally intractable, or require (local) access to a simulator. In this paper, we propose a computationally efficient online RL algorithm, named Frozen Policy Iteration, under the linear $Q^{pi}$ realizability setting that works for Markov Decision Processes (MDPs) with stochastic initial […]

Ver mais

Like 0

Liked Liked

technocracy

Retrodictive Forecasting: A Proof-of-Concept for Exploiting Temporal Asymmetry in Time Series Prediction

digitado ⋅ 3 de March de 2026

arXiv:2603.00636v1 Announce Type: cross Abstract: We propose a retrodictive forecasting paradigm for time series: instead of predicting the future from the past, we identify the future that best explains the observed present via inverse MAP optimization over a Conditional Variational Autoencoder (CVAE). This conditioning is a statistical modeling choice for Bayesian inversion; it does not assert that future events cause past observations. The approach is theoretically grounded in an information-theoretic arrow-of-time measure: the symmetrized Kullback-Leibler divergence between forward […]

Ver mais

Like 0

Liked Liked

technocracy

Spectral Condition for $mu$P under Width-Depth Scaling

digitado ⋅ 3 de March de 2026

arXiv:2603.00541v1 Announce Type: cross Abstract: Generative foundation models are increasingly scaled in both width and depth, posing significant challenges for stable feature learning and reliable hyperparameter (HP) transfer across model sizes. While maximal update parameterization ($mu$P) has provided a principled solution to both problems for width scaling, existing extensions to the joint width-depth scaling regime remain fragmented, architecture- and optimizer-specific, and often rely on technically involved theories. In this work, we develop a simple and unified spectral framework […]

Ver mais

Like 0

Liked Liked

technocracy

Dual-space posterior sampling for Bayesian inference in constrained inverse problems

digitado ⋅ 3 de March de 2026

arXiv:2603.00393v1 Announce Type: cross Abstract: Inverse problems constrained by partial differential equations are often ill-conditioned due to noisy and incomplete data or inherent non-uniqueness. A prominent example is full waveform inversion, which estimates Earth’s subsurface properties by fitting seismic measurements subject to the wave equation, where ill-conditioning is inherent to noisy, band-limited, finite-aperture data and shadow zones. Casting the inverse problem into a Bayesian framework allows for a more comprehensive description of its solution, where instead of a […]

Ver mais

Like 0

Liked Liked

technocracy

Dynamic Proximal Gradient Algorithms for Schatten-$p$ Quasi-Norm Regularized Problems

digitado ⋅ 3 de March de 2026

arXiv:2603.00333v1 Announce Type: cross Abstract: This paper investigates numerical solution methods for the Schatten-$p$ quasi-norm regularized problem with $p in [0,1]$, which has been widely studied for finding low-rank solutions of linear inverse problems and gained successful applications in various mathematics and applied science fields. We propose a dynamic proximal gradient algorithm that, through the use of the Cayley transformation, avoids computationally expensive singular value decompositions at each iteration, thereby significantly reducing the computational complexity. The algorithm incorporates […]

Ver mais

Like 0

Liked Liked

technocracy

LIDS: LLM Summary Inference Under the Layered Lens

digitado ⋅ 3 de March de 2026

arXiv:2603.00105v1 Announce Type: cross Abstract: Large language models (LLMs) have gained significant attention by many researchers and practitioners in natural language processing (NLP) since the introduction of ChatGPT in 2022. One notable feature of ChatGPT is its ability to generate summaries based on prompts. Yet evaluating the quality of these summaries remains challenging due to the complexity of language. To this end, in this paper we suggest a new method of LLM summary inference with BERT-SVD-based direction metric […]

Ver mais

Like 0

Liked Liked

technocracy

CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation

digitado ⋅ 3 de March de 2026

arXiv:2603.00039v1 Announce Type: cross Abstract: LLM-as-a-judge ensembles are the standard paradigm for scalable evaluation, but their aggregation mechanisms suffer from a fundamental flaw: they implicitly assume that judges provide independent estimates of true quality. However, in practice, LLM judges exhibit correlated errors caused by shared latent confounders — such as verbosity, stylistic preferences, or training artifacts — causing standard aggregation rules like majority vote or averaging to provide little gain or even amplify systematic mistakes. To address this, […]

Ver mais

Like 0

Liked Liked

technocracy

Instrumental and Proximal Causal Inference with Gaussian Processes

digitado ⋅ 3 de March de 2026

arXiv:2603.02159v1 Announce Type: new Abstract: Instrumental variable (IV) and proximal causal learning (Proxy) methods are central frameworks for causal inference in the presence of unobserved confounding. Despite substantial methodological advances, existing approaches rarely provide reliable epistemic uncertainty (EU) quantification. We address this gap through a Deconditional Gaussian Process (DGP) framework for uncertainty-aware causal learning. Our formulation recovers popular kernel estimators as the posterior mean, ensuring predictive precision, while the posterior variance yields principled and well-calibrated EU. Moreover, the […]

Ver mais

Like 0

Liked Liked

technocracy

TRAKNN: Efficient Trajectory Aware Spatiotemporal kNN for Rare Meteorological Trajectory Detection

digitado ⋅ 3 de March de 2026

arXiv:2603.02059v1 Announce Type: new Abstract: Extreme weather events, such as windstorms and heatwaves, are driven by persistent atmospheric circulation patterns that evolve over several consecutive days. While traditional circulation-based studies often focus on instantaneous atmospheric states, capturing the temporal evolution, or trajectory, of these spatial fields is essential for characterizing rare and potentially impactful atmospheric behavior. However, performing an exhaustive similarity search on multi-decadal, continental-scale gridded datasets presents significant computational and memory challenges. In this paper, we propose […]

Ver mais

Like 0

Liked Liked

technocracy

Density-Matrix Spectral Embeddings for Categorical Data: Operator Structure and Stability

digitado ⋅ 3 de March de 2026

arXiv:2603.01975v1 Announce Type: new Abstract: We introduce a supervised dimensionality reduction methodology for categorical (and discretized mixed-type) data based on a density-matrix construction induced by class-conditional frequencies. Given a labeled dataset encoded in a one-hot survey space, we assemble a frequency matrix whose columns aggregate feature occurrences within each class, and define a normalized Gram-type operator that satisfies the axioms of a density matrix. The resulting representation admits an intrinsic rank bound controlled by the number of classes, […]

Ver mais

Like 0

Liked Liked