digitado – Page 139

Learning to Remember, Learn, and Forget in Attention-Based Models

digitado ⋅ 11 de February de 2026

arXiv:2602.09075v1 Announce Type: new Abstract: In-Context Learning (ICL) in transformers acts as an online associative memory and is believed to underpin their high performance on complex sequence processing tasks. However, in gated linear attention models, this memory has a fixed capacity and is prone to interference, especially for long sequences. We propose Palimpsa, a self-attention model that views ICL as a continual learning problem that must address a stability-plasticity dilemma. Palimpsa uses Bayesian metaplasticity, where the plasticity of […]

Ver mais

Like 0

Liked Liked

technocracy

Bayesian Modeling of Collatz Stopping Times: A Probabilistic Machine Learning Perspective

digitado ⋅ 6 de March de 2026

arXiv:2603.04479v1 Announce Type: new Abstract: We study the Collatz total stopping time $tau(n)$ over $nle 10^7$ from a probabilistic machine learning viewpoint. Empirically, $tau(n)$ is a skewed and heavily overdispersed count with pronounced arithmetic heterogeneity. We develop two complementary models. First, a Bayesian hierarchical Negative Binomial regression (NB2-GLM) predicts $tau(n)$ from simple covariates ($log n$ and residue class $n bmod 8$), quantifying uncertainty via posterior and posterior predictive distributions. Second, we propose a mechanistic generative approximation based on […]

Ver mais

Like 0

Liked Liked

technocracy

Coordinatewise Balanced Covering for Linear Gain Graphs, with an Application to Coset-List Min-2-Lin over Powers of Two

digitado ⋅ 22 de April de 2026

arXiv:2604.18661v1 Announce Type: new Abstract: We study a list-constrained extension of modular equation deletion over powers of two, called Coset-List Min-2-Lin$^{pm}$ over $mathbb{Z}/2^dmathbb{Z}$. Each variable is restricted to a dyadic coset $a+2^{ell}(mathbb{Z}/2^dmathbb{Z})$, each binary constraint is of the form $x_u=x_v$, $x_u=-x_v$, or $x_u=2x_v$, and the goal is to delete a minimum number of constraints so that the remaining system is satisfiable. This problem lies between the no-list case and the poorly understood fully conservative list setting. Our main […]

Ver mais

Like 0

Liked Liked

technocracy

Using Synthetic Data for Machine Learning-based Childhood Vaccination Prediction in Narok, Kenya

digitado ⋅ 10 de April de 2026

Background: Limited data utilization in low-resource settings poses a barrier to the vaccine delivery ecosystem, undermining efforts to achieve equitable immunization coverage. In nomadic populations, individuals face an increased risk of missing crucial vaccination doses as children. One such population is the Maasai in Narok County, Kenya, where the absence of high-volume, quality data hampers accurate coverage estimates, impedes efficient resource allocation, and weakens the ability to deliver timely interventions. Additionally, data privacy concerns are heightened in groups […]

Ver mais

Like 0

Liked Liked

technocracy

OpenAI Agents SDK improves governance with sandbox execution

digitado ⋅ 16 de April de 2026

OpenAI is introducing sandbox execution that allows enterprise governance teams to deploy automated workflows with controlled risk. Teams taking systems from prototype to production have faced difficult architectural compromises regarding where their operations occurred. Using model-agnostic frameworks offered initial flexibility but failed to fully utilise the capabilities of frontier models. Model-provider SDKs remained closer to the underlying model, but often lacked enough visibility into the control harness. To complicate matters further, managed agent APIs simplified the deployment process […]

Ver mais

Like 0

Liked Liked

technocracy

Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching

digitado ⋅ 16 de January de 2026

arXiv:2601.10418v1 Announce Type: cross Abstract: We study tabular reinforcement learning problems with multiple steps of lookahead information. Before acting, the learner observes $ell$ steps of future transition and reward realizations: the exact state the agent would reach and the rewards it would collect under any possible course of action. While it has been shown that such information can drastically boost the value, finding the optimal policy is NP-hard, and it is common to apply one of two tractable […]

Ver mais

Like 0

Liked Liked

technocracy

What do Geometric Hallucination Detection Metrics Actually Measure?

digitado ⋅ 11 de February de 2026

arXiv:2602.09158v1 Announce Type: new Abstract: Hallucination remains a barrier to deploying generative models in high-consequence applications. This is especially true in cases where external ground truth is not readily available to validate model outputs. This situation has motivated the study of geometric signals in the internal state of an LLM that are predictive of hallucination and require limited external knowledge. Given that there are a range of factors that can lead model output to be called a hallucination […]

Ver mais

Like 0

Liked Liked

technocracy

Multimodal Structure Learning: Disentangling Shared and Specific Topology via Cross-Modal Graphical Lasso

digitado ⋅ 5 de April de 2026

Learning interpretable multimodal representations inherently relies on uncovering the conditional dependencies between heterogeneous features. However, sparse graph estimation techniques, such as Graphical Lasso (GLasso), to visual-linguistic domains is severely bottlenecked by high-dimensional noise, modality misalignment, and the confounding of shared versus category-specific topologies. In this paper, we propose Cross-Modal Graphical Lasso (CM-GLasso) that overcomes these fundamental limitations. By coupling a novel text-visualization strategy with a unified vision-language encoder, we strictly align multimodal features into a shared latent space. […]

Ver mais

Like 0

Liked Liked

technocracy

El día en que descubrimos que el móvil también se apagaba

digitado ⋅ 29 de June de 2026

El gran apagón ibérico de abril de 2025 nos dejó muchas imágenes: trenes parados, semáforos muertos, comercios incapaces de cobrar, ascensores detenidos, gente haciendo colas en cajeros o supermercados, y ciudades que de repente parecían haber retrocedido varias décadas. Pero, para muchos, lo más desconcertante no fue quedarse sin luz. Fue mirar el móvil, ese objeto que hemos convertido en brújula, radio, cartera, agenda, llave y cordón umbilical con el mundo, y descubrir que tampoco servía para nada. […]

Ver mais

Like 0

Liked Liked

technocracy

Casewise and Cellwise Robust Multilinear Principal Component Analysis

digitado ⋅ 18 de March de 2026

arXiv:2503.07327v2 Announce Type: replace-cross Abstract: Multilinear Principal Component Analysis (MPCA) is an important tool for analyzing tensor data. It performs dimension reduction similar to PCA for multivariate data. However, standard MPCA is sensitive to outliers. It is highly influenced by observations deviating from the bulk of the data, called casewise outliers, as well as by individual outlying cells in the tensors, so-called cellwise outliers. This latter type of outlier is highly likely to occur in tensor data, as […]

Ver mais

Like 0

Liked Liked