technocracy

Impact of Positional Encoding: Clean and Adversarial Rademacher Complexity for Transformers under In-Context Regression

digitado ⋅ 25 de March de 2026

arXiv:2512.09275v2 Announce Type: replace Abstract: Positional encoding (PE) is a core architectural component of Transformers, yet its impact on the Transformer’s generalization and robustness remains unclear. In this work, we provide the first generalization analysis for a single-layer Transformer under in-context regression that explicitly accounts for a completely trainable PE module. Our result shows that PE systematically enlarges the generalization gap. Extending to the adversarial setting, we derive the adversarial Rademacher generalization bound. We find that the gap […]

Ver mais

Like 0

Liked Liked

technocracy

Coverage and Rate Analysis of Follower-Based LEO Satellite Networks: A Stochastic Geometry Approach

digitado ⋅ 3 de April de 2026

arXiv:2604.01265v1 Announce Type: new Abstract: To mitigate inter-satellite interference and payload limits in LEO mega-constellations, satellite clusters, groups of small cooperative satellites have been proposed to improve performance and reduce interference. The typical configuration divides the cluster into a leader satellite with full processing and control capabilities and multiple simpler follower satellites that assist with coverage and throughput. These clusters enhance coverage and throughput, prompting interest in their performance gains and optimal deployment. Given that the spherical stochastic […]

Ver mais

Like 0

Liked Liked

technocracy

Sequential Group Composition: A Window into the Mechanics of Deep Learning

digitado ⋅ 3 de February de 2026

How do neural networks trained over sequences acquire the ability to perform structured operations, such as arithmetic, geometric, and algorithmic computation? To gain insight into this question, we introduce the sequential group composition task. In this task, networks receive a sequence of elements from a finite group encoded in a real vector space and must predict their cumulative product. The task can be order-sensitive and requires a nonlinear architecture to be learned. Our analysis isolates the roles of […]

Ver mais

Like 0

Liked Liked

technocracy

A Pure Hypothesis Test for Inhomogeneous Random Graph Models Based on a Kernelised Stein Discrepancy

digitado ⋅ 2 de April de 2026

arXiv:2505.21580v3 Announce Type: replace Abstract: Complex data are often represented as a graph, which in turn can often be viewed as a realisation of a random graph, such as an inhomogeneous random graph model (IRG). For general fast goodness-of-fit tests in high dimensions, kernelised Stein discrepancy (KSD) tests are a powerful tool. Here, we develop a KSD-type test for IRG models that can be carried out with a single observation of the network. The test applies to a […]

Ver mais

Like 0

Liked Liked

technocracy

Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows

digitado ⋅ 11 de February de 2026

Hierarchical goal-conditioned reinforcement learning (H-GCRL) provides a powerful framework for tackling complex, long-horizon tasks by decomposing them into structured subgoals. However, its practical adoption is hindered by poor data efficiency and limited policy expressivity, especially in offline or data-scarce regimes. In this work, Normalizing flow-based hierarchical implicit Q-learning (NF-HIQL), a novel framework that replaces unimodal gaussian policies with expressive normalizing flow policies at both the high- and low-levels of the hierarchy is introduced. This design enables tractable log-likelihood […]

Ver mais

Like 0

Liked Liked

technocracy

Text Generation

digitado ⋅ 30 de August de 2019

Early this year, OpenAI announced a very powerful language model they developed that can generate human-like text. While such announcements are usually followed by the release of a model to the public, this one suggested that such a powerful tool will pose a danger, and therefore only a smaller and less powerful version of it was released. Soon enough, in addition to the usual buzz on academic Twitter, the news made it to popular media – where it was described […]

Ver mais

Like 0

Liked Liked

technocracy

Envy-Free Allocation of Indivisible Goods via Noisy Queries

digitado ⋅ 9 de February de 2026

arXiv:2602.06361v1 Announce Type: cross Abstract: We introduce a problem of fairly allocating indivisible goods (items) in which the agents’ valuations cannot be observed directly, but instead can only be accessed via noisy queries. In the two-agent setting with Gaussian noise and bounded valuations, we derive upper and lower bounds on the required number of queries for finding an envy-free allocation in terms of the number of items, $m$, and the negative-envy of the optimal allocation, $Delta$. In particular, […]

Ver mais

Like 0

Liked Liked

technocracy

Leakage and Interpretability in Concept-Based Models

digitado ⋅ 25 de March de 2026

arXiv:2504.14094v3 Announce Type: replace-cross Abstract: Concept-based Models aim to improve interpretability by predicting high-level intermediate concepts, representing a promising approach for deployment in high-risk scenarios. However, they are known to suffer from information leakage, whereby models exploit unintended information encoded within the learned concepts. We introduce an information-theoretic framework to rigorously characterise and quantify leakage, and define two complementary measures: the concepts-task leakage (CTL) and interconcept leakage (ICL) scores. We show that these measures are strongly predictive of […]

Ver mais

Like 0

Liked Liked

technocracy

Vector-Valued Distributional Reinforcement Learning Policy Evaluation: A Hilbert Space Embedding Approach

digitado ⋅ 28 de January de 2026

arXiv:2601.18952v1 Announce Type: cross Abstract: We propose an (offline) multi-dimensional distributional reinforcement learning framework (KE-DRL) that leverages Hilbert space mappings to estimate the kernel mean embedding of the multi-dimensional value distribution under a proposed target policy. In our setting, the state-action variables are multi-dimensional and continuous. By mapping probability measures into a reproducing kernel Hilbert space via kernel mean embeddings, our method replaces Wasserstein metrics with an integral probability metric. This enables efficient estimation in multi-dimensional state-action spaces […]

Ver mais

Like 0

Liked Liked

technocracy

EvoMU: Evolutionary Machine Unlearning

digitado ⋅ 2 de February de 2026

Machine unlearning aims to unlearn specified training data (e.g. sensitive or copyrighted material). A prominent approach is to fine-tune an existing model with an unlearning loss that retains overall utility. The space of suitable unlearning loss functions is vast, making the search for an optimal loss function daunting. Additionally, there might not even exist a universally optimal loss function: differences in the structure and overlap of the forget and retain data can cause a loss to work well […]

Ver mais

Like 0

Liked Liked