digitado

About digitado

https://www.digitado.com.br

Posts by :

Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings

digitado ⋅ 6 de March de 2026

arXiv:2508.11847v3 Announce Type: replace Abstract: We propose a method for evaluating the robustness of widely used LLM ranking systems — variants of a Bradley–Terry model — to dropping a worst-case very small fraction of preference data. Our approach is computationally fast and easy to adopt. When we apply our method to matchups from popular LLM ranking platforms, including Chatbot Arena and derivatives, we find that the rankings of top-performing models can be remarkably sensitive to the removal of […]

Ver mais

Like 0

Liked Liked

technocracy

Highly Efficient and Effective LLMs with Multi-Boolean Architectures

digitado ⋅ 6 de March de 2026

arXiv:2505.22811v4 Announce Type: replace Abstract: Weight binarization has emerged as a promising strategy to reduce the complexity of large language models (LLMs). Existing approaches fall into post-training binarization, which is simple but causes severe performance loss, and training-aware methods, which depend on full-precision latent weights, adding complexity and limiting efficiency. We propose a novel framework that represents LLMs with multi-kernel Boolean parameters and, for the first time, enables direct finetuning LMMs in the Boolean domain, eliminating the need […]

Ver mais

Like 0

Liked Liked

technocracy

Variational Formulation of Particle Flow

digitado ⋅ 6 de March de 2026

arXiv:2505.04007v2 Announce Type: replace Abstract: This paper provides a formulation of the log-homotopy particle flow from the perspective of variational inference. We show that the transient density used to derive the particle flow follows a time-scaled trajectory of the Fisher-Rao gradient flow in the space of probability densities. The Fisher-Rao gradient flow is obtained as a continuous-time algorithm for variational inference, minimizing the Kullback-Leibler divergence between a variational density and the true posterior density. When considering a parametric […]

Ver mais

Like 0

Liked Liked

technocracy

Generalization Bounds for Markov Algorithms through Entropy Flow Computations

digitado ⋅ 6 de March de 2026

arXiv:2502.07584v2 Announce Type: replace Abstract: Many learning algorithms can be represented as Markov processes, and understanding their generalization error is a central topic in learning theory. For specific continuous-time noisy algorithms, a prominent analysis technique relies on information-theoretic tools and the so-called “entropy flow” method. This technique is compatible with a broad range of assumptions and leverages the convergence properties of learning dynamics to produce meaningful generalization bounds, which can also be informative or extend to discrete-time settings. […]

Ver mais

Like 0

Liked Liked

technocracy

SurvHTE-Bench: A Benchmark for Heterogeneous Treatment Effect Estimation in Survival Analysis

digitado ⋅ 6 de March de 2026

arXiv:2603.05483v1 Announce Type: cross Abstract: Estimating heterogeneous treatment effects (HTEs) from right-censored survival data is critical in high-stakes applications such as precision medicine and individualized policy-making. Yet, the survival analysis setting poses unique challenges for HTE estimation due to censoring, unobserved counterfactuals, and complex identification assumptions. Despite recent advances, from Causal Survival Forests to survival meta-learners and outcome imputation approaches, evaluation practices remain fragmented and inconsistent. We introduce SurvHTE-Bench, the first comprehensive benchmark for HTE estimation with censored […]

Ver mais

Like 0

Liked Liked

technocracy

Layer by layer, module by module: Choose both for optimal OOD probing of ViT

digitado ⋅ 6 de March de 2026

arXiv:2603.05280v1 Announce Type: cross Abstract: Recent studies have observed that intermediate layers of foundation models often yield more discriminative representations than the final layer. While initially attributed to autoregressive pretraining, this phenomenon has also been identified in models trained via supervised and discriminative self-supervised objectives. In this paper, we conduct a comprehensive study to analyze the behavior of intermediate layers in pretrained vision transformers. Through extensive linear probing experiments across a diverse set of image classification benchmarks, we […]

Ver mais

Like 0

Liked Liked

technocracy

Towards a data-scale independent regulariser for robust sparse identification of non-linear dynamics

digitado ⋅ 6 de March de 2026

arXiv:2603.05201v1 Announce Type: cross Abstract: Data normalisation, a common and often necessary preprocessing step in engineering and scientific applications, can severely distort the discovery of governing equations by magnitudebased sparse regression methods. This issue is particularly acute for the Sparse Identification of Nonlinear Dynamics (SINDy) framework, where the core assumption of sparsity is undermined by the interaction between data scaling and measurement noise. The resulting discovered models can be dense, uninterpretable, and physically incorrect. To address this critical […]

Ver mais

Like 0

Liked Liked

technocracy

Federated Causal Discovery Across Heterogeneous Datasets under Latent Confounding

digitado ⋅ 6 de March de 2026

arXiv:2603.05149v1 Announce Type: cross Abstract: Causal discovery across multiple datasets is often constrained by data privacy regulations and cross-site heterogeneity, limiting the use of conventional methods that require a single, centralized dataset. To address these challenges, we introduce fedCI, a federated conditional independence test that rigorously handles heterogeneous datasets with non-identical sets of variables, site-specific effects, and mixed variable types, including continuous, ordinal, binary, and categorical variables. At its core, fedCI uses a federated Iteratively Reweighted Least Squares […]

Ver mais

Like 0

Liked Liked

technocracy

Non-Euclidean Gradient Descent Operates at the Edge of Stability

digitado ⋅ 6 de March de 2026

arXiv:2603.05002v1 Announce Type: cross Abstract: The Edge of Stability (EoS) is a phenomenon where the sharpness (largest eigenvalue) of the Hessian converges to $2/eta$ during training with gradient descent (GD) with a step-size $eta$. Despite (apparently) violating classical smoothness assumptions, EoS has been widely observed in deep learning, but its theoretical foundations remain incomplete. We provide an interpretation of EoS through the lens of Directional Smoothness Mishkin et al. [2024]. This interpretation naturally extends to non-Euclidean norms, which […]

Ver mais

Like 0

Liked Liked

technocracy

Distributional Equivalence in Linear Non-Gaussian Latent-Variable Cyclic Causal Models: Characterization and Learning

digitado ⋅ 6 de March de 2026

arXiv:2603.04780v1 Announce Type: cross Abstract: Causal discovery with latent variables is a fundamental task. Yet most existing methods rely on strong structural assumptions, such as enforcing specific indicator patterns for latents or restricting how they can interact with others. We argue that a core obstacle to a general, structural-assumption-free approach is the lack of an equivalence characterization: without knowing what can be identified, one generally cannot design methods for how to identify it. In this work, we aim […]

Ver mais

Like 0

Liked Liked