digitado

About digitado

https://www.digitado.com.br

Posts by :

Ice Cream Doesn’t Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

digitado ⋅ 6 de March de 2026

arXiv:2505.13770v2 Announce Type: replace-cross Abstract: Reliable causal inference is essential for making decisions in high-stakes areas like medicine, economics, and public policy. However, it remains unclear whether large language models (LLMs) can handle rigorous and trustworthy statistical causal inference. Current benchmarks usually involve simplified tasks. For example, these tasks might only ask LLMs to identify semantic causal relationships or draw conclusions directly from raw data. As a result, models may overlook important statistical pitfalls, such as Simpson’s paradox […]

Ver mais

Like 0

Liked Liked

technocracy

Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy

digitado ⋅ 6 de March de 2026

arXiv:2502.11682v2 Announce Type: replace-cross Abstract: Strong Differential Privacy (DP) and Optimization guarantees are two desirable properties for a method in Federated Learning (FL). However, existing algorithms do not achieve both properties at once: they either have optimal DP guarantees but rely on restrictive assumptions such as bounded gradients/bounded data heterogeneity, or they ensure strong optimization performance but lack DP guarantees. To address this gap in the literature, we propose and analyze a new method called Clip21-SGD2M based on […]

Ver mais

Like 0

Liked Liked

technocracy

Curse of Dimensionality in Neural Network Optimization

digitado ⋅ 6 de March de 2026

arXiv:2502.05360v3 Announce Type: replace-cross Abstract: This paper demonstrates that when a shallow neural network with a Lipschitz continuous activation function is trained using either empirical or population risk to approximate a target function that is $r$ times continuously differentiable on $[0,1]^d$, the population risk may not decay at a rate faster than $t^{-frac{4r}{d-2r}}$, where $t$ denotes the time parameter of the gradient flow dynamics. This result highlights the presence of the curse of dimensionality in the optimization computation […]

Ver mais

Like 0

Liked Liked

technocracy

An Experimental Study on Fairness-aware Machine Learning for Credit Scoring Problems

digitado ⋅ 6 de March de 2026

arXiv:2412.20298v2 Announce Type: replace-cross Abstract: The digitalization of credit scoring has become essential for financial institutions and commercial banks, especially in the era of digital transformation. Machine learning techniques are commonly used to evaluate customers’ creditworthiness. However, the predicted outcomes of machine learning models can be biased toward protected attributes, such as race or gender. Numerous fairness-aware machine learning models and fairness measures have been proposed. Nevertheless, their performance in the context of credit scoring has not been […]

Ver mais

Like 0

Liked Liked

technocracy

Towards a Fairer Non-negative Matrix Factorization

digitado ⋅ 6 de March de 2026

arXiv:2411.09847v3 Announce Type: replace-cross Abstract: There has been a recent critical need to study fairness and bias in machine learning (ML) algorithms. Since there is clearly no one-size-fits-all solution to fairness, ML methods should be developed alongside bias mitigation strategies that are practical and approachable to the practitioner. Motivated by recent work on “fair” PCA, here we consider the more challenging method of non-negative matrix factorization (NMF) as both a showcasing example and a method that is important […]

Ver mais

Like 0

Liked Liked

technocracy

Zeroth-Order primal-dual Alternating Projection Gradient Algorithms for Nonconvex Minimax Problems with Coupled linear Constraints

digitado ⋅ 6 de March de 2026

arXiv:2402.03352v3 Announce Type: replace-cross Abstract: In this paper, we study zeroth-order algorithms for nonconvex minimax problems with coupled linear constraints under the deterministic and stochastic settings, which have attracted wide attention in machine learning, signal processing and many other fields in recent years, e.g., adversarial attacks in resource allocation problems and network flow problems etc. We propose two single-loop algorithms, namely the zeroth-order primal-dual alternating projected gradient (ZO-PDAPG) algorithm and the zeroth-order regularized momentum primal-dual projected gradient algorithm […]

Ver mais

Like 0

Liked Liked

technocracy

Conformal Graph Prediction with Z-Gromov Wasserstein Distances

digitado ⋅ 6 de March de 2026

arXiv:2603.02460v3 Announce Type: replace Abstract: Supervised graph prediction addresses regression problems where the outputs are structured graphs. Although several approaches exist for graph-valued prediction, principled uncertainty quantification remains limited. We propose a conformal prediction framework for graph-valued outputs, providing distribution-free coverage guarantees in structured output spaces. Our method defines nonconformity via the Z-Gromov-Wasserstein distance, instantiated in practice through Fused Gromov-Wasserstein (FGW), enabling permutation invariant comparison between predicted and candidate graphs. To obtain adaptive prediction sets, we introduce Score […]

Ver mais

Like 0

Liked Liked

technocracy

Latent-IMH: Efficient Bayesian Inference for Inverse Problems with Approximate Operators

digitado ⋅ 6 de March de 2026

arXiv:2601.20888v2 Announce Type: replace Abstract: We study sampling from posterior distributions in Bayesian linear inverse problems where $A$, the parameters to observables operator, is computationally expensive. In many applications, $A$ can be factored in a manner that facilitates the construction of a cost-effective approximation $tilde{A}$. In this framework, we introduce Latent-IMH, a sampling method based on the Metropolis-Hastings independence (IMH) sampler. Latent-IMH first generates intermediate latent variables using the approximate $tilde{A}$, and then refines them using the exact […]

Ver mais

Like 0

Liked Liked

technocracy

Symmetric Aggregation of Conformity Scores for Efficient Uncertainty Sets

digitado ⋅ 6 de March de 2026

arXiv:2512.06945v2 Announce Type: replace Abstract: Access to multiple predictive models trained for the same task, whether in regression or classification, is increasingly common in many applications. Aggregating their predictive uncertainties to produce reliable and efficient uncertainty quantification is therefore a critical but still underexplored challenge, especially within the framework of conformal prediction (CP). While CP methods can generate individual prediction sets from each model, combining them into a single, more informative set remains a challenging problem. To address […]

Ver mais

Like 0

Liked Liked

technocracy

Testing Most Influential Sets

digitado ⋅ 6 de March de 2026

arXiv:2510.20372v3 Announce Type: replace Abstract: Small influential data subsets can dramatically impact model conclusions, with a few data points overturning key findings. While recent work identifies these most influential sets, there is no formal way to tell when maximum influence is excessive rather than expected under natural random sampling variation. We address this gap by developing a principled framework for most influential sets. Focusing on linear least-squares, we derive a convenient exact influence formula and identify the extreme […]

Ver mais

Like 0

Liked Liked