digitado

Inverse Contextual Bandits without Rewards: Learning from a Non-Stationary Learner via Suffix Imitation

digitado ⋅ 4 de March de 2026

We study the Inverse Contextual Bandit (ICB) problem, in which a learner seeks to optimize a policy while an observer, who cannot access the learner’s rewards and only observes actions, aims to recover the underlying problem parameters. During the learning process, the learner’s behavior naturally transitions from exploration to exploitation, resulting in non-stationary action data that poses significant challenges for the observer. To address this issue, we propose a simple and effective framework called Two-Phase Suffix Imitation. The […]

Ver mais

Like 0

Liked Liked

technocracy

Falsifying Predictive Algorithm

digitado ⋅ 27 de January de 2026

arXiv:2601.17146v1 Announce Type: cross Abstract: Empirical investigations into unintended model behavior often show that the algorithm is predicting another outcome than what was intended. These exposes highlight the need to identify when algorithms predict unintended quantities – ideally before deploying them into consequential settings. We propose a falsification framework that provides a principled statistical test for discriminant validity: the requirement that an algorithm predict intended outcomes better than impermissible ones. Drawing on falsification practices from causal inference, econometrics, […]

Ver mais

Like 0

Liked Liked

technocracy

Design Stability in Adaptive Experiments: Implications for Treatment Effect Estimation

digitado ⋅ 1 de April de 2026

arXiv:2510.22351v2 Announce Type: replace-cross Abstract: We study the problem of estimating the average treatment effect (ATE) under sequentially adaptive treatment assignment mechanisms. In contrast to classical completely randomized designs, we consider a setting in which the probability of assigning treatment to each experimental unit may depend on prior assignments and observed outcomes. Within the potential outcomes framework, we propose and analyze two natural estimators for the ATE: the inverse propensity weighted (IPW) estimator and an augmented IPW (AIPW) […]

Ver mais

Like 0

Liked Liked

technocracy

Cardinality is Not Enough: Super Host Detection via Segmented Cardinality Estimation

digitado ⋅ 7 de April de 2026

arXiv:2604.02379v1 Announce Type: new Abstract: Accurately detecting super host that establishes connections to a large number of distinct peers is significant for mitigating web attacks and ensuring high quality of web service. Existing sketch-based approaches estimate the number of distinct connections called flow cardinality according to full IP addresses, while ignoring the fact that a malicious or victim super host often communicates with hosts within the same subnet, resulting in high false positive rates and low accuracy. Though […]

Ver mais

Like 0

Liked Liked

technocracy

Creating GIFs in Python using Pillow (PIL Fork)

digitado ⋅ 21 de August de 2018

I was working on a personal project the other day and I needed to create some images (frames) and save them as a playable GIF. Working in Python, I excepted to find an easy solution fast but oh boy did it take me too long to find it. Here I am now, creating a blog post to help future people looking to create gifs in Python.

Ver mais

Like 0

Liked Liked

technocracy

Mathematical minimalism

digitado ⋅ 13 de April de 2026

Andrzej Odrzywolek recently posted an article on arXiv showing that you can obtain all the elementary functions from just the function and the constant 1. The following equations, taken from the paper’s supplement, show how to bootstrap addition, subtraction, multiplication, and division from the elm function. See the paper and supplement for how to obtain constants like π and functions like square and square root, as well as the standard circular and hyperbolic functions. Related posts Bootstrapping a […]

Ver mais

Like 0

Liked Liked

technocracy

Thompson Sampling for Infinite-Horizon Discounted Decision Processes

digitado ⋅ 9 de April de 2026

arXiv:2405.08253v3 Announce Type: replace Abstract: This paper develops a viable notion of learning for sampling-based algorithms that applies in broader settings than previously considered. More specifically, we model a discounted infinite-horizon MDPs with Borel state and action spaces, whose rewards and transitions depend on an unknown parameter. To analyze adaptive learning algorithms based on sampling we introduce a general canonical probability space in this setting. Since standard definitions of regret are inadequate for policy evaluation in this setting, […]

Ver mais

Like 0

Liked Liked

technocracy

Another AT&T FirstNet user gets shocking $6,200 bill, at $2 per megabyte

digitado ⋅ 13 de March de 2026

If you’re an AT&T FirstNet customer and suddenly get hit with a $6,200 charge, the good news is that it’s probably a mistake and can be corrected. But actually getting the wrong charge wiped out might not be so easy. This has now happened at least twice. In December 2024, a Texas police officer received a $6,223 bill with a $6,194 charge for using 3.1GB of data. He said he had unlimited data but was charged incorrectly after […]

Ver mais

Like 0

Liked Liked

technocracy

Rank-Accuracy Trade-off for LoRA: A Gradient-Flow Analysis

digitado ⋅ 12 de February de 2026

arXiv:2602.10212v1 Announce Type: new Abstract: Previous empirical studies have shown that LoRA achieves accuracy comparable to full-parameter methods on downstream fine-tuning tasks, even for rank-1 updates. By contrast, the theoretical underpinnings of the dependence of LoRA’s accuracy on update rank remain relatively unexplored. In this work, we compare the accuracy of rank-r LoRA updates against full-parameter updates for fine-tuning tasks from a dynamical systems perspective. We perform gradient flow analysis in both full-rank and low-rank regimes to establish […]

Ver mais

Like 0

Liked Liked

technocracy

Score-matching-based Structure Learning for Temporal Data on Networks

digitado ⋅ 14 de April de 2026

arXiv:2412.07469v3 Announce Type: replace Abstract: Causal discovery is a crucial initial step in establishing causality from empirical data and background knowledge. Numerous algorithms have been developed for this purpose. Among them, the score-matching method has demonstrated superior performance across various evaluation metrics, particularly for the commonly encountered Additive Nonlinear Causal Models. However, current score-matching-based algorithms are primarily designed to analyze independent and identically distributed (i.i.d.) data. More importantly, they suffer from high computational complexity due to the pruning […]

Ver mais

Like 0

Liked Liked