January 2026

Safety Generalization Under Distribution Shift in Safe Reinforcement Learning: A Diabetes Testbed

digitado ⋅ 28 de January de 2026

Safe Reinforcement Learning (RL) algorithms are typically evaluated under fixed training conditions. We investigate whether training-time safety guarantees transfer to deployment under distribution shift, using diabetes management as a safety-critical testbed. We benchmark safe RL algorithms on a unified clinical simulator and reveal a safety generalization gap: policies satisfying constraints during training frequently violate safety requirements on unseen patients. We demonstrate that test-time shielding, which filters unsafe actions using learned dynamics models, effectively restores safety across algorithms and […]

Ver mais

Like 0

Liked Liked

technocracy

High-dimensional learning dynamics of multi-pass Stochastic Gradient Descent in multi-index models

digitado ⋅ 28 de January de 2026

We study the learning dynamics of a multi-pass, mini-batch Stochastic Gradient Descent (SGD) procedure for empirical risk minimization in high-dimensional multi-index models with isotropic random data. In an asymptotic regime where the sample size $n$ and data dimension $d$ increase proportionally, for any sub-linear batch size $κasymp n^α$ where $αin [0,1)$, and for a commensurate “critical” scaling of the learning rate, we provide an asymptotically exact characterization of the coordinate-wise dynamics of SGD. This characterization takes the form […]

Ver mais

Like 0

Liked Liked

technocracy

MapPFN: Learning Causal Perturbation Maps in Context

digitado ⋅ 28 de January de 2026

Planning effective interventions in biological systems requires treatment-effect models that adapt to unseen biological contexts by identifying their specific underlying mechanisms. Yet single-cell perturbation datasets span only a handful of biological contexts, and existing methods cannot leverage new interventional evidence at inference time to adapt beyond their training data. To meta-learn a perturbation effect estimator, we present MapPFN, a prior-data fitted network (PFN) pretrained on synthetic data generated from a prior over causal perturbations. Given a set of […]

Ver mais

Like 0

Liked Liked

technocracy

Google is Quietly Testing Voice Cloning & GitHub Imports in AI Studio

digitado ⋅ 28 de January de 2026

Key Highlights: Google appears to be all set for a massive push into native audio, and AI Studio is where the first signs are starting to show. Over the past few days, users digging through AI Studio have spotted an interesting new option in the interface. A hidden “Create Your Voice” option has been spotted in AI Studio The news comes via Testing Catalog, which reports that when you select the Flash native audio preview model, powered by […]

Ver mais

Like 0

Liked Liked

technocracy

An efficient, accurate, and interpretable machine learning method for computing probability of failure

digitado ⋅ 28 de January de 2026

We introduce a novel machine learning method called the Penalized Profile Support Vector Machine based on the Gabriel edited set for the computation of the probability of failure for a complex system as determined by a threshold condition on a computer model of system behavior. The method is designed to minimize the number of evaluations of the computer model while preserving the geometry of the decision boundary that determines the probability. It employs an adaptive sampling strategy designed […]

Ver mais

Like 0

Liked Liked

technocracy

Adding dynamic features to an aggressively cached website

digitado ⋅ 28 de January de 2026

My blog uses aggressive caching: it sits behind Cloudflare with a 15 minute cache header, which guarantees it can survive even the largest traffic spike to any given page. I’ve recently added a couple of dynamic features that work in spite of that full-page caching. Here’s how those work. Edit links that are visible only to me This is a Django site and I manage it through the Django admin. I have four types of content – entries, […]

Ver mais

Like 0

Liked Liked

technocracy

Site catering to online criminals has been seized by the FBI

digitado ⋅ 28 de January de 2026

RAMP—the predominantly Russian-language online bazaar that billed itself as the “only place ransomware allowed”—had its dark web and clear web sites seized by the FBI as the agency tries to combat the growing scourge threatening critical infrastructure and organizations around the world. Visits to both sites on Wednesday returned pages that said the FBI had taken control of the RAMP domains, which mirrored each other. RAMP has been among the dwindling number of online crime forums to operate […]

Ver mais

Like 0

Liked Liked

technocracy

Seven things to know about how Apple’s Creator Studio subscriptions work

digitado ⋅ 28 de January de 2026

Apple’s new Creator Studio subscription bundle officially launches today, offering access to a wide range of updated professional apps for an all-or-nothing price of $12.99 a month or $129 a year. Teachers and students can get the same apps for $2.99 a month, or $29.99 a year. The bundle includes either access to or enhanced features for a total of 10 Apple apps, though the base versions of several of these are available for free to all Mac […]

Ver mais

Like 0

Liked Liked

technocracy

The Five Levels: from Spicy Autocomplete to the Dark Factory

digitado ⋅ 28 de January de 2026

The Five Levels: from Spicy Autocomplete to the Dark Factory Dan Shapiro proposes a five level model of AI-assisted programming, inspired by the five (or rather six, it’s zero-indexed) levels of driving automation. Spicy autocomplete, aka original GitHub Copilot or copying and pasting snippets from ChatGPT. The coding intern, writing unimportant snippets and boilerplate with full human review. The junior developer, pair programming with the model but still reviewing every line. The developer. Most code is generated by […]

Ver mais

Like 0

Liked Liked

technocracy

Stranded boys struggle to survive in Lord of the Flies trailer

digitado ⋅ 28 de January de 2026

BBC One has adapted William Golding’s classic 1954 novel Lord of the Flies into a new miniseries and just dropped the first trailer. The book has been adapted for film three times since its publication and also inspired the Emmy-nominated TV series Yellowjackets (renewed for its fourth and final season this year). This BBC miniseries apparently has the support of the Golding family and is expected to hew quite closely to the novel. (Spoilers for the 1954 novel […]

Ver mais

Like 0

Liked Liked