May 2026

Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup

digitado ⋅ 15 de May de 2026

Zyphra, the San Francisco-based AI lab behind the ZAYA1 model family, released ZAYA1-8B-Diffusion-Preview — a preview of its early work in diffusion-language models. The release demonstrates that an existing autoregressive language model can be converted into a discrete diffusion model with no systematic loss of evaluation performance, while delivering substantial inference speedups on AMD hardware. https://www.zyphra.com/post/zaya1-8b-diffusion-preview The Problem With Autoregressive Decoding To understand why this matters, it helps to first understand how most language models generate text today. […]

Ver mais

Like 0

Liked Liked

technocracy

The OpenAI trial wraps up, and the Musk founder machine keeps spinning

digitado ⋅ 15 de May de 2026

The Musk v. Altman trial came to a close this week, and the final arguments kept circling back to one question: can we trust the people in charge of AI? All of this is playing out as SpaceX charges toward what could be one of the largest IPOs in American history, with a whole generation of founders already spinning out […]

Ver mais

Like 0

Liked Liked

technocracy

Three’s a party: US, China, and now Russia are on the prowl in GEO

digitado ⋅ 15 de May de 2026

The world’s leading space powers desperately want to know what the others are up to high above the equator. For more than a decade, the US military has operated a fleet of “inspector” satellites designed to sidle up to other spacecraft in geosynchronous orbit and take pictures. China started launching its satellites for a similar mission in 2018. Ars has written about these activities in geosynchronous orbit (GEO) before, but the last few months have seen a couple […]

Ver mais

Like 0

Liked Liked

technocracy

Ebola outbreak with uncommon strain erupts in Congo and Uganda; 65 deaths

digitado ⋅ 15 de May de 2026

The Africa Centres for Disease Control and Prevention on Friday confirmed an Ebola outbreak in the Northeastern Ituri province of the Democratic Republic of the Congo. Officials in Uganda subsequently reported that the deadly hemorrhagic disease had spilled over the border, with one “imported” confirmed case identified in Kampala, the capital. So far, the DRC has reported 246 suspected cases and 65 deaths, mainly in the Mongwalu and Rwampara health zones. Although it is now just being reported, […]

Ver mais

Like 0

Liked Liked

technocracy

Send the arXiv AI-generated slop, get a yearlong vacation from submissions

digitado ⋅ 15 de May de 2026

AI-generated slop has shown up everywhere, including in the peer-reviewed literature. Fake citations, unedited prompt responses, and nonsensical diagrams have all slipped past editors and peer reviewers, and it’s not always clear if there are any consequences for the people responsible. Now, it appears that a number of scientific fields will be enforcing rules against AI-generated problems even before peer review or journals get involved. One of the people involved in the physics and astronomy preprint server arXiv […]

Ver mais

Like 0

Liked Liked

technocracy

The Privacy Price of Tail-Risk Learning: Effective Tail Sample Size in Differentially Private CVaR Optimization

digitado ⋅ 15 de May de 2026

Differential privacy changes the effective sample size governing CVaR learning. For tail mass $τ$, the privacy-relevant sample size is not $n$, but $nτ$; equivalently, the effective private tail sample size is $εnτ$. Private CVaR excess risk decomposes into ordinary tail-risk statistical error and a privacy price. This decomposition is complete for scalar estimation and finite classes: scalar estimation has rate $Θ(B min{1,(nτ)^{-1/2}+(εnτ)^{-1}})$, and finite classes of size $M$ have rate $Θ(B min{1,sqrt{log(2M)/(nτ)}+log(2M)/(εnτ)})$. These complete rates hold under pure […]

Ver mais

Like 0

Liked Liked

technocracy

Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion [R]

digitado ⋅ 15 de May de 2026

Paper: https://arxiv.org/abs/2605.12825 Code: https://github.com/chiennv2000/orthrus Disclosure: co-author. Idea: Inject a trainable diffusion attention module into each layer of a frozen AR Transformer. Both heads share one KV cache. Diffusion head projects K=32 tokens in parallel; AR head verifies in a second pass and accepts the longest matching prefix. Output distribution is provably identical to the base model. Results: Up to 7.8× TPF, ~6× wall-clock on MATH-500. 16% of params trained, <1B tokens, 24h on 8×H200. vs. diffusion LMs (Dream, […]

Ver mais

Like 0

Liked Liked

technocracy

Imitation learning for clinical decision support in pediatric ECMO

digitado ⋅ 15 de May de 2026

Pediatric critical care is a dynamic, high-stakes process involving constant monitoring and adjustments in life-saving treatments. Modeling these interventions is crucial for effective decision support. To address the challenges of high complexity and data scarcity in pediatric Extracorporeal Membrane Oxygenation (ECMO), we frame clinical decision-making as learning to act from trajectories, i.e., imitation learning that learns action models from observational data, with a key feature that actions are not directly observed. We consider TabPFN, a recent transformer-based approach […]

Ver mais

Like 0

Liked Liked

technocracy

BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control

digitado ⋅ 15 de May de 2026

Real-world control systems frequently operate under emph{piecewise stationary} conditions, where dynamics remain stable for extended periods before undergoing abrupt regime changes. Standard robust RL methods face a fundamental dilemma: a globally conservative policy wastes performance during stable periods, while a locally adaptive policy risks catastrophic failure when the regime changes undetected. We propose textbf{BAPR} (Bayesian Amnesic Piecewise-Robust SAC), which unifies Bayesian Online Change Detection (BOCD) with robust ensemble RL. The BAPR operator — a convex combination of mode-conditional […]

Ver mais

Like 0

Liked Liked

technocracy

Learn Where Outcomes Diverge: Efficient VLA RL via Probabilistic Chunk Masking

digitado ⋅ 15 de May de 2026

Reinforcement learning (RL) allows vision-language-action (VLA) policies to generalize beyond their training distribution by optimizing directly for task success, but post-training is computationally expensive. A natural response has been to speed rollout collection through faster simulators and world models. In GRPO-based VLA RL, we find that the dominant cost lies elsewhere: gradient computation accounts for approximately 78% of wall-clock time per step in our runs, while rollout collection accounts for only 21%. Gradient cost dominates because much of […]

Ver mais

Like 0

Liked Liked