January 2026

Teach Diffusion Language Models to Learn from Their Own Mistakes

digitado ⋅ 10 de January de 2026

Masked Diffusion Language Models (DLMs) achieve significant speed by generating multiple tokens in parallel. However, this parallel sampling approach, especially when using fewer inference steps, will introduce strong dependency errors and cause quality to deteriorate rapidly as the generation step size grows. As a result, reliable self-correction becomes essential for maintaining high-quality multi-token generation. To address this, we propose Decoupled Self-Correction (DSC), a novel two-stage methodology. DSC first fully optimizes the DLM’s generative ability before freezing the model […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond TPC-H: Scaling IA2 for Real-World Database Optimization

digitado ⋅ 10 de January de 2026

Table of Links Abstract and 1. Introduction Related Works 2.1 Traditional Index Selection Approaches 2.2 RL-based Index Selection Approaches Index Selection Problem Methodology 4.1 Formulation of the DRL Problem 4.2 Instance-Aware Deep Reinforcement Learning for Efficient Index Selection System Framework of IA2 5.1 Preprocessing Phase 5.2 RL Training and Application Phase Experiments 6.1 Experimental Setting 6.2 Experimental Results 6.3 End-to-End Performance Comparison 6.4 Key Insights Conclusion and Future Work, and References 7 Conclusion and Future Work This study […]

Ver mais

Like 0

Liked Liked

technocracy

Adaptive Action Pruning: Scaling Index Selection for Unseen Workloads

digitado ⋅ 10 de January de 2026

Ver mais

Like 0

Liked Liked

technocracy

Beyond SWIRL: Scalable Database Indexing for Dynamic Workloads

digitado ⋅ 10 de January de 2026

Ver mais

Like 0

Liked Liked

technocracy

SpaceX gets FCC permission to launch another 7,500 Starlink satellites

digitado ⋅ 10 de January de 2026

SpaceX today received US permission to launch another 7,500 second-generation Starlink satellites, bringing its total authorization to 15,000 Gen2 satellites including those previously approved. “Under this grant, SpaceX is authorized to construct, deploy, and operate an additional 7,500 Gen2 Starlink satellites, bringing the total to 15,000 satellites worldwide,” the Federal Communications Commission announced today. “This expansion will enable SpaceX to deliver high-speed, low-latency Internet service globally, including enhanced mobile and supplemental coverage from space.” The FCC gave SpaceX permission […]

Ver mais

Like 0

Liked Liked

technocracy

Antimemetics: An Essential Field Guide

digitado ⋅ 10 de January de 2026

Memes have been part of the discourse since Richard Dawkins published The Selfish Gene in 1976. Dawkins defined “memes” as units of cultural transmission—ideas and concepts that spread through imitation. Any idea that replicates by passing from one mind to another is a meme. A meme can be a belief system, a set of behaviors, an ideology, a viral catchphrase, a fashion trend, a cultural artifact, or an urban legend. But if memes are defined by virality, antimemes […]

Ver mais

Like 0

Liked Liked

technocracy

ESA considers righting the wrongs of Ariane 6 by turning it into a Franken-rocket

digitado ⋅ 10 de January de 2026

It took a while, but a consensus has emerged in Europe that the continent’s space industry needs to develop reusable rockets. How to do it and how much to spend on it remain unresolved questions. Much of the discourse around reusable rockets in Europe has focused on developing a brand new rocket that might eventually replace the Ariane 6, which debuted less than two years ago but still uses the use it and lose it model embraced by […]

Ver mais

Like 0

Liked Liked

technocracy

Crossmodal search with Amazon Nova Multimodal Embeddings

digitado ⋅ 10 de January de 2026

Amazon Nova Multimodal Embeddings processes text, documents, images, video, and audio through a single model architecture. Available through Amazon Bedrock, the model converts different input modalities into numerical embeddings within the same vector space, supporting direct similarity calculations regardless of content type. We developed this unified model to reduce the need for separate embedding models, which complicate architectures, are difficult to maintain and operate, and further limit use cases to a one-dimensional approach. In this post, we explore […]

Ver mais

Like 0

Liked Liked

technocracy

Fly’s new Sprites.dev addresses both developer sandboxes and API sandboxes at the same time

digitado ⋅ 10 de January de 2026

New from Fly.io today: Sprites.dev. Here’s their blog post and YouTube demo. It’s an interesting new product that’s quite difficult to explain – Fly call it “Stateful sandbox environments with checkpoint & restore” but I see it as hitting two of my current favorite problems: a safe development environment for running coding agents and an API for running untrusted code in a secure sandbox. Disclosure: Fly sponsor some of my work. They did not ask me to write […]

Ver mais

Like 0

Liked Liked

technocracy

Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning

digitado ⋅ 10 de January de 2026

Mixture-of-experts variants of parameter-efficient fine-tuning enable per-token specialization, but they introduce additional trainable routers and expert parameters, increasing memory usage and training cost. This undermines the core goal of parameter-efficient fine-tuning. We propose Monkey Jump, a method that brings mixture-of-experts-style specialization to parameter-efficient fine-tuning without introducing extra trainable parameters for experts or routers. Instead of adding new adapters as experts, Monkey Jump treats the adapters already present in each Transformer block (such as query, key, value, up, and […]

Ver mais

Like 0

Liked Liked