March 2026

Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models

digitado ⋅ 16 de March de 2026

Existing test-time scaling (TTS) methods for unified multimodal models (UMMs) in text-to-image (T2I) generation primarily rely on search or sampling strategies that produce only instance-level improvements, limiting the ability to learn from prior inferences and accumulate knowledge across similar prompts. To overcome these limitations, we propose Meta-TTRL, a metacognitive test-time reinforcement learning framework. Meta-TTRL performs test-time parameter optimization guided by model-intrinsic monitoring signals derived from the meta-knowledge of UMMs, achieving self-improvement and capability-level improvement at test time. Extensive […]

Ver mais

Like 0

Liked Liked

technocracy

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

digitado ⋅ 16 de March de 2026

We present the PokeAgent Challenge, a large-scale benchmark for decision-making research built on Pokemon’s multi-agent battle system and expansive role-playing game (RPG) environment. Partial observability, game-theoretic reasoning, and long-horizon planning remain open problems for frontier AI, yet few benchmarks stress all three simultaneously under realistic conditions. PokeAgent targets these limitations at scale through two complementary tracks: our Battling Track, which calls for strategic reasoning and generalization under partial observability in competitive Pokemon battles, and our Speedrunning Track, which […]

Ver mais

Like 0

Liked Liked

technocracy

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

digitado ⋅ 16 de March de 2026

Ver mais

Like 0

Liked Liked

technocracy

Self-Distillation of Hidden Layers for Self-Supervised Representation Learning

digitado ⋅ 16 de March de 2026

The landscape of self-supervised learning (SSL) is currently dominated by generative approaches (e.g., MAE) that reconstruct raw low-level data, and predictive approaches (e.g., I-JEPA) that predict high-level abstract embeddings. While generative methods provide strong grounding, they are computationally inefficient for high-redundancy modalities like imagery, and their training objective does not prioritize learning high-level, conceptual features. Conversely, predictive methods often suffer from training instability due to their reliance on the non-stationary targets of final-layer self-distillation. We introduce Bootleg, a […]

Ver mais

Like 0

Liked Liked

technocracy

F1 in China: I’ve never seen so many people in those grandstands

digitado ⋅ 16 de March de 2026

Formula 1 raced in China this past weekend, just a week after the sport kicked off its 2026 season in Australia. Most of the teams had a better handle on the sport’s complicated new cars in China, and the more traditional racetrack environment played better to the strengths of their hybrid power units, with enough hard braking zones to recharge batteries without having to sap engine power instead. We have a better idea of the grid’s current pecking […]

Ver mais

Like 0

Liked Liked

technocracy

Bridging Local and Global Knowledge: Cascaded Mixture-of-Experts Learning for Near-Shortest Path Routing

digitado ⋅ 16 de March de 2026

While deep learning models that leverage local features have demonstrated significant potential for near-optimal routing in dense Euclidean graphs, they struggle to generalize well in sparse networks where topological irregularities require broader structural awareness. To address this limitation, we train a Cascaded Mixture of Experts (Ca-MoE) to solve the all-pairs near-shortest path (APNSP) routing problem. Our Ca-MoE is a modular two-tier architecture that supports the decision-making for forwarder selection with lower-tier experts relying on local features and upper-tier […]

Ver mais

Like 0

Liked Liked

technocracy

Introducing Disaggregated Inference on AWS powered by llm-d

digitado ⋅ 16 de March de 2026

We thank Greg Pereira and Robert Shaw from the llm-d team for their support in bringing llm-d to AWS. In the agentic and reasoning era, large language models (LLMs) generate 10x more tokens and compute through complex reasoning chains compared to single-shot replies. Agentic AI workflows also create highly variable demands and another exponential increase in processing, bogging down the inference process and degrading the user experience. As the world transitions from prototyping AI solutions to deploying AI at scale, […]

Ver mais

Like 0

Liked Liked

technocracy

Apple’s AirPods Max 2 bring H2 chip, boosted ANC in April for $549

digitado ⋅ 16 de March de 2026

Apple announced the AirPods Max 2 today, following up the original AirPods Max, which were announced in December 2020. The new model brings improved active noise cancellation (ANC) and other new features via an updated H2 chip. The AirPods Max 2 are available in the same five colorways as their predecessor. Credit: Apple Apple introduced the H2 with the AirPods Pro (2nd Generation), which came out in September 2022. The original AirPods Max released in 2021 with an H1, […]

Ver mais

Like 0

Liked Liked

technocracy

100 years later, where is Robert Goddard’s first liquid-fueled rocket?

digitado ⋅ 16 de March de 2026

It flew for only two seconds, but its impact is still felt a century later. Robert Goddard’s first liquid-fueled rocket, which lifted off from a snowy field on March 16, 1926, has been written about extensively. Earlier solid-fueled rockets existed, but liquid-fueled rockets promised the sustainability and control needed to send spacecraft and humans into Earth orbit and beyond. “The rocket’s reach was short, but it marked the moment that humanity entered a new era,” said Kevin Schindler, […]

Ver mais

Like 0

Liked Liked

technocracy

Federated Learning of Binary Neural Networks: Enabling Low-Cost Inference

digitado ⋅ 16 de March de 2026

Federated Learning (FL) preserves privacy by distributing training across devices. However, using DNNs is computationally intensive at the low-powered edge during inference. Edge deployment demands models that simultaneously optimize memory footprint and computational efficiency, a dilemma where conventional DNNs fail by exceeding resource limits. Traditional post-training binarization reduces model size but suffers from severe accuracy loss due to quantization errors. To address these challenges, we propose FedBNN, a rotation-aware binary neural network framework that learns binary representations directly […]

Ver mais

Like 0

Liked Liked