March 2026

Contextual Counterfactual Credit Assignment for Multi-Agent Reinforcement Learning in LLM Collaboration

digitado ⋅ 6 de March de 2026

Cooperative multi-agent reinforcement learning (MARL) systems powered by large language models (LLMs) are frequently optimized via sparse terminal-only feedback. This shared signal entangles upstream decisions, obstructing accurate decision-level credit assignment. To address this trajectory-level diffusion, we introduce Contextual Counterfactual Credit Assignment (textbf{texttt{C3}}). Instead of distributing rewards across an entire episode, textbf{texttt{C3}} isolates the causal impact of individual messages by freezing the exact transcript-derived context, evaluating context-matched alternatives via fixed-continuation replay, and applying a leave-one-out (LOO) baseline. This localized […]

Ver mais

Like 0

Liked Liked

technocracy

Google’s new command-line tool can plug OpenClaw into your Workspace data

digitado ⋅ 6 de March de 2026

The command line is hot again. For some people, command lines were never not hot, of course, but it’s becoming more common now in the age of AI. Google launched a Gemini command-line tool last year, and now it has a new AI-centric command-line option for cloud products. The new Google Workspace CLI bundles the company’s existing cloud APIs into a package that makes it easy to integrate with a variety of AI tools, including OpenClaw. How do […]

Ver mais

Like 0

Liked Liked

technocracy

Feds take notice of iOS vulnerabilities exploited under mysterious circumstances

digitado ⋅ 6 de March de 2026

The Cybersecurity and Infrastructure Security Agency has ordered federal agencies to patch three critical iOS vulnerabilities that were exploited over a 10-month span in hacking campaigns conducted by three distinct groups. The hacking campaigns came to light on Thursday in a report published by Google. All three campaigns used Coruna, the name of an advanced hacking kit that amassed 23 separate iOS exploits into five potent exploit chains. While some of the vulnerabilities had been exploited as zero-days […]

Ver mais

Like 0

Liked Liked

technocracy

Multi-Agent Reinforcement Learning with Submodular Reward

digitado ⋅ 6 de March de 2026

In this paper, we study cooperative multi-agent reinforcement learning (MARL) where the joint reward exhibits submodularity, which is a natural property capturing diminishing marginal returns when adding agents to a team. Unlike standard MARL with additive rewards, submodular rewards model realistic scenarios where agent contributions overlap (e.g., multi-drone surveillance, collaborative exploration). We provide the first formal framework for this setting and develop algorithms with provable guarantees on sample efficiency and regret bound. For known dynamics, our greedy policy […]

Ver mais

Like 0

Liked Liked

technocracy

NEST: Network- and Memory-Aware Device Placement For Distributed Deep Learning

digitado ⋅ 6 de March de 2026

The growing scale of deep learning demands distributed training frameworks that jointly reason about parallelism, memory, and network topology. Prior works often rely on heuristic or topology-agnostic search, handling communication and memory separately. Without per-device memory awareness, these methods typically ensure feasibility post hoc by sharding parameters and activations across many devices, increasing synchronization, inflating communication, and underutilizing compute-limiting scalability and efficiency on real datacenter networks. We present NEST, a network-, compute-, and memory-aware device placement framework that […]

Ver mais

Like 0

Liked Liked

technocracy

Your Life as an RPG: Why Lifespans Feels Uncomfortably True

digitado ⋅ 6 de March de 2026

Simulation theory has been thrown around the internet for years, stemming from the early days of computing. It has long been theorized that everything around us is the result of a computer generated simulation, a fake reality that is being fed through us and presented as real. The Matrix is the first popular reference of simulation theory, and made the idea not only more understood by the general public but more accepted as well. Technology Acceleration Moore’s Law […]

Ver mais

Like 0

Liked Liked

technocracy

Asteroid defense mission shifted the orbit of more than its target

digitado ⋅ 6 de March de 2026

On September 26, 2022, NASA’s Double Asteroid Redirection Test (DART) spacecraft crashed into a binary asteroid system. By intentionally ramming a probe into the 160-meter-wide moonlet named Dimorphos, the smaller of the two asteroids, humanity demonstrated that the kinetic impact method of planetary defense actually works. The immediate result was that Dimorphos’ orbital period around Didymos, its larger parent body, was slashed by 33 minutes. Of course, altering a moonlet’s local orbit doesn’t seem like enough to safeguard […]

Ver mais

Like 0

Liked Liked

technocracy

Boosting deep Reinforcement Learning using pretraining with Logical Options

digitado ⋅ 6 de March de 2026

Deep reinforcement learning agents are often misaligned, as they over-exploit early reward signals. Recently, several symbolic approaches have addressed these challenges by encoding sparse objectives along with aligned plans. However, purely symbolic architectures are complex to scale and difficult to apply to continuous settings. Hence, we propose a hybrid approach, inspired by humans’ ability to acquire new skills. We use a two-stage framework that injects symbolic structure into neural-based reinforcement learning agents without sacrificing the expressivity of deep […]

Ver mais

Like 0

Liked Liked

technocracy

How moss helped convict grave robbers of a Chicago cemetery

digitado ⋅ 6 de March de 2026

Back in 2009, residents were scandalized when employees at Burr Oak Cemetery in the Chicago suburb of Alsip were accused of exhuming old graves in order to resell the burial plots, unceremoniously dumping older remains in another area on the grounds. The perpetrators were tried and convicted in 2015, but the forensic evidence of the moss that helped convict them has now been detailed in a new paper published in the journal Forensic Sciences Research. It’s a follow-up […]

Ver mais

Like 0

Liked Liked

technocracy

Musk fails to block California data disclosure law he fears will ruin xAI

digitado ⋅ 6 de March de 2026

Elon Musk’s xAI has lost its bid for a preliminary injunction that would have temporarily blocked California from enforcing a law that requires AI firms to publicly share information about their training data. xAI had tried to argue that California’s Assembly Bill 2013 (AB 2013) forced AI firms to disclose carefully guarded trade secrets. The law requires AI developers whose models are accessible in the state to clearly explain which dataset sources were used to train models, when […]

Ver mais

Like 0

Liked Liked