March 2026

Hybrid Belief Reinforcement Learning for Efficient Coordinated Spatial Exploration

digitado ⋅ 4 de March de 2026

Coordinating multiple autonomous agents to explore and serve spatially heterogeneous demand requires jointly learning unknown spatial patterns and planning trajectories that maximize task performance. Pure model-based approaches provide structured uncertainty estimates but lack adaptive policy learning, while deep reinforcement learning often suffers from poor sample efficiency when spatial priors are absent. This paper presents a hybrid belief-reinforcement learning (HBRL) framework to address this gap. In the first phase, agents construct spatial beliefs using a Log-Gaussian Cox Process (LGCP) […]

Ver mais

Like 0

Liked Liked

technocracy

Quoting Donald Knuth

digitado ⋅ 4 de March de 2026

Shock! Shock! I learned yesterday that an open problem I’d been working on for several weeks had just been solved by Claude Opus 4.6 – Anthropic’s hybrid reasoning model that had been released three weeks earlier! It seems that I’ll have to revise my opinions about “generative AI” one of these days. What a joy it is to learn not only that my conjecture has a nice solution but also to celebrate this dramatic advance in automatic deduction […]

Ver mais

Like 0

Liked Liked

technocracy

Link Decay Prediction in Affiliate Marketing: Turning “Alive” URLs into a Time Series Monitoring Problem

digitado ⋅ 4 de March de 2026

Author(s): Hernan M Originally published on Towards AI. Key Takeaways Affiliate link health isn’t binary, even if most dashboards force it into green/red. A link can be “up” and still be quietly losing a third of your traffic. The strongest signal I’ve found is landing page arrival rate: across repeated tests, what fraction of attempts actually land on the intended page. Logged over time, that’s a time series with recognizable pre-failure shapes — gradual drift, volatility spikes, and […]

Ver mais

Like 0

Liked Liked

technocracy

Spectacular New Discovery about the Digits of π

digitado ⋅ 4 de March de 2026

Everyone believes that the digits of constants such as π or √2 cannot be distinguished from a sequence of random bits. The first few trillion successfully pass all tests of randomness. However, proving that they indeed behave perfectly randomly is arguably one of the oldest and most difficult unsolved math conjectures. So far, nobody succeeded in proving even the most basic facts for any of these constants, for instance: In the binary digits of π, is the proportion […]

Ver mais

Like 0

Liked Liked

technocracy

No fooling: NASA targets April 1 for Artemis II launch to the Moon

digitado ⋅ 3 de March de 2026

NASA has fixed the problem that forced it to remove the rocket for the Artemis II mission from its launch pad last month, but it will be a couple of weeks before officials are ready to move the vehicle back into the starting blocks at Kennedy Space Center in Florida. The 322-foot-tall (98-meter) rocket could have launched as soon as this week after it passed a key fueling test on February 21. During that test, NASA loaded the […]

Ver mais

Like 0

Liked Liked

technocracy

Downdetector, Speedtest sold to IT service-provider Accenture in $1.2B deal

digitado ⋅ 3 de March de 2026

IT consultant and services provider Accenture has agreed to buy Speedtest and Downdetector owner Ookla from Ziff Davis for $1.2 billion in cash. Accenture plans to integrate Ookla’s data products into its own offerings that are targeted at helping communications service providers, hyperscalers, government entities, and other types of customers “optimize … mission-critical Wi-Fi and 5G networks,” Accenture’s announcement today said. Ookla’s platform also includes Ekahau, which offers tools for troubleshooting and designing wireless networks, and RootMetrics, which […]

Ver mais

Like 0

Liked Liked

technocracy

FCC chair calls Paramount/WBD merger “a lot cleaner” than defunct Netflix deal

digitado ⋅ 3 de March de 2026

Paramount Skydance’s $111 billion purchase of Warner Bros. Discovery (WBD) has a notable supporter in Federal Communications Commission Chairman Brendan Carr. The FCC boss told CNBC today that the Paramount/WBD combination “is a lot cleaner” than the now-defunct Netflix deal to buy WBD. Netflix “would have had a very difficult path forward from a regulatory perspective” because of “the scope and scale” of the streaming service that would have been created by combining Netflix with WBD property HBO […]

Ver mais

Like 0

Liked Liked

technocracy

Gemini 3.1 Flash-Lite

digitado ⋅ 3 de March de 2026

Gemini 3.1 Flash-Lite Google’s latest model is an update to their inexpensive Flash-Lite family. At $0.25/million tokens of input and $1.5/million output this is 1/8th the price of Gemini 3.1 Pro. It supports four different thinking levels, so I had it output four different pelicans: minimal low medium high Tags: google, ai, generative-ai, llms, llm, gemini, llm-pricing, pelican-riding-a-bicycle, llm-release

Ver mais

Like 0

Liked Liked

technocracy

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

digitado ⋅ 3 de March de 2026

Frozen self-supervised representations often transfer well with only a few labels across many semantic tasks. We argue that a single geometric quantity, emph{directional} CDNV (decision-axis variance), sits at the core of two favorable behaviors: strong few-shot transfer within a task, and low interference across many tasks. We show that both emerge when variability emph{along} class-separating directions is small. First, we prove sharp non-asymptotic multiclass generalization bounds for downstream classification whose leading term is the directional CDNV. The bounds […]

Ver mais

Like 0

Liked Liked

technocracy

Q-Measure-Learning for Continuous State RL: Efficient Implementation and Convergence

digitado ⋅ 3 de March de 2026

We study reinforcement learning in infinite-horizon discounted Markov decision processes with continuous state spaces, where data are generated online from a single trajectory under a Markovian behavior policy. To avoid maintaining an infinite-dimensional, function-valued estimate, we propose the novel Q-Measure-Learning, which learns a signed empirical measure supported on visited state-action pairs and reconstructs an action-value estimate via kernel integration. The method jointly estimates the stationary distribution of the behavior chain and the Q-measure through coupled stochastic approximation, leading […]

Ver mais

Like 0

Liked Liked