digitado – Page 247

Learning under noisy supervision is governed by a feedback-truth gap

digitado ⋅ 18 de February de 2026

When feedback is absorbed faster than task structure can be evaluated, the learner will favor feedback over truth. A two-timescale model shows this feedback-truth gap is inevitable whenever the two rates differ and vanishes only when they match. We test this prediction across neural networks trained with noisy labels (30 datasets, 2,700 runs), human probabilistic reversal learning (N = 292), and human reward/punishment learning with concurrent EEG (N = 25). In each system, truth is defined operationally: held-out […]

Ver mais

Like 0

Liked Liked

technocracy

I built a multi-agent asteroid racing environment in Godot 4.6 and trained the pilots with RL

digitado ⋅ 13 de April de 2026

Hey, this is the second episode in a small series where I’m experimenting with reinforcement learning in Godot 4.6, hoping to build a game using it once I am confident enough. In this one I took the navigation setup from the first episode and turned it into a racing environment: 25 ships, checkpoints, asteroid fields, a timeout system, and elimination on collision. The agents don’t use scripted steering, racing lines, or hand-authored behavior. They only get observations, raw […]

Ver mais

Like 0

Liked Liked

technocracy

System-Technology Co-Optimization of Bitline Routing and Bonding Pathways in Monolithic 3D DRAM Architectures

digitado ⋅ 16 de March de 2026

arXiv:2603.12461v1 Announce Type: new Abstract: 3D DRAM has emerged as a promising approach for continued density scaling, but its viability is limited by routing and hybrid bonding constraints to periphery, which may degrade sensing margin, latency, and array efficiency. With device characteristics and array parasitics extracted from TCAD, SPICE simulations are performed with peri logic in a CMOS-Bonded-Array (CBA). The analysis shows that the bitline strap architecture with amorphous oxide semiconductor (AOS) selectors is essential to manage routing […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Discriminative and Generalizable Anomaly Detector for Dynamic Graph with Limited Supervision

digitado ⋅ 23 de February de 2026

Dynamic graph anomaly detection (DGAD) is critical for many real-world applications but remains challenging due to the scarcity of labeled anomalies. Existing methods are either unsupervised or semi-supervised: unsupervised methods avoid the need for labeled anomalies but often produce ambiguous boundary, whereas semi-supervised methods can overfit to the limited labeled anomalies and generalize poorly to unseen anomalies. To address this gap, we consider a largely underexplored problem in DGAD: learning a discriminative boundary from normal/unlabeled data, while leveraging […]

Ver mais

Like 0

Liked Liked

technocracy

Task-tailored Pre-processing: Fair Downstream Supervised Learning

digitado ⋅ 21 de January de 2026

arXiv:2601.11897v1 Announce Type: cross Abstract: Fairness-aware machine learning has recently attracted various communities to mitigate discrimination against certain societal groups in data-driven tasks. For fair supervised learning, particularly in pre-processing, there have been two main categories: data fairness and task-tailored fairness. The former directly finds an intermediate distribution among the groups, independent of the type of the downstream model, so a learned downstream classification/regression model returns similar predictive scores to individuals inputting the same covariates irrespective of their […]

Ver mais

Like 0

Liked Liked

technocracy

The most popular blogs of Hacker News in 2025

digitado ⋅ 2 de January de 2026

The most popular blogs of Hacker News in 2025 Michael Lynch maintains HN Popularity Contest, a site that tracks personal blogs on Hacker News and scores them based on how well they perform on that platform. The engine behind the project is the domain-meta.csv CSV on GiHub, a hand-curated list of known personal blogs with author and bio and tag metadata, which Michael uses to separate out personal blog posts from other types of content. I came top […]

Ver mais

Like 0

Liked Liked

technocracy

Reinforcement learning-based dynamic cleaning scheduling framework for solar energy system

digitado ⋅ 8 de March de 2026

Advancing autonomous green technologies in solar photovoltaic (PV) systems is key to improving sustainability and efficiency in renewable energy production. This study presents a reinforcement learning (RL)-based framework to autonomously optimize the cleaning schedules of PV panels in arid regions, where soiling from dust and other airborne particles significantly reduces energy output. By employing advanced RL algorithms, Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC), the framework dynamically adjusts cleaning intervals based on uncertain environmental conditions. The proposed […]

Ver mais

Like 0

Liked Liked

technocracy

A principled framework for uncertainty decomposition in TabPFN

digitado ⋅ 5 de February de 2026

arXiv:2602.04596v1 Announce Type: new Abstract: TabPFN is a transformer that achieves state-of-the-art performance on supervised tabular tasks by amortizing Bayesian prediction into a single forward pass. However, there is currently no method for uncertainty decomposition in TabPFN. Because it behaves, in an idealised limit, as a Bayesian in-context learner, we cast the decomposition challenge as a Bayesian predictive inference (BPI) problem. The main computational tool in BPI, predictive Monte Carlo, is challenging to apply here as it requires […]

Ver mais

Like 0

Liked Liked

technocracy

Mutually Causal Semantic Distillation Network for Zero-Shot Learning

digitado ⋅ 18 de March de 2026

Zero-shot learning (ZSL) aims to recognize the unseen classes in the open-world guided by the side-information (e.g., attributes). Its key task is how to infer the latent semantic knowledge between visual and attribute features on seen classes, and thus conducting a desirable semantic knowledge transfer from seen classes to unseen ones. Prior works simply utilize unidirectional attention within a weakly-supervised manner to learn the spurious and limited latent semantic representations, which fail to effectively discover the intrinsic semantic […]

Ver mais

Like 0

Liked Liked

technocracy

NVIDIA Nemotron 3 Nano 30B MoE model is now available in Amazon SageMaker JumpStart

digitado ⋅ 11 de February de 2026

Today we’re excited to announce that the NVIDIA Nemotron 3 Nano 30B model with 3B active parameters is now generally available in the Amazon SageMaker JumpStart model catalog. You can accelerate innovation and deliver tangible business value with Nemotron 3 Nano on Amazon Web Services (AWS) without having to manage model deployment complexities. You can power your generative AI applications with Nemotron capabilities using the managed deployment capabilities offered by SageMaker JumpStart. Nemotron 3 Nano is a small […]

Ver mais

Like 0

Liked Liked