digitado – Page 135

The Efficiency Wall: Why the Next 1,000x Leap Isn’t More GPUs

digitado ⋅ 12 de February de 2026

Author(s): Kapardhi kannekanti Originally published on Towards AI. The fundamental flaw in modern AI architecture, and the biological “hack” to solve it. We are currently witnessing a massive misallocation of capital in Silicon Valley and beyond. We are burning billions of dollars to build bigger “statues” — massive, frozen models that know everything but can do nothing in the real world without a constant tether to a massive server farm. The fundamental shift from rigid, “crystal” AI to […]

Ver mais

Like 0

Liked Liked

technocracy

Persistent Entropy as a Detector of Phase Transitions

digitado ⋅ 11 de February de 2026

arXiv:2602.09058v1 Announce Type: new Abstract: Persistent entropy (PE) is an information-theoretic summary statistic of persistence barcodes that has been widely used to detect regime changes in complex systems. Despite its empirical success, a general theoretical understanding of when and why persistent entropy reliably detects phase transitions has remained limited, particularly in stochastic and data-driven settings. In this work, we establish a general, model-independent theorem providing sufficient conditions under which persistent entropy provably separates two phases. We show that […]

Ver mais

Like 0

Liked Liked

technocracy

Active Query Synthesis for Preference Learning

digitado ⋅ 25 de May de 2026

Efficient learning of user preferences is crucial for many modern decision making systems but typically requires costly labeled data. Active learning reduces this cost, yet standard methods are computationally expensive due to pool-based evaluation. Further, most methods assume all query feedback is equally reliable, ignoring that pairwise queries between nearly identical or entirely dissimilar items yield ambiguous, low-confidence responses. To address the issue of feedback reliability, we introduce a novel confidence aware response model that explicitly accounts for […]

Ver mais

Like 0

Liked Liked

technocracy

On Fun for Teaching Large Programming Courses

digitado ⋅ 16 de January de 2026

arXiv:2601.09842v1 Announce Type: new Abstract: Teaching software development basics to hundreds of students in a frontal setting is cost-efficient and thus still common in universities. However, in a large lecture hall, students can easily get bored, distracted, and disengaged. The frontal setting can also frustrate lecturers since interaction opportunities are limited and hard to scale. Fun activities can activate students and, if well designed, can also help remember and reflect on abstract software development concepts. We present a […]

Ver mais

Like 0

Liked Liked

technocracy

NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement Learning

digitado ⋅ 7 de March de 2026

Multi-agent reinforcement learning (MARL) is increasingly used to design learning-enabled agents that interact in shared environments. However, training MARL algorithms in general-sum games remains challenging: learning dynamics can become unstable, and convergence guarantees typically hold only in restricted settings such as two-player zero-sum or fully cooperative games. Moreover, when agents have heterogeneous and potentially conflicting preferences, it is unclear what system-level objective should guide learning. In this paper, we propose a new MARL pipeline called Near-Potential Policy Optimization […]

Ver mais

Like 0

Liked Liked

technocracy

World Model for no-linear control

digitado ⋅ 25 de June de 2026

I had a question does the complexity of the training env or the playground have any effect on RL agents…like if you are building a general Multi SAC agent should I give it the ability to change its own size ? submitted by /u/d13maxx [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

digitado ⋅ 16 de February de 2026

arXiv:2602.12430v1 Announce Type: new Abstract: The transition from monolithic language models to modular, skill-equipped agents marks a defining shift in how large language models (LLMs) are deployed in practice. Rather than encoding all procedural knowledge within model weights, agent skills — composable packages of instructions, code, and resources that agents load on demand — enable dynamic capability extension without retraining. It is formalized in a paradigm of progressive disclosure, portable skill definitions, and integration with the Model Context […]

Ver mais

Like 0

Liked Liked

technocracy

Commvault launches a ‘Ctrl-Z’ for cloud AI workloads

digitado ⋅ 15 de April de 2026

Enterprise cloud environments now have access to an undo feature for AI agents following the deployment of Commvault AI Protect. Autonomous software now roams across infrastructure, potentially deleting files, reading databases, spinning up server clusters, and even rewriting access policies. Commvault identified this governance issue and the data protection vendor has launched AI Protect, a system designed to discover, monitor, and forcefully roll back the actions of autonomous models operating inside AWS, Microsoft Azure, and Google Cloud. Traditional […]

Ver mais

Like 0

Liked Liked

technocracy

Trump FCC asks public to comment on whether ABC’s The View is a news show

digitado ⋅ 22 de May de 2026

The Federal Communications Commission is escalating its attack on ABC’s The View with a proceeding that seeks public comment on whether the talk show is a “bona fide news interview program.” The FCC Media Bureau today issued a public notice seeking opinions on whether The View qualifies for the bona fide news exemption to the equal-time rule, which requires equal time for opposing political candidates on non-news programming. The probe of The View is driven by Chairman Brendan […]

Ver mais

Like 0

Liked Liked

technocracy

Sparsified-Learning for High-Dimensional Heavy-Tailed Locally Stationary Time Series, Concentration and Oracle Inequalities

digitado ⋅ 10 de February de 2026

arXiv:2504.06477v2 Announce Type: replace Abstract: Sparse learning is ubiquitous in many machine learning tasks. It aims to regularize the goodness-of-fit objective by adding a penalty term to encode structural constraints on the model parameters. In this paper, we develop a flexible sparse learning framework tailored to high-dimensional heavy-tailed locally stationary time series (LSTS). The data-generating mechanism incorporates a regression function that changes smoothly over time and is observed under noise belonging to the class of sub-Weibull and regularly […]

Ver mais

Like 0

Liked Liked