digitado – Page 108

Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning

digitado ⋅ 23 de February de 2026

Scaling cooperative multi-agent reinforcement learning (MARL) is fundamentally limited by cross-agent noise: when agents share a common reward, the actions of all $N$ agents jointly determine each agent’s learning signal, so cross-agent noise grows with $N$. In the policy gradient setting, per-agent gradient estimate variance scales as $Θ(N)$, yielding sample complexity $mathcal{O}(N/ε)$. We observe that many domains — cloud computing, transportation, power systems — have differentiable analytical models that prescribe efficient system states. In this work, we propose […]

Ver mais

Like 0

Liked Liked

technocracy

Closed-form conditional diffusion models for data assimilation

digitado ⋅ 2 de April de 2026

arXiv:2603.21291v2 Announce Type: replace Abstract: We propose closed-form conditional diffusion models for data assimilation. Diffusion models use data to learn the score function (defined as the gradient of the log-probability density of a data distribution), allowing them to generate new samples from the data distribution by reversing a noise injection process. While it is common to train neural networks to approximate the score function, we leverage the analytical tractability of the score function to assimilate the states of […]

Ver mais

Like 0

Liked Liked

technocracy

EngineAD: A Real-World Vehicle Engine Anomaly Detection Dataset

digitado ⋅ 30 de March de 2026

arXiv:2603.25955v1 Announce Type: new Abstract: The progress of Anomaly Detection (AD) in safety-critical domains, such as transportation, is severely constrained by the lack of large-scale, real-world benchmarks. To address this, we introduce EngineAD, a novel, multivariate dataset comprising high-resolution sensor telemetry collected from a fleet of 25 commercial vehicles over a six-month period. Unlike synthetic datasets, EngineAD features authentic operational data labeled with expert annotations, distinguishing normal states from subtle indicators of incipient engine faults. We preprocess the […]

Ver mais

Like 0

Liked Liked

technocracy

Inside Cloud-Scale Systems: A Discussion with Abhinav Sharma

digitado ⋅ 7 de January de 2026

The evolution of cloud computing and artificial intelligence has fundamentally transformed how enterprises build and scale technology platforms. Modern cloud infrastructure must handle millions of concurrent users across dozens of geographic regions while maintaining security, reliability, and performance standards that meet the demands of government agencies and Fortune 500 companies alike. Engineers working at this scale navigate complex distributed systems, automate buildout pipelines, and build sophisticated monitoring frameworks that can detect potential outages before they impact customers. The […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Nonlinear Heterogeneity in Physical Kolmogorov-Arnold Networks

digitado ⋅ 20 de January de 2026

Physical neural networks typically train linear synaptic weights while treating device nonlinearities as fixed. We show the opposite – by training the synaptic nonlinearity itself, as in Kolmogorov-Arnold Network (KAN) architectures, we yield markedly higher task performance per physical resource and improved performance-parameter scaling than conventional linear weight-based networks, demonstrating ability of KAN topologies to exploit reconfigurable nonlinear physical dynamics. We experimentally realise physical KANs in silicon-on-insulator devices we term ‘Synaptic Nonlinear Elements’ (SYNEs), operating at room temperature, […]

Ver mais

Like 0

Liked Liked

technocracy

GraphQL Is the Native Language of AI Agents

digitado ⋅ 5 de April de 2026

Your APIs were designed for humans. That’s about to be a problem. When Facebook’s engineering team designed GraphQL in 2012, they were solving a mobile problem: REST endpoints were returning too much data over slow networks, and iOS clients were paying the cost in latency. The solution — let the client declare exactly what it needs, enforce that contract through a typed schema, and expose everything about the API through introspection — turned out to solve a different problem entirely, one Facebook couldn’t […]

Ver mais

Like 0

Liked Liked

technocracy

Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

digitado ⋅ 4 de March de 2026

arXiv:2603.02353v1 Announce Type: new Abstract: Writing is a foundational literacy skill that underpins effective communication, fosters critical thinking, facilitates learning across disciplines, and enables individuals to organize and articulate complex ideas. Consequently, writing assessment plays a vital role in evaluating language proficiency, communicative effectiveness, and analytical reasoning. The rapid advancement of large language models (LLMs) has made it increasingly easy to generate coherent, high-quality essays, raising significant concerns about the authenticity of student-submitted work. This chapter first provides […]

Ver mais

Like 0

Liked Liked

technocracy

Joint Learning of Hierarchical Neural Options and Abstract World Model

digitado ⋅ 2 de February de 2026

Building agents that can perform new skills by composing existing skills is a long-standing goal of AI agent research. Towards this end, we investigate how to efficiently acquire a sequence of skills, formalized as hierarchical neural options. However, existing model-free hierarchical reinforcement algorithms need a lot of data. We propose a novel method, which we call AgentOWL (Option and World model Learning Agent), that jointly learns — in a sample efficient way — an abstract world model (abstracting […]

Ver mais

Like 0

Liked Liked

technocracy

DART: aDaptive Accept RejecT for non-linear top-K subset identification

digitado ⋅ 16 de February de 2026

arXiv:2011.07687v2 Announce Type: replace-cross Abstract: We consider the bandit problem of selecting $K$ out of $N$ arms at each time step. The reward can be a non-linear function of the rewards of the selected individual arms. The direct use of a multi-armed bandit algorithm requires choosing among $binom{N}{K}$ options, making the action space large. To simplify the problem, existing works on combinatorial bandits {typically} assume feedback as a linear function of individual rewards. In this paper, we prove […]

Ver mais

Like 0

Liked Liked

technocracy

Scalable Uncertainty Quantification for Black-Box Density-Based Clustering

digitado ⋅ 4 de March de 2026

arXiv:2603.03188v1 Announce Type: new Abstract: We introduce a novel framework for uncertainty quantification in clustering. By combining the martingale posterior paradigm with density-based clustering, uncertainty in the estimated density is naturally propagated to the clustering structure. The approach scales effectively to high-dimensional and irregularly shaped data by leveraging modern neural density estimators and GPU-friendly parallel computation. We establish frequentist consistency guarantees and validate the methodology on synthetic and real data.

Ver mais

Like 0

Liked Liked