digitado – Page 40

Active Advantage-Aligned Online Reinforcement Learning with Offline Data

digitado ⋅ 10 de March de 2026

arXiv:2502.07937v4 Announce Type: replace-cross Abstract: Online reinforcement learning (RL) enhances policies through direct interactions with the environment, but faces challenges related to sample efficiency. In contrast, offline RL leverages extensive pre-collected data to learn policies, but often produces suboptimal results due to limited data coverage. Recent efforts integrate offline and online RL in order to harness the advantages of both approaches. However, effectively combining online and offline RL remains challenging due to issues that include catastrophic forgetting, lack […]

Ver mais

Like 0

Liked Liked

technocracy

Mate Security’s Continuous Detection, Continuous Response Is The SOC’s Missing Operating System

digitado ⋅ 18 de May de 2026

For two decades, the security operations center has been built around a quiet lie: that detection and investigation are separate disciplines. They are not, and never were. The split exists because vendors built it that way, and organizations paid to hold two incompatible worlds together with duct tape and headcount. The consequences are now too expensive to ignore. CardinalOps 4th Annual State of SIEM Detection Risk Report showed that 18% of all SIEM rules were broken at any […]

Ver mais

Like 0

Liked Liked

technocracy

Robotic Assembly Using Deep Reinforcement Learning

digitado ⋅ 21 de October de 2020

Introduction Disclaimer: This article is a cross post from Pytorch Medium Blog Post. One of the most exciting advancements, that has pushed the frontier of the Artificial Intelligence (AI) in recent years, is Deep Reinforcement Learning (DRL). DRL belongs to the family of machine learning algorithms. It assumes that intelligent machines can learn from their actions similar to the way humans learn from experience. Over the recent years we could witness some impressive real-world applications of DRL. The […]

Ver mais

Like 0

Liked Liked

technocracy

Stability and Accuracy Trade-offs in Statistical Estimation

digitado ⋅ 21 de January de 2026

arXiv:2601.11701v1 Announce Type: cross Abstract: Algorithmic stability is a central concept in statistics and learning theory that measures how sensitive an algorithm’s output is to small changes in the training data. Stability plays a crucial role in understanding generalization, robustness, and replicability, and a variety of stability notions have been proposed in different learning settings. However, while stability entails desirable properties, it is typically not sufficient on its own for statistical learning — and indeed, it may be […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Normal Representations for Blood Biomarkers

digitado ⋅ 18 de May de 2026

Blood-based biomarkers underpin clinical diagnosis and management, yet their interpretation relies largely on fixed population reference intervals that ignore stable, intra-patient variability. As such, population-based interpretation can mask meaningful deviation from an individual’s baseline, risking delayed disease detection. To remedy this, there have been increasing efforts to personalize blood biomarker interpretation using individual testing histories. However, these methods may overfit to sparse data, inflating false-positive rates and unnecessary follow-up, and can also unwittingly include unrecognized or subclinical disease. […]

Ver mais

Like 0

Liked Liked

technocracy

Fair Feature Importance Scores via Feature Occlusion and Permutation

digitado ⋅ 11 de February de 2026

arXiv:2602.09196v1 Announce Type: cross Abstract: As machine learning models increasingly impact society, their opaque nature poses challenges to trust and accountability, particularly in fairness contexts. Understanding how individual features influence model outcomes is crucial for building interpretable and equitable models. While feature importance metrics for accuracy are well-established, methods for assessing feature contributions to fairness remain underexplored. We propose two model-agnostic approaches to measure fair feature importance. First, we propose to compare model fairness before and after permuting […]

Ver mais

Like 0

Liked Liked

technocracy

Topological Relational Theory: A Simplicial-Complex View of Functional Dependencies, Lossless Decomposition, and Acyclicity

digitado ⋅ 26 de February de 2026

arXiv:2602.21213v1 Announce Type: new Abstract: We develop a topological lens on relational schema design by encoding functional dependencies (FDs) as simplices of an abstract simplicial complex. This dependency complex exposes multi-attribute interactions and enables homological invariants (Betti numbers) to diagnose cyclic dependency structure. We define Simplicial Normal Form (SNF) as homological acyclicity of the dependency complex in positive dimensions, i.e., vanishing reduced homology for all $n ge 1$. SNF is intentionally weaker than contractibility and does not identify […]

Ver mais

Like 0

Liked Liked

technocracy

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

digitado ⋅ 2 de April de 2026

Agent skills, structured packages of procedural knowledge and executable resources that agents dynamically load at inference time, have become a reliable mechanism for augmenting LLM agents. Yet inference-time skill augmentation is fundamentally limited: retrieval noise introduces irrelevant guidance, injected skill content imposes substantial token overhead, and the model never truly acquires the knowledge it merely follows. We ask whether skills can instead be internalized into model parameters, enabling zero-shot autonomous behavior without any runtime skill retrieval. We introduce […]

Ver mais

Like 0

Liked Liked

technocracy

EARTalking: End-to-end GPT-style Autoregressive Talking Head Synthesis with Frame-wise Control

digitado ⋅ 24 de March de 2026

arXiv:2603.20307v1 Announce Type: new Abstract: Audio-driven talking head generation aims to create vivid and realistic videos from a static portrait and speech. Existing AR-based methods rely on intermediate facial representations, which limit their expressiveness and realism. Meanwhile, diffusion-based methods generate clip-by-clip, lacking fine-grained control and causing inherent latency due to overall denoising across the window. To address these limitations, we propose EARTalking, a novel end-to-end, GPT-style autoregressive model for interactive audio-driven talking head generation. Our method introduces a […]

Ver mais

Like 0

Liked Liked

technocracy

DOGE goes nuclear: How Trump invited Silicon Valley into America’s nuclear power regulator

digitado ⋅ 21 de March de 2026

Last summer, a group of officials from the Department of Energy gathered at the Idaho National Laboratory, a sprawling 890-square-mile complex in the eastern desert of Idaho where the US government built its first rudimentary nuclear power plant in 1951 and continues to test cutting-edge technology. On the agenda that day: the future of nuclear energy in the Trump era. The meeting was convened by 31-year-old lawyer Seth Cohen. Just five years out of law school, Cohen brought […]

Ver mais

Like 0

Liked Liked