digitado – Page 459

I trained a DQN agent to solve drone intercept cost optimization — here’s what it figured out on its own

digitado ⋅ 2 de April de 2026

Built a drone interception environment from scratch in Pygame — no OpenAI Gym dependency. State vector is 10-dimensional, tracking 2 nearest drones with angle error, predicted position 15 steps ahead, distance, and vertical speed. Reward structure is where it gets interesting: Hit: +10 Building destroyed: -20 Shot fired: -0.5 Drone escaped: -5 The -0.5 firing penalty forces the agent to learn ammo conservation. What emerged: under low swarm density it fires aggressively, under high density it becomes selective. […]

Ver mais

Like 0

Liked Liked

technocracy

Attention-Guided Flow-Matching for Sparse 3D Geological Generation

digitado ⋅ 15 de April de 2026

arXiv:2604.09700v1 Announce Type: new Abstract: Constructing high-resolution 3D geological models from sparse 1D borehole and 2D surface data is a highly ill-posed inverse problem. Traditional heuristic and implicit modeling methods fundamentally fail to capture non-linear topological discontinuities under extreme sparsity, often yielding unrealistic artifacts. Furthermore, while deep generative architectures like Diffusion Models have revolutionized continuous domains, they suffer from severe representation collapse when conditioned on sparse categorical grids. To bridge this gap, we propose 3D-GeoFlow, the first Attention-Guided […]

Ver mais

Like 0

Liked Liked

technocracy

Functional Consciousness: A Proxy Metric Using Self-Models

digitado ⋅ 21 de April de 2026

This paper proposes Functional Consciousness (FC) as a measurable architectural property: the observable capacity of a system to access and reason about internal representations of its own states. We introduce a computationally tractable metric on FC that operationalizes core tenets of major consciousness theories through self-models and their associated reasoning power, measured through informational richness and state-space expansion under inference. The resulting Functional Consciousness Score (FCS) is applied to benchmark systems with known internal structure, including a Waymo […]

Ver mais

Like 0

Liked Liked

technocracy

Physics-constrained Gaussian Processes for Predicting Shockwave Hugoniot Curves

digitado ⋅ 13 de January de 2026

arXiv:2601.06655v1 Announce Type: cross Abstract: A physics-constrained Gaussian Process regression framework is developed for predicting shocked material states along the Hugoniot curve using data from a small number of shockwave simulations. The proposed Gaussian process employs a probabilistic Taylor series expansion in conjunction with the Rankine-Hugoniot jump conditions between the various shocked material states to construct a thermodynamically consistent covariance function. This leads to the formulation of an optimization problem over a small number of interpretable hyperparameters and […]

Ver mais

Like 0

Liked Liked

technocracy

Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning

digitado ⋅ 9 de April de 2026

Offline multi-agent reinforcement learning (MARL) aims to learn the optimal joint policy from pre-collected datasets, requiring a trade-off between maximizing global returns and mitigating distribution shift from offline data. Recent studies use diffusion or flow generative models to capture complex joint policy behaviors among agents; however, they typically rely on multi-step iterative sampling, thereby reducing training and inference efficiency. Although further research improves sampling efficiency through methods like distillation, it remains sensitive to the behavior regularization coefficient. To […]

Ver mais

Like 0

Liked Liked

technocracy

Governing frontier general-purpose AI in the public sector: adaptive risk management and policy capacity under uncertainty through 2030

digitado ⋅ 9 de April de 2026

arXiv:2604.06215v1 Announce Type: new Abstract: The governance of frontier general-purpose artificial intelligence has become a public-sector problem of institutional design, not merely a technical issue of model performance. Recent evidence indicates that AI capabilities are advancing rapidly, though unevenly, while knowledge about harms, safeguards, and effective interventions remains partial and lagged. This combination creates a difficult policy condition: governments must decide under uncertainty, across multiple plausible trajectories of progress through 2030, and in environments where adoption outcomes depend […]

Ver mais

Like 0

Liked Liked

technocracy

DeEscalWild: A Real-World Benchmark for Automated De-Escalation Training with SLMs

digitado ⋅ 16 de April de 2026

arXiv:2604.13075v1 Announce Type: new Abstract: Effective de-escalation is critical for law enforcement safety and community trust, yet traditional training methods lack scalability and realism. While Large Language Models (LLMs) enable dynamic, open-ended simulations, their substantial computational footprint renders them impractical for deployment on the lightweight, portable hardware required for immersive field training. Small Language Models (SLMs) offer a viable real-time alternative but suffer from a critical scarcity of high-quality, domain-specific training data. To bridge this gap, we present […]

Ver mais

Like 0

Liked Liked

technocracy

A Sled Dog’s Final Loyalty

digitado ⋅ 16 de March de 2026

:::info Astounding Stories of Super-Science July, 2008, by Astounding Stories is part of HackerNoon’s Book Blog Post series. You can jump to any chapter in this book here. The Call of the Wild – Who Has Won to Mastership Astounding Stories of Super-Science July 2008: The Call of the Wild – Who Has Won to Mastership By Jack London ::: “Eh? Wot I say? I spik true w’en I say dat Buck two devils.” This was François’s speech next […]

Ver mais

Like 0

Liked Liked

technocracy

General Convex Agreement with Near-Optimal Communication

digitado ⋅ 26 de February de 2026

arXiv:2602.21411v1 Announce Type: new Abstract: Convex Agreement (CA) strengthens Byzantine Agreement (BA) by requiring the output agreed upon to lie in the convex hull of the honest parties’ inputs. This validity condition is motivated by practical aggregation tasks (e.g., robust learning or sensor fusion) where honest inputs need not coincide but should still constrain the decision. CA inherits BA lower bounds, and optimal synchronous round complexity is easy to obtain (e.g., via Byzantine Broadcast). The main challenge is […]

Ver mais

Like 0

Liked Liked

technocracy

Neuro-Oracle: A Trajectory-Aware Agentic RAG Framework for Interpretable Epilepsy Surgical Prognosis

digitado ⋅ 18 de April de 2026

arXiv:2604.14216v1 Announce Type: new Abstract: Predicting post-surgical seizure outcomes in pharmacoresistant epilepsy is a clinical challenge. Conventional deep-learning approaches operate on static, single-timepoint pre-operative scans, omitting longitudinal morphological changes. We propose emph{Neuro-Oracle}, a three-stage framework that: (i) distils pre-to-post-operative MRI changes into a compact 512-dimensional trajectory vector using a 3D Siamese contrastive encoder; (ii) retrieves historically similar surgical trajectories from a population archive via nearest-neighbour search; and (iii) synthesises a natural-language prognosis grounded in the retrieved evidence using […]

Ver mais

Like 0

Liked Liked