digitado – Page 239

DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns

digitado ⋅ 19 de March de 2026

arXiv:2603.16969v1 Announce Type: new Abstract: This paper presents DeepStage, a deep reinforcement learning (DRL) framework for adaptive, stage-aware defense against Advanced Persistent Threats (APTs). The enterprise environment is modeled as a partially observable Markov decision process (POMDP), where host provenance and network telemetry are fused into unified provenance graphs. Building on our prior work, StageFinder, a graph neural encoder and an LSTM-based stage estimator infer probabilistic attacker stages aligned with the MITRE ATT&CK framework. These stage beliefs, combined […]

Ver mais

Like 0

Liked Liked

technocracy

Diffusion-based Generative Machine Learning Model for Predicting Crack Propagation in Aluminum Nitride at the Atomic Scale

digitado ⋅ 13 de March de 2026

Predicting atomic-scale crack propagation in aluminum nitride (AlN) is critical for semiconductor reliability but remains prohibitively expensive via molecular dynamics (MD). We develop a diffusion-based generative machine learning model to predict atomic-scale crack propagation in AlN, a critical semiconductor material, by conditioning solely on initial microstructure embeddings. Trained on MD simulations of single-crack systems, the model achieves a significant speedup while accurately forecasting dynamic fracture processes, including stress-driven crack initiation, crack branching, and atomic-scale bridging ligaments. Crucially, it […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Predict, Discover, and Reason in High-Dimensional Discrete Event Sequences

digitado ⋅ 17 de March de 2026

Electronic control units (ECUs) embedded within modern vehicles generate a large number of asynchronous events known as diagnostic trouble codes (DTCs). These discrete events form complex temporal sequences that reflect the evolving health of the vehicle’s subsystems. In the automotive industry, domain experts manually group these codes into higher-level error patterns (EPs) using Boolean rules to characterize system faults and ensure safety. However, as vehicle complexity grows, this manual process becomes increasingly costly, error-prone, and difficult to scale. […]

Ver mais

Like 0

Liked Liked

technocracy

Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning

digitado ⋅ 8 de January de 2026

arXiv:2601.03320v1 Announce Type: new Abstract: On-policy reinforcement learning (RL), particularly Proximal Policy Optimization (PPO) and Group Relative Policy Optimization (GRPO), has become the dominant paradigm for fine-tuning large language models (LLMs). While policy ratio clipping stabilizes training, this heuristic hard constraint incurs a fundamental cost: it indiscriminately truncates gradients from high-return yet high-divergence actions, suppressing rare but highly informative “eureka moments” in complex reasoning. Moreover, once data becomes slightly stale, hard clipping renders it unusable, leading to severe […]

Ver mais

Like 0

Liked Liked

technocracy

Fighting for the health of the planet with AI

digitado ⋅ 22 de December de 2025

For Priya Donti, childhood trips to India were more than an opportunity to visit extended family. The biennial journeys activated in her a motivation that continues to shape her research and her teaching. Contrasting her family home in Massachusetts, Donti — now the Silverman Family Career Development Professor in the MIT Department of Electrical Engineering and Computer Science (EECS) and a principal investigator at the MIT Laboratory for Information and Decision Systems — was struck by the disparities […]

Ver mais

Like 0

Liked Liked

technocracy

Variational Learning of Fractional Posteriors

digitado ⋅ 29 de March de 2026

We introduce a novel one-parameter variational objective that lower bounds the data evidence and enables the estimation of approximate fractional posteriors. We extend this framework to hierarchical construction and Bayes posteriors, offering a versatile tool for probabilistic modelling. We demonstrate two cases where gradients can be obtained analytically and a simulation study on mixture models showing that our fractional posteriors can be used to achieve better calibration compared to posteriors from the conventional variational bound. When applied to […]

Ver mais

Like 0

Liked Liked

technocracy

Public transport challenges and technology-assisted accessibility for visually impaired elderly residents in urban environments

digitado ⋅ 23 de January de 2026

arXiv:2601.15291v1 Announce Type: new Abstract: Independent navigation is a core aspect of maintaining social participation and individual health for vulnerable populations. While historic cities such as Edinburgh, as the capital of Scotland, often feature well-established public transport systems, urban accessibility challenges remain and are exacerbated by a complex landscape, especially for groups with multiple vulnerabilities such as the blind elderly. With limited research examining how real-time data feeds and developments in artificial intelligence can enhance navigation aids, we […]

Ver mais

Like 0

Liked Liked

technocracy

DAISI: Data Assimilation with Inverse Sampling using Stochastic Interpolants

digitado ⋅ 9 de March de 2026

arXiv:2512.00252v3 Announce Type: replace Abstract: Data assimilation (DA) is a cornerstone of scientific and engineering applications, combining model forecasts with sparse and noisy observations to estimate latent system states. Classical high-dimensional DA methods, such as the ensemble Kalman filter, rely on Gaussian approximations that are violated for complex dynamics or observation operators. To address this limitation, we introduce DAISI, a scalable filtering algorithm built on flow-based generative models that enables flexible probabilistic inference using data-driven priors. The core […]

Ver mais

Like 0

Liked Liked

technocracy

BIASEDTALES-ML: A Multilingual Dataset for Analyzing Narrative Attribute Distributions in LLM-Generated Stories

digitado ⋅ 21 de April de 2026

arXiv:2604.17008v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used to generate narrative content, including children’s stories, which play an important role in social and cultural learning. Despite growing interest in AI safety and alignment, most existing evaluations focus primarily on English, leaving the cross-lingual generalization of aligned behavior underexplored. In this work, we introduce BiasedTales-ML, a large-scale parallel corpus of approximately 350,000 children’s stories generated across eight typologically and culturally diverse languages using a full-permutation […]

Ver mais

Like 0

Liked Liked

technocracy

A Technical Report on the Second Place Solution for the CIKM 2025 AnalytiCup Competition

digitado ⋅ 12 de January de 2026

arXiv:2601.05259v1 Announce Type: new Abstract: In this work, we address the challenge of multilingual category relevance judgment in e-commerce search, where traditional ensemble-based systems improve accuracy but at the cost of heavy training, inference, and maintenance complexity. To overcome this limitation, we propose a simplified yet effective framework that leverages prompt engineering with Chain-of-Thought task decomposition to guide reasoning within a single large language model. Specifically, our approach decomposes the relevance judgment process into four interpretable subtasks: translation, […]

Ver mais

Like 0

Liked Liked