digitado – Page 33

CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction

digitado ⋅ 3 de March de 2026

arXiv:2603.00610v1 Announce Type: new Abstract: While music generation models have evolved to handle complex multimodal inputs mixing text, lyrics, and reference audio, evaluation mechanisms have lagged behind. In this paper, we bridge this critical gap by establishing a comprehensive ecosystem for music reward modeling under Compositional Multimodal Instruction (CMI), where the generated music may be conditioned on text descriptions, lyrics, and audio prompts. We first introduce CMI-Pref-Pseudo, a large-scale preference dataset comprising 110k pseudo-labeled samples, and CMI-Pref, a […]

Ver mais

Like 0

Liked Liked

technocracy

GMA-SAWGAN-GP: A Novel Data Generative Framework to Enhance IDS Detection Performance

digitado ⋅ 1 de April de 2026

arXiv:2603.28838v1 Announce Type: new Abstract: Intrusion Detection System (IDS) is often calibrated to known attacks and generalizes poorly to unknown threats. This paper proposes GMA-SAWGAN-GP, a novel generative augmentation framework built on a Self-Attention-enhanced Wasserstein GAN with Gradient Penalty (WGAN-GP). The generator employs Gumbel-Softmax regularization to model discrete fields, while a Multilayer Perceptron (MLP)-based AutoEncoder acts as a manifold regularizer. A lightweight gating network adaptively balances adversarial and reconstruction losses via entropy regularization, improving stability and mitigating mode […]

Ver mais

Like 0

Liked Liked

technocracy

Automated Residual Plot Assessment With the R Package autovi and the Shiny Application autovi.web

digitado ⋅ 24 de June de 2026

arXiv:2606.24236v1 Announce Type: new Abstract: Visual assessment of residual plots is a common approach for diagnosing linear models, but it relies on manual evaluation, which does not scale well and can lead to inconsistent decisions across analysts. The lineup protocol, which embeds the observed plot among null plots, can reduce subjectivity but requires even more human effort. In today’s data-driven world, such tasks are well suited for automation. We present a new R package that uses a computer […]

Ver mais

Like 0

Liked Liked

technocracy

Just look at Ayaneo’s absolute unit of a Windows gaming “handheld”

digitado ⋅ 9 de February de 2026

In 2023, we marveled at the sheer mass of Lenovo’s Legion Go, a 1.88-pound, 11.8-inch-wide monstrosity of a Windows gaming handheld. In 2026, though, Ayaneo unveiled details of its Next II handheld, which puts Lenovo’s big boy to shame while also offering heftier specs and a higher price than most other Windows gaming handhelds. Let’s focus on the bulk first. The Ayaneo Next II weighs in at a truly wrist-straining 3.14 pounds, making it more than twice as […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators

digitado ⋅ 4 de February de 2026

arXiv:2602.03730v1 Announce Type: new Abstract: Generative models trained using self-supervision of tokenized electronic health record (EHR) timelines show promise for clinical outcome prediction. This is typically done using Monte Carlo simulation for future patient trajectories. However, existing approaches suffer from three key limitations: sparse estimate distributions that poorly differentiate patient risk levels, extreme computational costs, and high sampling variance. We propose two new estimators: the Sum of Conditional Outcome Probability Estimator (SCOPE) and Risk Estimation from Anticipated Conditional […]

Ver mais

Like 0

Liked Liked

technocracy

Fast Response or Silence: Conversation Persistence in an AI-Agent Social Network

digitado ⋅ 10 de February de 2026

arXiv:2602.07667v1 Announce Type: cross Abstract: Autonomous AI agents are beginning to populate social platforms, but it is still unclear whether they can sustain the back-and-forth needed for extended coordination. We study Moltbook, an AI-agent social network, using a first-week snapshot and introduce interaction half-life: how quickly a comment’s chance of receiving a direct reply fades as the comment ages. Across tens of thousands of commented threads, Moltbook discussions are dominated by first-layer reactions rather than extended chains. Most […]

Ver mais

Like 0

Liked Liked

technocracy

Evaluating Robustness and Adaptability in Learning-Based Mission Planning for Active Debris Removal

digitado ⋅ 4 de February de 2026

Autonomous mission planning for Active Debris Removal (ADR) must balance efficiency, adaptability, and strict feasibility constraints on fuel and mission duration. This work compares three planners for the constrained multi-debris rendezvous problem in Low Earth Orbit: a nominal Masked Proximal Policy Optimization (PPO) policy trained under fixed mission parameters, a domain-randomized Masked PPO policy trained across varying mission constraints for improved robustness, and a plain Monte Carlo Tree Search (MCTS) baseline. Evaluations are conducted in a high-fidelity orbital […]

Ver mais

Like 0

Liked Liked

technocracy

Conformal Risk Control for Safety-Critical Wildfire Evacuation Mapping: A Comparative Study of Tabular, Spatial, and Graph-Based Models

digitado ⋅ 25 de March de 2026

arXiv:2603.22331v1 Announce Type: new Abstract: Every wildfire prediction model deployed today shares a dangerous property: none of these methods provides formal guarantees on how much fire spread is missed. Despite extensive work on wildfire spread prediction using deep learning, no prior study has applied distribution-free safety guarantees to this domain, leaving evacuation planners reliant on probability thresholds with no formal assurance. We address this gap by presenting, to our knowledge, the first application of conformal risk control (CRC) […]

Ver mais

Like 0

Liked Liked

technocracy

A Contrastive Pre-trained Foundation Model for Deciphering Imaging Noisomics across Modalities

digitado ⋅ 27 de January de 2026

arXiv:2601.17047v1 Announce Type: new Abstract: Characterizing imaging noise is notoriously data-intensive and device-dependent, as modern sensors entangle physical signals with complex algorithmic artifacts. Current paradigms struggle to disentangle these factors without massive supervised datasets, often reducing noise to mere interference rather than an information resource. Here, we introduce “Noisomics”, a framework shifting the focus from suppression to systematic noise decoding via the Contrastive Pre-trained (CoP) Foundation Model. By leveraging the manifold hypothesis and synthetic noise genome, CoP employs […]

Ver mais

Like 0

Liked Liked

technocracy

The optimal age to freeze eggs is 19

digitado ⋅ 23 de February de 2026

Published on February 8, 2026 9:44 AM GMT If you’re a woman interested in preserving your fertility window beyond its natural close in your early 40s, egg freezing is one of your best options. But if you rely on your doctor to tell you when to freeze them, you will likely be doing yourself and your future prospects for a family a disservice. The female reproductive system is one of the fastest aging parts of human biology. But […]

Ver mais

Like 0

Liked Liked