April 2026

From Public-Key Linting to Operational Post-Quantum X.509 Assurance for ML-KEM and ML-DSA: Registry-Driven Policy, Mutation-Based Evaluation, and Import Validation

digitado ⋅ 21 de April de 2026

arXiv:2604.17003v1 Announce Type: new Abstract: Final FIPS and PKIX standards for ML-KEM and ML-DSA fix the normative floor, but operational assurance in post-quantum X.509 still depends on accountable checks across certificate-profile semantics, SubjectPublicKeyInfo representation, and private-key-container import. We present a workflow-centric assurance framework for ML-KEM and ML-DSA in the narrow executable profile pkix-core. The framework reifies 17 final-standards requirements into an assurance registry indexed by owner, stage, detector kind, normative strength, and mode-specific action; groups them into three […]

Ver mais

Like 0

Liked Liked

technocracy

Intelligent Drill-Down: Large Language Model-Driven Drill-Down Technique for Human-AI Collaborative Visual Exploration

digitado ⋅ 21 de April de 2026

arXiv:2604.17002v1 Announce Type: new Abstract: In visual analytics, applying filters to drill-down and extract higher-value insights is a common and important data analysis method. When the drill-down space becomes excessively large, analysts may lose orientation, leading to decreased efficiency in the drill-down process. To tackle these challenges, we propose the Intelligent Drill-Down Framework, in which a large language model (LLM) facilitates the generation of visual insights, leverages user interaction data to interpret user intent, and generates appropriate drill-down […]

Ver mais

Like 0

Liked Liked

technocracy

Inductive Convolution Nuclear Norm Minimization for Tensor Completion with Arbitrary Sampling

digitado ⋅ 21 de April de 2026

arXiv:2604.17001v1 Announce Type: new Abstract: The recently established Convolution Nuclear Norm Minimization (CNNM) addresses the problem of textit{tensor completion with arbitrary sampling} (TCAS), which involves restoring a tensor from a subset of its entries sampled in an arbitrary manner. Despite its promising performance, the optimization procedure of CNNM needs performing Singular Value Decomposition (SVD) multiple times, which is computationally expensive and hard to parallelize. To address the issue, we reformulate the optimization objective of CNNM from the perspective […]

Ver mais

Like 0

Liked Liked

technocracy

SPS: Steering Probability Squeezing for Better Exploration in Reinforcement Learning for Large Language Models

digitado ⋅ 21 de April de 2026

arXiv:2604.16995v1 Announce Type: new Abstract: Reinforcement learning (RL) has emerged as a promising paradigm for training reasoning-oriented models by leveraging rule-based reward signals. However, RL training typically tends to improve single-sample success rates (i.e., Pass@1) while offering limited exploration of diverse reasoning trajectories, which is crucial for multi-sample performance (i.e., Pass@k). Our preliminary analysis reveals that this limitation stems from a fundamental squeezing effect, whereby probability mass is excessively concentrated on a narrow subset of high-reward trajectories, restricting […]

Ver mais

Like 0

Liked Liked

technocracy

Rule-VLN: Bridging Perception and Compliance via Semantic Reasoning and Geometric Rectification

digitado ⋅ 21 de April de 2026

arXiv:2604.16993v1 Announce Type: new Abstract: As embodied AI transitions to real-world deployment, the success of the Vision-and-Language Navigation (VLN) task tends to evolve from mere reachability to social compliance. However, current agents suffer from a “goal-driven trap”, prioritizing physical geometry (“can I go?”) over semantic rules (“may I go?”), frequently overlooking subtle regulatory constraints. To bridge this gap, we establish Rule-VLN, the first large-scale urban benchmark for rule-compliant navigation. Spanning a massive 29k-node environment, it injects 177 diverse […]

Ver mais

Like 0

Liked Liked

technocracy

Bolzano: Case Studies in LLM-Assisted Mathematical Research

digitado ⋅ 21 de April de 2026

arXiv:2604.16989v1 Announce Type: new Abstract: We report new results on six problems in mathematics and theoretical computer science, produced with the assistance of Bolzano, an open-source multi-agent LLM system. Bolzano orchestrates rounds of interaction between parallel prover agents and a verifier agent while maintaining a persistent knowledge base that is carried across rounds. Classified using the significance-autonomy taxonomy of Feng et al., four of the six results reach the level of publishable research, and three of the six […]

Ver mais

Like 0

Liked Liked

technocracy

In-Context Learning Under Regime Change

digitado ⋅ 21 de April de 2026

arXiv:2604.16988v1 Announce Type: new Abstract: Non-stationary sequences arise naturally in control, forecasting, and decision-making. The data-generating process shifts at unknown times, and models must detect the change, discard or downweight obsolete evidence, and adapt to new dynamics on the fly. Transformer-based foundation models increasingly rely on in-context learning for time series forecasting, tabular prediction, and continuous control. As these models are deployed in non-stationary environments, understanding their ability to detect and adapt to regime shifts is important. We […]

Ver mais

Like 0

Liked Liked

technocracy

DVAR: Adversarial Multi-Agent Debate for Video Authenticity Detection

digitado ⋅ 21 de April de 2026

arXiv:2604.16987v1 Announce Type: new Abstract: The rapid evolution of video generation technologies poses a significant challenge to media forensics, as conventional detection methods often fail to generalize beyond their training distributions. To address this, we propose DVAR (Debate-based Video Authenticity Reasoning), a training-free framework that reformulates video detection as a structured multi-agent forensic reasoning process. Moving beyond the paradigm of pattern matching, DVAR orchestrates a competition between a Generative Hypothesis Agent and a Natural Mechanism Agent. Through iterative […]

Ver mais

Like 0

Liked Liked

technocracy

Shift schema drift left: policy-aware compile-time contracts for typed JVM and Spark pipelines

digitado ⋅ 21 de April de 2026

arXiv:2604.16986v1 Announce Type: new Abstract: Schema drift in data pipelines is often caught only when a job touches real data. Typed-Dataset layers close part of this gap but require wholesale adoption; table-level enforcement systems close another part but operate at write time against a stored schema. We present a small Scala 3 framework that occupies the seam: it proves producer-to-contract structural compatibility under explicit policies at compile time, derives Spark schemas from the same contract types, and re-checks […]

Ver mais

Like 0

Liked Liked

technocracy

Adverse-to-the-eXtreme Panoptic Segmentation: URVIS 2026 Study and Benchmark

digitado ⋅ 21 de April de 2026

arXiv:2604.16984v1 Announce Type: new Abstract: This paper presents the report of the URVIS 2026 challenge on adverse-to-extreme panoptic segmentation. As the first challenge of its kind, it attracted 17 registered participants and 47 submissions, with 4 teams reaching the final phase. The challenge is based on the MUSES dataset, a multi-sensor benchmark for panoptic segmentation in adverse-to-extreme weather, including RGB frame camera, LiDAR, radar, and event camera data. Weighted Panoptic Quality (wPQ) is designed and adopted as the […]

Ver mais

Like 0

Liked Liked