March 2026

Diagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring

digitado ⋅ 31 de March de 2026

arXiv:2603.27389v1 Announce Type: cross Abstract: Reinforcement learning algorithms assume that observations satisfy the Markov property, yet real-world sensors frequently violate this assumption through correlated noise, latency, or partial observability. Standard performance metrics conflate Markov breakdowns with other sources of suboptimality, leaving practitioners without diagnostic tools for such violations. This paper introduces a prediction-based scoring method that quantifies non-Markovian structure in observation trajectories. A random forest first removes nonlinear Markov-compliant dynamics; ridge regression then tests whether historical observations reduce […]

Ver mais

Like 0

Liked Liked

technocracy

A Firefly Algorithm for Mixed-Variable Optimization Based on Hybrid Distance Modeling

digitado ⋅ 31 de March de 2026

arXiv:2603.26792v1 Announce Type: new Abstract: Several real-world optimization problems involve mixed-variable search spaces, where continuous, ordinal, and categorical decision variables coexist. However, most population-based metaheuristic algorithms are designed for either continuous or discrete optimization problems and do not naturally handle heterogeneous variable types. In this paper, we propose an adaptation of the Firefly Algorithm for mixed-variable optimization problems (FAmv). The proposed method relies on a modified distance-based attractiveness mechanism that integrates continuous and discrete components within a unified […]

Ver mais

Like 0

Liked Liked

technocracy

CRISP: Characterizing Relative Impact of Scholarly Publications

digitado ⋅ 31 de March de 2026

arXiv:2603.26791v1 Announce Type: new Abstract: Assessing a cited paper’s impact is typically done by analyzing its citation context in isolation within the citing paper. While this focuses on the most directly relevant text, it prevents relative comparisons across all the works a paper cites. We propose CRISP, which instead jointly ranks all cited papers within a citing paper using large language models (LLMs). To mitigate LLMs’ positional bias, we rank each list three times in a randomized order […]

Ver mais

Like 0

Liked Liked

technocracy

Elucidating the Design Space of Flow Matching for Cellular Microscopy

digitado ⋅ 31 de March de 2026

arXiv:2603.26790v1 Announce Type: new Abstract: Flow-matching generative models are increasingly used to simulate cell responses to biological perturbations. However, the design space for building such models is large and underexplored. We systematically analyse the design space of flow matching models for cell-microscopy images, finding that many popular techniques are unnecessary and can even hurt performance. We develop a simple, stable, and scalable recipe which we use to train our foundation model. We scale our model to two orders […]

Ver mais

Like 0

Liked Liked

technocracy

Confidence Matters: Uncertainty Quantification and Precision Assessment of Deep Learning-based CMR Biomarker Estimates Using Scan-rescan Data

digitado ⋅ 31 de March de 2026

arXiv:2603.26789v1 Announce Type: new Abstract: The performance of deep learning (DL) methods for the analysis of cine cardiovascular magnetic resonance (CMR) is typically assessed in terms of accuracy, overlooking precision. In this work, uncertainty estimation techniques, namely deep ensemble, test-time augmentation, and Monte Carlo dropout, are applied to a state-of-the-art DL pipeline for cardiac functional biomarker estimation, and new distribution-based metrics are proposed for the assessment of biomarker precision. The model achieved high accuracy (average Dice 87%) and […]

Ver mais

Like 0

Liked Liked

technocracy

ReMemNav: A Rethinking and Memory-Augmented Framework for Zero-Shot Object Navigation

digitado ⋅ 31 de March de 2026

arXiv:2603.26788v1 Announce Type: new Abstract: Zero-shot object navigation requires agents to locate unseen target objects in unfamiliar environments without prior maps or task-specific training which remains a significant challenge. Although recent advancements in vision-language models(VLMs) provide promising commonsense reasoning capabilities for this task, these models still suffer from spatial hallucinations, local exploration deadlocks, and a disconnect between high-level semantic intent and low-level control. In this regard, we propose a novel hierarchical navigation framework named ReMemNav, which seamlessly integrates […]

Ver mais

Like 0

Liked Liked

technocracy

Brain-Inspired Multimodal Spiking Neural Network for Image-Text Retrieval

digitado ⋅ 31 de March de 2026

arXiv:2603.26787v1 Announce Type: new Abstract: Spiking neural networks (SNNs) have recently shown strong potential in unimodal visual and textual tasks, yet building a directly trained, low-energy, and high-performance SNN for multimodal applications such as image-text retrieval (ITR) remains highly challenging. Existing artificial neural network (ANN)-based methods often pursue richer unimodal semantics using deeper and more complex architectures, while overlooking cross-modal interaction, retrieval latency, and energy efficiency. To address these limitations, we present a brain-inspired Cross-Modal Spike Fusion network […]

Ver mais

Like 0

Liked Liked

technocracy

A Step Toward Federated Pretraining of Multimodal Large Language Models

digitado ⋅ 31 de March de 2026

arXiv:2603.26786v1 Announce Type: new Abstract: The rapid evolution of Multimodal Large Language Models (MLLMs) is bottlenecked by the saturation of high-quality public data, while vast amounts of diverse multimodal data remain inaccessible in privacy-sensitive silos. Federated Learning (FL) offers a promising solution to unlock these distributed resources, but existing research focuses predominantly on fine-tuning, leaving the foundational pre-training phase largely unexplored. In this paper, we formally introduce the Federated MLLM Alignment (Fed-MA) task, a lightweight pre-training paradigm that […]

Ver mais

Like 0

Liked Liked

technocracy

HighlightBench: Benchmarking Markup-Driven Table Reasoning in Scientific Documents

digitado ⋅ 31 de March de 2026

arXiv:2603.26784v1 Announce Type: new Abstract: Visual markups such as highlights, underlines, and bold text are common in table-centric documents. Although multimodal large language models (MLLMs) have made substantial progress in document understanding, their ability to treat such cues as explicit logical directives remains under-explored. More importantly, existing evaluations cannot distinguish whether a model fails to see the markup or fails to reason with it. This creates a key blind spot in assessing markup-conditioned behavior over tables. To address […]

Ver mais

Like 0

Liked Liked

technocracy

Can We Change the Stroke Size for Easier Diffusion?

digitado ⋅ 31 de March de 2026

arXiv:2603.26783v1 Announce Type: new Abstract: Diffusion models can be challenged in the low signal-to-noise regime, where they have to make pixel-level predictions despite the presence of high noise. The geometric intuition is akin to using the finest stroke for oil painting throughout, which may be ineffective. We therefore study stroke-size control as a controlled intervention that changes the effective roughness of the supervised target, predictions and perturbations across timesteps, in an attempt to ease the low signal-to-noise challenge. […]

Ver mais

Like 0

Liked Liked