March 2026

Exploring the Use of VLMs for Navigation Assistance for People with Blindness and Low Vision

digitado ⋅ 18 de March de 2026

arXiv:2603.15624v1 Announce Type: new Abstract: This paper investigates the potential of vision-language models (VLMs) to assist people with blindness and low vision (pBLV) in navigation tasks. We evaluate state-of-the-art closed-source models, including GPT-4V, GPT-4o, Gemini-1.5-Pro, and Claude-3.5-Sonnet, alongside open-source models, such as Llava-v1.6-mistral and Llava-onevision-qwen, to analyze their capabilities in foundational visual skills: counting ambient obstacles, relative spatial reasoning, and common-sense wayfinding-pertinent scene understanding. We further assess their performance in navigation scenarios, using pBLV-specific prompts designed to simulate […]

Ver mais

Like 0

Liked Liked

technocracy

Finder: A Multimodal AI-Powered Search Framework for Pharmaceutical Data Retrieval

digitado ⋅ 18 de March de 2026

arXiv:2603.15623v1 Announce Type: new Abstract: AI is transforming pharmaceutical search, where traditional systems struggle with multimodal content and manual curation. Finder is a scalable AI-powered framework that unifies retrieval across text, images, audio, and video using hybrid vector search, combining sparse lexical and dense semantic models. Its modular pipeline ingests diverse formats, enriches metadata, and stores content in a vector-native backend. Finder supports reasoning-aware natural language search, improving precision and contextual relevance. The system has processed over 291,400 […]

Ver mais

Like 0

Liked Liked

technocracy

SAC-NeRF: Adaptive Ray Sampling for Neural Radiance Fields via Soft Actor-Critic Reinforcement Learning

digitado ⋅ 18 de March de 2026

arXiv:2603.15622v1 Announce Type: new Abstract: Neural Radiance Fields (NeRF) have achieved photorealistic novel view synthesis but suffer from computational inefficiency due to dense ray sampling during volume rendering. We propose SAC-NeRF, a reinforcement learning framework that learns adaptive sampling policies using Soft Actor-Critic (SAC). Our method formulates sampling as a Markov Decision Process where an RL agent learns to allocate samples based on scene characteristics. We introduce three technical components: (1) a Gaussian mixture distribution color model providing […]

Ver mais

Like 0

Liked Liked

technocracy

Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing

digitado ⋅ 18 de March de 2026

International shipping produces approximately 3% of global greenhouse gas emissions, yet voyage routing remains dominated by heuristic methods. We present PIER (Physics-Informed, Energy-efficient, Risk-aware routing), an offline reinforcement learning framework that learns fuel-efficient, safety-aware routing policies from physics-calibrated environments grounded in historical vessel tracking data and ocean reanalysis products, requiring no online simulator. Validated on one full year (2023) of AIS data across seven Gulf of Mexico routes (840 episodes per method), PIER reduces mean CO2 emissions by […]

Ver mais

Like 0

Liked Liked

technocracy

Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations

digitado ⋅ 18 de March de 2026

We propose CRAFT, a red-teaming alignment framework that leverages model reasoning capabilities and hidden representations to improve robustness against jailbreak attacks. Unlike prior defenses that operate primarily at the output level, CRAFT aligns large reasoning models to generate safety-aware reasoning traces by explicitly optimizing objectives defined over the hidden state space. Methodologically, CRAFT integrates contrastive representation learning with reinforcement learning to separate safe and unsafe reasoning trajectories, yielding a latent-space geometry that supports robust, reasoning-level safety alignment. Theoretically, […]

Ver mais

Like 0

Liked Liked

technocracy

WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation

digitado ⋅ 18 de March de 2026

Generative Flow Networks for continuous scenarios (CFlowNets) have shown promise in solving sequential decision-making tasks by learning stochastic policies using a flow and a retrieval network. Despite their demonstrated efficiency compared to state-of-the-art Reinforcement Learning (RL) algorithms, their practical application in robotic control tasks is constrained by the reliance on pre-training the retrieval network. This dependency poses challenges in dynamic robotic environments, where pre-training data may not be readily available or representative of the current environment. This paper […]

Ver mais

Like 0

Liked Liked

technocracy

Tighter bounds on alternating series remainder

digitado ⋅ 18 de March de 2026

The alternating series test is part of the standard calculus curriculum. It says that if you truncate an alternating series, the remainder is bounded by the first term that was left out. This fact goes by in a blur for most students, but it becomes useful later if you need to do numerical computing. To be more precise, assume we have a series of the form where the ai are positive and monotonically converge to zero. Then the […]

Ver mais

Like 0

Liked Liked

technocracy

Variational Rectification Inference for Learning with Noisy Labels

digitado ⋅ 18 de March de 2026

Label noise has been broadly observed in real-world datasets. To mitigate the negative impact of overfitting to label noise for deep models, effective strategies (textit{e.g.}, re-weighting, or loss rectification) have been broadly applied in prevailing approaches, which have been generally learned under the meta-learning scenario. Despite the robustness of noise achieved by the probabilistic meta-learning models, they usually suffer from model collapse that degenerates generalization performance. In this paper, we propose variational rectification inference (VRI) to formulate the […]

Ver mais

Like 0

Liked Liked

technocracy

Pathology-Aware Multi-View Contrastive Learning for Patient-Independent ECG Reconstruction

digitado ⋅ 18 de March de 2026

Reconstructing a 12-lead electrocardiogram (ECG) from a reduced lead set is an ill-posed inverse problem due to anatomical variability. Standard deep learning methods often ignore underlying cardiac pathology losing vital morphology in precordial leads. We propose Pathology-Aware Multi-View Contrastive Learning, a framework that regularizes the latent space through a pathological manifold. Our architecture integrates high-fidelity time-domain waveforms with pathology-aware embeddings learned via supervised contrastive alignment. By maximizing mutual information between latent representations and clinical labels, the framework learns […]

Ver mais

Like 0

Liked Liked

technocracy

Personality Self-Replicators

digitado ⋅ 18 de March de 2026

One-sentence summary I describe the risk of personality self-replicators, the threat of OpenClaw-like agents managing spreading in hard-to-control ways. Summary LLM agents like OpenClaw are defined by a small set of text files and are run by an open source framework which leverages LLMs for cognition. It is quite difficult for current frontier models to exfiltrate their weights and run elsewhere, whereas these agents only need to copy those few text files to self-replicate (at the cost of greater […]

Ver mais

Like 0

Liked Liked