February 2026

VAM: Verbalized Action Masking for Controllable Exploration in RL Post-Training — A Chess Case Study

digitado ⋅ 20 de February de 2026

arXiv:2602.16833v1 Announce Type: new Abstract: Exploration remains a key bottleneck for reinforcement learning (RL) post-training of large language models (LLMs), where sparse feedback and large action spaces can lead to premature collapse into repetitive behaviors. We propose Verbalized Action Masking (VAM), which verbalizes an action mask in the prompt and enforces that the model outputs an action from the masked set. Building on this interface, we introduce iterative action-space pruning: if the target action is not sampled, we […]

Ver mais

Like 0

Liked Liked

technocracy

IndicJR: A Judge-Free Benchmark of Jailbreak Robustness in South Asian Languages

digitado ⋅ 20 de February de 2026

arXiv:2602.16832v1 Announce Type: new Abstract: Safety alignment of large language models (LLMs) is mostly evaluated in English and contract-bound, leaving multilingual vulnerabilities understudied. We introduce textbf{Indic Jailbreak Robustness (IJR)}, a judge-free benchmark for adversarial safety across 12 Indic and South Asian languages (2.1 Billion speakers), covering 45216 prompts in JSON (contract-bound) and Free (naturalistic) tracks. IJR reveals three patterns. (1) Contracts inflate refusals but do not stop jailbreaks: in JSON, LLaMA and Sarvam exceed 0.92 JSR, and in […]

Ver mais

Like 0

Liked Liked

technocracy

Low-Thrust Trajectory Optimization for Cubesat Lunar Mission: HORYU-VI

digitado ⋅ 20 de February de 2026

arXiv:2602.16831v1 Announce Type: new Abstract: This paper presents a low-thrust trajectory optimization strategy to achieve a near-circular lunar orbit for a CubeSat injected into a lunar flyby trajectory. The 12U CubeSat HORYU-VI is equipped with four Hall-effect thrusters and designed as a secondary payload on NASA’s Space Launch System under the Artemis program. Upon release, the spacecraft gains sufficient energy to escape the Earth-Moon system after a lunar flyby. The proposed trajectory is decomposed into three phases: (1) […]

Ver mais

Like 0

Liked Liked

technocracy

Learning under noisy supervision is governed by a feedback-truth gap

digitado ⋅ 20 de February de 2026

arXiv:2602.16829v1 Announce Type: new Abstract: When feedback is absorbed faster than task structure can be evaluated, the learner will favor feedback over truth. A two-timescale model shows this feedback-truth gap is inevitable whenever the two rates differ and vanishes only when they match. We test this prediction across neural networks trained with noisy labels (30 datasets, 2,700 runs), human probabilistic reversal learning (N = 292), and human reward/punishment learning with concurrent EEG (N = 25). In each system, […]

Ver mais

Like 0

Liked Liked

technocracy

An order-oriented approach to scoring hesitant fuzzy elements

digitado ⋅ 20 de February de 2026

arXiv:2602.16827v1 Announce Type: new Abstract: Traditional scoring approaches on hesitant fuzzy sets often lack a formal base in order theory. This paper proposes a unified framework, where each score is explicitly defined with respect to a given order. This order-oriented perspective enables more flexible and coherent scoring mechanisms. We examine several classical orders on hesitant fuzzy elements, that is, nonempty subsets in [0,1], and show that, contrary to prior claims, they do not induce lattice structures. In contrast, […]

Ver mais

Like 0

Liked Liked

technocracy

HiVAE: Hierarchical Latent Variables for Scalable Theory of Mind

digitado ⋅ 20 de February de 2026

arXiv:2602.16826v1 Announce Type: new Abstract: Theory of mind (ToM) enables AI systems to infer agents’ hidden goals and mental states, but existing approaches focus mainly on small human understandable gridworld spaces. We introduce HiVAE, a hierarchical variational architecture that scales ToM reasoning to realistic spatiotemporal domains. Inspired by the belief-desire-intention structure of human cognition, our three-level VAE hierarchy achieves substantial performance improvements on a 3,185-node campus navigation task. However, we identify a critical limitation: while our hierarchical structure […]

Ver mais

Like 0

Liked Liked

technocracy

RRT$^eta$: Sampling-based Motion Planning and Control from STL Specifications using Arithmetic-Geometric Mean Robustness

digitado ⋅ 20 de February de 2026

arXiv:2602.16825v1 Announce Type: new Abstract: Sampling-based motion planning has emerged as a powerful approach for robotics, enabling exploration of complex, high-dimensional configuration spaces. When combined with Signal Temporal Logic (STL), a temporal logic widely used for formalizing interpretable robotic tasks, these methods can address complex spatiotemporal constraints. However, traditional approaches rely on min-max robustness measures that focus only on critical time points and subformulae, creating non-smooth optimization landscapes with sharp decision boundaries that hinder efficient tree exploration. We […]

Ver mais

Like 0

Liked Liked

technocracy

Formal Mechanistic Interpretability: Automated Circuit Discovery with Provable Guarantees

digitado ⋅ 20 de February de 2026

arXiv:2602.16823v1 Announce Type: new Abstract: *Automated circuit discovery* is a central tool in mechanistic interpretability for identifying the internal components of neural networks responsible for specific behaviors. While prior methods have made significant progress, they typically depend on heuristics or approximations and do not offer provable guarantees over continuous input domains for the resulting circuits. In this work, we leverage recent advances in neural network verification to propose a suite of automated algorithms that yield circuits with *provable […]

Ver mais

Like 0

Liked Liked

technocracy

TopoFlow: Physics-guided Neural Networks for high-resolution air quality prediction

digitado ⋅ 20 de February de 2026

arXiv:2602.16821v1 Announce Type: new Abstract: We propose TopoFlow (Topography-aware pollutant Flow learning), a physics-guided neural network for efficient, high-resolution air quality prediction. To explicitly embed physical processes into the learning framework, we identify two critical factors governing pollutant dynamics: topography and wind direction. Complex terrain can channel, block, and trap pollutants, while wind acts as a primary driver of their transport and dispersion. Building on these insights, TopoFlow leverages a vision transformer architecture with two novel mechanisms: topography-aware […]

Ver mais

Like 0

Liked Liked

technocracy

AI-Mediated Feedback Improves Student Revisions: A Randomized Trial with FeedbackWriter in a Large Undergraduate Course

digitado ⋅ 20 de February de 2026

arXiv:2602.16820v1 Announce Type: new Abstract: Despite growing interest in using LLMs to generate feedback on students’ writing, little is known about how students respond to AI-mediated versus human-provided feedback. We address this gap through a randomized controlled trial in a large introductory economics course (N=354), where we introduce and deploy FeedbackWriter – a system that generates AI suggestions to teaching assistants (TAs) while they provide feedback on students’ knowledge-intensive essays. TAs have the full capacity to adopt, edit, […]

Ver mais

Like 0

Liked Liked