February 2026

Intent Laundering: AI Safety Datasets Are Not What They Seem

digitado ⋅ 20 de February de 2026

arXiv:2602.16729v1 Announce Type: new Abstract: We systematically evaluate the quality of widely used AI safety datasets from two perspectives: in isolation and in practice. In isolation, we examine how well these datasets reflect real-world attacks based on three key properties: driven by ulterior intent, well-crafted, and out-of-distribution. We find that these datasets overrely on “triggering cues”: words or phrases with overt negative/sensitive connotations that are intended to trigger safety mechanisms explicitly, which is unrealistic compared to real-world attacks. […]

Ver mais

Like 0

Liked Liked

technocracy

Mobility-Aware Cache Framework for Scalable LLM-Based Human Mobility Simulation

digitado ⋅ 20 de February de 2026

arXiv:2602.16727v1 Announce Type: new Abstract: Large-scale human mobility simulation is critical for applications such as urban planning, epidemiology, and transportation analysis. Recent works treat large language models (LLMs) as human agents to simulate realistic mobility behaviors using structured reasoning, but their high computational cost limits scalability. To address this, we design a mobility-aware cache framework named MobCache that leverages reconstructible caches to enable efficient large-scale human mobility simulations. It consists of: (1) a reasoning component that encodes each […]

Ver mais

Like 0

Liked Liked

technocracy

Guiding LLM-Based Human Mobility Simulation with Mobility Measures from Shared Data

digitado ⋅ 20 de February de 2026

arXiv:2602.16726v1 Announce Type: new Abstract: Large-scale human mobility simulation is critical for many science domains such as urban science, epidemiology, and transportation analysis. Recent works treat large language models (LLMs) as human agents to simulate realistic mobility trajectories by modeling individual-level cognitive processes. However, these approaches generate individual mobility trajectories independently, without any population-level coordination mechanism, and thus fail to capture the emergence of collective behaviors. To address this issue, we design M2LSimu, a mobility measures-guided multi-prompt adjustment […]

Ver mais

Like 0

Liked Liked

technocracy

Is Mamba Reliable for Medical Imaging?

digitado ⋅ 20 de February de 2026

arXiv:2602.16723v1 Announce Type: new Abstract: State-space models like Mamba offer linear-time sequence processing and low memory, making them attractive for medical imaging. However, their robustness under realistic software and hardware threat models remains underexplored. This paper evaluates Mamba on multiple MedM-NIST classification benchmarks under input-level attacks, including white-box adversarial perturbations (FGSM/PGD), occlusion-based PatchDrop, and common acquisition corruptions (Gaussian noise and defocus blur) as well as hardware-inspired fault attacks emulated in software via targeted and random bit-flip injections into […]

Ver mais

Like 0

Liked Liked

technocracy

A Real-Time Approach to Autonomous CAN Bus Reverse Engineering

digitado ⋅ 20 de February de 2026

arXiv:2602.16722v1 Announce Type: new Abstract: This paper introduces a real-time method for reverse engineering a vehicle’s CAN bus without prior knowledge of the vehicle or its CAN system. By comparing inertial measurement and CAN data during significant vehicle events, the method accurately identified the CAN channels associated with the accelerator pedal, brake pedal, and steering wheel. Utilizing an IMU, CAN module, and event-driven software architecture, the system was validated using prerecorded serialized data from previous studies. This data, […]

Ver mais

Like 0

Liked Liked

technocracy

Speech to Speech Synthesis for Voice Impersonation

digitado ⋅ 20 de February de 2026

arXiv:2602.16721v1 Announce Type: new Abstract: Numerous models have shown great success in the fields of speech recognition as well as speech synthesis, but models for speech to speech processing have not been heavily explored. We propose Speech to Speech Synthesis Network (STSSN), a model based on current state of the art systems that fuses the two disciplines in order to perform effective speech to speech style transfer for the purpose of voice impersonation. We show that our proposed […]

Ver mais

Like 0

Liked Liked

technocracy

APEX-SQL: Talking to the data via Agentic Exploration for Text-to-SQL

digitado ⋅ 20 de February de 2026

arXiv:2602.16720v1 Announce Type: new Abstract: Text-to-SQL systems powered by Large Language Models have excelled on academic benchmarks but struggle in complex enterprise environments. The primary limitation lies in their reliance on static schema representations, which fails to resolve semantic ambiguity and scale effectively to large, complex databases. To address this, we propose APEX-SQL, an Agentic Text-to-SQL Framework that shifts the paradigm from passive translation to agentic exploration. Our framework employs a hypothesis-verification loop to ground model reasoning in […]

Ver mais

Like 0

Liked Liked

technocracy

GPU-Accelerated Algorithms for Graph Vector Search: Taxonomy, Empirical Study, and Research Directions

digitado ⋅ 20 de February de 2026

arXiv:2602.16719v1 Announce Type: new Abstract: Approximate Nearest Neighbor Search (ANNS) underpins many large-scale data mining and machine learning applications, with efficient retrieval increasingly hinging on GPU acceleration as dataset sizes grow. Although graph-based approaches represent the state of the art in approximate nearest neighbor search, there is a lack of systematic understanding regarding their optimization for modern GPU architectures and their end-to-end effectiveness in practical scenarios. In this work, we present a comprehensive survey and experimental study of […]

Ver mais

Like 0

Liked Liked

technocracy

UPER: Efficient Utility-driven Partially-ordered Episode Rule Mining

digitado ⋅ 20 de February de 2026

arXiv:2602.16718v1 Announce Type: new Abstract: Episode mining is a fundamental problem in analyzing a sequence of numerous events. For discovering strong relationships between events in a complex event sequence, episode rule mining is proposed. However, both the episode and episode rules have strict requirements for the order of events. Hence, partially-ordered episode rule mining (POERM) is designed to loosen the constraints on the ordering, i.e., events in the antecedents and consequents of the rule can be unordered, and […]

Ver mais

Like 0

Liked Liked

technocracy

Guided Exploration of Sequential Rules

digitado ⋅ 20 de February de 2026

arXiv:2602.16717v1 Announce Type: new Abstract: In pattern mining, sequential rules provide a formal framework to capture the temporal relationships and inferential dependencies between items. However, the discovery process is computationally intensive. To obtain mining results efficiently and flexibly, many methods have been proposed that rely on specific evaluation metrics (i.e., ensuring results meet minimum threshold requirements). A key issue with these methods, however, is that they generate many sequential rules that are irrelevant to users. Such rules not […]

Ver mais

Like 0

Liked Liked