January 2026

Jailbreaking Large Language Models through Iterative Tool-Disguised Attacks via Reinforcement Learning

digitado ⋅ 12 de January de 2026

arXiv:2601.05466v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse applications, however, they remain critically vulnerable to jailbreak attacks that elicit harmful responses violating human values and safety guidelines. Despite extensive research on defense mechanisms, existing safeguards prove insufficient against sophisticated adversarial strategies. In this work, we propose iMIST (underline{i}nteractive underline{M}ulti-step underline{P}rogreunderline{s}sive underline{T}ool-disguised Jailbreak Attack), a novel adaptive jailbreak method that synergistically exploits vulnerabilities in current defense mechanisms. iMIST disguises malicious queries as […]

Ver mais

Like 0

Liked Liked

technocracy

PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering

digitado ⋅ 12 de January de 2026

arXiv:2601.05465v1 Announce Type: new Abstract: Answering real-world open-domain multi-hop questions over massive corpora is a critical challenge in Retrieval-Augmented Generation (RAG) systems. Recent research employs reinforcement learning (RL) to end-to-end optimize the retrieval-augmented reasoning process, directly enhancing its capacity to resolve complex queries. However, reliable deployment is hindered by two obstacles. 1) Retrieval Collapse: iterative retrieval over large corpora fails to locate intermediate evidence containing bridge answers without reasoning-guided planning, causing downstream reasoning to collapse. 2) Learning Instability: […]

Ver mais

Like 0

Liked Liked

technocracy

Rethinking Basis Path Testing: Mixed Integer Programming Approach for Test Path Set Generation

digitado ⋅ 12 de January de 2026

arXiv:2601.05463v1 Announce Type: new Abstract: Basis path testing is a cornerstone of structural testing, yet traditional automated methods, relying on greedy graph-traversal algorithms (e.g., DFS/BFS), often generate sub-optimal paths. This structural inferiority is not a trivial issue; it directly impedes downstream testing activities by complicating automated test data generation and increasing the cognitive load for human engineers. This paper reframes basis path generation from a procedural search task into a declarative optimization problem. We introduce a Mixed Integer […]

Ver mais

Like 0

Liked Liked

technocracy

RECOR: Reasoning-focused Multi-turn Conversational Retrieval Benchmark

digitado ⋅ 12 de January de 2026

arXiv:2601.05461v1 Announce Type: new Abstract: Existing benchmarks treat multi-turn conversation and reasoning-intensive retrieval separately, yet real-world information seeking requires both. To bridge this gap, we present a benchmark for reasoning-based conversational information retrieval comprising 707 conversations (2,971 turns) across eleven domains. To ensure quality, our Decomposition-and-Verification framework transforms complex queries into fact-grounded multi-turn dialogues through multi-level validation, where atomic facts are verified against sources and explicit retrieval reasoning is generated for each turn. Comprehensive evaluation reveals that combining […]

Ver mais

Like 0

Liked Liked

technocracy

Do LLMs Need Inherent Reasoning Before Reinforcement Learning? A Study in Korean Self-Correction

digitado ⋅ 12 de January de 2026

arXiv:2601.05459v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate strong reasoning and self-correction abilities in high-resource languages like English, but their performance remains limited in low-resource languages such as Korean. In this study, we investigate whether reinforcement learning (RL) can enhance Korean reasoning abilities to a degree comparable to English. Our findings reveal that RL alone yields limited improvements when applied to models lacking inherent Korean reasoning capabilities. To address this, we explore several fine-tuning strategies and […]

Ver mais

Like 0

Liked Liked

technocracy

ART: Adaptive Reasoning Trees for Explainable Claim Verification

digitado ⋅ 12 de January de 2026

arXiv:2601.05455v1 Announce Type: new Abstract: Large Language Models (LLMs) are powerful candidates for complex decision-making, leveraging vast encoded knowledge and remarkable zero-shot abilities. However, their adoption in high-stakes environments is hindered by their opacity; their outputs lack faithful explanations and cannot be effectively contested to correct errors, undermining trustworthiness. In this paper, we propose ART (Adaptive Reasoning Trees), a hierarchical method for claim verification. The process begins with a root claim, which branches into supporting and attacking child […]

Ver mais

Like 0

Liked Liked

technocracy

RingSQL: Generating Synthetic Data with Schema-Independent Templates for Text-to-SQL Reasoning Models

digitado ⋅ 12 de January de 2026

arXiv:2601.05451v1 Announce Type: new Abstract: Recent advances in text-to-SQL systems have been driven by larger models and improved datasets, yet progress is still limited by the scarcity of high-quality training data. Manual data creation is expensive, and existing synthetic methods trade off reliability and scalability. Template-based approaches ensure correct SQL but require schema-specific templates, while LLM-based generation scales easily but lacks quality and correctness guarantees. We introduce RingSQL, a hybrid data generation framework that combines schema-independent query templates […]

Ver mais

Like 0

Liked Liked

technocracy

Feedback Effects on Cognitive Dynamics: Network-Based Insights from EEG Patterns and Behavioral Performance

digitado ⋅ 12 de January de 2026

arXiv:2601.05450v1 Announce Type: new Abstract: This study examines the impact of feedback on Electroencephalography (EEG) activity and performance during the Reading the Mind in the Eyes Test. In a within-subject design, eleven participants completed the test under Feedback and No-Feedback conditions. Using the principles of Epistemic Network Analysis (ENA) and Ordered Network Analysis (ONA), we extend these network-based models to explore the link between neural dynamics and task outcomes. ENA results showed that feedback is associated with stronger […]

Ver mais

Like 0

Liked Liked

technocracy

Uncovering Failures in Cyber-Physical System State Transitions: A Fuzzing-Based Approach Applied to sUAS

digitado ⋅ 12 de January de 2026

arXiv:2601.05449v1 Announce Type: new Abstract: The increasing deployment of small Uncrewed Aerial Systems (sUAS) in diverse and often safety-critical environments demands rigorous validation of onboard decision logic under various conditions. In this paper, we present SaFUZZ, a state-aware fuzzing pipeline that validates core behavior associated with state transitions, automated failsafes, and human operator interactions in sUAS applications operating under various timing conditions and environmental disturbances. We create fuzzing specifications to detect behavioral deviations, and then dynamically generate associated […]

Ver mais

Like 0

Liked Liked

technocracy

TAPM-Net: Trajectory-Aware Perturbation Modeling for Infrared Small Target Detection

digitado ⋅ 12 de January de 2026

arXiv:2601.05446v1 Announce Type: new Abstract: Infrared small target detection (ISTD) remains a long-standing challenge due to weak signal contrast, limited spatial extent, and cluttered backgrounds. Despite performance improvements from convolutional neural networks (CNNs) and Vision Transformers (ViTs), current models lack a mechanism to trace how small targets trigger directional, layer-wise perturbations in the feature space, which is an essential cue for distinguishing signal from structured noise in infrared scenes. To address this limitation, we propose the Trajectory-Aware Mamba […]

Ver mais

Like 0

Liked Liked