January 2026

Jailbreak-Zero: A Path to Pareto Optimal Red Teaming for Large Language Models

digitado ⋅ 8 de January de 2026

arXiv:2601.03265v1 Announce Type: new Abstract: This paper introduces Jailbreak-Zero, a novel red teaming methodology that shifts the paradigm of Large Language Model (LLM) safety evaluation from a constrained example-based approach to a more expansive and effective policy-based framework. By leveraging an attack LLM to generate a high volume of diverse adversarial prompts and then fine-tuning this attack model with a preference dataset, Jailbreak-Zero achieves Pareto optimality across the crucial objectives of policy coverage, attack strategy diversity, and prompt […]

Ver mais

Like 0

Liked Liked

technocracy

Internal Reasoning vs. External Control: A Thermodynamic Analysis of Sycophancy in Large Language Models

digitado ⋅ 8 de January de 2026

arXiv:2601.03263v1 Announce Type: new Abstract: Large Language Models frequently exhibit sycophancy, prioritizing user agreeableness over correctness. We investigate whether this requires external regulation or can be mitigated by internal reasoning alone. Using CAP-GSM8K (N=500), an adversarial dataset, we evaluate internal (CoT) versus external (RCA) mechanisms across GPT-3.5, GPT-4o, and GPT-5.1. Our results reveal the structural limits of internal reasoning: it causes performance collapse in weak models (the Prioritization Paradox) and leaves an 11.4% final output gap in frontier […]

Ver mais

Like 0

Liked Liked

technocracy

Roles of MLLMs in Visually Rich Document Retrieval for RAG: A Survey

digitado ⋅ 8 de January de 2026

arXiv:2601.03262v1 Announce Type: new Abstract: Visually rich documents (VRDs) challenge retrieval-augmented generation (RAG) with layout-dependent semantics, brittle OCR, and evidence spread across complex figures and structured tables. This survey examines how Multimodal Large Language Models (MLLMs) are being used to make VRD retrieval practical for RAG. We organize the literature into three roles: Modality-Unifying Captioners, Multimodal Embedders, and End-to-End Representers. We compare these roles along retrieval granularity, information fidelity, latency and index size, and compatibility with reranking and […]

Ver mais

Like 0

Liked Liked

technocracy

DeepResearch-Slice: Bridging the Retrieval-Utilization Gap via Explicit Text Slicing

digitado ⋅ 8 de January de 2026

arXiv:2601.03261v1 Announce Type: new Abstract: Deep Research agents predominantly optimize search policies to maximize retrieval probability. However, we identify a critical bottleneck: the retrieval-utilization gap, where models fail to use gold evidence even after it is retrieved, due to context blindness in noisy environments. To bridge this gap, we propose DeepResearch-Slice, a simple yet effective neuro-symbolic framework. Unlike implicit attention, our approach predicts precise span indices to perform a deterministic hard filter before reasoning. Extensive evaluations across six […]

Ver mais

Like 0

Liked Liked

technocracy

SciNetBench: A Relation-Aware Benchmark for Scientific Literature Retrieval Agents

digitado ⋅ 8 de January de 2026

arXiv:2601.03260v1 Announce Type: new Abstract: The rapid development of AI agent has spurred the development of advanced research tools, such as Deep Research. Achieving this require a nuanced understanding of the relations within scientific literature, surpasses the scope of keyword-based or embedding-based retrieval. Existing retrieval agents mainly focus on the content-level similarities and are unable to decode critical relational dynamics, such as identifying corroborating or conflicting studies or tracing technological lineages, all of which are essential for a […]

Ver mais

Like 0

Liked Liked

technocracy

LLMDiRec: LLM-Enhanced Intent Diffusion for Sequential Recommendation

digitado ⋅ 8 de January de 2026

arXiv:2601.03259v1 Announce Type: new Abstract: Existing sequential recommendation models, even advanced diffusion-based approaches, often struggle to capture the rich semantic intent underlying user behavior, especially for new users or long-tail items. This limitation stems from their reliance on ID-based embeddings, which lack semantic grounding. We introduce LLMDiRec, a new approach that addresses this gap by integrating Large Language Models (LLMs) into an intent-aware diffusion model. Our approach combines collaborative signals from ID embeddings with rich semantic representations from […]

Ver mais

Like 0

Liked Liked

technocracy

Enhancing Retrieval-Augmented Generation with Two-Stage Retrieval: FlashRank Reranking and Query Expansion

digitado ⋅ 8 de January de 2026

arXiv:2601.03258v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) couples a retriever with a large language model (LLM) to ground generated responses in external evidence. While this framework enhances factuality and domain adaptability, it faces a key bottleneck: balancing retrieval recall with limited LLM context. Retrieving too few passages risks missing critical context, while retrieving too many overwhelms the prompt window, diluting relevance and increasing cost. We propose a two-stage retrieval pipeline that integrates LLM-driven query expansion to improve […]

Ver mais

Like 0

Liked Liked

technocracy

5 Underrated Libraries & Frameworks for AI Engineers to Learn in 2026

digitado ⋅ 8 de January de 2026

In the fast-moving world of AI, we often get distracted by the flashiest models where everyone is talking about Gemini, GPT, Claude, or Grok models. But for AI Engineers building actual production systems, the model is just one small piece of a much larger complicated puzzle. To build a robust AI application, you need to solve distinct engineering challenges: inference latency, observability, user interfaces, agentic orchestration, and memory management. Here are 5 underrated libraries/frameworks (plus a bonus) that […]

Ver mais

Like 0

Liked Liked

technocracy

FedKDX: Federated Learning with Negative Knowledge Distillation for Enhanced Healthcare AI Systems

digitado ⋅ 8 de January de 2026

This paper introduces FedKDX, a federated learning framework that addresses limitations in healthcare AI through Negative Knowledge Distillation (NKD). Unlike existing approaches that focus solely on positive knowledge transfer, FedKDX captures both target and non-target information to improve model generalization in healthcare applications. The framework integrates multiple knowledge transfer techniques–including traditional knowledge distillation, contrastive learning, and NKD–within a unified architecture that maintains privacy while reducing communication costs. Through experiments on healthcare datasets (SLEEP, UCI-HAR, and PAMAP2), FedKDX demonstrates […]

Ver mais

Like 0

Liked Liked

technocracy

Improving Semi-Supervised Contrastive Learning via Entropy-Weighted Confidence Integration of Anchor-Positive Pairs

digitado ⋅ 8 de January de 2026

Conventional semi-supervised contrastive learning methods assign pseudo-labels only to samples whose highest predicted class probability exceeds a predefined threshold, and then perform supervised contrastive learning using those selected samples. In this study, we propose a novel loss function that estimates the confidence of each sample based on the entropy of its predicted probability distribution and applies confidence-based adaptive weighting. This approach enables pseudo-label assignment even to samples that were previously excluded from training and facilitates contrastive learning that […]

Ver mais

Like 0

Liked Liked