digitado – Page 179

Neural solver for Wasserstein Geodesics and optimal transport dynamics

digitado ⋅ 26 de February de 2026

arXiv:2602.22003v1 Announce Type: cross Abstract: In recent years, the machine learning community has increasingly embraced the optimal transport (OT) framework for modeling distributional relationships. In this work, we introduce a sample-based neural solver for computing the Wasserstein geodesic between a source and target distribution, along with the associated velocity field. Building on the dynamical formulation of the optimal transport (OT) problem, we recast the constrained optimization as a minimax problem, using deep neural networks to approximate the relevant […]

Ver mais

Like 0

Liked Liked

technocracy

Automated Reproducibility Has a Problem Statement Problem

digitado ⋅ 9 de January de 2026

arXiv:2601.04226v1 Announce Type: new Abstract: Background. Reproducibility is essential to the scientific method, but reproduction is often a laborious task. Recent works have attempted to automate this process and relieve researchers of this workload. However, due to varying definitions of reproducibility, a clear problem statement is missing. Objectives. Create a generalisable problem statement, applicable to any empirical study. We hypothesise that we can represent any empirical study using a structure based on the scientific method and that this […]

Ver mais

Like 0

Liked Liked

technocracy

Multi-User Large Language Model Agents

digitado ⋅ 13 de April de 2026

arXiv:2604.08567v1 Announce Type: new Abstract: Large language models (LLMs) and LLM-based agents are increasingly deployed as assistants in planning and decision making, yet most existing systems are implicitly optimized for a single-principal interaction paradigm, in which the model is designed to satisfy the objectives of one dominant user whose instructions are treated as the sole source of authority and utility. However, as they are integrated into team workflows and organizational tools, they are increasingly required to serve multiple […]

Ver mais

Like 0

Liked Liked

technocracy

FRESCO: Benchmarking and Optimizing Re-rankers for Evolving Semantic Conflict in Retrieval-Augmented Generation

digitado ⋅ 18 de April de 2026

arXiv:2604.14227v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) is a key approach to mitigating the temporal staleness of large language models (LLMs) by grounding responses in up-to-date evidence. Within the RAG pipeline, re-rankers play a pivotal role in selecting the most useful documents from retrieved candidates. However, existing benchmarks predominantly evaluate re-rankers in static settings and do not adequately assess performance under evolving information — a critical gap, as real-world systems often must choose among temporally different pieces […]

Ver mais

Like 0

Liked Liked

technocracy

Latent-Augmented Discrete Diffusion Models

digitado ⋅ 25 de February de 2026

arXiv:2510.18114v2 Announce Type: replace-cross Abstract: Discrete diffusion models have emerged as a powerful class of models and a promising route to fast language generation, but practical implementations typically rely on factored reverse transitions that ignore cross-token dependencies and degrade performance in the few-step regime. We propose Latent-Augmented Discrete Diffusion (LADD), which introduces a learnable auxiliary latent channel and performs diffusion over the joint (token, latent) space. The latent variables provide an intermediate representation that can express joint structure […]

Ver mais

Like 0

Liked Liked

technocracy

Failing to Falsify: Evaluating and Mitigating Confirmation Bias in Language Models

digitado ⋅ 7 de April de 2026

arXiv:2604.02485v1 Announce Type: new Abstract: Confirmation bias, the tendency to seek evidence that supports rather than challenges one’s belief, hinders one’s reasoning ability. We examine whether large language models (LLMs) exhibit confirmation bias by adapting the rule-discovery study from human psychology: given a sequence of three numbers (a “triple”), an agent engages in an interactive feedback loop where it (1) proposes a new triple, (2) receives feedback on whether it satisfies the hidden rule, and (3) guesses the […]

Ver mais

Like 0

Liked Liked

technocracy

Lemon Agent Technical Report

digitado ⋅ 10 de February de 2026

arXiv:2602.07092v1 Announce Type: new Abstract: Recent advanced LLM-powered agent systems have exhibited their remarkable capabilities in tackling complex, long-horizon tasks. Nevertheless, they still suffer from inherent limitations in resource efficiency, context management, and multimodal perception. Based on these observations, Lemon Agent is introduced, a multi-agent orchestrator-worker system built on a newly proposed AgentCortex framework, which formalizes the classic Planner-Executor-Memory paradigm through an adaptive task execution mechanism. Our system integrates a hierarchical self-adaptive scheduling mechanism that operates at both […]

Ver mais

Like 0

Liked Liked

technocracy

Locally Coherent Parallel Decoding in Diffusion Language Models

digitado ⋅ 24 de March de 2026

arXiv:2603.20216v1 Announce Type: new Abstract: Diffusion language models (DLMs) have emerged as a promising alternative to autoregressive (AR) models, offering sub-linear generation latency and bidirectional capabilities that are particularly appealing for code generation and editing. Achieving sub-linear latency in discrete DLMs requires predicting multiple tokens in parallel. However, standard DLMs sample tokens independently from conditional marginal distributions, failing to capture the joint dependencies among concurrently generated tokens. As a result, they often lead to syntactic inconsistencies and break […]

Ver mais

Like 0

Liked Liked

technocracy

Provably Safe Generative Sampling with Constricting Barrier Functions

digitado ⋅ 26 de February de 2026

arXiv:2602.21429v1 Announce Type: new Abstract: Flow-based generative models, such as diffusion models and flow matching models, have achieved remarkable success in learning complex data distributions. However, a critical gap remains for their deployment in safety-critical domains: the lack of formal guarantees that generated samples will satisfy hard constraints. We address this by proposing a safety filtering framework that acts as an online shield for any pre-trained generative model. Our key insight is to cooperate with the generative process […]

Ver mais

Like 0

Liked Liked

technocracy

Soldier won $410K in Polymarket bets on timing of Maduro capture, US alleges

digitado ⋅ 24 de April de 2026

A US Army soldier was arrested for insider trading after being accused of making prediction-market wagers on the timing of the military’s capture of Venezuelan President Nicolás Maduro. Army soldier Gannon Ken Van Dyke made a profit of nearly $410,000 by making bets on Polymarket, and he was indicted on charges of unlawful use of confidential government information for personal gain, theft of nonpublic government information, commodities fraud, wire fraud, and making an unlawful monetary transaction, the Department […]

Ver mais

Like 0

Liked Liked