digitado – Page 23

Reinforcement Learning with LLM-Guided Action Spaces for Synthesizable Lead Optimization

digitado ⋅ 9 de April de 2026

Lead optimization in drug discovery requires improving therapeutic properties while ensuring that proposed molecular modifications correspond to feasible synthetic routes. Existing approaches either prioritize property scores without enforcing synthesizability, or rely on expensive enumeration over large reaction networks, while direct application of Large Language Models (LLMs) frequently produces chemically invalid structures. We introduce MolReAct, a framework that formulates lead optimization as a Markov Decision Process over a synthesis-constrained action space defined by validated reaction templates. A tool-augmented LLM […]

Ver mais

Like 0

Liked Liked

technocracy

Privacy utility trade offs for parameter estimation in degree heterogeneous higher order networks

digitado ⋅ 5 de February de 2026

arXiv:2602.03948v1 Announce Type: new Abstract: In sensitive applications involving relational datasets, protecting information about individual links from adversarial queries is of paramount importance. In many such settings, the available data are summarized solely through the degrees of the nodes in the network. We adopt the $beta$ model, which is the prototypical statistical model adopted for this form of aggregated relational information, and study the problem of minimax-optimal parameter estimation under both local and central differential privacy constraints. We […]

Ver mais

Like 0

Liked Liked

technocracy

Quoting Steve Yegge

digitado ⋅ 30 de January de 2026

Getting agents using Beads requires much less prompting, because Beads now has 4 months of “Desire Paths” design, which I’ve talked about before. Beads has evolved a very complex command-line interface, with 100+ subcommands, each with many sub-subcommands, aliases, alternate syntaxes, and other affordances. The complicated Beads CLI isn’t for humans; it’s for agents. What I did was make their hallucinations real, over and over, by implementing whatever I saw the agents trying to do with Beads, until […]

Ver mais

Like 0

Liked Liked

technocracy

Control Barrier Functions with Audio Risk Awareness for Robot Safe Navigation on Construction Sites

digitado ⋅ 16 de February de 2026

arXiv:2602.12416v1 Announce Type: new Abstract: Construction automation increasingly requires autonomous mobile robots, yet robust autonomy remains challenging on construction sites. These environments are dynamic and often visually occluded, which complicates perception and navigation. In this context, valuable information from audio sources remains underutilized in most autonomy stacks. This work presents a control barrier function (CBF)-based safety filter that provides safety guarantees for obstacle avoidance while adapting safety margins during navigation using an audio-derived risk cue. The proposed framework […]

Ver mais

Like 0

Liked Liked

technocracy

The Constructive Lie: Why Telling Your LLM the Wrong Answer Makes It Smarter

digitado ⋅ 29 de December de 2025

Author(s): Adham Khaled Originally published on Towards AI. Stop asking your AI to “think step-by-step.” Start asking it, “Why is this wrong?” We have all been there. You ask an LLM a complex logic question. It starts confidently. Step 1 looks good. Step 2 is plausible. But by Step 3, it has made a tiny arithmetic error or a slight logic leap. Source: Research PaperThe article discusses the concept of the “Constructive Lie” and its potential to enhance […]

Ver mais

Like 0

Liked Liked

technocracy

Discretization-free Bayesian inverse problems in distribution spaces

digitado ⋅ 12 de February de 2026

arXiv:2602.10247v1 Announce Type: new Abstract: The Bayesian approach to inverse problems provides a practical way to solve ill-posed problems by augmenting the observation model with prior information. Due to the measure-theoretic underpinnings, the approach has raised theoretical interest, leading to a rather comprehensive description in infinite-dimensional function spaces. The goal of this article is to bridge the infinite-dimensional theory for linear inverse problems in distribution spaces and associated computational inverse problems without resorting to a discrete approximation of […]

Ver mais

Like 0

Liked Liked

technocracy

The monotonicity of the Franz-Parisi potential is equivalent with Low-degree MMSE lower bounds

digitado ⋅ 23 de March de 2026

arXiv:2603.20070v1 Announce Type: cross Abstract: Over the last decades, two distinct approaches have been instrumental to our understanding of the computational complexity of statistical estimation. The statistical physics literature predicts algorithmic hardness through local stability and monotonicity properties of the Franz–Parisi (FP) potential cite{franz1995recipes,franz1997phase}, while the mathematically rigorous literature characterizes hardness via the limitations of restricted algorithmic classes, most notably low-degree polynomial estimators cite{hopkins2017efficient}. For many inference models, these two perspectives yield strikingly consistent predictions, giving rise to […]

Ver mais

Like 0

Liked Liked

technocracy

AI Agents in 2026: The Data Problem No One Mentions

digitado ⋅ 19 de January de 2026

Author(s): Ahmed M. Abdelfattah Originally published on Towards AI. Why vendors promise 3–5 employee productivity but Forrester finds 0% improvement and what your data infrastructure needs before deployment works Google Cloud claims AI agents deliver productivity equivalent to hiring 3–5 employees. Forrester’s analysis shows 0% actual improvement. Vendor demos run on sanitized test data. Your production systems have 10+ years of fragmented data scattered across tools that don’t talk to each other.The article explores the disparity between vendors’ […]

Ver mais

Like 0

Liked Liked

technocracy

Belief Dynamics for Detecting Behavioral Shifts in Safe Collaborative Manipulation

digitado ⋅ 8 de April de 2026

arXiv:2604.04967v1 Announce Type: new Abstract: Robots operating in shared workspaces must maintain safe coordination with other agents whose behavior may change during task execution. When a collaborating agent switches strategy mid-episode, continuing under outdated assumptions can lead to unsafe actions and increased collision risk. Reliable detection of such behavioral regime changes is therefore critical. We study regime-switch detection under controlled non-stationarity in ManiSkill shared-workspace manipulation tasks. Across ten detection methods and five random seeds, enabling detection reduces post-switch […]

Ver mais

Like 0

Liked Liked

technocracy

TrueBrief: Faithful Summarization through Small Language Models

digitado ⋅ 9 de January de 2026

arXiv:2601.04212v1 Announce Type: new Abstract: Large language models (LLMs) have exhibited remarkable proficiency in generating high-quality text; however, their propensity for producing hallucinations poses a significant challenge for their deployment in security-critical domains. In this work, we present TrueBrief, an end-to-end framework specifically designed to enhance the faithfulness of small LLMs (SLMs) primarily for the task of text summarization through a preference-optimization paradigm. Central to our framework is a data generation module that facilitates controlled hallucination injection to […]

Ver mais

Like 0

Liked Liked