digitado – Page 92

Dynamic Personality Adaptation in Large Language Models via State Machines

digitado ⋅ 25 de February de 2026

The inability of Large Language Models (LLMs) to modulate their personality expression in response to evolving dialogue dynamics hinders their performance in complex, interactive contexts. We propose a model-agnostic framework for dynamic personality simulation that employs state machines to represent latent personality states, where transition probabilities are dynamically adapted to the conversational context. Part of our architecture is a modular pipeline for continuous personality scoring that evaluates dialogues along latent axes while remaining agnostic to the specific personality […]

Ver mais

Like 0

Liked Liked

technocracy

When Small Variations Become Big Failures: Reliability Challenges in Compute-in-Memory Neural Accelerators

digitado ⋅ 5 de March de 2026

arXiv:2603.03491v1 Announce Type: new Abstract: Compute-in-memory (CiM) architectures promise significant improvements in energy efficiency and throughput for deep neural network acceleration by alleviating the von Neumann bottleneck. However, their reliance on emerging non-volatile memory devices introduces device-level non-idealities-such as write variability, conductance drift, and stochastic noise-that fundamentally challenge reliability, predictability, and safety, especially in safety-critical applications. This talk examines the reliability limits of CiM-based neural accelerators and presents a series of techniques that bridge device physics, architecture, and […]

Ver mais

Like 0

Liked Liked

technocracy

Sinkhorn-Drifting Generative Models

digitado ⋅ 16 de March de 2026

arXiv:2603.12366v1 Announce Type: new Abstract: We establish a theoretical link between the recently proposed “drifting” generative dynamics and gradient flows induced by the Sinkhorn divergence. In a particle discretization, the drift field admits a cross-minus-self decomposition: an attractive term toward the target distribution and a repulsive/self-correction term toward the current model, both expressed via one-sided normalized Gibbs kernels. We show that Sinkhorn divergence yields an analogous cross-minus-self structure, but with each term defined by entropic optimal-transport couplings obtained […]

Ver mais

Like 0

Liked Liked

technocracy

DinoDS isn’t “more scraped data.” It’s behavior engineering for LLMs.

digitado ⋅ 14 de April de 2026

I don’t think the interesting question anymore is “how much data did you scrape?” It’s: what exact model behavior did you engineer? That’s how we’ve been thinking about DinoDS. Not as one giant text pile, but as narrower training slices for things like: retrieval judgment grounded answering fixed structured output action / connector behavior safety boundaries The raw data matters, obviously. But the real value feels more and more like: task design, workflow realism, and how clearly the […]

Ver mais

Like 0

Liked Liked

technocracy

Create a Website Without Code: How Fabricate Turns Conversations Into Full-Stack Apps

digitado ⋅ 6 de March de 2026

In 2026, you don’t need a developer, a designer, or a $50K budget to build a professional web application. You just need to describe what you want. The idea of building a website without writing a single line of code is not new. Platforms like Wix, Squarespace, and WordPress have offered drag-and-drop builders for over a decade. But there has always been a ceiling. You could build a marketing site, maybe a simple blog, but the moment you […]

Ver mais

Like 0

Liked Liked

technocracy

BONSAI: Bayesian Optimization with Natural Simplicity and Interpretability

digitado ⋅ 10 de February de 2026

arXiv:2602.07144v1 Announce Type: cross Abstract: Bayesian optimization (BO) is a popular technique for sample-efficient optimization of black-box functions. In many applications, the parameters being tuned come with a carefully engineered default configuration, and practitioners only want to deviate from this default when necessary. Standard BO, however, does not aim to minimize deviation from the default and, in practice, often pushes weakly relevant parameters to the boundary of the search space. This makes it difficult to distinguish between important […]

Ver mais

Like 0

Liked Liked

technocracy

Learning False Discovery Rate Control via Model-Based Neural Networks

digitado ⋅ 6 de February de 2026

arXiv:2602.05798v1 Announce Type: cross Abstract: Controlling the false discovery rate (FDR) in high-dimensional variable selection requires balancing rigorous error control with statistical power. Existing methods with provable guarantees are often overly conservative, creating a persistent gap between the realized false discovery proportion (FDP) and the target FDR level. We introduce a learning-augmented enhancement of the T-Rex Selector framework that narrows this gap. Our approach replaces the analytical FDP estimator with a neural network trained solely on diverse synthetic […]

Ver mais

Like 0

Liked Liked

technocracy

Permutation-based Inference for Variational Learning of Directed Acyclic Graphs

digitado ⋅ 17 de February de 2026

arXiv:2402.02644v4 Announce Type: replace-cross Abstract: Estimating the structure of Bayesian networks as directed acyclic graphs (DAGs) from observational data is a fundamental challenge, particularly in causal discovery. Bayesian approaches excel by quantifying uncertainty and addressing identifiability, but key obstacles remain: (i) representing distributions over DAGs and (ii) estimating a posterior in the underlying combinatorial space. We introduce PIVID, a method that jointly infers a distribution over permutations and DAGs using variational inference and continuous relaxations of discrete distributions. […]

Ver mais

Like 0

Liked Liked

technocracy

Robust Federated Learning via Byzantine Filtering over Encrypted Updates

digitado ⋅ 5 de February de 2026

Federated Learning (FL) aims to train a collaborative model while preserving data privacy. However, the distributed nature of this approach still raises privacy and security issues, such as the exposure of sensitive data due to inference attacks and the influence of Byzantine behaviors on the trained model. In particular, achieving both secure aggregation and Byzantine resilience remains challenging, as existing solutions often address these aspects independently. In this work, we propose to address these challenges through a novel […]

Ver mais

Like 0

Liked Liked

technocracy

Why Agent Caching Fails and How to Fix It: Structured Intent Canonicalization with Few-Shot Learning

digitado ⋅ 21 de February de 2026

Personal AI agents incur substantial cost via repeated LLM calls. We show existing caching methods fail: GPTCache achieves 37.9% accuracy on real benchmarks; APC achieves 0-12%. The root cause is optimizing for the wrong property — cache effectiveness requires key consistency and precision, not classification accuracy. We observe cache-key evaluation reduces to clustering evaluation and apply V-measure decomposition to separate these on n=8,682 points across MASSIVE, BANKING77, CLINC150, and NyayaBench v2, our new 8,514-entry multilingual agentic dataset (528 […]

Ver mais

Like 0

Liked Liked