digitado

Control Reinforcement Learning: Token-Level Mechanistic Analysis via Learned SAE Feature Steering

digitado ⋅ 11 de February de 2026

Sparse autoencoders (SAEs) decompose language model activations into interpretable features, but existing methods reveal only which features activate, not which change model outputs when amplified. We introduce Control Reinforcement Learning (CRL), which trains a policy to select SAE features for steering at each token, producing interpretable intervention logs: the learned policy identifies features that change model outputs when amplified. Adaptive Feature Masking encourages diverse feature discovery while preserving singlefeature interpretability. The framework yields new analysis capabilities: branch point […]

Ver mais

Like 0

Liked Liked

technocracy

Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning

digitado ⋅ 9 de February de 2026

arXiv:2602.06584v1 Announce Type: cross Abstract: Standard chain-of-thought reasoning generates a solution in a single forward pass, committing irrevocably to each token and lacking a mechanism to recover from early errors. We introduce Inference-Time Rethinking, a generative framework that enables iterative self-correction by decoupling declarative latent thought vectors from procedural generation. We factorize reasoning into a continuous latent thought vector (what to reason about) and a decoder that verbalizes the trace conditioned on this vector (how to reason). Beyond […]

Ver mais

Like 0

Liked Liked

technocracy

Cost-Driven Representation Learning for Linear Quadratic Gaussian Control: Part II

digitado ⋅ 8 de March de 2026

We study the problem of state representation learning for control from partial and potentially high-dimensional observations. We approach this problem via cost-driven state representation learning, in which we learn a dynamical model in a latent state space by predicting cumulative costs. In particular, we establish finite-sample guarantees on finding a near-optimal representation function and a near-optimal controller using the learned latent model for infinite-horizon time-invariant Linear Quadratic Gaussian (LQG) control. We study two approaches to cost-driven representation learning, […]

Ver mais

Like 0

Liked Liked

technocracy

Microscopic Structure of Random 3-SAT: A Discrete Geometric Approach to Phase Transitions and Algorithmic Complexity

digitado ⋅ 2 de March de 2026

arXiv:2602.23411v1 Announce Type: new Abstract: The structural phase transitions and computational complexity of random 3-SAT instances are traditionally described using thermodynamic analogies from statistical physics, such as Replica Symmetry Breaking and energy landscapes. While providing profound macroscopic insights, these theories lack a discrete microscopic structure. In this paper, we propose a complementary, strictly discrete geometric model that maps these phenomena directly to the combinatorial topology of an $N$-dimensional Boolean hypercube. By defining the problem space purely through valid […]

Ver mais

Like 0

Liked Liked

technocracy

Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning

digitado ⋅ 9 de February de 2026

In hierarchical multi-label classification, a persistent challenge is enabling model predictions to reach deeper levels of the hierarchy for more detailed or fine-grained classifications. This difficulty partly arises from the natural rarity of certain classes (or hierarchical nodes) and the hierarchical constraint that ensures child nodes are almost always less frequent than their parents. To address this, we propose a weighted loss objective for neural networks that combines node-wise imbalance weighting with focal weighting components, the latter leveraging […]

Ver mais

Like 0

Liked Liked

technocracy

How to Make Email Marketing Work for You

digitado ⋅ 2 de January de 2026

Email marketing only works if messages reach the inbox. Deliverability testing identifies spam triggers, broken links, and authentication issues. Tools like MailGenius help marketers optimize campaigns, improve engagement, protect sender reputation, and maximize ROI by ensuring emails are seen, not lost to spam or promotions tabs.

Ver mais

Like 0

Liked Liked

technocracy

How I Test an AI Support Agent: A Practical Testing Pyramid

digitado ⋅ 12 de March de 2026

A walkthrough of the six testing layers I use to catch regressions, policy drift, hallucinations, and adversarial exploits in a B2B SaaS support agent — with an open-source repo you can fork and try yourself. I built an AI support agent. It looks up invoices, checks subscriptions, drafts MFA resets, escalates tickets, and refuses prompt injections — all against a real SQLite database and a local documentation corpus. It uses the OpenAI API for reasoning and tool calling. Then I asked: how do […]

Ver mais

Like 0

Liked Liked

technocracy

Orthogonal Uplift Learning with Permutation-Invariant Representations for Combinatorial Treatments

digitado ⋅ 23 de February de 2026

We study uplift estimation for combinatorial treatments. Uplift measures the pure incremental causal effect of an intervention (e.g., sending a coupon or a marketing message) on user behavior, modeled as a conditional individual treatment effect. Many real-world interventions are combinatorial: a treatment is a policy that specifies context-dependent action distributions rather than a single atomic label. Although recent work considers structured treatments, most methods rely on categorical or opaque encodings, limiting robustness and generalization to rare or newly […]

Ver mais

Like 0

Liked Liked

technocracy

What the heck is wrong with our AI overlords?

digitado ⋅ 7 de April de 2026

I don’t—thankfully—have to follow every statement that Sam Altman, the CEO of OpenAI, makes about the world. Many of these statements seem more like “hustles” or “pitches” than attempts to speak thoughtfully about the future. Even if they are genuine statements of belief, they often read like a teenager’s first sci-fi novel, written under the influence of weed and way too much Star Trek. Consider, for instance, Altman’s blog post “A Gentle Singularity,” published last year and read […]

Ver mais

Like 0

Liked Liked

technocracy

The Founder’s Guide to Choosing “Boring” Software That Won’t Betray You Later

digitado ⋅ 13 de February de 2026

Most founders have a version of this story. Personally, I’ve watched this happen (and made this mistake myself): the tool looks great in a demo, then headcount grows and reality hits. You pick a modern, impressive-looking tool. The demo is smooth. The UI feels fast. Everyone’s excited. For a while, it works. Then the company grows. You hire more people. Someone leaves. Finance asks for historical data. An auditor wants proof of approvals. A manager needs limited access. […]

Ver mais

Like 0

Liked Liked