February 2026

MonoLoss: A Training Objective for Interpretable Monosemantic Representations

digitado ⋅ 16 de February de 2026

arXiv:2602.12403v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) decompose polysemantic neural representations, where neurons respond to multiple unrelated concepts, into monosemantic features that capture single, interpretable concepts. However, standard training objectives only weakly encourage this decomposition, and existing monosemanticity metrics require pairwise comparisons across all dataset samples, making them inefficient during training and evaluation. We study a recent MonoScore metric and derive a single-pass algorithm that computes exactly the same quantity, but with a cost that grows linearly, […]

Ver mais

Like 0

Liked Liked

technocracy

AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

digitado ⋅ 16 de February de 2026

arXiv:2602.12402v1 Announce Type: new Abstract: Analog and mixed-signal (AMS) integrated circuits (ICs) lie at the core of modern computing and communications systems. However, despite the continued rise in design complexity, advances in AMS automation remain limited. This reflects the central challenge in developing a generalized optimization method applicable across diverse circuit design spaces, many of which are distinct, constrained, and non-differentiable. To address this, our work casts circuit design as a graph generation problem and introduces a novel […]

Ver mais

Like 0

Liked Liked

technocracy

ZeroDiff++: Substantial Unseen Visual-semantic Correlation in Zero-shot Learning

digitado ⋅ 16 de February de 2026

arXiv:2602.12401v1 Announce Type: new Abstract: Zero-shot Learning (ZSL) enables classifiers to recognize classes unseen during training, commonly via generative two stage methods: (1) learn visual semantic correlations from seen classes; (2) synthesize unseen class features from semantics to train classifiers. In this paper, we identify spurious visual semantic correlations in existing generative ZSL worsened by scarce seen class samples and introduce two metrics to quantify spuriousness for seen and unseen classes. Furthermore, we point out a more critical […]

Ver mais

Like 0

Liked Liked

technocracy

Secrecy and Verifiability: An Introduction to Electronic Voting

digitado ⋅ 16 de February de 2026

arXiv:2602.12398v1 Announce Type: new Abstract: Democracies are built upon secure and reliable voting systems. Electronic voting systems seek to replace ballot papers and boxes with computer hardware and software. Proposed electronic election schemes have been subjected to scrutiny, with researchers spotting inherent faults and weaknesses. Inspired by physical voting systems, we argue that any electronic voting system needs two essential properties: ballot secrecy and verifiability. These properties seemingly work against each other. An election scheme that is a […]

Ver mais

Like 0

Liked Liked

technocracy

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

digitado ⋅ 16 de February de 2026

arXiv:2602.12395v1 Announce Type: new Abstract: Reinforcement learning (RL) with verifiable rewards has become a standard post-training stage for boosting visual reasoning in vision-language models, yet it remains unclear what capabilities RL actually improves compared with supervised fine-tuning as cold-start initialization (IN). End-to-end benchmark gains conflate multiple factors, making it difficult to attribute improvements to specific skills. To bridge the gap, we propose a Frankenstein-style analysis framework including: (i) functional localization via causal probing; (ii) update characterization via parameter […]

Ver mais

Like 0

Liked Liked

technocracy

Synthetic Interaction Data for Scalable Personalization in Large Language Models

digitado ⋅ 16 de February de 2026

arXiv:2602.12394v1 Announce Type: new Abstract: Personalized prompting offers large opportunities for deploying large language models (LLMs) to diverse users, yet existing prompt optimization methods primarily focus on task-level optimization while largely overlooking user-specific preferences and latent constraints of individual users. This gap is primarily due to (i) the absence of high-quality, privacy-sensitive data that capture personalized user-LLM interactions at scale, and (ii) the lack of robust reward signals for individual preferences. To overcome existing data limitations, we introduce […]

Ver mais

Like 0

Liked Liked

technocracy

Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models

digitado ⋅ 16 de February de 2026

arXiv:2602.12393v1 Announce Type: new Abstract: DragDiffusion is a diffusion-based method for interactive point-based image editing that enables users to manipulate images by directly dragging selected points. The method claims that accurate spatial control can be achieved by optimizing a single diffusion latent at an intermediate timestep, together with identity-preserving fine-tuning and spatial regularization. This work presents a reproducibility study of DragDiffusion using the authors’ released implementation and the DragBench benchmark. We reproduce the main ablation studies on diffusion […]

Ver mais

Like 0

Liked Liked

technocracy

High-dimensional Level Set Estimation with Trust Regions and Double Acquisition Functions

digitado ⋅ 16 de February de 2026

arXiv:2602.12391v1 Announce Type: new Abstract: Level set estimation (LSE) classifies whether an unknown function’s value exceeds a specified threshold for given inputs, a fundamental problem in many real-world applications. In active learning settings with limited initial data, we aim to iteratively acquire informative points to construct an accurate classifier for this task. In high-dimensional spaces, this becomes challenging where the search volume grows exponentially with increasing dimensionality. We propose TRLSE, an algorithm for high-dimensional LSE, which identifies and […]

Ver mais

Like 0

Liked Liked

technocracy

Rational Neural Networks have Expressivity Advantages

digitado ⋅ 16 de February de 2026

arXiv:2602.12390v1 Announce Type: new Abstract: We study neural networks with trainable low-degree rational activation functions and show that they are more expressive and parameter-efficient than modern piecewise-linear and smooth activations such as ELU, LeakyReLU, LogSigmoid, PReLU, ReLU, SELU, CELU, Sigmoid, SiLU, Mish, Softplus, Tanh, Softmin, Softmax, and LogSoftmax. For an error target of $varepsilon>0$, we establish approximation-theoretic separations: Any network built from standard fixed activations can be uniformly approximated on compact domains by a rational-activation network with only […]

Ver mais

Like 0

Liked Liked

technocracy

Evolving Beyond Snapshots: Harmonizing Structure and Sequence via Entity State Tuning for Temporal Knowledge Graph Forecasting

digitado ⋅ 16 de February de 2026

arXiv:2602.12389v1 Announce Type: new Abstract: Temporal knowledge graph (TKG) forecasting requires predicting future facts by jointly modeling structural dependencies within each snapshot and temporal evolution across snapshots. However, most existing methods are stateless: they recompute entity representations at each timestamp from a limited query window, leading to episodic amnesia and rapid decay of long-term dependencies. To address this limitation, we propose Entity State Tuning (EST), an encoder-agnostic framework that endows TKG forecasters with persistent and continuously evolving entity […]

Ver mais

Like 0

Liked Liked