technocracy

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

digitado ⋅ 23 de February de 2026

arXiv:2602.17684v1 Announce Type: new Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) has driven recent progress in code large language models by leveraging execution-based feedback from unit tests, but its scalability is fundamentally constrained by the availability and reliability of high-quality test cases. We propose CodeScaler, an execution-free reward model designed to scale both reinforcement learning training and test-time inference for code generation. CodeScaler is trained on carefully curated preference data derived from verified code problems and incorporates syntax-aware […]

Ver mais

Like 0

Liked Liked

technocracy

A Practical Guide to Streaming Continual Learning

digitado ⋅ 2 de March de 2026

Continual Learning (CL) and Streaming Machine Learning (SML) study the ability of agents to learn from a stream of non-stationary data. Despite sharing some similarities, they address different and complementary challenges. While SML focuses on rapid adaptation after changes (concept drifts), CL aims to retain past knowledge when learning new tasks. After a brief introduction to CL and SML, we discuss Streaming Continual Learning (SCL), an emerging paradigm providing a unifying solution to real-world problems, which may require […]

Ver mais

Like 0

Liked Liked

technocracy

Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

digitado ⋅ 31 de March de 2026

arXiv:2603.26676v1 Announce Type: new Abstract: Current frontier AI safety evaluations emphasize static benchmarks, third-party annotations, and red-teaming. In this position paper, we argue that AI safety research should focus on human-centered evaluations that measure harmful capability uplift: the marginal increase in a user’s ability to cause harm with a frontier model beyond what conventional tools already enable. We frame harmful capability uplift as a core AI safety metric, ground it in prior social science research, and provide concrete […]

Ver mais

Like 0

Liked Liked

technocracy

AI Won’t Fix Your Broken IAM Data

digitado ⋅ 17 de March de 2026

Artificial intelligence is rapidly becoming the default answer to almost every security challenge. Boards are asking about it. CISOs are budgeting for it. Vendors are rebranding around it. But in identity and access management (IAM), AI is only as good as the data you feed it, and in most enterprises, that data is far from complete. IBM’s Cost of a Data Breach Report found that compromised credentials remain one of the most common initial attack vectors in breaches, […]

Ver mais

Like 0

Liked Liked

technocracy

Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

digitado ⋅ 18 de February de 2026

arXiv:2602.15198v1 Announce Type: new Abstract: Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks. This surfaces a unique safety problem when individual agents form a coalition and emph{collude} to pursue secondary goals and degrade the joint objective. In this paper, we present Colosseum, a framework for auditing LLM agents’ collusive behavior in multi-agent settings. We ground how agents cooperate through a Distributed Constraint Optimization Problem (DCOP) and measure collusion via […]

Ver mais

Like 0

Liked Liked

technocracy

Transfer Learning for Meta-analysis Under Covariate Shift

digitado ⋅ 3 de April de 2026

Randomized controlled trials often do not represent the populations where decisions are made, and covariate shift across studies can invalidate standard IPD meta-analysis and transport estimators. We propose a placebo-anchored transport framework that treats source-trial outcomes as abundant proxy signals and target-trial placebo outcomes as scarce, high-fidelity gold labels to calibrate baseline risk. A low-complexity (sparse) correction anchors proxy outcome models to the target population, and the anchored models are embedded in a cross-fitted doubly robust learner, yielding […]

Ver mais

Like 0

Liked Liked

technocracy

VQQA: An Agentic Approach for Video Evaluation and Quality Improvement

digitado ⋅ 16 de March de 2026

arXiv:2603.12310v1 Announce Type: new Abstract: Despite rapid advancements in video generation models, aligning their outputs with complex user intent remains challenging. Existing test-time optimization methods are typically either computationally expensive or require white-box access to model internals. To address this, we present VQQA (Video Quality Question Answering), a unified, multi-agent framework generalizable across diverse input modalities and video generation tasks. By dynamically generating visual questions and using the resulting Vision-Language Model (VLM) critiques as semantic gradients, VQQA replaces […]

Ver mais

Like 0

Liked Liked

technocracy

MINAR: Mechanistic Interpretability for Neural Algorithmic Reasoning

digitado ⋅ 26 de February de 2026

arXiv:2602.21442v1 Announce Type: new Abstract: The recent field of neural algorithmic reasoning (NAR) studies the ability of graph neural networks (GNNs) to emulate classical algorithms like Bellman-Ford, a phenomenon known as algorithmic alignment. At the same time, recent advances in large language models (LLMs) have spawned the study of mechanistic interpretability, which aims to identify granular model components like circuits that perform specific computations. In this work, we introduce Mechanistic Interpretability for Neural Algorithmic Reasoning (MINAR), an efficient […]

Ver mais

Like 0

Liked Liked

technocracy

Project Aletheia: Verifier-Guided Distillation of Backtracking for Small Language Models

digitado ⋅ 22 de January de 2026

arXiv:2601.14290v1 Announce Type: new Abstract: Small Language Models (SLMs, under 10B parameters) are attractive for private, on-device deployment, yet they frequently fail on strict constraint-satisfaction problems due to linear, overconfident reasoning traces that do not recover from early mistakes. We introduce Verifier-Guided Distillation, a training protocol that transfers the process of error repair – explicit conflict detection and backtracking – rather than only correct final answers. By training a 7B model on verified reasoning traces that include mistakes […]

Ver mais

Like 0

Liked Liked

technocracy

Annealed Co-Generation: Disentangling Variables via Progressive Pairwise Modeling

digitado ⋅ 10 de March de 2026

arXiv:2603.06615v1 Announce Type: new Abstract: For multivariate co-generation in scientific applications, we advocate pairwise block rather than joint modeling of all variables. This design mitigates the computational burden and data imbalance. To this end, we propose an Annealed Co-Generation (ACG) framework that replaces high-dimensional diffusion modeling with a low-dimensional diffusion model, which enables multivariate co-generation by composing pairwise variable generations. We first train an unconditional diffusion model over causal variables that are disentangled into pairs. At inference time, […]

Ver mais

Like 0

Liked Liked