February 2026

Alignment-Weighted DPO: A principled reasoning approach to improve safety alignment

digitado ⋅ 26 de February de 2026

arXiv:2602.21346v1 Announce Type: new Abstract: Recent advances in alignment techniques such as Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), and Direct Preference Optimization (DPO) have improved the safety of large language models (LLMs). However, these LLMs remain vulnerable to jailbreak attacks that disguise harmful intent through indirect or deceptive phrasing. Using causal intervention, we empirically demonstrate that this vulnerability stems from shallow alignment mechanisms that lack deep reasoning, often rejecting harmful prompts without truly understanding why […]

Ver mais

Like 0

Liked Liked

technocracy

UnlinkableDFL: a Practical Mixnet Protocol for Churn-Tolerant Decentralized FL Model Sharing

digitado ⋅ 26 de February de 2026

arXiv:2602.21343v1 Announce Type: new Abstract: Decentralized Federated Learning (DFL) eliminates the need for a central aggregator, but it can expose communication patterns that reveal participant identities. This work presents UnlinkableDFL, a DFL framework that combines a peer-based mixnet with fragment-based model aggregation to ensure unlinkability in fully decentralized settings. Model updates are divided into encrypted fragments, sent over separate multi-hop paths, and aggregated without using any identity information. A theoretical analysis indicates that relay and end-to-end unlinkability improve […]

Ver mais

Like 0

Liked Liked

technocracy

Scaling View Synthesis Transformers

digitado ⋅ 26 de February de 2026

arXiv:2602.21341v1 Announce Type: new Abstract: Geometry-free view synthesis transformers have recently achieved state-of-the-art performance in Novel View Synthesis (NVS), outperforming traditional approaches that rely on explicit geometry modeling. Yet the factors governing their scaling with compute remain unclear. We present a systematic study of scaling laws for view synthesis transformers and derive design principles for training compute-optimal NVS models. Contrary to prior findings, we show that encoder-decoder architectures can be compute-optimal; we trace earlier negative results to suboptimal […]

Ver mais

Like 0

Liked Liked

technocracy

HiPPO Zoo: Explicit Memory Mechanisms for Interpretable State Space Models

digitado ⋅ 26 de February de 2026

arXiv:2602.21340v1 Announce Type: new Abstract: Representing the past in a compressed, efficient, and informative manner is a central problem for systems trained on sequential data. The HiPPO framework, originally proposed by Gu & Dao et al., provides a principled approach to sequential compression by projecting signals onto orthogonal polynomial (OP) bases via structured linear ordinary differential equations. Subsequent works have embedded these dynamics in state space models (SSMs), where HiPPO structure serves as an initialization. Nonlinear successors of […]

Ver mais

Like 0

Liked Liked

technocracy

A Benchmark to Assess Common Ground in Human-AI Collaboration

digitado ⋅ 26 de February de 2026

arXiv:2602.21337v1 Announce Type: new Abstract: AI is becoming increasingly integrated into everyday life, both in professional work environments and in leisure and entertainment contexts. This integration requires AI to move beyond acting as an assistant for informational or transactional tasks toward a genuine collaborative partner. Effective collaboration, whether between humans or between humans and AI, depends on establishing and maintaining common ground: shared beliefs, assumptions, goals, and situational awareness that enable coordinated action and efficient repair of misunderstandings. […]

Ver mais

Like 0

Liked Liked

technocracy

Autonomous Satellite Rendezvous via Hybrid Feedback Optimization

digitado ⋅ 26 de February de 2026

arXiv:2602.21334v1 Announce Type: new Abstract: As satellites have proliferated, interest has increased in autonomous rendezvous, proximity operations, and docking (ARPOD). A fundamental challenge in these tasks is the uncertainties when operating in space, e.g., in measurements of satellites’ states, which can make future states difficult to predict. Another challenge is that satellites’ onboard processors are typically much slower than their terrestrial counterparts. Therefore, to address these challenges we propose to solve an ARPOD problem with feedback optimization, which […]

Ver mais

Like 0

Liked Liked

technocracy

HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles

digitado ⋅ 26 de February de 2026

arXiv:2602.21333v1 Announce Type: new Abstract: Controllable driving scene generation is critical for realistic and scalable autonomous driving simulation, yet existing approaches struggle to jointly achieve photorealism and precise control. We introduce HorizonForge, a unified framework that reconstructs scenes as editable Gaussian Splats and Meshes, enabling fine-grained 3D manipulation and language-driven vehicle insertion. Edits are rendered through a noise-aware video diffusion process that enforces spatial and temporal consistency, producing diverse scene variations in a single feed-forward pass without per-trajectory […]

Ver mais

Like 0

Liked Liked

technocracy

Two NP-hard Extensions of the Spearman Footrule even for a Small Constant Number of Voters

digitado ⋅ 26 de February de 2026

arXiv:2602.21332v1 Announce Type: new Abstract: The Spearman footrule is a voting rule that takes as input voter preferences expressed as rankings. It outputs a ranking that minimizes the sum of the absolute differences between the position of each candidate in the ranking and in the voters’ preferences. In this paper, we study the computational complexity of two extensions of the Spearman footrule when the number of voters is a small constant. The first extension, introduced by Pascual et […]

Ver mais

Like 0

Liked Liked

technocracy

CableRobotGraphSim: A Graph Neural Network for Modeling Partially Observable Cable-Driven Robot Dynamics

digitado ⋅ 26 de February de 2026

arXiv:2602.21331v1 Announce Type: new Abstract: General-purpose simulators have accelerated the development of robots. Traditional simulators based on first-principles, however, typically require full-state observability or depend on parameter search for system identification. This work presents texttt{CableRobotGraphSim}, a novel Graph Neural Network (GNN) model for cable-driven robots that aims to address shortcomings of prior simulation solutions. By representing cable-driven robots as graphs, with the rigid-bodies as nodes and the cables and contacts as edges, this model can quickly and accurately […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Opportunistic Approachability

digitado ⋅ 26 de February de 2026

arXiv:2602.21328v1 Announce Type: new Abstract: We study the problem of opportunistic approachability: a generalization of Blackwell approachability where the learner would like to obtain stronger guarantees (i.e., approach a smaller set) when their adversary limits themselves to a subset of their possible action space. Bernstein et al. (2014) introduced this problem in 2014 and presented an algorithm that guarantees sublinear approachability rates for opportunistic approachability. However, this algorithm requires the ability to produce calibrated online predictions of the […]

Ver mais

Like 0

Liked Liked