digitado – Page 501

Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling

digitado ⋅ 25 de March de 2026

arXiv:2603.22563v1 Announce Type: new Abstract: Preference-based fine-tuning has become an important component in training large language models, and the data used at this stage may contain sensitive user information. A central question is how to design a differentially private pipeline that is well suited to the distinct structure of reinforcement learning from human feedback. We propose a privacy-preserving framework that imposes differential privacy only on reward learning and derives the final policy from the resulting private reward model. […]

Ver mais

Like 0

Liked Liked

technocracy

Head-wise Modality Specialization within MLLMs for Robust Fake News Detection under Missing Modality

digitado ⋅ 15 de April de 2026

arXiv:2604.09711v1 Announce Type: new Abstract: Multimodal fake news detection (MFND) aims to verify news credibility by jointly exploiting textual and visual evidence. However, real-world news dissemination frequently suffers from missing modality due to deleted images, corrupted screenshots, and similar issues. Thus, robust detection in this scenario requires preserving strong verification ability for each modality, which is challenging in MFND due to insufficient learning of the low-contribution modality and scarce unimodal annotations. To address this issue, we propose Head-wise […]

Ver mais

Like 0

Liked Liked

technocracy

Long fingernails vs. touchscreens: This nail polish could help

digitado ⋅ 23 de March de 2026

The rise of touchscreen technology has been a boon in many respects, but for people with long fingernails, there can be issues with the capacitive variety since fingernails are non-conductive and thus don’t register on the screen as a touch. One can use a stylus, of course, or simply use the finger pad under the nail, but ideally it would be nice to be able to use one’s fingernail. A conductive nail polish might do the trick, according […]

Ver mais

Like 0

Liked Liked

technocracy

ACCIDENT: A Benchmark Dataset for Vehicle Accident Detection from Traffic Surveillance Videos

digitado ⋅ 15 de April de 2026

arXiv:2604.09819v1 Announce Type: new Abstract: We introduce ACCIDENT, a benchmark dataset for traffic accident detection in CCTV footage, designed to evaluate models in supervised (IID and OOD) and zero-shot settings, reflecting both data-rich and data-scarce scenarios. The benchmark consists of a curated set of 2,027 real and 2,211 synthetic clips annotated with the accident time, spatial location, and high-level collision type. We define three core tasks: (i) temporal localization of the accident, (ii) its spatial localization, and (iii) […]

Ver mais

Like 0

Liked Liked

technocracy

Differentially Private Inference for Longitudinal Linear Regression

digitado ⋅ 16 de January de 2026

arXiv:2601.10626v1 Announce Type: cross Abstract: Differential Privacy (DP) provides a rigorous framework for releasing statistics while protecting individual information present in a dataset. Although substantial progress has been made on differentially private linear regression, existing methods almost exclusively address the item-level DP setting, where each user contributes a single observation. Many scientific and economic applications instead involve longitudinal or panel data, in which each user contributes multiple dependent observations. In these settings, item-level DP offers inadequate protection, and […]

Ver mais

Like 0

Liked Liked

technocracy

Hybrid Structured Editing: Structures for Tools, Text for Users

digitado ⋅ 9 de March de 2026

arXiv:2603.05644v1 Announce Type: new Abstract: In programming, better tools often yield better results. For that, modern programming environments offer mechanisms to allow for their extensibility. The closer those tools are to the code, the easier it is for programmers to map the information provided by a tool to the code this information is about. However, existing extension mechanisms do not facilitate the close integration of tools with textual source code. Tools must be able to track program structures […]

Ver mais

Like 0

Liked Liked

technocracy

Scalable Cross-Facility Federated Learning for Scientific Foundation Models on Multiple Supercomputers

digitado ⋅ 20 de March de 2026

Artificial Intelligence for scientific applications increasingly requires training large models on data that cannot be centralized due to privacy constraints, data sovereignty, or the sheer volume of data generated. Federated learning (FL) addresses this by enabling collaborative training without centralizing raw data, but scientific applications demand model scales that requires extensive computing resources, typically offered at High Performance Computing (HPC) facilities. Deploying FL experiments across HPC facilities introduces challenges beyond cloud or enterprise settings. We present a comprehensive […]

Ver mais

Like 0

Liked Liked

technocracy

SLSREC: Self-Supervised Contrastive Learning for Adaptive Fusion of Long- and Short-Term User Interests

digitado ⋅ 6 de April de 2026

User interests typically encompass both long-term preferences and short-term intentions, reflecting the dynamic nature of user behaviors across different timeframes. The uneven temporal distribution of user interactions highlights the evolving patterns of interests, making it challenging to accurately capture shifts in interests using comprehensive historical behaviors. To address this, we propose SLSRec, a novel Session-based model with the fusion of Long- and Short-term Recommendations that effectively captures the temporal dynamics of user interests by segmenting historical behaviors over […]

Ver mais

Like 0

Liked Liked

technocracy

Two approaches to low-parametric SimRank computation

digitado ⋅ 25 de February de 2026

arXiv:2602.20282v1 Announce Type: new Abstract: In this work, we discuss low-parametric approaches for approximating SimRank matrices, which estimate the similarity between pairs of nodes in a graph. Although SimRank matrices and their computation require a significant amount of memory, common approaches mostly address the problem of algorithmic complexity. We propose two major formats for the economical embedding of target data. The first approach adopts a non-symmetric form that can be computed using a specialized alternating optimization algorithm. The […]

Ver mais

Like 0

Liked Liked

technocracy

TeleDex: Accessible Dexterous Teleoperation

digitado ⋅ 19 de March de 2026

arXiv:2603.17065v1 Announce Type: new Abstract: Despite increasing dataset scale and model capacity, robot manipulation policies still struggle to generalize beyond their training distributions. As a result, deploying state-of-the-art policies in new environments, tasks, or robot embodiments often requires collecting additional demonstrations. Enabling this in real-world deployment settings requires tools that allow users to collect demonstrations quickly, affordably, and with minimal setup. We present TeleDex, an open-source system for intuitive teleoperation of dexterous hands and robotic manipulators using any […]

Ver mais

Like 0

Liked Liked