February 2026

Risk-Sensitive Exponential Actor Critic

digitado ⋅ 10 de February de 2026

arXiv:2602.07202v1 Announce Type: new Abstract: Model-free deep reinforcement learning (RL) algorithms have achieved tremendous success on a range of challenging tasks. However, safety concerns remain when these methods are deployed on real-world applications, necessitating risk-aware agents. A common utility for learning such risk-aware agents is the entropic risk measure, but current policy gradient methods optimizing this measure must perform high-variance and numerically unstable updates. As a result, existing risk-sensitive model-free approaches are limited to simple tasks and tabular […]

Ver mais

Like 0

Liked Liked

technocracy

BadSNN: Backdoor Attacks on Spiking Neural Networks via Adversarial Spiking Neuron

digitado ⋅ 10 de February de 2026

arXiv:2602.07200v1 Announce Type: new Abstract: Spiking Neural Networks (SNNs) are energy-efficient counterparts of Deep Neural Networks (DNNs) with high biological plausibility, as information is transmitted through temporal spiking patterns. The core element of an SNN is the spiking neuron, which converts input data into spikes following the Leaky Integrate-and-Fire (LIF) neuron model. This model includes several important hyperparameters, such as the membrane potential threshold and membrane time constant. Both the DNNs and SNNs have proven to be exploitable […]

Ver mais

Like 0

Liked Liked

technocracy

Condition Matters in Full-head 3D GANs

digitado ⋅ 10 de February de 2026

arXiv:2602.07198v1 Announce Type: new Abstract: Conditioning is crucial for stable training of full-head 3D GANs. Without any conditioning signal, the model suffers from severe mode collapse, making it impractical to training. However, a series of previous full-head 3D GANs conventionally choose the view angle as the conditioning input, which leads to a bias in the learned 3D full-head space along the conditional view direction. This is evident in the significant differences in generation quality and diversity between the […]

Ver mais

Like 0

Liked Liked

technocracy

Lite-BD: A Lightweight Black-box Backdoor Defense via Reviving Multi-Stage Image Transformations

digitado ⋅ 10 de February de 2026

arXiv:2602.07197v1 Announce Type: new Abstract: Deep Neural Networks (DNNs) are vulnerable to backdoor attacks. Due to the nature of Machine Learning as a Service (MLaaS) applications, black-box defenses are more practical than white-box methods, yet existing purification techniques suffer from key limitations: a lack of justification for specific transformations, dataset dependency, high computational overhead, and a neglect of frequency-domain transformations. This paper conducts a preliminary study on various image transformations, identifying down-upscaling as the most effective backdoor trigger […]

Ver mais

Like 0

Liked Liked

technocracy

Automated Modernization of Machine Learning Engineering Notebooks for Reproducibility

digitado ⋅ 10 de February de 2026

arXiv:2602.07195v1 Announce Type: new Abstract: Interactive computational notebooks (e.g., Jupyter notebooks) are widely used in machine learning engineering (MLE) to program and share end-to-end pipelines, from data preparation to model training and evaluation. However, environment erosion-the rapid evolution of hardware and software ecosystems for machine learning-has rendered many published MLE notebooks non-reproducible in contemporary environments, hindering code reuse and scientific progress. To quantify this gap, we study 12,720 notebooks mined from 79 popular Kaggle competitions: only 35.4% remain […]

Ver mais

Like 0

Liked Liked

technocracy

“Death” of a Chatbot: Investigating and Designing Toward Psychologically Safe Endings for Human-AI Relationships

digitado ⋅ 10 de February de 2026

arXiv:2602.07193v1 Announce Type: new Abstract: Millions of users form emotional attachments to AI companions like Character.AI, Replika, and ChatGPT. When these relationships end through model updates, safety interventions, or platform shutdowns, users receive no closure, reporting grief comparable to human loss. As regulations mandate protections for vulnerable users, discontinuation events will accelerate, yet no platform has implemented deliberate end-of-“life” design. Through grounded theory analysis of AI companion communities, we find that discontinuation is a sense-making process shaped by […]

Ver mais

Like 0

Liked Liked

technocracy

Systematic Performance Assessment of Deep Material Networks for Multiscale Material Modeling

digitado ⋅ 10 de February de 2026

arXiv:2602.07192v1 Announce Type: new Abstract: Deep Material Networks (DMNs) are structure-preserving, mechanistic machine learning models that embed micromechanical principles into their architectures, enabling strong extrapolation capabilities and significant potential to accelerate multiscale modeling of complex microstructures. A key advantage of these models is that they can be trained exclusively on linear elastic data and then generalized to nonlinear inelastic regimes during online prediction. Despite their growing adoption, systematic evaluations of their performance across the full offline-online pipeline remain […]

Ver mais

Like 0

Liked Liked

technocracy

HALO: A Fine-Grained Resource Sharing Quantum Operating System

digitado ⋅ 10 de February de 2026

arXiv:2602.07191v1 Announce Type: new Abstract: As quantum computing enters the cloud era, thousands of users must share access to a small number of quantum processors. Users need to wait minutes to days to start their jobs, which only takes a few seconds for execution. Current quantum cloud platforms employ a fair-share scheduler, as there is no way to multiplex a quantum computer among multiple programs at the same time, leaving many qubits idle and significantly under-utilizing the hardware. […]

Ver mais

Like 0

Liked Liked

technocracy

Long-Context Long-Form Question Answering for Legal Domain

digitado ⋅ 10 de February de 2026

arXiv:2602.07190v1 Announce Type: new Abstract: Legal documents have complex document layouts involving multiple nested sections, lengthy footnotes and further use specialized linguistic devices like intricate syntax and domain-specific vocabulary to ensure precision and authority. These inherent characteristics of legal documents make question answering challenging, and particularly so when the answer to the question spans several pages (i.e. requires long-context) and is required to be comprehensive (i.e. a long-form answer). In this paper, we address the challenges of long-context […]

Ver mais

Like 0

Liked Liked

technocracy

Latent Target Score Matching, with an application to Simulation-Based Inference

digitado ⋅ 10 de February de 2026

arXiv:2602.07189v1 Announce Type: new Abstract: Denoising score matching (DSM) for training diffusion models may suffer from high variance at low noise levels. Target Score Matching (TSM) mitigates this when clean data scores are available, providing a low-variance objective. In many applications clean scores are inaccessible due to the presence of latent variables, leaving only joint signals exposed. We propose Latent Target Score Matching (LTSM), an extension of TSM to leverage joint scores for low-variance supervision of the marginal […]

Ver mais

Like 0

Liked Liked