February 2026

Learning Mixture Density via Natural Gradient Expectation Maximization

digitado ⋅ 11 de February de 2026

Mixture density networks are neural networks that produce Gaussian mixtures to represent continuous multimodal conditional densities. Standard training procedures involve maximum likelihood estimation using the negative log-likelihood (NLL) objective, which suffers from slow convergence and mode collapse. In this work, we improve the optimization of mixture density networks by integrating their information geometry. Specifically, we interpret mixture density networks as deep latent-variable models and analyze them through an expectation maximization framework, which reveals surprising theoretical connections to natural […]

Ver mais

Like 0

Liked Liked

technocracy

Neuro-symbolic Action Masking for Deep Reinforcement Learning

digitado ⋅ 11 de February de 2026

Deep reinforcement learning (DRL) may explore infeasible actions during training and execution. Existing approaches assume a symbol grounding function that maps high-dimensional states to consistent symbolic representations and a manually specified action masking techniques to constrain actions. In this paper, we propose Neuro-symbolic Action Masking (NSAM), a novel framework that automatically learn symbolic models, which are consistent with given domain constraints of high-dimensional states, in a minimally supervised manner during the DRL process. Based on the learned symbolic […]

Ver mais

Like 0

Liked Liked

technocracy

Roughness-Informed Federated Learning

digitado ⋅ 11 de February de 2026

Federated Learning (FL) enables collaborative model training across distributed clients while preserving data privacy, yet faces challenges in non-independent and identically distributed (non-IID) settings due to client drift, which impairs convergence. We propose RI-FedAvg, a novel FL algorithm that mitigates client drift by incorporating a Roughness Index (RI)-based regularization term into the local objective, adaptively penalizing updates based on the fluctuations of local loss landscapes. This paper introduces RI-FedAvg, leveraging the RI to quantify the roughness of high-dimensional […]

Ver mais

Like 0

Liked Liked

technocracy

Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning

digitado ⋅ 11 de February de 2026

Imitation Learning (IL) enables robots to learn complex skills from demonstrations without explicit task modeling, but it typically requires large amounts of demonstrations, creating significant collection costs. Prior work has investigated using flow as an intermediate representation to enable the use of human videos as a substitute, thereby reducing the amount of required robot demonstrations. However, most prior work has focused on the flow, either on the object or on specific points of the robot/hand, which cannot describe […]

Ver mais

Like 0

Liked Liked

technocracy

When Gradient Clipping Becomes a Control Mechanism for Differential Privacy in Deep Learning

digitado ⋅ 11 de February de 2026

Privacy-preserving training on sensitive data commonly relies on differentially private stochastic optimization with gradient clipping and Gaussian noise. The clipping threshold is a critical control knob: if set too small, systematic over-clipping induces optimization bias; if too large, injected noise dominates updates and degrades accuracy. Existing adaptive clipping methods often depend on per-example gradient norm statistics, adding computational overhead and introducing sensitivity to datasets and architectures. We propose a control-driven clipping strategy that adapts the threshold using a […]

Ver mais

Like 0

Liked Liked

technocracy

Contrastive Learning for Multi Label ECG Classification with Jaccard Score Based Sigmoid Loss

digitado ⋅ 11 de February de 2026

Recent advances in large language models (LLMs) have enabled the development of multimodal medical AI. While models such as MedGemini achieve high accuracy on VQA tasks like USMLE MM, their performance on ECG based tasks remains limited, and some models, such as MedGemma, do not support ECG data at all. Interpreting ECGs is inherently challenging, and diagnostic accuracy can vary depending on the interpreter’s experience. Although echocardiography provides rich diagnostic information, it requires specialized equipment and personnel, limiting […]

Ver mais

Like 0

Liked Liked

technocracy

What Makes Value Learning Efficient in Residual Reinforcement Learning?

digitado ⋅ 11 de February de 2026

Residual reinforcement learning (RL) enables stable online refinement of expressive pretrained policies by freezing the base and learning only bounded corrections. However, value learning in residual RL poses unique challenges that remain poorly understood. In this work, we identify two key bottlenecks: cold start pathology, where the critic lacks knowledge of the value landscape around the base policy, and structural scale mismatch, where the residual contribution is dwarfed by the base action. Through systematic investigation, we uncover the […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical Inference and Learning for Shapley Additive Explanations (SHAP)

digitado ⋅ 11 de February de 2026

The SHAP (short for Shapley additive explanation) framework has become an essential tool for attributing importance to variables in predictive tasks. In model-agnostic settings, SHAP uses the concept of Shapley values from cooperative game theory to fairly allocate credit to the features in a vector $X$ based on their contribution to an outcome $Y$. While the explanations offered by SHAP are local by nature, learners often need global measures of feature importance in order to improve model explainability […]

Ver mais

Like 0

Liked Liked

technocracy

Generalized Robust Adaptive-Bandwidth Multi-View Manifold Learning in High Dimensions with Noise

digitado ⋅ 11 de February de 2026

Multiview datasets are common in scientific and engineering applications, yet existing fusion methods offer limited theoretical guarantees, particularly in the presence of heterogeneous and high-dimensional noise. We propose Generalized Robust Adaptive-Bandwidth Multiview Diffusion Maps (GRAB-MDM), a new kernel-based diffusion geometry framework for integrating multiple noisy data sources. The key innovation of GRAB-MDM is a {view}-dependent bandwidth selection strategy that adapts to the geometry and noise level of each view, enabling a stable and principled construction of multiview diffusion […]

Ver mais

Like 0

Liked Liked

technocracy

Transformers v5 – Hugging Face’s Next Big Leap in Simple and Powerful AI Models

digitado ⋅ 11 de February de 2026

Author(s): Aniket Sanyal Originally published on Towards AI. Transformers v5 – Hugging Face’s Next Big Leap in Simple and Powerful AI Models Hugging Face has unveiled Transformers v5, the latest major release of its popular open-source library that powers many AI models and applications. It’s been five years since version 4 was introduced, and the growth has been staggering — daily installations have exploded from about 20,000 to over 3 million per day, with total installs now past […]

Ver mais

Like 0

Liked Liked