digitado – Page 160

JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models

digitado ⋅ 17 de April de 2026

Adapter-based methods have become a cost-effective approach to continual learning (CL) for Large Language Models (LLMs), by sequentially learning a low-rank update matrix for each task. To mitigate catastrophic forgetting, state-of-the-art approaches impose constraints on new adapters with respect to the previous ones, by targeting either subspace or coordinate-wise interference. In this paper, we propose JumpLoRA, a novel framework to adaptively induce sparsity in the Low-Rank Adaptation (LoRA) blocks through the use of JumpReLU gating. The method achieves […]

Ver mais

Like 0

Liked Liked

technocracy

From pixels to planning: Earth AI for nature restoration

digitado ⋅ 16 de June de 2026

Climate & Sustainability

Ver mais

Like 0

Liked Liked

technocracy

Biased Generalization in Diffusion Models

digitado ⋅ 5 de March de 2026

arXiv:2603.03469v1 Announce Type: new Abstract: Generalization in generative modeling is defined as the ability to learn an underlying distribution from a finite dataset and produce novel samples, with evaluation largely driven by held-out performance and perceived sample quality. In practice, training is often stopped at the minimum of the test loss, taken as an operational indicator of generalization. We challenge this viewpoint by identifying a phase of biased generalization during training, in which the model continues to decrease […]

Ver mais

Like 0

Liked Liked

technocracy

Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation

digitado ⋅ 16 de February de 2026

arXiv:2406.04112v3 Announce Type: replace-cross Abstract: While overparameterization in machine learning models offers great benefits in terms of optimization and generalization, it also leads to increased computational requirements as model sizes grow. In this work, we show that by leveraging the inherent low-dimensional structures of data and compressible dynamics within the model parameters, we can reap the benefits of overparameterization without the computational burdens. In practice, we demonstrate the effectiveness of this approach for deep low-rank matrix completion as […]

Ver mais

Like 0

Liked Liked

technocracy

Interpretable Early Warnings using Machine Learning in an Online Game-experiment

digitado ⋅ 23 de March de 2026

arXiv:2502.09880v2 Announce Type: replace-cross Abstract: Stemming from physics and later applied to other fields such as ecology, the theory of critical transitions suggests that some regime shifts are preceded by statistical early warning signals. Reddit’s r/place experiment, a large-scale social game, provides a unique opportunity to test these signals consistently across thousands of subsystems undergoing critical transitions. In r/place, millions of users collaboratively created ”compositions”, or pixel-art drawings, in which transitions occur when one composition rapidly replaces another. […]

Ver mais

Like 0

Liked Liked

technocracy

3DRealHead: Few-Shot Detailed Head Avatar

digitado ⋅ 16 de April de 2026

arXiv:2604.13171v1 Announce Type: new Abstract: The human face is central to communication. For immersive applications, the digital presence of a person should mirror the physical reality, capturing the users idiosyncrasies and detailed facial expressions. However, current 3D head avatar methods often struggle to faithfully reproduce the identity and facial expressions, despite having multi-view data or learned priors. Learning priors that capture the diversity of human appearances, especially, for regions with highly person-specific features, like the mouth and teeth […]

Ver mais

Like 0

Liked Liked

technocracy

DDCL: Deep Dual Competitive Learning: A Differentiable End-to-End Framework for Unsupervised Prototype-Based Representation Learning

digitado ⋅ 2 de April de 2026

A persistent structural weakness in deep clustering is the disconnect between feature learning and cluster assignment. Most architectures invoke an external clustering step, typically k-means, to produce pseudo-labels that guide training, preventing the backbone from directly optimising for cluster quality. This paper introduces Deep Dual Competitive Learning (DDCL), the first fully differentiable end-to-end framework for unsupervised prototype-based representation learning. The core contribution is architectural: the external k-means is replaced by an internal Dual Competitive Layer (DCL) that generates […]

Ver mais

Like 0

Liked Liked

technocracy

Fairness May Backfire: When Leveling-Down Occurs in Fair Machine Learning

digitado ⋅ 6 de March de 2026

As machine learning (ML) systems increasingly shape access to credit, jobs, and other opportunities, the fairness of algorithmic decisions has become a central concern. Yet it remains unclear when enforcing fairness constraints in these systems genuinely improves outcomes for affected groups or instead leads to "leveling down," making one or both groups worse off. We address this question in a unified, population-level (Bayes) framework for binary classification under prevalent group fairness notions. Our Bayes approach is distribution-free and […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Planning in Reinforcement Learning via Model Introspection

digitado ⋅ 7 de February de 2026

Reinforcement learning and classical planning are typically seen as two distinct problems, with differing formulations necessitating different solutions. Yet, when humans are given a task, regardless of the way it is specified, they can often derive the additional information needed to solve the problem efficiently. The key to this ability is introspection: by reasoning about their internal models of the problem, humans directly synthesize additional task-relevant information. In this paper, we propose that this introspection can be thought […]

Ver mais

Like 0

Liked Liked

technocracy

A Novel Edge-Assisted Quantum-Classical Hybrid Framework for Crime Pattern Learning and Classification

digitado ⋅ 8 de April de 2026

Crime pattern analysis is critical for law enforcement and predictive policing, yet the surge in criminal activities from rapid urbanization creates high-dimensional, imbalanced datasets that challenge traditional classification methods. This study presents a quantum-classical comparison framework for crime analytics, evaluating four computational paradigms: quantum models, classical baseline machine learning models, and two hybrid quantum-classical architectures. Using 16-year Bangladesh crime statistics, we systematically assess classification performance and computational efficiency under rigorous cross-validation methods. Experimental results show that quantum-inspired approaches, […]

Ver mais

Like 0

Liked Liked