digitado – Page 433

NeuroLip: An Event-driven Spatiotemporal Learning Framework for Cross-Scene Lip-Motion-based Visual Speaker Recognition

digitado ⋅ 17 de April de 2026

Visual speaker recognition based on lip motion offers a silent, hands-free, and behavior-driven biometric solution that remains effective even when acoustic cues are unavailable. Compared to traditional methods that rely heavily on appearance-dependent representations, lip motion encodes subject-specific behavioral dynamics driven by consistent articulation patterns and muscle coordination, offering inherent stability across environmental changes. However, capturing these robust, fine-grained dynamics is challenging for conventional frame-based cameras due to motion blur and low dynamic range. To exploit the intrinsic […]

Ver mais

Like 0

Liked Liked

technocracy

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

digitado ⋅ 8 de April de 2026

arXiv:2604.04988v1 Announce Type: new Abstract: Modern deployment often requires trading accuracy for efficiency under tight CPU and memory constraints, yet common compression proxies such as parameter count or FLOPs do not reliably predict wall-clock inference time. In particular, unstructured sparsity can reduce model storage while failing to accelerate (and sometimes slightly slowing down) standard CPU execution due to irregular memory access and sparse kernel overhead. Motivated by this gap between compression and acceleration, we study a practical, ordered […]

Ver mais

Like 0

Liked Liked

technocracy

Contrastive Self-Supervised Learning As Neural Manifold Packing

digitado ⋅ 6 de January de 2026

arXiv:2506.13717v2 Announce Type: replace-cross Abstract: Contrastive self-supervised learning based on point-wise comparisons has been widely studied for vision tasks. In the visual cortex of the brain, neuronal responses to distinct stimulus classes are organized into geometric structures known as neural manifolds. Accurate classification of stimuli can be achieved by effectively separating these manifolds, akin to solving a packing problem. We introduce Contrastive Learning As Manifold Packing (CLAMP), a self-supervised framework that recasts representation learning as a manifold packing […]

Ver mais

Like 0

Liked Liked

technocracy

Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities

digitado ⋅ 13 de January de 2026

arXiv:2601.06913v1 Announce Type: cross Abstract: We study the multinomial logit (MNL) contextual bandit problem for sequential assortment selection. Although most existing research assumes utility functions to be linear in item features, this linearity assumption restricts the modeling of intricate interactions between items and user preferences. A recent work (Zhang & Luo, 2024) has investigated general utility function classes, yet its method faces fundamental trade-offs between computational tractability and statistical efficiency. To address this limitation, we propose a computationally […]

Ver mais

Like 0

Liked Liked

technocracy

China’s Moonshot AI raises $2B at $20B valuation as demand for open-source AI skyrockets

digitado ⋅ 7 de May de 2026

Moonshot’s annualized recurring revenue topped $200 million in April, driven by rapid growth in paid subscriptions and API usage.

Ver mais

Like 0

Liked Liked

technocracy

Matrix Manifold Neural Networks++

digitado ⋅ 6 de January de 2026

arXiv:2405.19206v2 Announce Type: replace Abstract: Deep neural networks (DNNs) on Riemannian manifolds have garnered increasing interest in various applied areas. For instance, DNNs on spherical and hyperbolic manifolds have been designed to solve a wide range of computer vision and nature language processing tasks. One of the key factors that contribute to the success of these networks is that spherical and hyperbolic manifolds have the rich algebraic structures of gyrogroups and gyrovector spaces. This enables principled and effective […]

Ver mais

Like 0

Liked Liked

technocracy

Neural Backward Filtering Forward Guiding

digitado ⋅ 2 de February de 2026

arXiv:2601.23030v1 Announce Type: new Abstract: Inference in non-linear continuous stochastic processes on trees is challenging, particularly when observations are sparse (leaf-only) and the topology is complex. Exact smoothing via Doob’s $h$-transform is intractable for general non-linear dynamics, while particle-based methods degrade in high dimensions. We propose Neural Backward Filtering Forward Guiding (NBFFG), a unified framework for both discrete transitions and continuous diffusions. Our method constructs a variational posterior by leveraging an auxiliary linear-Gaussian process. This auxiliary process yields […]

Ver mais

Like 0

Liked Liked

technocracy

Mobility-Aware Cache Framework for Scalable LLM-Based Human Mobility Simulation

digitado ⋅ 20 de February de 2026

arXiv:2602.16727v1 Announce Type: new Abstract: Large-scale human mobility simulation is critical for applications such as urban planning, epidemiology, and transportation analysis. Recent works treat large language models (LLMs) as human agents to simulate realistic mobility behaviors using structured reasoning, but their high computational cost limits scalability. To address this, we design a mobility-aware cache framework named MobCache that leverages reconstructible caches to enable efficient large-scale human mobility simulations. It consists of: (1) a reasoning component that encodes each […]

Ver mais

Like 0

Liked Liked

technocracy

Self-Supervised Learning from Structural Invariance

digitado ⋅ 2 de February de 2026

Joint-embedding self-supervised learning (SSL), the key paradigm for unsupervised representation learning from visual data, learns from invariances between semantically-related data pairs. We study the one-to-many mapping problem in SSL, where each datum may be mapped to multiple valid targets. This arises when data pairs come from naturally occurring generative processes, e.g., successive video frames. We show that existing methods struggle to flexibly capture this conditional uncertainty. As a remedy, we introduce a latent variable to account for this […]

Ver mais

Like 0

Liked Liked

technocracy

On the Fragility of AI Agent Collusion

digitado ⋅ 24 de March de 2026

arXiv:2603.20281v1 Announce Type: new Abstract: Recent work shows that pricing with symmetric LLM agents leads to algorithmic collusion. We show that collusion is fragile under the heterogeneity typical of real deployments. In a stylized repeated-pricing model, heterogeneity in patience or data access reduces the set of collusive equilibria. Experiments with open-source LLM agents (totaling over 2,000 compute hours) align with these predictions: patience heterogeneity reduces price lift from 22% to 10% above competitive levels; asymmetric data access, to […]

Ver mais

Like 0

Liked Liked