digitado

A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models

digitado ⋅ 30 de January de 2026

arXiv:2601.20896v1 Announce Type: new Abstract: Self-supervised learning (SSL) has transformed speech processing, yet its reliance on massive pre-training datasets remains a bottleneck. While robustness is often attributed to scale and diversity, the role of the data distribution is less understood. We systematically examine how curated subsets of pre-training data influence Automatic Speech Recognition (ASR) performance. Surprisingly, optimizing for acoustic, speaker, or linguistic diversity yields no clear improvements over random sampling. Instead, we find that prioritizing the longest utterances […]

Ver mais

Like 0

Liked Liked

technocracy

5 Underrated Libraries & Frameworks for AI Engineers to Learn in 2026

digitado ⋅ 8 de January de 2026

In the fast-moving world of AI, we often get distracted by the flashiest models where everyone is talking about Gemini, GPT, Claude, or Grok models. But for AI Engineers building actual production systems, the model is just one small piece of a much larger complicated puzzle. To build a robust AI application, you need to solve distinct engineering challenges: inference latency, observability, user interfaces, agentic orchestration, and memory management. Here are 5 underrated libraries/frameworks (plus a bonus) that […]

Ver mais

Like 0

Liked Liked

technocracy

Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

digitado ⋅ 29 de January de 2026

Recent progress in geospatial foundation models highlights the importance of learning general-purpose representations for real-world locations, particularly points-of-interest (POIs) where human activity concentrates. Existing approaches, however, focus primarily on place identity derived from static textual metadata, or learn representations tied to trajectory context, which capture movement regularities rather than how places are actually used (i.e., POI’s function). We argue that POI function is a missing but essential signal for general POI representations. We introduce Mobility-Embedded POIs (ME-POIs), a […]

Ver mais

Like 0

Liked Liked

technocracy

Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning

digitado ⋅ 2 de February de 2026

arXiv:2601.22297v1 Announce Type: new Abstract: The reasoning abilities of large language models (LLMs) have been substantially improved by reinforcement learning with verifiable rewards (RLVR). At test time, collaborative reasoning through Multi-Agent Debate (MAD) has emerged as a promising approach for enhancing LLM performance. However, current RLVR methods typically train LLMs to solve problems in isolation, without explicitly preparing them to synthesize and benefit from different rationales that arise during debate. In this work, we propose Self-Debate Reinforcement Learning […]

Ver mais

Like 0

Liked Liked

technocracy

Online monotone density estimation and log-optimal calibration

digitado ⋅ 10 de February de 2026

arXiv:2602.08927v1 Announce Type: new Abstract: We study the problem of online monotone density estimation, where density estimators must be constructed in a predictable manner from sequentially observed data. We propose two online estimators: an online analogue of the classical Grenander estimator, and an expert aggregation estimator inspired by exponential weighting methods from the online learning literature. In the well-specified stochastic setting, where the underlying density is monotone, we show that the expected cumulative log-likelihood gap between the online […]

Ver mais

Like 0

Liked Liked

technocracy

Model-Free Monte Carlo-like Policy Evaluation

digitado ⋅ 31 de March de 2010

We propose an algorithm for estimating the finite-horizon expected return of a closed loop control policy from an a priori given (off-policy) sample of one-step transitions. It averages cumulated rewards along a set of “broken trajectories” made of one-step transitions selected from the sample on the basis of the control policy. Under some Lipschitz continuity assumptions on the system dynamics, reward function and control policy, we provide bounds on the bias and variance of the estimator that depend […]

Ver mais

Like 0

Liked Liked

technocracy

From Events to Trending: A Multi-Stage Hotspots Detection Method Based on Generative Query Indexing

digitado ⋅ 12 de January de 2026

arXiv:2601.05258v1 Announce Type: new Abstract: LLM-based conversational systems have become a popular gateway for information access, yet most existing chatbots struggle to handle news-related trending queries effectively. To improve user experience, an effective trending query detection method is urgently needed to enable differentiated processing of such target traffic. However, current research on trending detection tailored to the dialogue system scenario remains largely unexplored, and methods designed for traditional search engines often underperform in conversational contexts due to radically […]

Ver mais

Like 0

Liked Liked

technocracy

Screen, Match, and Cache: A Training-Free Causality-Consistent Reference Frame Framework for Human Animation

digitado ⋅ 2 de February de 2026

arXiv:2601.22160v1 Announce Type: new Abstract: Human animation aims to generate temporally coherent and visually consistent videos over long sequences, yet modeling long-range dependencies while preserving frame quality remains challenging. Inspired by the human ability to leverage past observations for interpreting ongoing actions, we propose FrameCache, a training-free three-stage framework consisting of Screen, Cache, and Match. In the Screen stage, a multi-dimensional, quality-aware mechanism with adaptive thresholds dynamically selects informative frames; the Cache stage maintains a reference pool using […]

Ver mais

Like 0

Liked Liked

technocracy

Google: Don’t make “bite-sized” content for LLMs if you care about search rank

digitado ⋅ 9 de January de 2026

Search engine optimization, or SEO, is a big business. While some SEO practices are useful, much of the day-to-day SEO wisdom you see online amounts to superstition. An increasingly popular approach geared toward LLMs called “content chunking” may fall into that category. In the latest installment of Google’s Search Off the Record podcast, John Mueller and Danny Sullivan say that breaking content down into bite-sized chunks for LLMs like Gemini is a bad idea. You’ve probably seen websites […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical Inference for Explainable Boosting Machines

digitado ⋅ 26 de January de 2026

Explainable boosting machines (EBMs) are popular "glass-box" models that learn a set of univariate functions using boosting trees. These achieve explainability through visualizations of each feature’s effect. However, unlike linear model coefficients, uncertainty quantification for the learned univariate functions requires computationally intensive bootstrapping, making it hard to know which features truly matter. We provide an alternative using recent advances in statistical inference for gradient boosting, deriving methods for statistical inference as well as end-to-end theoretical guarantees. Using a […]

Ver mais

Like 0

Liked Liked