digitado – Page 504

Beyond Caption-Based Queries for Video Moment Retrieval

digitado ⋅ 4 de March de 2026

arXiv:2603.02363v1 Announce Type: new Abstract: In this work, we investigate the degradation of existing VMR methods, particularly of DETR architectures, when trained on caption-based queries but evaluated on search queries. For this, we introduce three benchmarks by modifying the textual queries in three public VMR datasets — i.e., HD-EPIC, YouCook2 and ActivityNet-Captions. Our analysis reveals two key generalization challenges: (i) A language gap, arising from the linguistic under-specification of search queries, and (ii) a multi-moment gap, caused by […]

Ver mais

Like 0

Liked Liked

technocracy

Online Continual Learning for Time Series: a Natural Score-driven Approach

digitado ⋅ 21 de January de 2026

arXiv:2601.12931v1 Announce Type: cross Abstract: Online continual learning (OCL) methods adapt to changing environments without forgetting past knowledge. Similarly, online time series forecasting (OTSF) is a real-world problem where data evolve in time and success depends on both rapid adaptation and long-term memory. Indeed, time-varying and regime-switching forecasting models have been extensively studied, offering a strong justification for the use of OCL in these settings. Building on recent work that applies OCL to OTSF, this paper aims to […]

Ver mais

Like 0

Liked Liked

technocracy

I Built a NumPy-Like Library in Pure JavaScript: This Is Exactly How I Did It

digitado ⋅ 28 de March de 2026

While learning data science with Python, one library impressed me more than any other: NumPy. NumPy contains many essential numerical and statistical functions that make working with data much easier. Instead of writing complex mathematical logic from scratch, you can simply call built-in functions such as mean, sum, or dot. As I continued learning, a question came to my mind: “Can I build something similar using pure JavaScript?” This curiosity led me to explore how NumPy works internally […]

Ver mais

Like 0

Liked Liked

technocracy

The life of a prescription at Amazon Pharmacy

digitado ⋅ 30 de September de 2024

The life of a prescription at Amazon Pharmacy From pricing estimation and regulatory compliance to inventory management and chatbot assistants, machine learning models help Amazon Pharmacy customers stay healthy and save time and money. Conversational AI Alexandre Alves Anita Vila September 30, 01:32 PM October 02, 11:42 AM Pharmacies play a vital role in ensuring patients health, but the process of dispensing medications is far more complex than it may appear. At Amazon Pharmacy, we are using artificial […]

Ver mais

Like 0

Liked Liked

technocracy

Towards a Theoretical Understanding to the Generalization of RLHF

digitado ⋅ 26 de January de 2026

arXiv:2601.16403v1 Announce Type: new Abstract: Reinforcement Learning from Human Feedback (RLHF) and its variants have emerged as the dominant approaches for aligning Large Language Models with human intent. While empirically effective, the theoretical generalization properties of these methods in high-dimensional settings remain to be explored. To this end, we build the generalization theory on RLHF of LLMs under the linear reward model, through the framework of algorithmic stability. In contrast to the existing works built upon the consistency […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond Real Data: Synthetic Data through the Lens of Regularization

digitado ⋅ 2 de April de 2026

arXiv:2510.08095v2 Announce Type: replace Abstract: Synthetic data can improve generalization when real data is scarce, but excessive reliance may introduce distributional mismatches that degrade performance. In this paper, we present a learning-theoretic framework to quantify the trade-off between synthetic and real data. Our approach leverages algorithmic stability to derive generalization error bounds, characterizing the optimal synthetic-to-real data ratio that minimizes expected test error as a function of the Wasserstein distance between the real and synthetic distributions. We motivate […]

Ver mais

Like 0

Liked Liked

technocracy

Functional Futures: Carp with Erik Svedäng

digitado ⋅ 14 de July de 2022

In this month’s episode of Functional Futures, our guest is Erik Svedäng, a game designer who has created many board and video games. Among them is Else Heart.Break(), a puzzle video game with its own programming language. He is also the creator of Carp, a statically-typed lisp for real-time applications. In the episode, we talk about game design, game development, and how Carp enables developers to build performant games while keeping true to functional programming idioms. As always, […]

Ver mais

Like 0

Liked Liked

technocracy

SIGMA-PPG: Statistical-prior Informed Generative Masking Architecture for PPG Foundation Model

digitado ⋅ 30 de January de 2026

arXiv:2601.21031v1 Announce Type: new Abstract: Current foundation model for photoplethysmography (PPG) signals is challenged by the intrinsic redundancy and noise of the signal. Standard masked modeling often yields trivial solutions while contrastive methods lack morphological precision. To address these limitations, we propose a Statistical-prior Informed Generative Masking Architecture (SIGMA-PPG), a generative foundation model featuring a Prior-Guided Adversarial Masking mechanism, where a reinforcement learning-driven teacher leverages statistical priors to create challenging learning paths that prevent overfitting to noise. We […]

Ver mais

Like 0

Liked Liked

technocracy

A Learning-Based Hybrid Decision Framework for Matching Systems with User Departure Detection

digitado ⋅ 25 de February de 2026

In matching markets such as kidney exchanges and freight exchanges, delayed matching has been shown to improve overall market efficiency. The benefits of delay are highly sensitive to participants’ sojourn times and departure behavior, and delaying matches can impose significant costs, including longer waiting times and increased market congestion. These competing effects make fixed matching policies inherently inflexible in dynamic environments. We propose a learning-based Hybrid framework that adaptively combines immediate and delayed matching. The framework continuously collects […]

Ver mais

Like 0

Liked Liked

technocracy

Test-Time Training Provably Improves Transformers as In-context Learners

digitado ⋅ 24 de February de 2026

arXiv:2503.11842v2 Announce Type: replace-cross Abstract: Test-time training (TTT) methods explicitly update the weights of a model to adapt to the specific test instance, and they have found success in a variety of settings, including most recently language modeling and reasoning. To demystify this success, we investigate a gradient-based TTT algorithm for in-context learning, where we train a transformer model on the in-context demonstrations provided in the test prompt. Specifically, we provide a comprehensive theoretical characterization of linear transformers […]

Ver mais

Like 0

Liked Liked