digitado

Fine-Tuning Language Models to Know What They Know

digitado ⋅ 4 de February de 2026

arXiv:2602.02605v1 Announce Type: new Abstract: Metacognition is a critical component of intelligence, specifically regarding the awareness of one’s own knowledge. While humans rely on shared internal memory for both answering questions and reporting their knowledge state, this dependency in LLMs remains underexplored. This study proposes a framework to measure metacognitive ability $d_{rm{type2}}’$ using a dual-prompt method, followed by the introduction of Evolution Strategy for Metacognitive Alignment (ESMA) to bind a model’s internal knowledge to its explicit behaviors. ESMA […]

Ver mais

Like 0

Liked Liked

technocracy

Simple Baselines are Competitive with Code Evolution

digitado ⋅ 20 de February de 2026

arXiv:2602.16805v1 Announce Type: new Abstract: Code evolution is a family of techniques that rely on large language models to search through possible computer programs by evolving or mutating existing code. Many proposed code evolution pipelines show impressive performance but are often not compared to simpler baselines. We test how well two simple baselines do over three domains: finding better mathematical bounds, designing agentic scaffolds, and machine learning competitions. We find that simple baselines match or exceed much more […]

Ver mais

Like 0

Liked Liked

technocracy

A first-of-its-kind experiment to measure the impact of out-of-home advertising

digitado ⋅ 21 de May de 2025

A first-of-its-kind experiment to measure the impact of out-of-home advertising By combining surveys with ads targeted to metro and commuter rail lines, Amazon researchers identify the fraction of residents of different neighborhoods exposed to the ads and measure ad effectiveness. Economics Egor Abramov May 21, 02:47 PM May 21, 02:47 PM Amazon promotes its products and services through many advertising channels, and we naturally want to know how much bang we get for our buck. We thus try […]

Ver mais

Like 0

Liked Liked

technocracy

Online Learning with Limited Information in the Sliding Window Model

digitado ⋅ 8 de January de 2026

arXiv:2601.03533v1 Announce Type: new Abstract: Motivated by recent work on the experts problem in the streaming model, we consider the experts problem in the sliding window model. The sliding window model is a well-studied model that captures applications such as traffic monitoring, epidemic tracking, and automated trading, where recent information is more valuable than older data. Formally, we have $n$ experts, $T$ days, the ability to query the predictions of $q$ experts on each day, a limited amount […]

Ver mais

Like 0

Liked Liked

technocracy

Verdict: Yes, you should go see Project Hail Mary as soon as possible

digitado ⋅ 11 de March de 2026

First, in the plainest language, before we get to anything else, Project Hail Mary is a fantastic film. It does right by its source material, and it also easily stands on its own for folks who haven’t read the book. It comes out on March 20, and if you’re a regular Ars Technica reader, you will almost certainly enjoy the crap out of it. Go see it as soon as you can, and see it in a theater […]

Ver mais

Like 0

Liked Liked

technocracy

From Consistency to Complementarity: Aligned and Disentangled Multi-modal Learning for Time Series Understanding and Reasoning

digitado ⋅ 29 de January de 2026

Advances in multi-modal large language models (MLLMs) have inspired time series understanding and reasoning tasks, that enable natural language querying over time series, producing textual analyses of complex temporal dynamics. Recent attempts hybridize numerical time series with their visualized plots, facilitating precise value reasoning and visual structure comprehension for comprehensive time series understanding of MLLMs. However, effective cross-modal integration remains challenging due to fine-grained temporal misalignment across modalities and severe entanglement between shared and modality-specific semantics, which hinder […]

Ver mais

Like 0

Liked Liked

technocracy

The Context Advantage: How Palantir AIP Operates the Modern Enterprise

digitado ⋅ 15 de January de 2026

Over the last couple of years, most conversations about AI have focused on model size, speed, or how many parameters a system can fit into memory. These are useful metrics, but they do not explain why some organisations see operational results while others remain stuck in experimentation. The difference is not the model. The difference is context. It is similar to how we once compared phones by processor speed. Faster chips looked impressive, but they never explained why one […]

Ver mais

Like 0

Liked Liked

technocracy

NEST: Network- and Memory-Aware Device Placement For Distributed Deep Learning

digitado ⋅ 10 de March de 2026

arXiv:2603.06798v1 Announce Type: cross Abstract: The growing scale of deep learning demands distributed training frameworks that jointly reason about parallelism, memory, and network topology. Prior works often rely on heuristic or topology-agnostic search, handling communication and memory separately. Without per-device memory awareness, these methods typically ensure feasibility post hoc by sharding parameters and activations across many devices, increasing synchronization, inflating communication, and underutilizing compute-limiting scalability and efficiency on real datacenter networks. We present NEST, a network-, compute-, and […]

Ver mais

Like 0

Liked Liked

technocracy

Echo State Networks for Time Series Forecasting: Hyperparameter Sweep and Benchmarking

digitado ⋅ 5 de February de 2026

arXiv:2602.03912v1 Announce Type: new Abstract: This paper investigates the forecasting performance of Echo State Networks (ESNs) for univariate time series forecasting using a subset of the M4 Forecasting Competition dataset. Focusing on monthly and quarterly time series with at most 20 years of historical data, we evaluate whether a fully automatic, purely feedback-driven ESN can serve as a competitive alternative to widely used statistical forecasting methods. The study adopts a rigorous two-stage evaluation approach: a Parameter dataset is […]

Ver mais

Like 0

Liked Liked

technocracy

VillageNet: Graph-based, Easily-interpretable, Unsupervised Clustering for Broad Biomedical Applications

digitado ⋅ 24 de February de 2026

arXiv:2501.10471v2 Announce Type: replace-cross Abstract: Clustering large high-dimensional datasets with diverse variable is essential for extracting high-level latent information from these datasets. Here, we developed an unsupervised clustering algorithm, we call “Village-Net”. Village-Net is specifically designed to effectively cluster high-dimension data without priori knowledge on the number of existing clusters. The algorithm operates in two phases: first, utilizing K-Means clustering, it divides the dataset into distinct subsets we refer to as “villages”. Next, a weighted network is created, […]

Ver mais

Like 0

Liked Liked