digitado – Page 114

Lost in Aggregation: The Causal Interpretation of the IV Estimand

digitado ⋅ 21 de January de 2026

arXiv:2601.12120v1 Announce Type: cross Abstract: Instrumental variable based estimation of a causal effect has emerged as a standard approach to mitigate confounding bias in the social sciences and epidemiology, where conducting randomized experiments can be too costly or impossible. However, justifying the validity of the instrument often poses a significant challenge. In this work, we highlight a problem generally neglected in arguments for instrumental variable validity: the presence of an ”aggregate treatment variable”, where the treatment (e.g., education, […]

Ver mais

Like 0

Liked Liked

technocracy

The Illusion of Generalization: Re-examining Tabular Language Model Evaluation

digitado ⋅ 5 de February de 2026

arXiv:2602.04031v1 Announce Type: new Abstract: Tabular Language Models (TLMs) have been claimed to achieve emergent generalization for tabular prediction. We conduct a systematic re-evaluation of Tabula-8B as a representative TLM, utilizing 165 datasets from the UniPredict benchmark. Our investigation reveals three findings. First, binary and categorical classification achieve near-zero median lift over majority-class baselines and strong aggregate performance is driven entirely by quartile classification tasks. Second, top-performing datasets exhibit pervasive contamination, including complete train-test overlap and task-level leakage […]

Ver mais

Like 0

Liked Liked

technocracy

Deploy AI agents on Amazon Bedrock AgentCore using GitHub Actions

digitado ⋅ 16 de January de 2026

Recently, AWS announced Amazon Bedrock AgentCore, a flexible service that helps developers seamlessly create and manage AI agents across different frameworks and models, whether hosted on Amazon Bedrock or other environments. Specifically, AgentCore Runtime provides a secure, serverless, and purpose-built hosting environment for deploying and running AI agents or tools. AgentCore Runtime is framework agnostic, working seamlessly with popular frameworks like LangGraph, Strands, and CrewAI for deploying your AI agents and tools with automatic scaling and built-in security. […]

Ver mais

Like 0

Liked Liked

technocracy

Universal Sequence Preconditioning

digitado ⋅ 29 de January de 2026

arXiv:2502.06545v5 Announce Type: replace-cross Abstract: We study the problem of preconditioning in sequential prediction. From the theoretical lens of linear dynamical systems, we show that convolving the target sequence corresponds to applying a polynomial to the hidden transition matrix. Building on this insight, we propose a universal preconditioning method that convolves the target with coefficients from orthogonal polynomials such as Chebyshev or Legendre. We prove that this approach reduces regret for two distinct prediction algorithms and yields the […]

Ver mais

Like 0

Liked Liked

technocracy

Parametric RDT approach to computational gap of symmetric binary perceptron

digitado ⋅ 16 de January de 2026

arXiv:2601.10628v1 Announce Type: new Abstract: We study potential presence of statistical-computational gaps (SCG) in symmetric binary perceptrons (SBP) via a parametric utilization of emph{fully lifted random duality theory} (fl-RDT) [96]. A structural change from decreasingly to arbitrarily ordered $c$-sequence (a key fl-RDT parametric component) is observed on the second lifting level and associated with emph{satisfiability} ($alpha_c$) — emph{algorithmic} ($alpha_a$) constraints density threshold change thereby suggesting a potential existence of a nonzero computational gap $SCG=alpha_c-alpha_a$. The second level estimate […]

Ver mais

Like 0

Liked Liked

technocracy

CS-GBA: A Critical Sample-based Gradient-guided Backdoor Attack for Offline Reinforcement Learning

digitado ⋅ 15 de January de 2026

Offline Reinforcement Learning (RL) enables policy optimization from static datasets but is inherently vulnerable to backdoor attacks. Existing attack strategies typically struggle against safety-constrained algorithms (e.g., CQL) due to inefficient random poisoning and the use of easily detectable Out-of-Distribution (OOD) triggers. In this paper, we propose CS-GBA (Critical Sample-based Gradient-guided Backdoor Attack), a novel framework designed to achieve high stealthiness and destructiveness under a strict budget. Leveraging the theoretical insight that samples with high Temporal Difference (TD) errors […]

Ver mais

Like 0

Liked Liked

technocracy

Post Title

digitado ⋅ 6 de July de 2026

Artificial intelligence is now part of everyday scientific research. Researchers rely on artificial intelligence to search through huge piles of papers, write code, analyze datasets, and draft reports. But, running a full research project usually means switching between all sorts of tools for search, coding, data analysis, and documentation. Synthetic Sciences built an open source platform called OpenScience to put everything researchers need in one place, basically a workspace powered by artificial intelligence that keeps things simple, transparent, […]

Ver mais

Like 0

Liked Liked

technocracy

Detecting and Fixing ‘Dead Neurons’ in Foundation Models

digitado ⋅ 28 de October de 2025

TL;DR Dead neurons silently waste compute and reduce effective model capacity in foundation models. Simple visualizations of the activation frequency make neuron health measurable. Dead neurons can be brought back to life by swapping activation functions or implementing synaptic stripping. It is crucial for foundation model training success to proactively monitor neuron health with audits and alerts. In neural networks, some neurons end up outputting near-zero activations across all inputs. These so-called “dead neurons” degrade model capacity because […]

Ver mais

Like 0

Liked Liked

technocracy

F1 in Miami: That’s what it looks like when an upgrade works

digitado ⋅ 4 de May de 2026

After an unanticipated five-week break in the season, Formula One resumed action this past weekend in Miami. Held at a temporary circuit around Hard Rock Stadium, the event is emblematic of the Liberty era of F1: a turbocharged marketing extravaganza crammed full of hospitality suites with ticket prices as high as $95,000. It might be miles from the sea—the original plans to race across a bridge over Biscayne Bay did not survive contact with locals—but the sport is […]

Ver mais

Like 0

Liked Liked

technocracy

The Rise of Large Language Models and the Direction and Impact of US Federal Research Funding

digitado ⋅ 23 de January de 2026

arXiv:2601.15485v1 Announce Type: new Abstract: Federal research funding shapes the direction, diversity, and impact of the US scientific enterprise. Large language models (LLMs) are rapidly diffusing into scientific practice, holding substantial promise while raising widespread concerns. Despite growing attention to AI use in scientific writing and evaluation, little is known about how the rise of LLMs is reshaping the public funding landscape. Here, we examine LLM involvement at key stages of the federal funding pipeline by combining two […]

Ver mais

Like 0

Liked Liked