digitado – Page 77

Teaching Language Models to Forecast Research Success Through Comparative Idea Evaluation

digitado ⋅ 22 de May de 2026

arXiv:2605.21491v1 Announce Type: new Abstract: As language models accelerate scientific research by automating hypothesis generation and implementation, a new bottleneck emerges: evaluating and filtering hundreds of AI-generated ideas without exhaustive experimentation. We ask whether LMs can learn to forecast the empirical success of research ideas before any experiments are run. We study comparative empirical forecasting: given a benchmark-specific research goal and two candidate ideas, predict which will achieve better benchmark performance. We construct a dataset of 11,488 idea […]

Ver mais

Like 0

Liked Liked

technocracy

Phase transition in causal representation: flip frequency, not penalty severity, is the key variable

digitado ⋅ 12 de March de 2026

Posting a specific finding from a larger project that I think is relevant here. We ran a 7×6 parameter sweep over (flip_mean, penalty) in an evolutionary simulation of causal capacity emergence. The result surprised us: there is a sharp phase transition between flip_mean=80 and flip_mean=200 that is almost entirely independent of penalty severity. Below the boundary: equilibrium causal capacity 0.46–0.60. Above it: 0.30–0.36, regardless of whether the penalty is -2 or -30. The implication for RL environment design: […]

Ver mais

Like 0

Liked Liked

technocracy

Accurate Residues for Floating-Point Debugging

digitado ⋅ 9 de April de 2026

arXiv:2604.06258v1 Announce Type: new Abstract: Floating-point arithmetic is error-prone and unintuitive. Floating-point debuggers instrument programs to monitor floating-point arithmetic at run time and flag numerical issues. They estimate residues, i.e., the difference between actual floating-point and ideal real values, for every floating-point value in the program. Prior work explores various approaches for computing these residues accurately and efficiently. Unfortunately, the most efficient methods, based on “error-free transformations”, have a high rate of false reports, while the most accurate […]

Ver mais

Like 0

Liked Liked

technocracy

When the Gravity Gates Opened at Windy Corner

digitado ⋅ 25 de May de 2026

:::info Astounding Stories of Super-Science May 2001, by Astounding Stories is part of HackerNoon’s Book Blog Post series. You can jump to any chapter in this book here. A ROOM WITH A VIEW – Chapter VIII – Medieval Astounding Stories of Super-Science May 2001: A ROOM WITH A VIEW – Chapter VIII – Medieval By E. M. Forster ::: The drawing-room curtains at Windy Corner had been pulled to meet, for the carpet was new and deserved protection from […]

Ver mais

Like 0

Liked Liked

technocracy

Can PPO learn through “Imagination” similar to Dreamer?

digitado ⋅ 9 de March de 2026

Hi everyone, I’ve been diving into the Dreamer paper recently, and I found the concept of learning a policy through “imagination”(within a latent world model) absolutely fascinating. This got me wondering: Can the PPO (Proximal Policy Optimization) algorithm also be trained through imagination? Specifically, instead of interacting with a real environment, could we plug PPO into a learned world model to update its policy? I’d love to hear your thoughts on the technical feasibility or if there are […]

Ver mais

Like 0

Liked Liked

technocracy

Single-Turn LLM Reformulation Powered Multi-Stage Hybrid Re-Ranking for Tip-of-the-Tongue Known-Item Retrieval

digitado ⋅ 12 de February de 2026

arXiv:2602.10321v1 Announce Type: new Abstract: Retrieving known items from vague descriptions, Tip-of-the-Tongue (ToT) retrieval, remains a significant challenge. We propose using a single call to a generic 8B-parameter LLM for query reformulation, bridging the gap between ill-formed ToT queries and specific information needs. This method is particularly effective where standard Pseudo-Relevance Feedback fails due to poor initial recall. Crucially, our LLM is not fine-tuned for ToT or specific domains, demonstrating that gains stem from our prompting strategy rather […]

Ver mais

Like 0

Liked Liked

technocracy

Cosm: Collective Switched Motion for Fast and Accurate Sparse Ising Optimization

digitado ⋅ 1 de June de 2026

arXiv:2605.30355v1 Announce Type: new Abstract: We introduce Collective Switched Motion (Cosm), a heuristic algorithm for solving sparse Ising-type optimization problems. Cosm combines locally interacting continuous circular variables with global coordination rules that facilitate collective dynamics. Pairwise interactions occur sequentially over a set of conflict-free edge partitions, resulting in an interaction network that switches periodically. Unlike conventional gradient-based approaches, Cosm enables structured, non-gradient dynamics that promote exploration beyond local minima. A correlated perturbation mechanism helps enable collective variable rotations. […]

Ver mais

Like 0

Liked Liked

technocracy

High-dimensional analysis of ridge regression for non-identically distributed data with a variance profile

digitado ⋅ 13 de February de 2026

arXiv:2403.20200v4 Announce Type: replace-cross Abstract: High-dimensional linear regression has been thoroughly studied in the context of independent and identically distributed data. We propose to investigate high-dimensional regression models for independent but non-identically distributed data. To this end, we suppose that the set of observed predictors (or features) is a random matrix with a variance profile and with dimensions growing at a proportional rate. Assuming a random effect model, we study the predictive risk of the ridge estimator for […]

Ver mais

Like 0

Liked Liked

technocracy

Cross-Domain Few-Shot Learning for Hyperspectral Image Classification Based on Mixup Foundation Model

digitado ⋅ 30 de January de 2026

Although cross-domain few-shot learning (CDFSL) for hyper-spectral image (HSI) classification has attracted significant research interest, existing works often rely on an unrealistic data augmentation procedure in the form of external noise to enlarge the sample size, thus greatly simplifying the issue of data scarcity. They involve a large number of parameters for model updates, being prone to the overfitting problem. To the best of our knowledge, none has explored the strength of the foundation model, having strong generalization […]

Ver mais

Like 0

Liked Liked

technocracy

Want an oxygen-rich atmosphere? Stuff oxygen’s friends in the mantle.

digitado ⋅ 26 de May de 2026

Planet Earth has some pretty great qualities going for it. (Negative reviews mostly revolve around the staff and clientele.) Pretty high on the list of positives is a richly oxygenated atmosphere. But that’s something that evolved and built up over a couple billion years, only eventually resulting in a world conducive to animal life like us. Scientists have many ideas about what could have caused oxygen to increase, and it seems that a number of them are probably […]

Ver mais

Like 0

Liked Liked