digitado – Page 72

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

digitado ⋅ 2 de February de 2026

Progressive Learning (PL) reduces pre-training computational overhead by gradually increasing model scale. While prior work has extensively explored depth expansion, width expansion remains significantly understudied, with the few existing methods limited to the early stages of training. However, expanding width during the mid-stage is essential for maximizing computational savings, yet it remains a formidable challenge due to severe training instabilities. Empirically, we show that naive initialization at this stage disrupts activation statistics, triggering loss spikes, while copy-based initialization […]

Ver mais

Like 0

Liked Liked

technocracy

A Survey of Reinforcement Learning For Economics

digitado ⋅ 9 de March de 2026

This survey (re)introduces reinforcement learning methods to economists. The curse of dimensionality limits how far exact dynamic programming can be effectively applied, forcing us to rely on suitably "small" problems or our ability to convert "big" problems into smaller ones. While this reduction has been sufficient for many classical applications, a growing class of economic models resists such reduction. Reinforcement learning algorithms offer a natural, sample-based extension of dynamic programming, extending tractability to problems with high-dimensional states, continuous […]

Ver mais

Like 0

Liked Liked

technocracy

xjb: Fast Float to String Algorithm

digitado ⋅ 1 de April de 2026

Efficiently and accurately converting floating-point numbers to decimal strings is a critical challenge in numerical computation and data exchange. While existing algorithms like Ryu, Dragonbox, and Schubfach satisfy the Steele-White (SW) principle for accuracy, they often suffer from performance bottlenecks due to branch prediction failures and high-precision multiplication overhead. This paper presents a novel floating-point to string conversion algorithm called “xjb”, an optimized variant of the Schubfach algorithm designed to deliver superior performance for IEEE754 single-precision (binary32) and […]

Ver mais

Like 0

Liked Liked

technocracy

BIRD: A Museum Open Dataset Combining Behavior Patterns and Identity Types to Better Model Visitors’ Experience

digitado ⋅ 13 de February de 2026

arXiv:2602.11160v1 Announce Type: new Abstract: Lack of data is a recurring problem in Artificial Intelligence, as it is essential for training and validating models. This is particularly true in the field of cultural heritage, where the number of open datasets is relatively limited and where the data collected does not always allow for holistic modeling of visitors’ experience due to the fact that data are ad hoc (i.e. restricted to the sole characteristics required for the evaluation of […]

Ver mais

Like 0

Liked Liked

technocracy

Non-Intrusive Multimodal AI Framework for Detecting Hazardous Misdeclared Cargo in Maritime Containers

digitado ⋅ 5 de January de 2026

Misdeclared hazardous cargo inside sealed maritime containers continues to drive ship fires, port disruptions, and avoidable losses for crews, carriers, and coastal communities. Ports and carriers already use non-intrusive inspection (NII) imaging and document checks, but these systems are often treated as separate queues rather than a single integrated decision system. This paper proposes a practical multimodal framework that fuses radiographic sensing with shipping document intelligence to flag hazardous misdeclaration risks without opening a container. The approach combines […]

Ver mais

Like 0

Liked Liked

technocracy

Utilizing Class Separation Distance for the Evaluation of Corruption Robustness of Machine Learning Classifiers

digitado ⋅ 19 de January de 2026

arXiv:2206.13405v2 Announce Type: replace-cross Abstract: Robustness is a fundamental pillar of Machine Learning (ML) classifiers, substantially determining their reliability. Methods for assessing classifier robustness are therefore essential. In this work, we address the challenge of evaluating corruption robustness in a way that allows comparability and interpretability on a given dataset. We propose a test data augmentation method that uses a robustness distance $epsilon$ derived from the datasets minimal class separation distance. The resulting MSCR (minimal separation corruption robustness) […]

Ver mais

Like 0

Liked Liked

technocracy

Biology-based brain model matches animals in learning, enables new discovery

digitado ⋅ 23 de January de 2026

A new computational model of the brain based closely on its biology and physiology not only learned a simple visual category learning task exactly as well as lab animals, but even enabled the discovery of counterintuitive activity by a group of neurons that researchers working with animals to perform the same task had not noticed in their data before, says a team of scientists at Dartmouth College, MIT, and the State University of New York at Stony Brook. […]

Ver mais

Like 0

Liked Liked

technocracy

Japan lost a 5-ton navigation satellite when it fell off a rocket during launch

digitado ⋅ 28 de January de 2026

If you’re in the space business long enough, you learn there are numerous ways a rocket can fail. I’ve written my share of stories about misbehaving rockets and the extensive investigations that usually—but not always—reveal what went wrong. But I never expected to write this story. Maybe this was a failure of my own imagination. I’m used to writing about engine malfunctions, staging issues, guidance glitches, or structural failures. Last April, Ars reported on the bizarre failure of […]

Ver mais

Like 0

Liked Liked

technocracy

Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning

digitado ⋅ 11 de March de 2026

Machine learning models can represent climate processes that are nonlocal in horizontal space, height, and time, often by combining information across these dimensions in highly nonlinear ways. While this can improve predictive skill, it makes learned relationships difficult to interpret and prone to overfitting as the extent of nonlocal information grows. We address this challenge by introducing data-driven integration kernels, a framework that adds structure to nonlocal operator learning by explicitly separating nonlocal information aggregation from local nonlinear […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Probabilities of Causation with Mask-Augmented Data

digitado ⋅ 11 de February de 2026

arXiv:2505.17133v2 Announce Type: replace Abstract: Probabilities of causation play a central role in modern decision making. Tian and Pearl first introduced formal definitions and derived tight bounds for three binary probabilities of causation, such as the probability of necessity and sufficiency (PNS). However, estimating these probabilities requires both experimental and observational distributions specific to each subpopulation, which are often unreliable or impractical to obtain from limited population-level data. To solve this problem, we propose two machine learning models: […]

Ver mais

Like 0

Liked Liked