digitado – Page 25

So yeah, I vibe-coded a log colorizer—and I feel good about it

digitado ⋅ 4 de February de 2026

I can’t code. I know, I know—these days, that sounds like an excuse. Anyone can code, right?! Grab some tutorials, maybe an O’Reilly book, download an example project, and jump in. It’s just a matter of learning how to break your project into small steps that you can make the computer do, then memorizing a bit of syntax. Nothing about that is hard! Perhaps you can sense my sarcasm (and sympathize with my lack of time to learn […]

Ver mais

Like 0

Liked Liked

technocracy

Uncertainty-Aware PCA for Arbitrarily Distributed Data Modeled by Gaussian Mixture Models

digitado ⋅ 15 de January de 2026

arXiv:2508.13990v2 Announce Type: replace Abstract: Multidimensional data is often associated with uncertainties that are not well-described by normal distributions. In this work, we describe how such distributions can be projected to a low-dimensional space using uncertainty-aware principal component analysis (UAPCA). We propose to model multidimensional distributions using Gaussian mixture models (GMMs) and derive the projection from a general formulation that allows projecting arbitrary probability density functions. The low-dimensional projections of the densities exhibit more details about the distributions […]

Ver mais

Like 0

Liked Liked

technocracy

Multimodal Deep Learning for Early Prediction of Patient Deterioration in the ICU: Integrating Time-Series EHR Data with Clinical Notes

digitado ⋅ 16 de March de 2026

Early identification of patients at risk for clinical deterioration in the intensive care unit (ICU) remains a critical challenge. Delayed recognition of impending adverse events, including mortality, vasopressor initiation, and mechanical ventilation, contributes to preventable morbidity and mortality. We present a multimodal deep learning approach that combines structured time-series data (vital signs and laboratory values) with unstructured clinical notes to predict patient deterioration within 24 hours. Using the MIMIC-IV database, we constructed a cohort of 74,822 ICU stays […]

Ver mais

Like 0

Liked Liked

technocracy

No More Guessing: a Verifiable Gradient Inversion Attack in Federated Learning

digitado ⋅ 16 de April de 2026

Gradient inversion attacks threaten client privacy in federated learning by reconstructing training samples from clients’ shared gradients. Gradients aggregate contributions from multiple records and existing attacks may fail to disentangle them, yielding incorrect reconstructions with no intrinsic way to certify success. In vision and language, attackers may fall back on human inspection to judge reconstruction plausibility, but this is far less feasible for numerical tabular records, fueling the impression that tabular data is less vulnerable. We challenge this […]

Ver mais

Like 0

Liked Liked

technocracy

GSM8K-Platinum: Revealing Performance Gaps in Frontier LLMs

digitado ⋅ 6 de March de 2025

<!– –> Dataset Code Recently, we introduced Platinum Benchmarks as a step toward quantifying the reliability of large language models (LLMs). In that work, we revised older benchmarks to minimize label noise, such as ambiguous or mislabeled examples, and showed that frontier LLMs still make genuine errors on simple questions. For example, as part of that work we revised a 300-problem subset of GSM8K, a dataset of grade school math word problems, and found that all […]

Ver mais

Like 0

Liked Liked

technocracy

Feedback Does Not Increase the Capacity of Approximately Memoryless Surjective POST Channels

digitado ⋅ 11 de March de 2026

arXiv:2603.08886v1 Announce Type: new Abstract: We study a class of finite-state channels, known as POST channels, in which the previous channel output serves as the current state. A POST channel is deemed approximately memoryless when the state-dependent transition matrices are sufficiently close to one another. For this family of channels, under a surjectivity condition on the associated memoryless reference channel, we show that the feedback capacity coincides with the non-feedback capacity. Consequently, for almost all approximately memoryless POST […]

Ver mais

Like 0

Liked Liked

technocracy

Monotone Optimisation with Learned Projections

digitado ⋅ 28 de January de 2026

Monotone optimisation problems admit specialised global solvers such as the Polyblock Outer Approximation (POA) algorithm, but these methods typically require explicit objective and constraint functions. In many applications, these functions are only available through data, making POA difficult to apply directly. We introduce an algorithm-aware learning approach that integrates learned models into POA by directly predicting its projection primitive via the radial inverse, avoiding the costly bisection procedure used in standard POA. We propose Homogeneous-Monotone Radial Inverse (HM-RI) […]

Ver mais

Like 0

Liked Liked

technocracy

Rare Earths Are Rare in the US Only Because We Choose To Export Environmental Challenges

digitado ⋅ 8 de December de 2025

As has been said by many commentators, rare earths are not particularly rare. Via source, here is an estimate of their abundance in the Earth’s surface: Note by the way the Y-axis is logarithmic so small changes in vertical position can mean a factor of 10 or more difference in concentration. But the rare earths are not unreasonably far off fairly common industrial metals like lead, nickel, copper, and molybdenum and well more common than gold, silver, and […]

Ver mais

Like 0

Liked Liked

technocracy

LOLGORITHM: Funny Comment Generation Agent For Short Videos

digitado ⋅ 15 de April de 2026

arXiv:2604.09729v2 Announce Type: new Abstract: Short-form video platforms have become central to multimedia information dissemination, where comments play a critical role in driving engagement, propagation, and algorithmic feedback. However, existing approaches — including video summarization and live-streaming danmaku generation — fail to produce authentic comments that conform to platform-specific cultural and linguistic norms. In this paper, we present LOLGORITHM, a novel modular multi-agent framework for stylized short-form video comment generation. LOLGORITHM supports six controllable comment styles and comprises […]

Ver mais

Like 0

Liked Liked

technocracy

When Can LLMs Learn to Reason with Weak Supervision?

digitado ⋅ 20 de April de 2026

Large language models have achieved significant reasoning improvements through reinforcement learning with verifiable rewards (RLVR). Yet as model capabilities grow, constructing high-quality reward signals becomes increasingly difficult, making it essential to understand when RLVR can succeed under weaker forms of supervision. We conduct a systematic empirical study across diverse model families and reasoning domains under three weak supervision settings: scarce data, noisy rewards, and self-supervised proxy rewards. We find that generalization is governed by training reward saturation dynamics: […]

Ver mais

Like 0

Liked Liked