March 2026

ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning

digitado ⋅ 11 de March de 2026

Deep generative models can help with data scarcity and privacy by producing synthetic training data, but they struggle in low-data, imbalanced tabular settings to fully learn the complex data distribution. We argue that striving for the full joint distribution could be overkill; for greater data efficiency, models should prioritize learning the conditional distribution $P(ymid bm{X})$, as suggested by recent theoretical analysis. Therefore, we overcome this limitation with textbf{ReTabSyn}, a textbf{Re}inforced textbf{Tab}ular textbf{Syn}thesis pipeline that provides direct feedback on […]

Ver mais

Like 0

Liked Liked

technocracy

Dynamics-Informed Deep Learning for Predicting Extreme Events

digitado ⋅ 11 de March de 2026

Predicting extreme events in high-dimensional chaotic dynamical systems remains a fundamental challenge, as such events are rare, intermittent, and arise from transient dynamical mechanisms that are difficult to infer from limited observations. Accordingly, real-time forecasting calls for precursors that encode the mechanisms driving extremes, rather than relying solely on statistical associations. We propose a fully data-driven framework for long-lead prediction of extreme events that constructs interpretable, mechanism-aware precursors by explicitly tracking transient instabilities preceding event onset. The approach […]

Ver mais

Like 0

Liked Liked

technocracy

A Learning-Based Superposition Operator for Non-Renewal Arrival Processes in Queueing Networks

digitado ⋅ 11 de March de 2026

The superposition of arrival processes is a fundamental yet analytically intractable operation in queueing networks when inputs are general non-renewal streams. Classical methods either reduce merged flows to renewal surrogates, rely on computationally prohibitive Markovian representations, or focus solely on mean-value performance measures. We propose a scalable data-driven superposition operator that maps low-order moments and autocorrelation descriptors of multiple arrival streams to those of their merged process. The operator is a deep learning model trained on synthetically generated […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Tree-Based Models with Gradient Descent

digitado ⋅ 11 de March de 2026

Tree-based models are widely recognized for their interpretability and have proven effective in various application domains, particularly in high-stakes domains. However, learning decision trees (DTs) poses a significant challenge due to their combinatorial complexity and discrete, non-differentiable nature. As a result, traditional methods such as CART, which rely on greedy search procedures, remain the most widely used approaches. These methods make locally optimal decisions at each node, constraining the search space and often leading to suboptimal tree structures. […]

Ver mais

Like 0

Liked Liked

technocracy

Prioritizing Gradient Sign Over Modulus: An Importance-Aware Framework for Wireless Federated Learning

digitado ⋅ 11 de March de 2026

Wireless federated learning (FL) facilitates collaborative training of artificial intelligence (AI) models to support ubiquitous intelligent applications at the wireless edge. However, the inherent constraints of limited wireless resources inevitably lead to unreliable communication, which poses a significant challenge to wireless FL. To overcome this challenge, we propose Sign-Prioritized FL (SP-FL), a novel framework that improves wireless FL by prioritizing the transmission of important gradient information through uneven resource allocation. Specifically, recognizing the importance of descent direction in […]

Ver mais

Like 0

Liked Liked

technocracy

What crackdown? Trump’s EPA enforcement claims don’t pass sniff test.

digitado ⋅ 11 de March de 2026

For over a decade, Hino Motors Ltd. imported and sold more than 105,000 vehicles and engines with misleading or fabricated emissions data, until testing by the Environmental Protection Agency revealed the emissions-fraud scheme. The case would lead the Toyota subsidiary to plead guilty and agree to pay over $1.6 billion in fines over five years and forfeit an additional $1 billion in profits made from the illicit sales. On Monday, the EPA touted the case in its enforcement […]

Ver mais

Like 0

Liked Liked

technocracy

Digital Transformation Awards 2026 Now Open for Global Entries

digitado ⋅ 11 de March de 2026

London, United Kingdom | The Digital Transformation Awards an independent, global programme celebrating excellence in digital innovation, have officially opened entries for the 2026 Awards. The prestigious gala ceremony will take place on 16 June 2026 in London, United Kingdom, with the final entry deadline set for 14 May 2026. Recognising outstanding digital achievements across all industries, the awards honour businesses, teams, and individuals who have successfully leveraged technology to transform operations, enhance customer experiences, and drive meaningful cultural change. The programme is open to […]

Ver mais

Like 0

Liked Liked

technocracy

A Grammar of Machine Learning Workflows

digitado ⋅ 11 de March de 2026

Data leakage affected 294 published papers across 17 scientific fields (Kapoor & Narayanan, 2023). The dominant response has been documentation: checklists, linters, best-practice guides. Documentation does not prevent these failures. This paper proposes a structural remedy: a grammar that decomposes the supervised learning lifecycle into 7 kernel primitives connected by a typed directed acyclic graph (DAG), with four hard constraints that reject the two most damaging leakage classes at call time. The grammar’s core contribution is the terminal […]

Ver mais

Like 0

Liked Liked

technocracy

Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High dimensions

digitado ⋅ 11 de March de 2026

In this paper, we investigate the learning-augmented $k$-median clustering problem, which aims to improve the performance of traditional clustering algorithms by preprocessing the point set with a predictor of error rate $αin [0,1)$. This preprocessing step assigns potential labels to the points before clustering. We introduce an algorithm for this problem based on a simple yet effective sampling method, which substantially improves upon the time complexities of existing algorithms. Moreover, we mitigate their exponential dependency on the dimensionality […]

Ver mais

Like 0

Liked Liked

technocracy

Don’t lick that cold metal pole in winter—if you do, don’t panic

digitado ⋅ 11 de March de 2026

We all remember that infamous scene in the 1983 classic, A Christmas Story, where a boy licks a cold metal post on the playground and ends up getting his tongue stuck to the surface. It’s practically a childhood rite of passage. A 1996 case study coined the term “tundra tongue” to describe the phenomenon. But how dangerous is it, really? And what’s the best way to free one’s tongue with minimal damage? Anders Hagen Jarmund, a graduate student […]

Ver mais

Like 0

Liked Liked