technocracy

Ordered Local Momentum for Asynchronous Distributed Learning under Arbitrary Delays

digitado ⋅ 18 de January de 2026

Momentum SGD (MSGD) serves as a foundational optimizer in training deep models due to momentum’s key role in accelerating convergence and enhancing generalization. Meanwhile, asynchronous distributed learning is crucial for training large-scale deep models, especially when the computing capabilities of the workers in the cluster are heterogeneous. To reduce communication frequency, local updates are widely adopted in distributed learning. However, how to implement asynchronous distributed MSGD with local updates remains unexplored. To solve this problem, we propose a […]

Ver mais

Like 0

Liked Liked

technocracy

OpenAI introduces new ‘Trusted Contact’ safeguard for cases of possible self-harm

digitado ⋅ 7 de May de 2026

The company is expanding its efforts to protect ChatGPT users in cases where conversations may turn to self-harm.

Ver mais

Like 0

Liked Liked

technocracy

Optimism Stabilizes Thompson Sampling for Adaptive Inference

digitado ⋅ 6 de February de 2026

arXiv:2602.06014v1 Announce Type: cross Abstract: Thompson sampling (TS) is widely used for stochastic multi-armed bandits, yet its inferential properties under adaptive data collection are subtle. Classical asymptotic theory for sample means can fail because arm-specific sample sizes are random and coupled with the rewards through the action-selection rule. We study this phenomenon in the $K$-armed Gaussian bandit and identify emph{optimism} as a key mechanism for restoring emph{stability}, a sufficient condition for valid asymptotic inference requiring each arm’s pull […]

Ver mais

Like 0

Liked Liked

technocracy

JAWS: Enhancing Long-term Rollout of Neural Operators via Spatially-Adaptive Jacobian Regularization

digitado ⋅ 9 de March de 2026

arXiv:2603.05538v1 Announce Type: new Abstract: Data-driven surrogate models improve the efficiency of simulating continuous dynamical systems, yet their autoregressive rollouts are often limited by instability and spectral blow-up. While global regularization techniques can enforce contractive dynamics, they uniformly damp high-frequency features, introducing a contraction-dissipation dilemma. Furthermore, long-horizon trajectory optimization methods that explicitly correct drift are bottlenecked by memory constraints. In this work, we propose Jacobian-Adaptive Weighting for Stability (JAWS), a probabilistic regularization strategy designed to mitigate these limitations. […]

Ver mais

Like 0

Liked Liked

technocracy

Outliers Detection in PySpark #2 – Interquartile Range

digitado ⋅ 15 de July de 2019

In the first part, I talked about what Data Quality, Anomaly Detection and Outliers Detection are and what’s the difference between outliers detection and novelty detection. In this part, I will talk about a very known and easy method to detect outliers called Interquartile Range. Introduction The Interquartile Range method, also known as IQR, was developed by John Widler Turky, an American mathematician best known for development of the FFT algorithm and box plot.

Ver mais

Like 0

Liked Liked

technocracy

Modeling and Control for UAV with Off-center Slung Load

digitado ⋅ 8 de January de 2026

arXiv:2601.03386v1 Announce Type: new Abstract: Unmanned aerial vehicle (UAV) with slung load system is a classic air transportation system. In practical applications, the suspension point of the slung load does not always align with the center of mass (CoM) of the UAV due to mission requirements or mechanical interference. This offset creates coupling in the system’s nonlinear dynamics which leads to a complicated motion control problem. In existing research, modeling of the system are performed about the UAV’s […]

Ver mais

Like 0

Liked Liked

technocracy

Build a Custom Lead Enrichment Layer to Find Signal in Noise

digitado ⋅ 30 de January de 2026

Photo by DS stories: https://www.pexels.com/photo/photo-of-a-red-meeple-near-blue-meeple-pieces-6990039/ A Technical Guide to Building Custom Signal Detection That Standard Tools Can’t Provide About This Deep-Dive This is a technical walkthrough on custom lead enrichment — a challenge I’ve seen sales teams face when standard tools don’t track their specific buying signals. This is one approach that works for teams with clear signal-to-pipeline correlation data. It’s not the only approach, and it’s not right for everyone. I’ll show you when it makes sense and when it doesn’t. This […]

Ver mais

Like 0

Liked Liked

technocracy

OpenAI Codex system prompt includes explicit directive to “never talk about goblins”

digitado ⋅ 29 de April de 2026

The system prompt for OpenAI’s Codex CLI contains a perplexing and repeated warning for the most recent GPT model to “never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user’s query.” The explicit operational warning was made public last week as part of the latest open source code for Codex CLI that OpenAI posted on GitHub. The prohibition is repeated twice in a 3,500-plus […]

Ver mais

Like 0

Liked Liked

technocracy

New Amazon Nova image- and video-generating models

digitado ⋅ 4 de December de 2024

New Amazon Nova image- and video-generating models Amazon Nova Canvas and Amazon Nova Reel use diffusion transformers to deliver studio-quality visual content. Computer vision Ying Wang Xiaohan Fei December 04, 09:15 AM December 05, 02:00 PM Yesterday at Amazon Web Services annual re:Invent conference, Amazon CEO Andy Jassy announced Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. The Amazon Nova models include understanding models in three different sizes for […]

Ver mais

Like 0

Liked Liked

technocracy

Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off

digitado ⋅ 27 de March de 2026

arXiv:2603.24750v1 Announce Type: new Abstract: Online Health Communities connect patients for peer support, but users face a discovery challenge when they have minimal prior interactions to guide personalization. We study recommendation under extreme interaction sparsity in a survey driven setting where each user provides a 16 dimensional intake vector and each support group has a structured feature profile. We extend Neural Collaborative Filtering architectures, including Matrix Factorization, Multi Layer Perceptron, and NeuMF, with an auxiliary pseudo label objective […]

Ver mais

Like 0

Liked Liked