digitado – Page 348

Anchored Decoding: Provably Reducing Copyright Risk for Any Language Model

digitado ⋅ 10 de February de 2026

arXiv:2602.07120v1 Announce Type: new Abstract: Modern language models (LMs) tend to memorize portions of their training data and emit verbatim spans. When the underlying sources are sensitive or copyright-protected, such reproduction raises issues of consent and compensation for creators and compliance risks for developers. We propose Anchored Decoding, a plug-and-play inference-time method for suppressing verbatim copying: it enables decoding from any risky LM trained on mixed-license data by keeping generation in bounded proximity to a permissively trained safe […]

Ver mais

Like 0

Liked Liked

technocracy

Domain-Skewed Federated Learning with Feature Decoupling and Calibration

digitado ⋅ 15 de March de 2026

Federated learning (FL) allows distributed clients to collaboratively train a global model in a privacy-preserving manner. However, one major challenge is domain skew, where clients’ data originating from diverse domains may hinder the aggregated global model from learning a consistent representation space, resulting in poor generalizable ability in multiple domains. In this paper, we argue that the domain skew is reflected in the domain-specific biased features of each client, causing the local model’s representations to collapse into a […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond Augmented-Action Surrogates for Multi-Expert Learning-to-Defer

digitado ⋅ 10 de April de 2026

Learning-to-Defer routes each input to the expert that minimizes expected cost, but it assumes that the information available to every expert is fixed at decision time. Many modern systems violate this assumption: after selecting an expert, one may also choose what additional information that expert should receive, such as retrieved documents, tool outputs, or escalation context. We study this problem and call it Learning-to-Defer with advice. We show that a broad family of natural separated surrogates, which learn […]

Ver mais

Like 0

Liked Liked

technocracy

Get Started with Rust: Generics

digitado ⋅ 3 de January de 2023

Get Started with Rust: Generics Generic programming allows programmers to write general algorithms that work with arbitrary types. It reduces code duplication and provides type safety, which enables us to write more concise and clean code. This approach is also known as parametric polymorphism or templates in other languages. Rust supports two types of generic code: Compile-time generics, similar to C++ templates. Run-time generics, similar to virtual functions in C++ and generics in Java. Serokell has a broad […]

Ver mais

Like 0

Liked Liked

technocracy

VJEPA: Variational Joint Embedding Predictive Architectures as Probabilistic World Models

digitado ⋅ 22 de January de 2026

arXiv:2601.14354v1 Announce Type: new Abstract: Joint Embedding Predictive Architectures (JEPA) offer a scalable paradigm for self-supervised learning by predicting latent representations rather than reconstructing high-entropy observations. However, existing formulations rely on textit{deterministic} regression objectives, which mask probabilistic semantics and limit its applicability in stochastic control. In this work, we introduce emph{Variational JEPA (VJEPA)}, a textit{probabilistic} generalization that learns a predictive distribution over future latent states via a variational objective. We show that VJEPA unifies representation learning with Predictive […]

Ver mais

Like 0

Liked Liked

technocracy

Research taste is a skill nobody talks about. How do you develop it without collaborators? [D]

digitado ⋅ 24 de April de 2026

if you’ve ever built an elegant, complex ML pipeline to solve something a 10-line prompt could’ve handled… this is for you. i’ve been thinking about what separates people who do useful research from people who do impressive-looking research. it’s almost always the problems you choose rather than raw technical skill. here’s the mental model i’ve landed on. every problem kind of follows these steps: find a clear problem people actually care about try the dumbest solution first. can […]

Ver mais

Like 0

Liked Liked

technocracy

Obscuring P2P nodes with Dandelion

digitado ⋅ 8 de December de 2025

The weakest link in the privacy of cryptocurrency transactions is often outside the blockchain. There are technologies such as stealth addresses and subaddresses to try to thwart attempts to link transactions to individuals. They do a good job of anonymizing transaction data, but the weak link may be metadata, as is often the case. Cryptocurrency nodes circulate transaction data using a peer-to-peer network. An entity running multiple nodes can compare when data arrived at each of its nodes […]

Ver mais

Like 0

Liked Liked

technocracy

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation

digitado ⋅ 9 de January de 2026

arXiv:2601.04327v1 Announce Type: new Abstract: Parallel programming is central to HPC and AI, but producing code that is correct and fast remains challenging, especially for OpenMP GPU offload, where data movement and tuning dominate. Autonomous coding agents can compile, test, and profile on target hardware, but outputs are brittle without domain scaffolding. We present ParaCodex, an HPC-engineer workflow that turns a Codex-based agent into an autonomous OpenMP GPU offload system using staged hotspot analysis, explicit data planning, correctness […]

Ver mais

Like 0

Liked Liked

technocracy

Quoting Martin Fowler

digitado ⋅ 18 de February de 2026

LLMs are eating specialty skills. There will be less use of specialist front-end and back-end developers as the LLM-driving skills become more important than the details of platform usage. Will this lead to a greater recognition of the role of Expert Generalists? Or will the ability of LLMs to write lots of code mean they code around the silos rather than eliminating them? — Martin Fowler, tidbits from the Thoughtworks Future of Software Development Retreat, via HN) Tags: […]

Ver mais

Like 0

Liked Liked

technocracy

Which Crypto Could Be The Better Investment Now: MUTM or DOGE?

digitado ⋅ 2 de January de 2026

The crypto market is no longer driven purely by hype. As investors look ahead to 2026, the focus has shifted toward timing, utility, and long-term sustainability. In this article, we take a closer look at Dogecoin (DOGE) and Mutuum Finance (MUTM)—two very different cryptocurrencies at very different stages—to assess which may offer the stronger investment opportunity in today’s market environment. Dogecoin (DOGE) Dogecoin remains one of the most recognizable cryptocurrencies in the market, largely due to its meme-driven […]

Ver mais

Like 0

Liked Liked