digitado

About digitado

https://www.digitado.com.br

Posts by :

Dynamics of Stochastic Momentum with Sparse Updates in High Dimensions

digitado ⋅ 29 de May de 2026

arXiv:2605.28961v1 Announce Type: new Abstract: Existing theory of momentum assumes that gradients arrive at every parameter at a roughly constant rate, an assumption violated in practice by heavy-tailed data distributions and modern architectures. We theoretically analyze the dynamics of two tractable models of momentum under sparse updates: a least squares model with sparse inputs and a logistic regression model with a rare class. Both admit exact closed-form second-moment dynamics whose high-dimensional limits we characterize across three scaling exponents […]

Ver mais

Like 0

Liked Liked

technocracy

From Context Shift to Stylistic Collapse: Why Training Objectives Matter More Than Scale

digitado ⋅ 29 de May de 2026

arXiv:2605.28826v1 Announce Type: new Abstract: In modern LLMs, linguistic features function not as stylistic artifacts but as probes of probability mass, allocated under training alignment objectives. Language models trained with contemporary pipelines exhibit severe reshaping of linguistic features, leading to extreme language re-distribution. While previous stylometric analyses explored linguistic differences between AI-generated and human texts, we focus on the reshaping plaguing the LLM training pipeline itself. We analyze 17 models (410M-100B+ parameters) across 24 linguistically-motivated probes, documenting that […]

Ver mais

Like 0

Liked Liked

technocracy

MechELK: A Mechanistic Interpretability Framework for Eliciting Latent Knowledge in Large Language Models

digitado ⋅ 29 de May de 2026

arXiv:2605.28825v1 Announce Type: new Abstract: Large language models (LLMs) frequently encode factual and reasoning knowledge in their internal representations that is not faithfully reflected in their surface-level outputs — a phenomenon known as emph{latent knowledge}. Existing approaches to eliciting latent knowledge, such as Contrastive Consistency Search (CCS), rely on contrastive activation patterns and struggle with complex multi-step reasoning tasks, while mechanistic interpretability tools have primarily been used to emph{understand} model behavior rather than to emph{extract} hidden knowledge. We […]

Ver mais

Like 0

Liked Liked

technocracy

A Modular Architecture for Typologically Controlled Lexicon Generation

digitado ⋅ 29 de May de 2026

arXiv:2605.28824v1 Announce Type: new Abstract: Constructing artificial lexicons that are pronounceable, typologically plausible, and semantically structured remains an open challenge in computational linguistics. Existing conlang generators either lack formal phonotactic guarantees or delegate generation to opaque, non-reproducible LLM-based pipelines. We propose a modular framework that samples phoneme inventories from PHOIBLE, generates word forms under interchangeable phonological grammars (deterministic, OT, and MaxEnt), and assigns meanings via a Swadesh–Leipzig–Jakarta ontology with explicit form–meaning alignment. Evaluation on character $n$-gram perplexity, log-likelihood, […]

Ver mais

Like 0

Liked Liked

technocracy

What are They Thinking? Delineation, Probing and Tracking of Concepts in LLMs

digitado ⋅ 29 de May de 2026

arXiv:2605.28823v1 Announce Type: new Abstract: As the influence of LLMs expands, it is imperative to gain insight into their decisions. One way to do that is to develop probes that detect the presence or absence of a broad set of concepts within the embeddings computed in an LLM – which is what we might say a model is “thinking” about. Such probes should be low-cost and easily applicable to any LLM, so that monitoring for many concepts is […]

Ver mais

Like 0

Liked Liked

technocracy

Lightweight Multimodal LLM-Enabled Cost-Effective Defect Grading of Power Transmission Equipment

digitado ⋅ 29 de May de 2026

arXiv:2605.28822v1 Announce Type: new Abstract: Defect grading of power transmission equipment (DGPTE) is crucial to the stability of electric energy transmission. Although existing machine learning methods exhibit strong capabilities in defect detection, they are plagued by difficulties in integrating expert experience and facing class imbalance in more refined defect grading field. To address this issue, this paper introduces a novel defect grading framework based on multimodal large language model (MLLM). Specifically, this approach maximizes the commercial MLLMs’ potential […]

Ver mais

Like 0

Liked Liked

technocracy

datasette 1.0a31

digitado ⋅ 29 de May de 2026

Release: datasette 1.0a31 Another significant alpha release, with two new headline features. Datasette now offers users with the necessary permissions the ability to both execute write queries against their database and to save stored queries (renamed from “canned queries”) both privately and for use by other members of their Datasette instance. There’s more detail in SQL write queries and stored queries in Datasette 1.0a31 on the Datasette blog, which now has three posts introducing new features since the […]

Ver mais

Like 0

Liked Liked

technocracy

Strengthening societal resilience with Rosalind Biodefense

digitado ⋅ 29 de May de 2026

OpenAI launches Rosalind Biodefense, expanding trusted access to GPT-Rosalind for vetted developers and U.S. government partners advancing biodefense, public health, and pandemic preparedness through frontier AI.

Ver mais

Like 0

Liked Liked

technocracy

The Trick Behind the AI Magic: Explain AI to Your Manager in Plain English

digitado ⋅ 29 de May de 2026

Maybe you already know what AI is. But then your manager, a friend, or your mom asks you to explain it, and suddenly it becomes harder than expected. This article is my attempt to explain AI in plain words — the way I would explain it to someone who does not care about tokens, weights, gradients, or architecture diagrams. So, let me try. We want to speed up our businesses with AI. We trust it to help with […]

Ver mais

Like 0

Liked Liked

technocracy

308 Blog Posts To Learn About Founder Stories

digitado ⋅ 29 de May de 2026

Let’s learn about Founder Stories via these 308 free blog posts. They are ordered by HackerNoon reader engagement data. Visit the Learn Repo or LearnRepo.com to find the most read blog posts about any technology. Founders, very often, solve problems that we don’t know we have yet. 1. Building a Gaming Metaverse on 750 Acres of Land in Costa Rica Alóki is based on an intricate relationship with the 750 acres of jungle in Costa Rica. 2. The […]

Ver mais

Like 0

Liked Liked