digitado – Page 259

Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots

digitado ⋅ 26 de March de 2026

arXiv:2603.23682v1 Announce Type: new Abstract: The rapid adoption of large language models (LLMs) in education raises profound challenges for assessment design. To adapt assessments to the presence of LLM-based tools, it is crucial to characterize the strengths and weaknesses of LLMs in a generalizable, valid and reliable manner. However, current LLM evaluations often rely on descriptive statistics derived from benchmarks, and little research applies theory-grounded measurement methods to characterize LLM capabilities relative to human learners in ways that […]

Ver mais

Like 0

Liked Liked

technocracy

A tensor network formalism for neuro-symbolic AI

digitado ⋅ 23 de January de 2026

arXiv:2601.15442v1 Announce Type: cross Abstract: The unification of neural and symbolic approaches to artificial intelligence remains a central open challenge. In this work, we introduce a tensor network formalism, which captures sparsity principles originating in the different approaches in tensor decompositions. In particular, we describe a basis encoding scheme for functions and model neural decompositions as tensor decompositions. The proposed formalism can be applied to represent logical formulas and probability distributions as structured tensor decompositions. This unified treatment […]

Ver mais

Like 0

Liked Liked

technocracy

QV May Be Enough: Toward the Essence of Attention in LLMs

digitado ⋅ 18 de March de 2026

arXiv:2603.15665v1 Announce Type: new Abstract: Starting from first principles and a linguistic perspective centered on part-of-speech (POS) and syntactic analysis, this paper explores and derives the underlying essence of the Query-Key-Value (QKV) mechanism within the Transformer architecture. Based on this theoretical foundation, we provide a unified explanatory framework for the efficacy of contemporary architectures, including MQA, GQA, and MLA, while identifying their inherent trade-offs and potential optimization trajectories. We introduce the QV paradigm and provide empirical evidence for […]

Ver mais

Like 0

Liked Liked

technocracy

Probabilistic Inference and Learning with Stein’s Method

digitado ⋅ 8 de March de 2026

This monograph provides a rigorous overview of theoretical and methodological aspects of probabilistic inference and learning with Stein’s method. Recipes are provided for constructing Stein discrepancies from Stein operators and Stein sets, and properties of these discrepancies such as computability, separation, convergence detection, and convergence control are discussed. Further, the connection between Stein operators and Stein variational gradient descent is set out in detail. The main definitions and results are precisely stated, and references to all proofs are […]

Ver mais

Like 0

Liked Liked

technocracy

EQ-5D Classification Using Biomedical Entity-Enriched Pre-trained Language Models and Multiple Instance Learning

digitado ⋅ 26 de February de 2026

arXiv:2602.21216v1 Announce Type: new Abstract: The EQ-5D (EuroQol 5-Dimensions) is a standardized instrument for the evaluation of health-related quality of life. In health economics, systematic literature reviews (SLRs) depend on the correct identification of publications that use the EQ-5D, but manual screening of large volumes of scientific literature is time-consuming, error-prone, and inconsistent. In this study, we investigate fine-tuning of general-purpose (BERT) and domain-specific (SciBERT, BioBERT) pre-trained language models (PLMs), enriched with biomedical entity information extracted through scispaCy […]

Ver mais

Like 0

Liked Liked

technocracy

DRCY: Agentic Hardware Design Reviews

digitado ⋅ 18 de March de 2026

arXiv:2603.15672v1 Announce Type: new Abstract: Hardware design errors discovered after fabrication require costly physical respins that can delay products by months. Existing electronic design automation (EDA) tools enforce structural connectivity rules. However, they cannot verify that connections are emph{semantically} correct with respect to component datasheets. For example, that a symbol’s pinout matches the manufacturer’s specification, or that a voltage regulator’s feedback resistors produce the intended output. We present DRCY, the first production-ready multi-agent LLM system that automates first-pass […]

Ver mais

Like 0

Liked Liked

technocracy

MambaNet: Mamba-assisted Channel Estimation Neural Network With Attention Mechanism

digitado ⋅ 27 de January de 2026

arXiv:2601.17108v1 Announce Type: new Abstract: This paper proposes a Mamba-assisted neural network framework incorporating self-attention mechanism to achieve improved channel estimation with low complexity for orthogonal frequency-division multiplexing (OFDM) waveforms, particularly for configurations with a large number of subcarriers. With the integration of customized Mamba architecture, the proposed framework handles large-scale subcarrier channel estimation efficiently while capturing long-distance dependencies among these subcarriers effectively. Unlike conventional Mamba structure, this paper implements a bidirectional selective scan to improve channel estimation […]

Ver mais

Like 0

Liked Liked

technocracy

Jetpack Compose Memory Leaks: A Reference-Graph Deep Dive

digitado ⋅ 7 de January de 2026

Jetpack Compose doesn’t “leak by default.” Most Compose leaks are plain old Kotlin reference leaks where something long-lived (a ViewModel, singleton, registry, static object, app scope coroutine) ends up holding a reference to something UI-scoped (an Activity Context, a composable lambda, a CoroutineScope, a remembered object). If you internalize one idea, make it this: Leaks happen when composition-scoped references escape into longer-lived holders. 0) The mental model you debug with Composition = runtime tree of nodes backing your […]

Ver mais

Like 0

Liked Liked

technocracy

LLM Driven Design of Continuous Optimization Problems with Controllable High-level Properties

digitado ⋅ 28 de January de 2026

arXiv:2601.18846v1 Announce Type: new Abstract: Benchmarking in continuous black-box optimisation is hindered by the limited structural diversity of existing test suites such as BBOB. We explore whether large language models embedded in an evolutionary loop can be used to design optimisation problems with clearly defined high-level landscape characteristics. Using the LLaMEA framework, we guide an LLM to generate problem code from natural-language descriptions of target properties, including multimodality, separability, basin-size homogeneity, search-space homogeneity and globallocal optima contrast. Inside […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical-Neural Interaction Networks for Interpretable Mixed-Type Data Imputation

digitado ⋅ 21 de January de 2026

arXiv:2601.12380v1 Announce Type: cross Abstract: Real-world tabular databases routinely combine continuous measurements and categorical records, yet missing entries are pervasive and can distort downstream analysis. We propose Statistical-Neural Interaction (SNI), an interpretable mixed-type imputation framework that couples correlation-derived statistical priors with neural feature attention through a Controllable-Prior Feature Attention (CPFA) module. CPFA learns head-wise prior-strength coefficients ${lambda_h}$ that softly regularize attention toward the prior while allowing data-driven deviations when nonlinear patterns appear to be present in the data. […]

Ver mais

Like 0

Liked Liked