digitado – Page 93

Know When You’re Wrong: Aligning Confidence with Correctness for LLM Error Detection

digitado ⋅ 10 de March de 2026

arXiv:2603.06604v1 Announce Type: new Abstract: As large language models (LLMs) are increasingly deployed in critical decision-making systems, the lack of reliable methods to measure their uncertainty presents a fundamental trustworthiness risk. We introduce a normalized confidence score based on output anchor token probabilities: classification labels for structured tasks and self-evaluation responses (Yes/No) for open-ended generation. This enables direct detection of errors and hallucinations with minimal overhead and without external validation. We make three key contributions. First, we propose […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond Pessimism: Offline Learning in KL-regularized Games

digitado ⋅ 8 de April de 2026

We study offline learning in KL-regularized two-player zero-sum games, where policies are optimized under a KL constraint to a fixed reference policy. Prior work relies on pessimistic value estimation to handle distribution shift, yielding only $widetilde{mathcal{O}}(1/sqrt n)$ statistical rates. We develop a new pessimism-free algorithm and analytical framework for KL-regularized games, built on the smoothness of KL-regularized best responses and a stability property of the Nash equilibrium induced by skew symmetry. This yields the first $widetilde{mathcal{O}}(1/n)$ sample complexity […]

Ver mais

Like 0

Liked Liked

technocracy

What Really Determines the Speed of Your PyTorch Code?

digitado ⋅ 27 de January de 2026

Anyone who works with PyTorch model code starts asking the same questions: Why is this taking so long? How do I make my training loop faster? Whether you’re an ML engineer, a researcher or just decided to play around with a random ML repository over the weekend, you will eventually try to understand how to speed your code up. However, before we can do that, we need to learn how to measure performance correctly. And then draw the […]

Ver mais

Like 0

Liked Liked

technocracy

Adaptive Edge Learning for Density-Aware Graph Generation

digitado ⋅ 30 de January de 2026

Generating realistic graph-structured data is challenging due to discrete structures, variable sizes, and class-specific connectivity patterns that resist conventional generative modelling. While recent graph generation methods employ generative adversarial network (GAN) frameworks to handle permutation invariance and irregular topologies, they typically rely on random edge sampling with fixed probabilities, limiting their capacity to capture complex structural dependencies between nodes. We propose a density-aware conditional graph generation framework using Wasserstein GANs (WGAN) that replaces random sampling with a learnable […]

Ver mais

Like 0

Liked Liked

technocracy

Context-dependent manifold learning: A neuromodulated constrained autoencoder approach

digitado ⋅ 12 de March de 2026

Constrained autoencoders (cAE) provide a successful path towards interpretable dimensionality reduction by enforcing geometric structure on latent spaces. However, standard cAEs cannot adapt to varying physical parameters or environmental conditions without conflating these contextual shifts with the primary input. To address this, we integrated a neuromodulatory mechanism into the cAE framework to allow for context-dependent manifold learning. This paper introduces the Neuromodulated Constrained Autoencoder (NcAE), which adaptively parameterizes geometric constraints via gain and bias tuning conditioned on static […]

Ver mais

Like 0

Liked Liked

technocracy

A Novel Hybrid VANET Routing Protocol with Dynamic Power Management for Performance Enhancement

digitado ⋅ 1 de January de 2026

Vehicular Ad-hoc Networks (VANETs) face critical challenges regarding intermittent connectivity and latency due to high node mobility, often resulting in a performance trade-off between reactive and proactive routing paradigms. This study aims to resolve these inherent limitations and ensure reliable communication in volatile environments. We propose a novel context-aware framework, the Dynamic Hybrid Routing Protocol (DHRP), which integrates Ad hoc On-Demand Distance Vector (AODV) and Optimized Link State Routing (OLSR). Distinguished by a predictive multi-criteria switching logic and […]

Ver mais

Like 0

Liked Liked

technocracy

Microsoft vs Palantir: Two Paths to Enterprise Ontology (And Why Microsoft’s Bet on Semantic Contracts Changes the Game)

digitado ⋅ 6 de February de 2026

Author(s): Pankaj Kumar Originally published on Towards AI. A technical deep-dive into how Microsoft Fabric IQ actually implements ontology — and why it’s fundamentally different from Palantir’s approach 🚀 NEW: I just built OntoGuard in 48 hours — an ontology firewall for AI agents that prevents $4.6M mistakes. See the build story → Image caption not availableThe article discusses the contrasting approaches of Microsoft and Palantir in utilizing ontology within enterprise systems, emphasizing that Microsoft aims to empower […]

Ver mais

Like 0

Liked Liked

technocracy

NIMO: a Nonlinear Interpretable MOdel

digitado ⋅ 28 de January de 2026

arXiv:2506.05059v3 Announce Type: replace-cross Abstract: Deep learning has achieved remarkable success across many domains, but it has also created a growing demand for interpretability in model predictions. Although many explainable machine learning methods have been proposed, post-hoc explanations lack guaranteed fidelity and are sensitive to hyperparameter choices, highlighting the appeal of inherently interpretable models. For example, linear regression provides clear feature effects through its coefficients. However, such models are often outperformed by more complex neural networks (NNs) that […]

Ver mais

Like 0

Liked Liked

technocracy

Improving Normal/Abnormal and Benign/Malignant Classifications in Mammography with ROI-Stratified Deep Learning

digitado ⋅ 31 de December de 2025

Deep Learning (DL) has undergone widespread adoption for medical image analysis and diagnosis. Numerous studies have explored mammographic image analysis for breast cancer screening. For this study, we assessed the hypothesis that stratifying mammography images based on the presence or absence of a corresponding region of interest (ROI) improves classification accuracy for both normal–abnormal and benign–malignant classifications. Our methodology involves independently training models and performing predictions on each subgroup with subsequent integration of the results. We used several […]

Ver mais

Like 0

Liked Liked

technocracy

Haiku to Opus in Just 10 bits: LLMs Unlock Massive Compression Gains

digitado ⋅ 7 de April de 2026

arXiv:2604.02343v1 Announce Type: new Abstract: We study the compression of LLM-generated text across lossless and lossy regimes, characterizing a compression-compute frontier where more compression is possible at the cost of more compute. For lossless compression, domain-adapted LoRA adapters can improve LLM-based arithmetic coding by 2x over compression with the base LLM alone. For lossy compression, prompting a model for a succinct rewrite then applying arithmetic coding can achieve compression ratios of approximately 0.03, a 2x improvement over compressing […]

Ver mais

Like 0

Liked Liked