digitado – Page 48

A Consistency-Centric Approach to Set-Based Optimization with Multiple Models of Unranked Fidelity

digitado ⋅ 7 de May de 2026

arXiv:2605.04051v1 Announce Type: new Abstract: In complex real-world settings, optimization is challenged by the presence of diverse models of differing fidelity. In many optimization problems, a single model is treated as the most accurate representation of the underlying system, while other models are evaluated primarily by their agreement with this presumed most accurate model. Yet in real-world applications, model accuracy is rarely known a priori and assuming a single most accurate model can be misleading. This paper addresses […]

Ver mais

Like 0

Liked Liked

technocracy

Your Sentence Has a Secret Structure. Here’s How GPT Sees It.

digitado ⋅ 3 de March de 2026

Author(s): Rohini Joshi Originally published on Towards AI. Image Generated by ChatGPT The sentence “dog bites man” and “man bites dog” contain the exact same words. A Transformer without positional encoding would treat them as identical. Here’s how modern LLMs learn word order and then decide which words actually matter. The previous article here, explained how embeddings convert words into numbers, vectors in a high-dimensional space where distance reflects meaning. But embeddings alone have a problem. They represent […]

Ver mais

Like 0

Liked Liked

technocracy

Robotic Assembly Using Deep Reinforcement Learning

digitado ⋅ 21 de October de 2020

Introduction Disclaimer: This article is a cross post from Pytorch Medium Blog Post. One of the most exciting advancements, that has pushed the frontier of the Artificial Intelligence (AI) in recent years, is Deep Reinforcement Learning (DRL). DRL belongs to the family of machine learning algorithms. It assumes that intelligent machines can learn from their actions similar to the way humans learn from experience. Over the recent years we could witness some impressive real-world applications of DRL. The […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond ATE: Multi-Criteria Design for A/B Testing

digitado ⋅ 13 de February de 2026

arXiv:2509.05864v2 Announce Type: replace-cross Abstract: In the era of large-scale AI deployment and high-stakes clinical trials, adaptive experimentation faces a “trilemma” of conflicting objectives: minimizing cumulative regret (welfare loss during the experiment), maximizing the estimation accuracy of heterogeneous treatment effects (CATE), and ensuring differential privacy (DP) for participants. Existing literature typically optimizes these metrics in isolation or under restrictive parametric assumptions. In this work, we study the multi-objective design of adaptive experiments in a general non-parametric setting. First, […]

Ver mais

Like 0

Liked Liked

technocracy

Microsoft vows to cover full power costs for energy-hungry AI data centers

digitado ⋅ 13 de January de 2026

On Tuesday, Microsoft announced a new initiative called “Community-First AI Infrastructure” that commits the company to paying full electricity costs for its data centers and refusing to seek local property tax reductions. As demand for generative AI services has increased over the past year, Big Tech companies have been racing to spin up massive new data centers for serving chatbots and image generators that can have profound economic effects on the surrounding areas where they are located. Among […]

Ver mais

Like 0

Liked Liked

technocracy

Latent-Augmented Discrete Diffusion Models

digitado ⋅ 25 de February de 2026

arXiv:2510.18114v2 Announce Type: replace-cross Abstract: Discrete diffusion models have emerged as a powerful class of models and a promising route to fast language generation, but practical implementations typically rely on factored reverse transitions that ignore cross-token dependencies and degrade performance in the few-step regime. We propose Latent-Augmented Discrete Diffusion (LADD), which introduces a learnable auxiliary latent channel and performs diffusion over the joint (token, latent) space. The latent variables provide an intermediate representation that can express joint structure […]

Ver mais

Like 0

Liked Liked

technocracy

Hybrid Vision Transformer_GAN Attribute Neutralizer for Mitigating Bias in Chest X_Ray Diagnosis

digitado ⋅ 23 de January de 2026

arXiv:2601.15490v1 Announce Type: new Abstract: Bias in chest X-ray classifiers frequently stems from sex- and age-related shortcuts, leading to systematic underdiagnosis of minority subgroups. Previous pixel-space attribute neutralizers, which rely on convolutional encoders, lessen but do not fully remove this attribute leakage at clinically usable edit strengths. This study evaluates whether substituting the U-Net convolutional encoder with a Vision Transformer backbone in the Attribute-Neutral Framework can reduce demographic attribute leakage while preserving diagnostic accuracy. A data-efficient Image Transformer […]

Ver mais

Like 0

Liked Liked

technocracy

Dynamic Quantization Error Propagation in Encoder-Decoder ASR Quantization

digitado ⋅ 7 de January de 2026

arXiv:2601.02455v1 Announce Type: new Abstract: Running Automatic Speech Recognition (ASR) models on memory-constrained edge devices requires efficient compression. While layer-wise post-training quantization is effective, it suffers from error accumulation, especially in encoder-decoder architectures. Existing solutions like Quantization Error Propagation (QEP) are suboptimal for ASR due to the model’s heterogeneity, processing acoustic features in the encoder while generating text in the decoder. To address this, we propose Fine-grained Alpha for Dynamic Quantization Error Propagation (FADE), which adaptively controls the […]

Ver mais

Like 0

Liked Liked

technocracy

Continual uncertainty learning

digitado ⋅ 19 de February de 2026

Robust control of mechanical systems with multiple uncertainties remains a fundamental challenge, particularly when nonlinear dynamics and operating-condition variations are intricately intertwined. While deep reinforcement learning (DRL) combined with domain randomization has shown promise in mitigating the sim-to-real gap, simultaneously handling all sources of uncertainty often leads to sub-optimal policies and poor learning efficiency. This study formulates a new curriculum-based continual learning framework for robust control problems involving nonlinear dynamical systems in which multiple sources of uncertainty are […]

Ver mais

Like 0

Liked Liked

technocracy

Building Composable Safety and Performance Layers for Agents in Rust

digitado ⋅ 18 de March de 2026

As AI agents move from prototypes to production systems, a recurring challenge has emerged: how do you enforce safety, optimize performance, and handle sensitive data consistently across every inference call — without scattering that logic throughout your agent code? This is the problem we set out to solve with AutoAgents, an open-source AI agent framework written in Rust. Our latest feature, LLM Pipelines, introduces composable middleware layers for LLM inference — an approach inspired by how the web […]

Ver mais

Like 0

Liked Liked