digitado – Page 290

Enhancing sample efficiency in reinforcement-learning-based flow control: replacing the critic with an adaptive reduced-order model

digitado ⋅ 5 de April de 2026

Model-free deep reinforcement learning (DRL) methods suffer from poor sample efficiency. To overcome this limitation, this work introduces an adaptive reduced-order-model (ROM)-based reinforcement learning framework for active flow control. In contrast to conventional actor–critic architectures, the proposed approach leverages a ROM to estimate the gradient information required for controller optimization. The design of the ROM structure incorporates physical insights. The ROM integrates a linear dynamical system and a neural ordinary differential equation (NODE) for estimating the nonlinearity in […]

Ver mais

Like 0

Liked Liked

technocracy

AI-Driven Clinical Decision Support System for Enhanced Diabetes Diagnosis and Management

digitado ⋅ 13 de February de 2026

arXiv:2602.11237v1 Announce Type: new Abstract: Identifying type 2 diabetes mellitus can be challenging, particularly for primary care physicians. Clinical decision support systems incorporating artificial intelligence (AI-CDSS) can assist medical professionals in diagnosing type 2 diabetes with high accuracy. This study aimed to assess an AI-CDSS specifically developed for the diagnosis of type 2 diabetes by employing a hybrid approach that integrates expert-driven insights with machine learning techniques. The AI-CDSS was developed (training dataset: n = 650) and tested […]

Ver mais

Like 0

Liked Liked

technocracy

Meet Bruce, the “beak-jousting” parrot

digitado ⋅ 20 de April de 2026

Bruce the kea—a species of alpine parrot native to New Zealand—lost his upper beak in an accident as a young bird. But that hasn’t stopped him from becoming the dominant male in his kea community (known as a “circus”) at the Willowbank Wildlife Reserve. According to a new paper published in the journal Current Biology, Bruce achieved his alpha status via a unique fighting method, essentially “jousting” with what remains of his beak. Researchers already knew Bruce was […]

Ver mais

Like 0

Liked Liked

technocracy

Accurate Residues for Floating-Point Debugging

digitado ⋅ 9 de April de 2026

arXiv:2604.06258v1 Announce Type: new Abstract: Floating-point arithmetic is error-prone and unintuitive. Floating-point debuggers instrument programs to monitor floating-point arithmetic at run time and flag numerical issues. They estimate residues, i.e., the difference between actual floating-point and ideal real values, for every floating-point value in the program. Prior work explores various approaches for computing these residues accurately and efficiently. Unfortunately, the most efficient methods, based on “error-free transformations”, have a high rate of false reports, while the most accurate […]

Ver mais

Like 0

Liked Liked

technocracy

IoT-Dynamic Indoor Localization Leveraging Transfer Learning Techniques

digitado ⋅ 15 de April de 2026

With the rapid growth of location-based services (LBS) in the Internet of Things (IoT), fingerprint-based indoor localization has attracted attention for its high accuracy. However, environmental changes degrade signal stability, and traditional methods require frequent site surveys, leading to high labor and time costs. In response, we propose an adaptive Bluetooth Low Energy (BLE) indoor localization system that relies on an updated fingerprint to address these issues. We integrate the Domain Adaptation Localization (DALoc) method into the system. […]

Ver mais

Like 0

Liked Liked

technocracy

Robotic Assembly Using Deep Reinforcement Learning

digitado ⋅ 21 de October de 2020

Introduction Disclaimer: This article is a cross post from Pytorch Medium Blog Post. One of the most exciting advancements, that has pushed the frontier of the Artificial Intelligence (AI) in recent years, is Deep Reinforcement Learning (DRL). DRL belongs to the family of machine learning algorithms. It assumes that intelligent machines can learn from their actions similar to the way humans learn from experience. Over the recent years we could witness some impressive real-world applications of DRL. The […]

Ver mais

Like 0

Liked Liked

technocracy

Machine learning based radiative parameterization scheme and its performance in operational reforecast experiments

digitado ⋅ 20 de January de 2026

Radiation is typically the most time-consuming physical process in numerical models. One solution is to use machine learning methods to simulate the radiation process to improve computational efficiency. From an operational standpoint, this study investigates critical limitations inherent to hybrid forecasting frameworks that embed deep neural networks into numerical prediction models, with a specific focus on two fundamental bottlenecks: coupling compatibility and long-term integration stability. A residual convolutional neural network is employed to approximate the Rapid Radiative Transfer […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Equivariant Neural-Augmented Object Dynamics From Few Interactions

digitado ⋅ 4 de May de 2026

Learning data-efficient object dynamics models for robotic manipulation remains challenging, especially for deformable objects. A popular approach is to model objects as sets of 3D particles and learn their motion using graph neural networks. In practice, this is not enough to maintain physical feasibility over long horizons and may require large amounts of interaction data to learn. We introduce PIEGraph, a novel approach to combining analytical physics and data-driven models to capture object dynamics for both rigid and […]

Ver mais

Like 0

Liked Liked

technocracy

A variational approach to dimension-free self-normalized concentration

digitado ⋅ 6 de February de 2026

arXiv:2508.06483v2 Announce Type: replace-cross Abstract: We study the self-normalized concentration of vector-valued stochastic processes. We focus on bounds for “sub-$psi$” processes, a well-known and quite general class of process that encompasses a wide variety of well-known tail conditions (including sub-exponential, sub-Gaussian, sub-gamma, sub-Poisson, and several heavy-tailed settings without a moment generating function such as symmetric or bounded 2nd or 3rd moments). Our results recover and generalize the influential bound of de la Pe~na et al. [20] (proved again […]

Ver mais

Like 0

Liked Liked

technocracy

CoMI-IRL: Contrastive Multi-Intention Inverse Reinforcement Learning

digitado ⋅ 7 de February de 2026

Inverse Reinforcement Learning (IRL) seeks to infer reward functions from expert demonstrations. When demonstrations originate from multiple experts with different intentions, the problem is known as Multi-Intention IRL (MI-IRL). Recent deep generative MI-IRL approaches couple behavior clustering and reward learning, but typically require prior knowledge of the number of true behavioral modes $K^*$. This reliance on expert knowledge limits their adaptability to new behaviors, and only enables analysis related to the learned rewards, and not across the behavior […]

Ver mais

Like 0

Liked Liked