February 2026

Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows

digitado ⋅ 11 de February de 2026

Hierarchical goal-conditioned reinforcement learning (H-GCRL) provides a powerful framework for tackling complex, long-horizon tasks by decomposing them into structured subgoals. However, its practical adoption is hindered by poor data efficiency and limited policy expressivity, especially in offline or data-scarce regimes. In this work, Normalizing flow-based hierarchical implicit Q-learning (NF-HIQL), a novel framework that replaces unimodal gaussian policies with expressive normalizing flow policies at both the high- and low-levels of the hierarchy is introduced. This design enables tractable log-likelihood […]

Ver mais

Like 0

Liked Liked

technocracy

Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards

digitado ⋅ 11 de February de 2026

Reinforcement learning with verifiable rewards has driven recent advances in LLM post-training, in particular for reasoning. Policy optimization algorithms generate a number of responses for a given prompt and then effectively weight the corresponding gradients depending on the rewards. The most popular algorithms including GRPO, DAPO, and RLOO focus on ambiguous prompts, i.e., prompts with intermediate success probability, while downgrading gradients with very easy and very hard prompts. In this paper, we consider asymmetric prompt weightings that assign […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Compose for Cross-domain Agentic Workflow Generation

digitado ⋅ 11 de February de 2026

Automatically generating agentic workflows — executable operator graphs or codes that orchestrate reasoning, verification, and repair — has become a practical way to solve complex tasks beyond what single-pass LLM generation can reliably handle. Yet what constitutes a good workflow depends heavily on the task distribution and the available operators. Under domain shift, current systems typically rely on iterative workflow refinement to discover a feasible workflow from a large workflow space, incurring high iteration costs and yielding unstable, […]

Ver mais

Like 0

Liked Liked

technocracy

Smart home PSA: Apple’s “new architecture” for Home app becomes mandatory today

digitado ⋅ 11 de February de 2026

In 2022, Apple announced it was adopting a “new Home architecture” for its smart home ecosystem to improve its performance and reliability and make it possible to support different kinds of accessories. Although it was mostly an invisible update when it worked properly, some users who attempted to switch to the new architecture when it first rolled out in iOS 16.2 ran into slow or unresponsive devices and other problems, prompting Apple to pause the rollout and re-release […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical Learning Analysis of Physics-Informed Neural Networks

digitado ⋅ 11 de February de 2026

We study the training and performance of physics-informed learning for initial and boundary value problems (IBVP) with physics-informed neural networks (PINNs) from a statistical learning perspective. Specifically, we restrict ourselves to parameterizations with hard initial and boundary condition constraints and reformulate the problem of estimating PINN parameters as a statistical learning problem. From this perspective, the physics penalty on the IBVP residuals can be better understood not as a regularizing term bus as an infinite source of indirect […]

Ver mais

Like 0

Liked Liked

technocracy

MerLin: A Discovery Engine for Photonic and Hybrid Quantum Machine Learning

digitado ⋅ 11 de February de 2026

Identifying where quantum models may offer practical benefits in near term quantum machine learning (QML) requires moving beyond isolated algorithmic proposals toward systematic and empirical exploration across models, datasets, and hardware constraints. We introduce MerLin, an open source framework designed as a discovery engine for photonic and hybrid quantum machine learning. MerLin integrates optimized strong simulation of linear optical circuits into standard PyTorch and scikit learn workflows, enabling end to end differentiable training of quantum layers. MerLin is […]

Ver mais

Like 0

Liked Liked

technocracy

Direct Learning of Calibration-Aware Uncertainty for Neural PDE Surrogates

digitado ⋅ 11 de February de 2026

Neural PDE surrogates are often deployed in data-limited or partially observed regimes where downstream decisions depend on calibrated uncertainty in addition to low prediction error. Existing approaches obtain uncertainty through ensemble replication, fixed stochastic noise such as dropout, or post hoc calibration. Cross-regularized uncertainty learns uncertainty parameters during training using gradients routed through a held-out regularization split. The predictor is optimized on the training split for fit, while low-dimensional uncertainty controls are optimized on the regularization split to […]

Ver mais

Like 0

Liked Liked

technocracy

cysqlite – a new sqlite driver

digitado ⋅ 11 de February de 2026

cysqlite – a new sqlite driver Charles Leifer has been maintaining pysqlite3 – a fork of the Python standard library’s sqlite3 module that makes it much easier to run upgraded SQLite versions – since 2018. He’s been working on a ground-up Cython rewrite called cysqlite for almost as long, but it’s finally at a stage where it’s ready for people to try out. The biggest change from the sqlite3 module involves transactions. Charles explains his discomfort with the […]

Ver mais

Like 0

Liked Liked

technocracy

Motion Capture is Not the Target Domain: Scaling Synthetic Data for Learning Motion Representations

digitado ⋅ 11 de February de 2026

Synthetic data offers a compelling path to scalable pretraining when real-world data is scarce, but models pretrained on synthetic data often fail to transfer reliably to deployment settings. We study this problem in full-body human motion, where large-scale data collection is infeasible but essential for wearable-based Human Activity Recognition (HAR), and where synthetic motion can be generated from motion-capture-derived representations. We pretrain motion time-series models using such synthetic data and evaluate their transfer across diverse downstream HAR tasks. […]

Ver mais

Like 0

Liked Liked

technocracy

Toward Reliable Tea Leaf Disease Diagnosis Using Deep Learning Model: Enhancing Robustness With Explainable AI and Adversarial Training

digitado ⋅ 11 de February de 2026

Tea is a valuable asset for the economy of Bangladesh. So, tea cultivation plays an important role to boost the economy. These valuable plants are vulnerable to various kinds of leaf infections which may cause less production and low quality. It is not so easy to detect these diseases manually. It may take time and there could be some errors in the detection.Therefore, the purpose of the study is to develop an automated deep learning model for tea […]

Ver mais

Like 0

Liked Liked