February 2026

HoRD: Robust Humanoid Control via History-Conditioned Reinforcement Learning and Online Distillation

digitado ⋅ 4 de February de 2026

Humanoid robots can suffer significant performance drops under small changes in dynamics, task specifications, or environment setup. We propose HoRD, a two-stage learning framework for robust humanoid control under domain shift. First, we train a high-performance teacher policy via history-conditioned reinforcement learning, where the policy infers latent dynamics context from recent state–action trajectories to adapt online to diverse randomized dynamics. Second, we perform online distillation to transfer the teacher’s robust control capabilities into a transformer-based student policy that […]

Ver mais

Like 0

Liked Liked

technocracy

Performative Learning Theory

digitado ⋅ 4 de February de 2026

Performative predictions influence the very outcomes they aim to forecast. We study performative predictions that affect a sample (e.g., only existing users of an app) and/or the whole population (e.g., all potential app users). This raises the question of how well models generalize under performativity. For example, how well can we draw insights about new app users based on existing users when both of them react to the app’s predictions? We address this question by embedding performative predictions […]

Ver mais

Like 0

Liked Liked

technocracy

Blockchain Federated Learning for Sustainable Retail: Reducing Waste through Collaborative Demand Forecasting

digitado ⋅ 4 de February de 2026

Effective demand forecasting is crucial for reducing food waste. However, data privacy concerns often hinder collaboration among retailers, limiting the potential for improved predictive accuracy. In this study, we explore the application of Federated Learning (FL) in Sustainable Supply Chain Management (SSCM), with a focus on the grocery retail sector dealing with perishable goods. We develop a baseline predictive model for demand forecasting and waste assessment in an isolated retailer scenario. Subsequently, we introduce a Blockchain-based FL model, […]

Ver mais

Like 0

Liked Liked

technocracy

The Next Generation of Cybersecurity Protection for Healthcare

digitado ⋅ 4 de February de 2026

In a world where ransomware strikes, attempts at data manipulation, and system-crippling intrusions against hospitals are on the rise, the healthcare sector has reached a breaking point. These are no longer attacks on mere data; they are on ventilators, on imaging machines, on diagnostic algorithms, and on the very systems that sustain patients’ lives. On a national level across the United States, even a single breach has the power to halt surgeries, divert ambulances, and put lives in […]

Ver mais

Like 0

Liked Liked

technocracy

Ghulam Murtaza: From Rural Pakistan To Amazon Automation

digitado ⋅ 4 de February de 2026

Ghulam Murtaza emerged from Khairpur, a rural town in Pakistan, where his first interactions with technology began at age ten by digitizing his father’s construction ledgers. While most viewed technology as recreational, Murtaza saw computers as tools to solve real problems. His academic path later included a Master of Science in Robotics from the University at Buffalo, and he is currently a Control System Lead at Amazon’s CBRE in Oregon. Murtaza’s expertise spans robotics, industrial automation, custom data […]

Ver mais

Like 0

Liked Liked

technocracy

Mosaic Learning: A Framework for Decentralized Learning with Model Fragmentation

digitado ⋅ 4 de February de 2026

Decentralized learning (DL) enables collaborative machine learning (ML) without a central server, making it suitable for settings where training data cannot be centrally hosted. We introduce Mosaic Learning, a DL framework that decomposes models into fragments and disseminates them independently across the network. Fragmentation reduces redundant communication across correlated parameters and enables more diverse information propagation without increasing communication cost. We theoretically show that Mosaic Learning (i) shows state-of-the-art worst-case convergence rate, and (ii) leverages parameter correlation in […]

Ver mais

Like 0

Liked Liked

technocracy

RISE: Interactive Visual Diagnosis of Fairness in Machine Learning Models

digitado ⋅ 4 de February de 2026

Evaluating fairness under domain shift is challenging because scalar metrics often obscure exactly where and how disparities arise. We introduce textit{RISE} (Residual Inspection through Sorted Evaluation), an interactive visualization tool that converts sorted residuals into interpretable patterns. By connecting residual curve structures to formal fairness notions, RISE enables localized disparity diagnosis, subgroup comparison across environments, and the detection of hidden fairness issues. Through post-hoc analysis, RISE exposes accuracy-fairness trade-offs that aggregate statistics miss, supporting more informed model selection.

Ver mais

Like 0

Liked Liked

technocracy

Imposing Boundary Conditions on Neural Operators via Learned Function Extensions

digitado ⋅ 4 de February de 2026

Neural operators have emerged as powerful surrogates for the solution of partial differential equations (PDEs), yet their ability to handle general, highly variable boundary conditions (BCs) remains limited. Existing approaches often fail when the solution operator exhibits strong sensitivity to boundary forcings. We propose a general framework for conditioning neural operators on complex non-homogeneous BCs through function extensions. Our key idea is to map boundary data to latent pseudo-extensions defined over the entire spatial domain, enabling any standard […]

Ver mais

Like 0

Liked Liked

technocracy

Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning

digitado ⋅ 4 de February de 2026

Managing agent thought and observation during multi-turn agent-environment interactions is an emerging strategy to improve agent efficiency. However, existing studies treat the entire interaction trajectories equally, overlooking the thought necessity and observation utility varies across turns. To this end, we first conduct quantitative investigations into how thought and observation affect agent effectiveness and efficiency. Based on our findings, we propose Agent-Omit, a unified training framework that empowers LLM agents to adaptively omit redundant thoughts and observations. Specifically, we […]

Ver mais

Like 0

Liked Liked

technocracy

Multi Objective Design Optimization of Non Pneumatic Passenger Car Tires Using Finite Element Modeling, Machine Learning, and Particle swarm Optimization and Bayesian Optimization Algorithms

digitado ⋅ 4 de February de 2026

Non Pneumatic tires offer a promising alternative to pneumatic tires. However, their discontinuous spoke structures present challenges in stiffness tuning, durability, and high speed vibration. This study introduces an integrated generative design and machine learning driven framework to optimize UPTIS type spoke geometries for passenger vehicles. Upper and lower spoke profiles were parameterized using high order polynomial representations, enabling the creation of approximately 250 generative designs through PCHIP based geometric variation. Machine learning models like KRR for stiffness […]

Ver mais

Like 0

Liked Liked