digitado

C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents

digitado ⋅ 25 de March de 2026

Safe navigation in complex environments remains a central challenge for reinforcement learning (RL) in robotics. This paper introduces Continuous Space-Time Empowerment for Physics-informed (C-STEP) safe RL, a novel measure of agent-centric safety tailored to deterministic, continuous domains. This measure can be used to design physics-informed intrinsic rewards by augmenting positive navigation reward functions. The reward incorporates the agents internal states (e.g., initial velocity) and forward dynamics to differentiate safe from risky behavior. By integrating C-STEP with navigation rewards, […]

Ver mais

Like 0

Liked Liked

technocracy

Context Structure Reshapes the Representational Geometry of Language Models

digitado ⋅ 2 de February de 2026

arXiv:2601.22364v1 Announce Type: new Abstract: Large Language Models (LLMs) have been shown to organize the representations of input sequences into straighter neural trajectories in their deep layers, which has been hypothesized to facilitate next-token prediction via linear extrapolation. Language models can also adapt to diverse tasks and learn new structure in context, and recent work has shown that this in-context learning (ICL) can be reflected in representational changes. Here we bring these two lines of research together to […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Recursive Multi-Scale Representations for Irregular Multivariate Time Series Forecasting

digitado ⋅ 25 de February de 2026

Irregular Multivariate Time Series (IMTS) are characterized by uneven intervals between consecutive timestamps, which carry sampling pattern information valuable and informative for learning temporal and variable dependencies. In addition, IMTS often exhibit diverse dependencies across multiple time scales. However, many existing multi-scale IMTS methods use resampling to obtain the coarse series, which can alter the original timestamps and disrupt the sampling pattern information. To address the challenge, we propose ReIMTS, a Recursive multi-scale modeling approach for Irregular Multivariate […]

Ver mais

Like 0

Liked Liked

technocracy

Structure Detection for Contextual Reinforcement Learning

digitado ⋅ 13 de January de 2026

Contextual Reinforcement Learning (CRL) tackles the problem of solving a set of related Contextual Markov Decision Processes (CMDPs) that vary across different context variables. Traditional approaches–independent training and multi-task learning–struggle with either excessive computational costs or negative transfer. A recently proposed multi-policy approach, Model-Based Transfer Learning (MBTL), has demonstrated effectiveness by strategically selecting a few tasks to train and zero-shot transfer. However, CMDPs encompass a wide range of problems, exhibiting structural properties that vary from problem to problem. […]

Ver mais

Like 0

Liked Liked

technocracy

Soft-Radial Projection for Constrained End-to-End Learning

digitado ⋅ 4 de February de 2026

arXiv:2602.03461v1 Announce Type: cross Abstract: Integrating hard constraints into deep learning is essential for safety-critical systems. Yet existing constructive layers that project predictions onto constraint boundaries face a fundamental bottleneck: gradient saturation. By collapsing exterior points onto lower-dimensional surfaces, standard orthogonal projections induce rank-deficient Jacobians, which nullify gradients orthogonal to active constraints and hinder optimization. We introduce Soft-Radial Projection, a differentiable reparameterization layer that circumvents this issue through a radial mapping from Euclidean space into the interior of […]

Ver mais

Like 0

Liked Liked

technocracy

You can finally change the goofy Gmail address you chose years ago

digitado ⋅ 31 de March de 2026

Someone is celebrating a birthday tomorrow—it’s Gmail. The iconic email service debuted 22 years ago on April 1, forever altering what people expected from free email. But 22 years is a long time, and the username you chose when you finally got your hands on an invite in 2004 may not have stood the test of time. Starting today, Google will let US-based users ditch an old username without creating a new account. Google started testing this option […]

Ver mais

Like 0

Liked Liked

technocracy

MIT’s Recursive Language Models Just Killed Context Limits

digitado ⋅ 8 de January de 2026

MIT Just Killed the Context Window Recursive Language Models Are the Future The End of “Context Rot” and the Dawn of Truly Infinite AI Reasoning A comparison of GPT-5 and a corresponding RLM on three long-context tasks of increasing complexity: S-NIAH (Find a single fact hidden in a huge pile of text), OOLONG (Understand a complicated story where all the details connect), and OOLONG-Pairs (Compare two different complicated stories with each other). For each task, Input length scaled from ²¹³ […]

Ver mais

Like 0

Liked Liked

technocracy

A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time Series: Closed-Form Estimation and Rolling-Window Inference

digitado ⋅ 3 de March de 2026

arXiv:2507.07469v3 Announce Type: replace Abstract: We introduce Galerkin-ARIMA and Galerkin-SARIMA, a projection-based extension of classical ARIMA/SARIMA that replaces rigid linear lag operators with low-dimensional Galerkin basis expansions while preserving the familiar AR-MA decomposition. Experiments on synthetic series and on quarterly GDP and daily S&P 500 returns show that Galerkin-SARIMA matches or improves forecast accuracy relative to classical ARIMA/SARIMA. Estimation is closed-form via a two-stage least-squares procedure, and the closed-form two-stage estimator enables efficient rolling-window re-estimation while preserving the […]

Ver mais

Like 0

Liked Liked

technocracy

SWE-Bench authors reflect on the state of LLM agents at Neurips 2024

digitado ⋅ 14 de January de 2025

The SWE-bench task measures AI agents on software engineering tasks at the level of a github issue. It was one of the most important tasks measuring the progress of agents tackling software engineering tasks in 2024. We caught up with two of its creators, Ofir Press and Carlos E. Jimenez, to share their ideas on the state of LLM-backed agents.

Ver mais

Like 0

Liked Liked

technocracy

Single-Round Scalable Analytic Federated Learning

digitado ⋅ 31 de March de 2026

arXiv:2512.03336v2 Announce Type: replace-cross Abstract: Federated Learning (FL) is plagued by two key challenges: high communication overhead and performance collapse on heterogeneous (non-IID) data. Analytic FL (AFL) provides a single-round, data distribution invariant solution, but is limited to linear models. Subsequent non-linear approaches, like DeepAFL, regain accuracy but sacrifice the single-round benefit. In this work, we break this trade-off. We propose SAFLe, a framework that achieves scalable non-linear expressivity by introducing a structured head of bucketed features and […]

Ver mais

Like 0

Liked Liked