digitado

In-Context Reinforcement Learning From Suboptimal Historical Data

digitado ⋅ 28 de January de 2026

Transformer models have achieved remarkable empirical successes, largely due to their in-context learning capabilities. Inspired by this, we explore training an autoregressive transformer for in-context reinforcement learning (ICRL). In this setting, we initially train a transformer on an offline dataset consisting of trajectories collected from various RL tasks, and then fix and use this transformer to create an action policy for new RL tasks. Notably, we consider the setting where the offline dataset contains trajectories sampled from suboptimal […]

Ver mais

Like 0

Liked Liked

technocracy

Dynamics of Stochastic Momentum with Sparse Updates in High Dimensions

digitado ⋅ 29 de May de 2026

arXiv:2605.28961v1 Announce Type: new Abstract: Existing theory of momentum assumes that gradients arrive at every parameter at a roughly constant rate, an assumption violated in practice by heavy-tailed data distributions and modern architectures. We theoretically analyze the dynamics of two tractable models of momentum under sparse updates: a least squares model with sparse inputs and a logistic regression model with a rare class. Both admit exact closed-form second-moment dynamics whose high-dimensional limits we characterize across three scaling exponents […]

Ver mais

Like 0

Liked Liked

technocracy

MADE: Benchmark Environments for Closed-Loop Materials Discovery

digitado ⋅ 30 de January de 2026

arXiv:2601.20996v1 Announce Type: new Abstract: Existing benchmarks for computational materials discovery primarily evaluate static predictive tasks or isolated computational sub-tasks. While valuable, these evaluations neglect the inherently iterative and adaptive nature of scientific discovery. We introduce MAterials Discovery Environments (MADE), a novel framework for benchmarking end-to-end autonomous materials discovery pipelines. MADE simulates closed-loop discovery campaigns in which an agent or algorithm proposes, evaluates, and refines candidate materials under a constrained oracle budget, capturing the sequential and resource-limited nature […]

Ver mais

Like 0

Liked Liked

technocracy

Grasp as You Dream: Imitating Functional Grasping from Generated Human Demonstrations

digitado ⋅ 10 de April de 2026

arXiv:2604.07517v1 Announce Type: new Abstract: Building generalist robots capable of performing functional grasping in everyday, open-world environments remains a significant challenge due to the vast diversity of objects and tasks. Existing methods are either constrained to narrow object/task sets or rely on prohibitively large-scale data collection to capture real-world variability. In this work, we present an alternative approach, GraspDreamer, a method that leverages human demonstrations synthesized by visual generative models (VGMs) (e.g., video generation models) to enable zero-shot […]

Ver mais

Like 0

Liked Liked

technocracy

Quoting Craig Mod

digitado ⋅ 13 de March de 2026

Simply put: It’s a big mess, and no off-the-shelf accounting software does what I need. So after years of pain, I finally sat down last week and started to build my own. It took me about five days. I am now using the best piece of accounting software I’ve ever used. It’s blazing fast. Entirely local. Handles multiple currencies and pulls daily (historical) conversion rates. It’s able to ingest any CSV I throw at it and represent it […]

Ver mais

Like 0

Liked Liked

technocracy

O-DSS: An Open Dynamic Spectrum Sharing Framework for Cellular-Radar Coexistence in Mid-band Frequencies

digitado ⋅ 7 de January de 2026

arXiv:2601.02571v1 Announce Type: new Abstract: The growing demand for mid-band spectrum necessitates efficient Dynamic Spectrum Sharing (DSS) to ensure coexistence between cellular networks and incumbent radar systems. Existing Spectrum Access System (SAS) frameworks rely on fixed Environmental Sensing Capability (ESC) sensors, which are latency-prone and inflexible. This paper introduces O-DSS, an O-RAN-compliant, Machine Learning (ML)-driven DSS framework that enables real-time cellular-radar coexistence in mid-band frequencies with shipborne and fast-moving airborne radars. O-DSS integrates radar detection from low-overhead Key […]

Ver mais

Like 0

Liked Liked

technocracy

Clustered random forests with correlated data for optimal estimation and inference under potential covariate shift

digitado ⋅ 26 de January de 2026

arXiv:2503.12634v2 Announce Type: replace-cross Abstract: We develop Clustered Random Forests, a random forests algorithm for clustered data, arising from independent groups that exhibit within-cluster dependence. The leaf-wise predictions for each decision tree making up clustered random forests takes the form of a weighted least squares estimator, which leverage correlations between observations for improved prediction accuracy and tighter confidence intervals when performing inference. We show that approximately linear time algorithms exist for fitting classes of clustered random forests, matching […]

Ver mais

Like 0

Liked Liked

technocracy

Build custom code-based evaluators in Amazon Bedrock AgentCore

digitado ⋅ 18 de May de 2026

Special thanks to everyone who contributed to this launch: Stephanie Yuan, Lefan Zhang, Ritvika Pillai, Irene Wang, Carter Williams, T.J Ariyawansa, Gitika Jha, Shoaib Javed and the product leadership from Vivek Singh. Moving prototype agents to production requires measuring quality across multiple dimensions. Amazon Bedrock AgentCore Evaluations provides large language model (LLM)-as-a-Judge checks and extensible code-based evaluators that capture domain-specific requirements you need for assessing your agentic application. In financial services and specialized domains, the critical quality dimensions […]

Ver mais

Like 0

Liked Liked

technocracy

Explore LLM-enabled Tools to Facilitate Imaginal Exposure Exercises for Social Anxiety

digitado ⋅ 30 de March de 2026

arXiv:2603.25933v1 Announce Type: new Abstract: Social anxiety (SA) is a prevalent mental health challenge that significantly impacts daily social interactions. Imaginal Exposure (IE), a Cognitive Behavioral Therapy (CBT) technique involving imagined anxiety-provoking scenarios, is effective but underutilized, in part because traditional IE homework requires clients to construct and sustain clinically relevant fear narratives. In this work, we explore the feasibility of an LLM-enabled tool that supports IE by generating vivid, personalized exposure scripts. We first co-designed ImaginalExpoBot with […]

Ver mais

Like 0

Liked Liked

technocracy

Control in Hedonic Games

digitado ⋅ 24 de February de 2026

arXiv:2602.18506v1 Announce Type: new Abstract: We initiate the study of control in hedonic games, where an external actor influences coalition formation by adding or deleting agents. We consider three basic control goals (1) enforcing that an agent is not alone (NA); (2) enforcing that a pair of agents is in the same coalition (PA); (3) enforcing that all agents are in the same grand coalition (GR), combined with two control actions: adding agents (AddAg) or deleting agents (DelAg). […]

Ver mais

Like 0

Liked Liked