February 2026

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

digitado ⋅ 26 de February de 2026

arXiv:2602.21320v1 Announce Type: new Abstract: Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically under tightly controlled training setups. It often depends on carefully constructed task-solution pairs and substantial human supervision, which creates a fundamental obstacle to open-ended self-evolution toward superintelligent systems. In this paper, we propose Tool-R0 framework for training general […]

Ver mais

Like 0

Liked Liked

technocracy

Uncertainty-Aware Diffusion Model for Multimodal Highway Trajectory Prediction via DDIM Sampling

digitado ⋅ 26 de February de 2026

arXiv:2602.21319v1 Announce Type: new Abstract: Accurate and uncertainty-aware trajectory prediction remains a core challenge for autonomous driving, driven by complex multi-agent interactions, diverse scene contexts and the inherently stochastic nature of future motion. Diffusion-based generative models have recently shown strong potential for capturing multimodal futures, yet existing approaches such as cVMD suffer from slow sampling, limited exploitation of generative diversity and brittle scenario encodings. This work introduces cVMDx, an enhanced diffusion-based trajectory prediction framework that improves efficiency, robustness […]

Ver mais

Like 0

Liked Liked

technocracy

Shared Nature, Unique Nurture: PRISM for Pluralistic Reasoning via In-context Structure Modeling

digitado ⋅ 26 de February de 2026

arXiv:2602.21317v1 Announce Type: new Abstract: Large Language Models (LLMs) are converging towards a singular Artificial Hivemind, where shared Nature (pre-training priors) result in a profound collapse of distributional diversity, limiting the distinct perspectives necessary for creative exploration and scientific discovery. To address this, we propose to equip models with inference-time Nurture (individualized epistemic trajectories) using Epistemic Evolution paradigm, progressing through explore, internalize, and express. We instantiate this via PRISM (Pluralistic Reasoning via In-context Structure Modeling), a model-agnostic system […]

Ver mais

Like 0

Liked Liked

technocracy

Unified Complementarity-Based Contact Modeling and Planning for Soft Robots

digitado ⋅ 26 de February de 2026

arXiv:2602.21316v1 Announce Type: new Abstract: Soft robots were introduced in large part to enable safe, adaptive interaction with the environment, and this interaction relies fundamentally on contact. However, modeling and planning contact-rich interactions for soft robots remain challenging: dense contact candidates along the body create redundant constraints and rank-deficient LCPs, while the disparity between high stiffness and low friction introduces severe ill-conditioning. Existing approaches rely on problem-specific approximations or penalty-based treatments. This letter presents a unified complementarity-based framework […]

Ver mais

Like 0

Liked Liked

technocracy

Precedence-Constrained Decision Trees and Coverings

digitado ⋅ 26 de February de 2026

arXiv:2602.21312v1 Announce Type: new Abstract: This work considers a number of optimization problems and reductive relations between them. The two main problems we are interested in are the emph{Optimal Decision Tree} and emph{Set Cover}. We study these two fundamental tasks under precedence constraints, that is, if a test (or set) $X$ is a predecessor of $Y$, then in any feasible decision tree $X$ needs to be an ancestor of $Y$ (or respectively, if $Y$ is added to set […]

Ver mais

Like 0

Liked Liked

technocracy

SymTorch: A Framework for Symbolic Distillation of Deep Neural Networks

digitado ⋅ 26 de February de 2026

arXiv:2602.21307v1 Announce Type: new Abstract: Symbolic distillation replaces neural networks, or components thereof, with interpretable, closed-form mathematical expressions. This approach has shown promise in discovering physical laws and mathematical relationships directly from trained deep learning models, yet adoption remains limited due to the engineering barrier of integrating symbolic regression into deep learning workflows. We introduce SymTorch, a library that automates this distillation by wrapping neural network components, collecting their input-output behavior, and approximating them with human-readable equations via […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Deformable Object Manipulation Using Task-Level Iterative Learning Control

digitado ⋅ 26 de February de 2026

arXiv:2602.21302v1 Announce Type: new Abstract: Dynamic manipulation of deformable objects is challenging for humans and robots because they have infinite degrees of freedom and exhibit underactuated dynamics. We introduce a Task-Level Iterative Learning Control method for dynamic manipulation of deformable objects. We demonstrate this method on a non-planar rope manipulation task called the flying knot. Using a single human demonstration and a simplified rope model, the method learns directly on hardware without reliance on large amounts of demonstration […]

Ver mais

Like 0

Liked Liked

technocracy

Robust AI Evaluation through Maximal Lotteries

digitado ⋅ 26 de February de 2026

arXiv:2602.21297v1 Announce Type: new Abstract: The standard way to evaluate language models on subjective tasks is through pairwise comparisons: an annotator chooses the “better” of two responses to a prompt. Leaderboards aggregate these comparisons into a single Bradley-Terry (BT) ranking, forcing heterogeneous preferences into a total order and violating basic social-choice desiderata. In contrast, social choice theory provides an alternative approach called maximal lotteries, which aggregates pairwise preferences without imposing any assumptions on their structure. However, we show […]

Ver mais

Like 0

Liked Liked

technocracy

Heterogeneous Memory Design Exploration for AI Accelerators with a Gain Cell Memory Compiler

digitado ⋅ 26 de February de 2026

arXiv:2602.21278v1 Announce Type: new Abstract: As memory increasingly dominates system cost and energy, heterogeneous on-chip memory systems that combine technologies with complementary characteristics are becoming essential. Gain Cell RAM (GCRAM) offers higher density, lower power, and tunable retention, expanding the design space beyond conventional SRAM. To this end, we create an OpenGCRAM compiler supporting both SRAM and GCRAM. It generates macro-level designs and layouts for commercial CMOS processes and characterizes area, delay, and power across user-defined configurations. The […]

Ver mais

Like 0

Liked Liked

technocracy

StoryTailor:A Zero-Shot Pipeline for Action-Rich Multi-Subject Visual Narratives

digitado ⋅ 26 de February de 2026

arXiv:2602.21273v1 Announce Type: new Abstract: Generating multi-frame, action-rich visual narratives without fine-tuning faces a threefold tension: action text faithfulness, subject identity fidelity, and cross-frame background continuity. We propose StoryTailor, a zero-shot pipeline that runs on a single RTX 4090 (24 GB) and produces temporally coherent, identity-preserving image sequences from a long narrative prompt, per-subject references, and grounding boxes. Three synergistic modules drive the system: Gaussian-Centered Attention (GCA) to dynamically focus on each subject core and ease grounding-box overlaps; […]

Ver mais

Like 0

Liked Liked