January 2026

From Consistency to Complementarity: Aligned and Disentangled Multi-modal Learning for Time Series Understanding and Reasoning

digitado ⋅ 29 de January de 2026

Advances in multi-modal large language models (MLLMs) have inspired time series understanding and reasoning tasks, that enable natural language querying over time series, producing textual analyses of complex temporal dynamics. Recent attempts hybridize numerical time series with their visualized plots, facilitating precise value reasoning and visual structure comprehension for comprehensive time series understanding of MLLMs. However, effective cross-modal integration remains challenging due to fine-grained temporal misalignment across modalities and severe entanglement between shared and modality-specific semantics, which hinder […]

Ver mais

Like 0

Liked Liked

technocracy

Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning

digitado ⋅ 29 de January de 2026

Large Reasoning Models (LRMs) achieve explicit chain-of-thought expansion by imitating deep thinking behaviors of humans, demonstrating excellent performance in complex task scenarios. However, the deep-thinking mode often leads to unnecessarily lengthy reasoning and resource inefficiency when handling simple tasks. This overthinking phenomenon may arise from the generation preference triggered by the reward function during post-training. Existing research attempts to mitigate overthinking from the perspective of prompt design or model training, but generally underestimates the importance of task difficulty […]

Ver mais

Like 0

Liked Liked

technocracy

Statsformer: Validated Ensemble Learning with LLM-Derived Semantic Priors

digitado ⋅ 29 de January de 2026

We introduce Statsformer, a principled framework for integrating large language model (LLM)-derived knowledge into supervised statistical learning. Existing approaches are limited in adaptability and scope: they either inject LLM guidance as an unvalidated heuristic, which is sensitive to LLM hallucination, or embed semantic information within a single fixed learner. Statsformer overcomes both limitations through a guardrailed ensemble architecture. We embed LLM-derived feature priors within an ensemble of linear and nonlinear learners, adaptively calibrating their influence via cross-validation. This […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Optimize Job Shop Scheduling Under Structural Uncertainty

digitado ⋅ 29 de January de 2026

The Job-Shop Scheduling Problem (JSSP), under various forms of manufacturing uncertainty, has recently attracted considerable research attention. Most existing studies focus on parameter uncertainty, such as variable processing times, and typically adopt the actor-critic framework. In this paper, we explore a different but prevalent form of uncertainty in JSSP: structural uncertainty. Structural uncertainty arises when a job may follow one of several routing paths, and the selection is determined not by policy, but by situational factors (e.g., the […]

Ver mais

Like 0

Liked Liked

technocracy

Hebbian Learning with Global Direction

digitado ⋅ 29 de January de 2026

Backpropagation algorithm has driven the remarkable success of deep neural networks, but its lack of biological plausibility and high computational costs have motivated the ongoing search for alternative training methods. Hebbian learning has attracted considerable interest as a biologically plausible alternative to backpropagation. Nevertheless, its exclusive reliance on local information, without consideration of global task objectives, fundamentally limits its scalability. Inspired by the biological synergy between neuromodulators and local plasticity, we introduce a novel model-agnostic Global-guided Hebbian Learning […]

Ver mais

Like 0

Liked Liked

technocracy

Factored Causal Representation Learning for Robust Reward Modeling in RLHF

digitado ⋅ 29 de January de 2026

A reliable reward model is essential for aligning large language models with human preferences through reinforcement learning from human feedback. However, standard reward models are susceptible to spurious features that are not causally related to human labels. This can lead to reward hacking, where high predicted reward does not translate into better behavior. In this work, we address this problem from a causal perspective by proposing a factored representation learning framework that decomposes the model’s contextual embedding into […]

Ver mais

Like 0

Liked Liked

technocracy

The TechBeat: Benchmarking 1B Vectors with Low Latency and High Throughput (1/29/2026)

digitado ⋅ 29 de January de 2026

How are you, hacker? 🪐Want to know what’s trending right now?: The Techbeat by HackerNoon has got you covered with fresh content from our trending stories of the day! Set email preference here. ## Claude Code Launches Teleport Workflow: Start Anywhere, Continue Everywhere By @proflead [ 4 Min read ] Read More. AI Doesn’t Mean the End of Work for Us By @bernard [ 4 Min read ] I believe that AI’s impact and future pathways are overstated […]

Ver mais

Like 0

Liked Liked

technocracy

Heterogeneous Vertiport Selection Optimization for On-Demand Air Taxi Services: A Deep Reinforcement Learning Approach

digitado ⋅ 29 de January de 2026

Urban Air Mobility (UAM) has emerged as a transformative solution to alleviate urban congestion by utilizing low-altitude airspace, thereby reducing pressure on ground transportation networks. To enable truly efficient and seamless door-to-door travel experiences, UAM requires close integration with existing ground transportation infrastructure. However, current research on optimal integrated routing strategies for passengers in air-ground mobility systems remains limited, with a lack of systematic exploration.To address this gap, we first propose a unified optimization model that integrates strategy […]

Ver mais

Like 0

Liked Liked

technocracy

Is there an AI playable RTS ? (or a turn based one)

digitado ⋅ 29 de January de 2026

Hi, i’ve done plenty of RL projects. AlphaZero (checkers), self driving racecar with SAC, some classic gymnasium environment with DQN. The problem is, always, the environment. Playing checkers ? Need to implement checkers environment racecar ? need to write a car simulator (really difficult actually) and so on I’d love to give a try to a (mini) RTS, like AlphaStar, but i’m not google and i don’t have a custom version of SC2 … MicroRTS is dead and […]

Ver mais

Like 0

Liked Liked

technocracy

Few-Shot Learning for Dynamic Operations of Automated Electric Taxi Fleets under Evolving Charging Infrastructure: A Meta-Deep Reinforcement Learning Approach

digitado ⋅ 29 de January de 2026

With the rapid expansion of electric vehicles (EVs) and charging infrastructure, the effective management of Autonomous Electric Taxi (AET) fleets faces a critical challenge in environments with dynamic and uncertain charging availability. While most existing research assumes a static charging network, this simplification creates a significant gap between theoretical models and real-world operations. To bridge this gap, we propose GAT-PEARL, a novel meta-reinforcement learning framework that learns an adaptive operational policy. Our approach integrates a graph attention network […]

Ver mais

Like 0

Liked Liked