January 2026

Call2Instruct: Automated Pipeline for Generating Q&A Datasets from Call Center Recordings for LLM Fine-Tuning

digitado ⋅ 22 de January de 2026

arXiv:2601.14263v1 Announce Type: new Abstract: The adaptation of Large-Scale Language Models (LLMs) to specific domains depends on high-quality fine-tuning datasets, particularly in instructional format (e.g., Question-Answer – Q&A). However, generating these datasets, particularly from unstructured sources such as call center audio recordings, poses a significant challenge due to the noisy and disorganized nature of the data. This paper presents a solution to this challenge by offering an end-to-end automated pipeline for generating Q&A instructional datasets from such recordings. […]

Ver mais

Like 0

Liked Liked

technocracy

Intelligent Power Grid Design Review via Active Perception-Enabled Multimodal Large Language Models

digitado ⋅ 22 de January de 2026

arXiv:2601.14261v1 Announce Type: new Abstract: The intelligent review of power grid engineering design drawings is crucial for power system safety. However, current automated systems struggle with ultra-high-resolution drawings due to high computational demands, information loss, and a lack of holistic semantic understanding for design error identification. This paper proposes a novel three-stage framework for intelligent power grid drawing review, driven by pre-trained Multimodal Large Language Models (MLLMs) through advanced prompt engineering. Mimicking the human expert review process, the […]

Ver mais

Like 0

Liked Liked

technocracy

End-to-End Transformer Acceleration Through Processing-in-Memory Architectures

digitado ⋅ 22 de January de 2026

arXiv:2601.14260v1 Announce Type: new Abstract: Transformers have become central to natural language processing and large language models, but their deployment at scale faces three major challenges. First, the attention mechanism requires massive matrix multiplications and frequent movement of intermediate results between memory and compute units, leading to high latency and energy costs. Second, in long-context inference, the key-value cache (KV cache) can grow unpredictably and even surpass the model’s weight size, creating severe memory and bandwidth bottlenecks. Third, […]

Ver mais

Like 0

Liked Liked

technocracy

A Cloud-Based Cross-Modal Transformer for Emotion Recognition and Adaptive Human-Computer Interaction

digitado ⋅ 22 de January de 2026

arXiv:2601.14259v1 Announce Type: new Abstract: Emotion recognition is a fundamental component of next-generation human-computer interaction (HCI), enabling machines to perceive, understand, and respond to users’ affective states. However, existing systems often rely on single-modality analysis such as facial expressions, speech tone, or textual sentiment, resulting in limited robustness and poor generalization in real-world environments. To address these challenges, this study proposes a Cloud-Based Cross-Modal Transformer (CMT) framework for multimodal emotion recognition and adaptive human-computer interaction. The proposed model […]

Ver mais

Like 0

Liked Liked

technocracy

SOSControl: Enhancing Human Motion Generation through Saliency-Aware Symbolic Orientation and Timing Control

digitado ⋅ 22 de January de 2026

arXiv:2601.14258v1 Announce Type: new Abstract: Traditional text-to-motion frameworks often lack precise control, and existing approaches based on joint keyframe locations provide only positional guidance, making it challenging and unintuitive to specify body part orientations and motion timing. To address these limitations, we introduce the Salient Orientation Symbolic (SOS) script, a programmable symbolic framework for specifying body part orientations and motion timing at keyframes. We further propose an automatic SOS extraction pipeline that employs temporally-constrained agglomerative clustering for frame […]

Ver mais

Like 0

Liked Liked

technocracy

Machine Failure Detection Based on Projected Quantum Models

digitado ⋅ 22 de January de 2026

Detecting machine failures promptly is of utmost importance in industry for maintaining efficiency and minimizing downtime. This paper introduces a failure detection algorithm based on quantum computing and a statistical change-point detection approach. Our method leverages the potential of projected quantum feature maps to enhance the precision of anomaly detection in machine monitoring systems. We empirically validate our approach on benchmark multi-dimensional time series datasets as well as on a real-world dataset comprising IoT sensor readings from operational […]

Ver mais

Like 0

Liked Liked

technocracy

An Empirical Study on Ensemble-Based Transfer Learning Bayesian Optimisation with Mixed Variable Types

digitado ⋅ 22 de January de 2026

Bayesian optimisation is a sample efficient method for finding a global optimum of expensive black-box objective functions. Historic datasets from related problems can be exploited to help improve performance of Bayesian optimisation by adapting transfer learning methods to various components of the Bayesian optimisation pipeline. In this study we perform an empirical analysis of various ensemble-based transfer learning Bayesian optimisation methods and pipeline components. We expand on previous work in the literature by contributing some specific pipeline components, […]

Ver mais

Like 0

Liked Liked

technocracy

Hippocampus model implementing a Turing machine

digitado ⋅ 22 de January de 2026

Hippocampus Model Implementing a Turing Machine Abstract This paper presents a spiking semantic neural network augmented with a hippocampus-inspired memory model. Together, these components implement a Turing-machine–like computational system with internal memory represented directly within the neural substrate. Unlike previous neural approaches to Turing machine emulation, where the memory tape is implemented as an external structure or using single neurons, the proposed model stores memory internally using neural clusters. This design increases behavioral flexibility and enables direct control over […]

Ver mais

Like 0

Liked Liked

technocracy

Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors

digitado ⋅ 22 de January de 2026

Large language models (LLMs) can call tools effectively, yet they remain brittle in multi-turn execution: following a tool call error, smaller models often degenerate into repetitive invalid re-invocations, failing to interpret error feedback and self-correct. This brittleness hinders reliable real-world deployment, where the execution errors are inherently inevitable during tool interaction procedures. We identify a key limitation of current approaches: standard reinforcement learning (RL) treats errors as sparse negative rewards, providing no guidance on how to recover, while […]

Ver mais

Like 0

Liked Liked

technocracy

Deep Learning for Perishable Inventory Systems with Human Knowledge

digitado ⋅ 22 de January de 2026

Managing perishable products with limited lifetimes is a fundamental challenge in inventory management, as poor ordering decisions can quickly lead to stockouts or excessive waste. We study a perishable inventory system with random lead times in which both the demand process and the lead time distribution are unknown. We consider a practical setting where orders are placed using limited historical data together with observed covariates and current system states. To improve learning efficiency under limited data, we adopt […]

Ver mais

Like 0

Liked Liked