digitado

About digitado

https://www.digitado.com.br

Posts by :

Disentangled Dual-Branch Graph Learning for Conversational Emotion Recognition

digitado ⋅ 18 de April de 2026

arXiv:2604.14204v1 Announce Type: new Abstract: Multimodal emotion recognition in conversations aims to infer utterance-level emotions by jointly modeling textual, acoustic, and visual cues within context. Despite recent progress, key challenges remain, including redundant cross-modal information, imperfect semantic alignment, and insufficient modeling of high-order speaker interactions. To address these issues, we propose a framework that combines dual-space feature disentanglement with dual-branch graph learning. A shared encoder and modality-specific encoders are used to separate modality-invariant and modality-specific representations. The invariant […]

Ver mais

Like 0

Liked Liked

technocracy

Energetic Resilience under Temporal Logic Specifications

digitado ⋅ 18 de April de 2026

arXiv:2604.14203v1 Announce Type: new Abstract: In environments with uncertainties or undesirable influences, control systems can require additional energy to achieve their task while remaining resilient to these influences. In this paper, we present an energetic resilience metric that quantifies the maximal additional energy used by a system under undesired effects, while satisfying complex specifications encoded through temporal logic. We prove that this metric satisfies properties that enable its computation even for compositions of these specifications, thus allowing considerations […]

Ver mais

Like 0

Liked Liked

technocracy

MixAtlas: Uncertainty-aware Data Mixture Optimization for Multimodal LLM Midtraining

digitado ⋅ 18 de April de 2026

arXiv:2604.14198v1 Announce Type: new Abstract: Domain reweighting can improve sample efficiency and downstream generalization, but data-mixture optimization for multimodal midtraining remains largely unexplored. Current multimodal training recipes tune mixtures along a single dimension, typically data format or task type. We introduce MixAtlas, a method that produces benchmark-targeted data recipes that can be inspected, adapted, and transferred to new corpora. MixAtlas decomposes the training corpus along two axes: image concepts (10 visual-domain clusters discovered via CLIP embeddings) and task […]

Ver mais

Like 0

Liked Liked

technocracy

The PICCO Framework for Large Language Model Prompting: A Taxonomy and Reference Architecture for Prompt Structure

digitado ⋅ 18 de April de 2026

arXiv:2604.14197v1 Announce Type: new Abstract: Large language model (LLM) performance depends heavily on prompt design, yet prompt construction is often described and applied inconsistently. Our purpose was to derive a reference framework for structuring LLM prompts. This paper presents PICCO, a framework derived through a rigorous synthesis of 11 previously published prompting frameworks identified through a multi-database search. The analysis yields two main contributions. First, it proposes a taxonomy that distinguishes prompt frameworks, prompt elements, prompt generation, prompting […]

Ver mais

Like 0

Liked Liked

technocracy

QualiaNet: An Experience-Before-Inference Network

digitado ⋅ 18 de April de 2026

arXiv:2604.14193v1 Announce Type: new Abstract: Human 3D vision involves two distinct stages: an Experience Module, where stereo depth is extracted relative to fixation, and an Inference Module, where this experience is interpreted to estimate 3D scene properties. Paradoxically, although our experience of stereo vision does not provide us with distance information, it does affect our inferences about visual scale. We propose the Inference Module exploits a natural scene statistic: near scenes produce vivid disparity gradients, while far scenes […]

Ver mais

Like 0

Liked Liked

technocracy

Attention to Mamba: A Recipe for Cross-Architecture Distillation

digitado ⋅ 18 de April de 2026

arXiv:2604.14191v1 Announce Type: new Abstract: State Space Models (SSMs) such as Mamba have become a popular alternative to Transformer models, due to their reduced memory consumption and higher throughput at generation compared to their Attention-based counterparts. On the other hand, the community has built up a considerable body of knowledge on how to train Transformers, and many pretrained Transformer models are readily available. To facilitate the adoption of SSMs while leveraging existing pretrained Transformers, we aim to identify […]

Ver mais

Like 0

Liked Liked

technocracy

End-to-End Learning-based Operation of Integrated Energy Systems for Buildings and Data Centers

digitado ⋅ 18 de April de 2026

arXiv:2604.14184v1 Announce Type: new Abstract: Buildings and data centers (DCs) are energy-intensive sectors, playing a critical role to achieve the low-carbon and sustainable energy transition targets. To this end, integrated energy system (IES) that incorporates diverse renewables, energy generation, conversion, and storage technologies to enable coordinated multi-energy supply have been widely investigated for both buildings and DCs. However, few works consider the two sectors jointly within IES to exploit their substantial synergistic benefits. Meanwhile, the operational optimization of […]

Ver mais

Like 0

Liked Liked

technocracy

Internal Knowledge Without External Expression: Probing the Generalization Boundary of a Classical Chinese Language Model

digitado ⋅ 18 de April de 2026

arXiv:2604.14180v1 Announce Type: new Abstract: We train a 318M-parameter Transformer language model from scratch on a curated corpus of 1.56 billion tokens of pure Classical Chinese, with zero English characters or Arabic numerals. Through systematic out-of-distribution (OOD) testing, we investigate whether the model can distinguish known from unknown inputs, and crucially, whether it can express this distinction in its generated text. We find a clear dissociation between internal and external uncertainty. Internally, the model exhibits a perplexity jump […]

Ver mais

Like 0

Liked Liked

technocracy

An Underexplored Frontier: Large Language Models for Rare Disease Patient Education and Communication — A scoping review

digitado ⋅ 18 de April de 2026

arXiv:2604.14179v1 Announce Type: new Abstract: Rare diseases affect over 300 million people worldwide and are characterized by complex care pathways, limited clinical expertise, and substantial unmet communication needs throughout the long patient journey. Recent advances in large language models (LLMs) offer new opportunities to support patient education and communication, yet their application in rare diseases remains unclear. We conducted a scoping review of studies published between January 2022 and March 2026 across major databases, identifying 12 studies on […]

Ver mais

Like 0

Liked Liked

technocracy

Simulating Human Cognition: Heartbeat-Driven Autonomous Thinking Activity Scheduling for LLM-based AI systems

digitado ⋅ 18 de April de 2026

arXiv:2604.14178v1 Announce Type: new Abstract: Large Language Model (LLM) agents have demonstrated remarkable capabilities in reasoning and tool use, yet they often suffer from rigid, reactive control flows that limit their adaptability and efficiency. Most existing frameworks rely on fixed pipelines or failure-triggered reflection, causing agents to act impulsively or correct errors only after they occur. In this paper, we introduce Heartbeat-Driven Autonomous Thinking Activity Scheduling, a mechanism that enables proactive, adaptive, and continuous self-regulation. Mirroring the natural […]

Ver mais

Like 0

Liked Liked