February 2026

CALM: Class-Conditional Sparse Attention Vectors for Large Audio-Language Models

digitado ⋅ 10 de February de 2026

arXiv:2602.07077v1 Announce Type: new Abstract: Large audio-language models (LALMs) exhibit strong zero-shot capabilities in multiple downstream tasks, such as audio question answering (AQA) and abstract reasoning; however, these models still lag behind specialized models for certain discriminative tasks (e.g., audio classification). Recent studies show that sparse subsets of attention heads within an LALM can serve as strong discriminative feature extractors for downstream tasks such as classification via simple voting schemes. However, these methods assign uniform weights to all […]

Ver mais

Like 0

Liked Liked

technocracy

Airspace-aware Contingency Landing Planning

digitado ⋅ 10 de February de 2026

arXiv:2602.07074v1 Announce Type: new Abstract: This paper develops a real-time, search-based aircraft contingency landing planner that minimizes traffic disruptions while accounting for ground risk. The airspace model captures dense air traffic departure and arrival flows, helicopter corridors, and prohibited zones and is demonstrated with a Washington, D.C., area case study. Historical Automatic Dependent Surveillance-Broadcast (ADS-B) data are processed to estimate air traffic density. A low-latency computational geometry algorithm generates proximity-based heatmaps around high-risk corridors and restricted regions. Airspace […]

Ver mais

Like 0

Liked Liked

technocracy

Pro-ZD: A Transferable Graph Neural Network Approach for Proactive Zero-Day Threats Mitigation

digitado ⋅ 10 de February de 2026

arXiv:2602.07073v1 Announce Type: new Abstract: In today’s enterprise network landscape, the combination of perimeter and distributed firewall rules governs connectivity. To address challenges arising from increased traffic and diverse network architectures, organizations employ automated tools for firewall rule and access policy generation. Yet, effectively managing risks arising from dynamically generated policies, especially concerning critical asset exposure, remains a major challenge. This challenge is amplified by evolving network structures due to trends like remote users, bring-your-own devices, and cloud […]

Ver mais

Like 0

Liked Liked

technocracy

AgentSpawn: Adaptive Multi-Agent Collaboration Through Dynamic Spawning for Long-Horizon Code Generation

digitado ⋅ 10 de February de 2026

arXiv:2602.07072v1 Announce Type: new Abstract: Long-horizon code generation requires sustained context and adaptive expertise across domains. Current multi-agent systems use static workflows that cannot adapt when runtime analysis reveals unanticipated complexity. We propose AgentSpawn, an architecture enabling dynamic agent collaboration through: (1) automatic memory transfer during spawning, (2) adaptive spawning policies triggered by runtime complexity metrics, and (3) coherence protocols for concurrent modifications. AgentSpawn addresses five critical gaps in existing research around memory continuity, skill inheritance, task resumption, […]

Ver mais

Like 0

Liked Liked

technocracy

Artificial Intelligence in Open Source Software Engineering: A Foundation for Sustainability

digitado ⋅ 10 de February de 2026

arXiv:2602.07071v1 Announce Type: new Abstract: Open-source software (OSS) is foundational to modern digital infrastructure, yet this context for group work continues to struggle to ensure sufficient contributions in many critical cases. This literature review explores how artificial intelligence (AI) is being leveraged to address critical challenges to OSS sustainability, including maintaining contributor engagement, securing funding, ensuring code quality and security, fostering healthy community dynamics, and preventing project abandonment. Synthesizing recent interdisciplinary research, the paper identifies key applications of […]

Ver mais

Like 0

Liked Liked

technocracy

Hybrid Dual-Path Linear Transformations for Efficient Transformer Architectures

digitado ⋅ 10 de February de 2026

arXiv:2602.07070v1 Announce Type: new Abstract: Standard Transformer architectures rely heavily on dense linear transformations, treating feature projection as a monolithic, full-rank operation. We argue that this formulation is inefficient and lacks the structural inductive bias necessary for distinguishing between local feature preservation and global context integration. To address this, we introduce the Hybrid Dual-Path Linear (HDPL) operator, which decomposes the affine transformation into two topologically distinct pathways: a sparse block-diagonal component for high-rank local processing, and a low-rank […]

Ver mais

Like 0

Liked Liked

technocracy

Bidirectional Reward-Guided Diffusion for Real-World Image Super-Resolution

digitado ⋅ 10 de February de 2026

arXiv:2602.07069v1 Announce Type: new Abstract: Diffusion-based super-resolution can synthesize rich details, but models trained on synthetic paired data often fail on real-world LR images due to distribution shifts. We propose Bird-SR, a bidirectional reward-guided diffusion framework that formulates super-resolution as trajectory-level preference optimization via reward feedback learning (ReFL), jointly leveraging synthetic LR-HR pairs and real-world LR images. For structural fidelity easily affected in ReFL, the model is directly optimized on synthetic pairs at early diffusion steps, which also […]

Ver mais

Like 0

Liked Liked

technocracy

Contactless estimation of continuum displacement and mechanical compressibility from image series using a deep learning based framework

digitado ⋅ 10 de February de 2026

arXiv:2602.07065v1 Announce Type: new Abstract: Contactless and non-invasive estimation of mechanical properties of physical media from optical observations is of interest for manifold engineering and biomedical applications, where direct physical measurements are not possible. Conventional approaches to the assessment of image displacement and non-contact material probing typically rely on time-consuming iterative algorithms for non-rigid image registration and constitutive modelling using discretization and iterative numerical solving techniques, such as Finite Element Method (FEM) and Finite Difference Method (FDM), which […]

Ver mais

Like 0

Liked Liked

technocracy

Exploring Physical Intelligence Emergence via Omni-Modal Architecture and Physical Data Engine

digitado ⋅ 10 de February de 2026

arXiv:2602.07064v1 Announce Type: new Abstract: Physical understanding remains brittle in omni-modal models because key physical attributes are visually ambiguous and sparsely represented in web-scale data. We present OmniFysics, a compact omni-modal model that unifies understanding across images, audio, video, and text, with integrated speech and image generation. To inject explicit physical knowledge, we build a physical data engine with two components. FysicsAny produces physics-grounded instruction–image supervision by mapping salient objects to verified physical attributes through hierarchical retrieval over […]

Ver mais

Like 0

Liked Liked

technocracy

Video-based Music Generation

digitado ⋅ 10 de February de 2026

arXiv:2602.07063v1 Announce Type: new Abstract: As the volume of video content on the internet grows rapidly, finding a suitable soundtrack remains a significant challenge. This thesis presents EMSYNC (EMotion and SYNChronization), a fast, free, and automatic solution that generates music tailored to the input video, enabling content creators to enhance their productions without composing or licensing music. Our model creates music that is emotionally and rhythmically synchronized with the video. A core component of EMSYNC is a novel […]

Ver mais

Like 0

Liked Liked