digitado

UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy

digitado ⋅ 27 de March de 2026

arXiv:2603.24690v1 Announce Type: new Abstract: In-context Learning enables training-free adaptation via demonstrations but remains highly sensitive to example selection and formatting. In unified multimodal models spanning understanding and generation, this sensitivity is exacerbated by cross-modal interference and varying cognitive demands. Consequently, In-context Learning efficacy is often non-monotonic and highly task-dependent. To diagnose these behaviors, we introduce a six-level capability-oriented taxonomy that categorizes the functional role of demonstrations from basic perception to high-order discernment. Guided by this cognitive framework, […]

Ver mais

Like 0

Liked Liked

technocracy

Get ready for new Macs and iPads: Apple announces “Special Experience” on March 4

digitado ⋅ 16 de February de 2026

It may be more tempting to take that aging Mac you’ve been coddling and put it out to pasture soon. Apple has announced an event for March 4, which in usual Apple fashion, it has branded a “Special Apple Experience.” Also in usual Apple fashion, it has not come out and said what it’s going to be announcing. We have a pretty good idea, though. The event will kick off at 9AM ET on March 4—Ars will be […]

Ver mais

Like 0

Liked Liked

technocracy

From LLMs to Agents in Programming: The Impact of Providing an LLM with a Compiler

digitado ⋅ 21 de January de 2026

arXiv:2601.12146v1 Announce Type: new Abstract: Large Language Models have demonstrated a remarkable capability in natural language and program generation and software development. However, the source code generated by the LLMs does not always meet quality requirements and may fail to compile. Therefore, many studies evolve into agents that can reason about the problem before generating the source code for the solution. The goal of this paper is to study the degree to which such agents benefit from access […]

Ver mais

Like 0

Liked Liked

technocracy

ESSAM: A Novel Competitive Evolution Strategies Approach to Reinforcement Learning for Memory Efficient LLMs Fine-Tuning

digitado ⋅ 3 de February de 2026

arXiv:2602.01003v1 Announce Type: new Abstract: Reinforcement learning (RL) has become a key training step for improving mathematical reasoning in large language models (LLMs), but it often has high GPU memory usage, which makes it hard to use in settings with limited resources. To reduce these issues, we propose Evolution Strategies with Sharpness-Aware Maximization (ESSAM), a full parameter fine-tuning framework that tightly combines the zero-order search in parameter space from Evolution Strategies (ES) with the Sharpness-Aware Maximization (SAM) to […]

Ver mais

Like 0

Liked Liked

technocracy

IA2 Preprocessing: Establishing the Foundation for Index Selection

digitado ⋅ 6 de January de 2026

Table of Links Abstract and 1. Introduction Related Works 2.1 Traditional Index Selection Approaches 2.2 RL-based Index Selection Approaches Index Selection Problem Methodology 4.1 Formulation of the DRL Problem 4.2 Instance-Aware Deep Reinforcement Learning for Efficient Index Selection System Framework of IA2 5.1 Preprocessing Phase 5.2 RL Training and Application Phase Experiments 6.1 Experimental Setting 6.2 Experimental Results 6.3 End-to-End Performance Comparison 6.4 Key Insights Conclusion and Future Work, and References 5.1 Preprocessing Phase The preprocessing phase is […]

Ver mais

Like 0

Liked Liked

technocracy

Line-based Event Preprocessing: Towards Low-Energy Neuromorphic Computer Vision

digitado ⋅ 19 de January de 2026

arXiv:2601.10742v1 Announce Type: new Abstract: Neuromorphic vision made significant progress in recent years, thanks to the natural match between spiking neural networks and event data in terms of biological inspiration, energy savings, latency and memory use for dynamic visual data processing. However, optimising its energy requirements still remains a challenge within the community, especially for embedded applications. One solution may reside in preprocessing events to optimise data quantity thus lowering the energy cost on neuromorphic hardware, proportional to […]

Ver mais

Like 0

Liked Liked

technocracy

A practical guide to Amazon Nova Multimodal Embeddings

digitado ⋅ 5 de February de 2026

Embedding models power many modern applications—from semantic search and Retrieval-Augmented Generation (RAG) to recommendation systems and content understanding. However, selecting an embedding model requires careful consideration—after you’ve ingested your data, migrating to a different model means re-embedding your entire corpus, rebuilding vector indexes, and validating search quality from scratch. The right embedding model should deliver strong baseline performance, adapt to your specific use-case, and support the modalities you need now and in the future. The Amazon Nova Multimodal […]

Ver mais

Like 0

Liked Liked

technocracy

Koopman Operator Identification of Model Parameter Trajectories for Temporal Domain Generalization (KOMET)

digitado ⋅ 31 de March de 2026

arXiv:2603.26923v1 Announce Type: new Abstract: Parametric models deployed in non-stationary environments degrade as the underlying data distribution evolves over time (a phenomenon known as temporal domain drift). In the current work, we present KOMET (Koopman Operator identification of Model parameter Evolution under Temporal drift), a model-agnostic, data-driven framework that treats the sequence of trained parameter vectors as the trajectory of a nonlinear dynamical system and identifies its governing linear operator via Extended Dynamic Mode Decomposition (EDMD). A warm-start […]

Ver mais

Like 0

Liked Liked

technocracy

On damage of interpolation to adversarial robustness in regression

digitado ⋅ 23 de January de 2026

arXiv:2601.16070v1 Announce Type: new Abstract: Deep neural networks (DNNs) typically involve a large number of parameters and are trained to achieve zero or near-zero training error. Despite such interpolation, they often exhibit strong generalization performance on unseen data, a phenomenon that has motivated extensive theoretical investigations. Comforting results show that interpolation indeed may not affect the minimax rate of convergence under the squared error loss. In the mean time, DNNs are well known to be highly vulnerable to […]

Ver mais

Like 0

Liked Liked

technocracy

Provably Convergent Actor-Critic in Risk-averse MARL

digitado ⋅ 16 de February de 2026

arXiv:2602.12386v1 Announce Type: new Abstract: Learning stationary policies in infinite-horizon general-sum Markov games (MGs) remains a fundamental open problem in Multi-Agent Reinforcement Learning (MARL). While stationary strategies are preferred for their practicality, computing stationary forms of classic game-theoretic equilibria is computationally intractable — a stark contrast to the comparative ease of solving single-agent RL or zero-sum games. To bridge this gap, we study Risk-averse Quantal response Equilibria (RQE), a solution concept rooted in behavioral game theory that incorporates […]

Ver mais

Like 0

Liked Liked