digitado

MaS-VQA: A Mask-and-Select Framework for Knowledge-Based Visual Question Answering

digitado ⋅ 19 de February de 2026

arXiv:2602.15915v1 Announce Type: new Abstract: Knowledge-based Visual Question Answering (KB-VQA) requires models to answer questions by integrating visual information with external knowledge. However, retrieved knowledge is often noisy, partially irrelevant, or misaligned with the visual content, while internal model knowledge is difficult to control and interpret. Naive aggregation of these sources limits reasoning effectiveness and reduces answer accuracy. To address this, we propose MaS-VQA, a selection-driven framework that tightly couples explicit knowledge filtering with implicit knowledge reasoning. MaS-VQA […]

Ver mais

Like 0

Liked Liked

technocracy

Switching water sources improved hygiene of Pompeii’s public baths

digitado ⋅ 12 de January de 2026

The eruption of Mount Vesuvius in 79 CE released thermal energy roughly equivalent to 100,000 times the atomic bombs dropped on Hiroshima and Nagasaki at the end of World War II, spewing molten rock, pumice, and hot ash over Pompeii. Pompeii’s public baths, aqueduct, and water towers were among the preserved structures frozen in time. A new paper published in the Proceedings of the National Academy of Sciences analyzed calcium carbonate deposits from those structures to learn more […]

Ver mais

Like 0

Liked Liked

technocracy

Simulation-Based Inference via Regression Projection and Batched Discrepancies

digitado ⋅ 4 de February de 2026

arXiv:2602.03613v1 Announce Type: cross Abstract: We analyze a lightweight simulation-based inference method that infers simulator parameters using only a regression-based projection of the observed data. After fitting a surrogate linear regression once, the procedure simulates small batches at the proposed parameter values and assigns kernel weights based on the resulting batch-residual discrepancy, producing a self-normalized pseudo-posterior that is simple, parallelizable, and requires access only to the fitted regression coefficients rather than raw observations. We formalize the construction as […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Communicate Across Modalities: Perceptual Heterogeneity in Multi-Agent Systems

digitado ⋅ 29 de January de 2026

Emergent communication offers insight into how agents develop shared structured representations, yet most research assumes homogeneous modalities or aligned representational spaces, overlooking the perceptual heterogeneity of real-world settings. We study a heterogeneous multi-step binary communication game where agents differ in modality and lack perceptual grounding. Despite perceptual misalignment, multimodal systems converge to class-consistent messages grounded in perceptual input. Unimodal systems communicate more efficiently, using fewer bits and achieving lower classification entropy, while multimodal agents require greater information exchange […]

Ver mais

Like 0

Liked Liked

technocracy

Review Beats Planning: Dual-Model Interaction Patterns for Code Synthesis

digitado ⋅ 5 de March de 2026

arXiv:2603.03406v1 Announce Type: new Abstract: How should two language models interact to produce better code than either can alone? The conventional approach — a reasoning model plans, a code specialist implements — seems natural but fails: on HumanEval+, plan-then-code degrades performance by 2.4 percentage points versus the code specialist alone. We show that reversing the interaction changes everything. When the code specialist generates freely and the reasoning model reviews instead of plans, the same two models on the […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

digitado ⋅ 3 de March de 2026

Graph-based tasks in the zero-shot setting remain a significant challenge due to data scarcity and the inability of traditional Graph Neural Networks (GNNs) to generalize to unseen domains or label spaces. While recent advancements have transitioned toward leveraging Large Language Models (LLMs) as predictors to enhance GNNs, these methods often suffer from cross-modal alignment issues. A recent paradigm (i.e., Graph-R1) overcomes the aforementioned architectural dependencies by adopting a purely text-based format and utilizing LLM-based graph reasoning, showing improved […]

Ver mais

Like 0

Liked Liked

technocracy

Guide to Hugging Face AutoModelFor** Classes and Tokenizers

digitado ⋅ 11 de January de 2026

Understanding SentenceTransformer Vs AutoTokenizer + AutoModel A tokenizer such as AutoTokenizer simply converts the words into tokens ( A numerical representation of text) however this alone doesnt produce sentence embeddings Sentencetransformer() does both tokenization and embedding computations automatically it also applies pooling(typically mean pooling) to hidden states resulting a final sentence embedding that can be directly used for various NLP tasks from sentence_transformers import SentenceTransformermodel = SentenceTransformer(“sentence-transformers/all-MiniLM-L6-v2”)sentences = [“I love machine learning”, “I am expert in AI”]embeddings = model.encode(sentences) […]

Ver mais

Like 0

Liked Liked

technocracy

A Systematic Method for Evaluating the Generalizability of Mobile-Specific Research: Green Computing as a Case Study

digitado ⋅ 6 de January de 2026

Mobile Software Engineering has emerged as a distinct subfield, raising questions about the transferability of its research findings to general Software Engineering. This paper addresses the challenge of evaluating the generalizability of mobile-specific research, using Green Computing as a representative case. We propose a systematic method that combines a mapping study to identify potentially overlooked mobile-specific papers with a focused literature review to assess their broader relevance. Applying this approach, we find that several mobile-specific studies offer insights […]

Ver mais

Like 0

Liked Liked

technocracy

Verdict: Yes, you should go see Project Hail Mary as soon as possible

digitado ⋅ 11 de March de 2026

First, in the plainest language, before we get to anything else, Project Hail Mary is a fantastic film. It does right by its source material, and it also easily stands on its own for folks who haven’t read the book. It comes out on March 20, and if you’re a regular Ars Technica reader, you will almost certainly enjoy the crap out of it. Go see it as soon as you can, and see it in a theater […]

Ver mais

Like 0

Liked Liked

technocracy

Structured Matching via Cost-Regularized Unbalanced Optimal Transport

digitado ⋅ 9 de January de 2026

arXiv:2511.19075v2 Announce Type: replace Abstract: Unbalanced optimal transport (UOT) provides a flexible way to match or compare nonnegative finite Radon measures. However, UOT requires a predefined ground transport cost, which may misrepresent the data’s underlying geometry. Choosing such a cost is particularly challenging when datasets live in heterogeneous spaces, often motivating practitioners to adopt Gromov-Wasserstein formulations. To address this challenge, we introduce cost-regularized unbalanced optimal transport (CR-UOT), a framework that allows the ground cost to vary while allowing […]

Ver mais

Like 0

Liked Liked