digitado

About digitado

https://www.digitado.com.br

Posts by :

Pasted File Editor

digitado ⋅ 2 de June de 2026

Tool: Pasted File Editor I really like how you can paste a large volume of text into claude.ai (or the Claude desktop/mobile apps) and it will detect it as a large paste and turn it into a file attachment instead. I decided to have Codex desktop build me a version of that as a prototype. You can also open files directly – including images which will be shown as thumbnails – or drag files onto the textarea. Tags: […]

Ver mais

Like 0

Liked Liked

technocracy

Riemannian Stochastic Optimization for Sufficient Dimension Reduction

digitado ⋅ 2 de June de 2026

arXiv:2606.00413v1 Announce Type: new Abstract: Sufficient dimension reduction (SDR) makes high-dimensional regression tractable by projecting the covariates onto a low-dimensional subspace that preserves the conditional mean of the response. Existing gradient-based estimators either operate in the ambient space and suffer from the curse of dimensionality, or localize in the reduced space at a per-outer-iteration cost at least quadratic in the sample size. We show that minimizers of the population Minimum Average Variance Estimation (MAVE) risk approximate the same […]

Ver mais

Like 0

Liked Liked

technocracy

ERICA: Quantifying Replicability of Cluster Analysis

digitado ⋅ 2 de June de 2026

arXiv:2606.00302v1 Announce Type: new Abstract: Despite being ubiquitous in science, clustering remains a technique whose results are not quantitatively scrutinized via a framework. We present an analysis called evaluating replicability via iterative clustering assignments (ERICA) that is applied to a dataset to determine whether clusters are identified in a replicable manner. The pipeline computes a statistic that describes whether structure is found in a dataset. Quantitative visualization methods are presented to answer important questions such as the similarity […]

Ver mais

Like 0

Liked Liked

technocracy

Is Zero-Shot Super-Resolution Possible in Operator Learning?

digitado ⋅ 2 de June de 2026

arXiv:2606.00296v1 Announce Type: new Abstract: Neural operators are often reported to exhibit zero-shot super-resolution, a phenomenon in which a model trained on coarse grids produces accurate predictions on finer testing grids without additional retraining. Despite strong empirical evidence, the theoretical foundations of this phenomenon remain unclear. In this work, we provide a systematic theoretical study of zero-shot super-resolution in operator learning. We first show that zero-shot super-resolution can be information-theoretically impossible even in benign settings such as when […]

Ver mais

Like 0

Liked Liked

technocracy

Out-of-Distribution generalization of quantile regression with heavy tailed inputs: an SVM approach

digitado ⋅ 2 de June de 2026

arXiv:2606.00265v1 Announce Type: new Abstract: We study quantile regression in an extrapolation regime where the covariate takes unusually large values. Under regular variation assumptions, extreme observations can be effectively characterized through their angular components, enabling learning strategies that focus on the angle of the most extreme observations. This approach is formalized through the minimization of an asymptotic conditional risk that localizes learning in the tail of the covariate distribution. We propose a novel Support Vector Machine (SVM) framework […]

Ver mais

Like 0

Liked Liked

technocracy

Interpreting FCDNNs via RG on Exponential Family

digitado ⋅ 2 de June de 2026

arXiv:2606.00157v1 Announce Type: new Abstract: We consider establishing the interpretability theory of deep learning through constructing a corresponding relationship between the renormalization group (RG) method in statistical physics and the training process of deep neural networks (DNNs). We have proved the constructed relationship using the one-dimensional Ising model as the input data. In this paper we generalize our results to the case of continuous input data, which is a necessary preparation for applying the corresponding framework to real-world […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Multi-Modal Trajectory Policies for Data-Efficient Robotic Manipulation

digitado ⋅ 2 de June de 2026

arXiv:2606.01047v1 Announce Type: new Abstract: Robotic manipulation requires the effective integration of heterogeneous inputs, including visual observations, language instructions, and trajectory representations, to generate accurate actions. Existing transformer-based policies typically process these heterogeneous modalities within a shared parameter space, which often leads to modality interference and inefficient representation learning, especially in data-scarce scenarios. While Mixture-of-Experts (MoE) offers a scalable solution through expert specialization, conventional routing mechanisms are often sensitive to such cross-modal representation discrepancies, resulting in unstable expert […]

Ver mais

Like 0

Liked Liked

technocracy

TravelEval: A Comprehensive Benchmarking Framework for Evaluating LLM-Powered Travel Planning Agents

digitado ⋅ 2 de June de 2026

arXiv:2606.01046v1 Announce Type: new Abstract: The development of Large Language Models (LLMs) has significantly improved travel planning applications, yet evaluating such models is limited by existing benchmarks’ limitations: 1) overemphasis on constraint compliance, neglecting multi-dimensional qualities like spatio-temporal cost; 2) datasets lacking real-world authenticity and coverage in key areas (e.g., lodging, transport); and 3) isolated daily plan assessments that miss critical details (e.g., the impact of daily accommodation and visit pacing) needed for entire plan’s evaluation. To address […]

Ver mais

Like 0

Liked Liked

technocracy

Child-directed speech facilitates production, not comprehension, in BabyLMs

digitado ⋅ 2 de June de 2026

arXiv:2606.01045v1 Announce Type: new Abstract: Recent studies suggest that child-directed speech is not conducive to language learning in BabyLMs. However, current evaluations focus predominantly on comprehension and not production, which is central to usage-based theories of language acquisition which argue how CDS facilitates early language use through constructional ”frames” (frequent lexical patterns with open slots). We introduce a novel generation-based evaluation inspired by such theories in form of a frame-completion task, and compare Llama models trained with CDS, […]

Ver mais

Like 0

Liked Liked

technocracy

Ask4VG: Risk-Aware Question Selection for Reducing Prior-Driven Answers in Medical VQA

digitado ⋅ 2 de June de 2026

arXiv:2606.01044v1 Announce Type: new Abstract: Medical visual question answering requires models to ground their responses in image evidence, because visually unsupported answers can mislead downstream interpretation. However, many medical VQA questions are generic, template-like, or highly similar in form, which can encourage models to learn question-answer shortcuts instead of image-dependent reasoning and thereby increase the risk of hallucinated responses. We propose Ask4VG, a label-free pilot framework for risk-aware question selection. Ask4VG estimates question-induced hallucination risk through counterfactual visual […]

Ver mais

Like 0

Liked Liked