digitado – Page 24

Training-Conditional Coverage Bounds under Covariate Shift

digitado ⋅ 9 de February de 2026

arXiv:2405.16594v4 Announce Type: replace Abstract: Conformal prediction methodology has recently been extended to the covariate shift setting, where the distribution of covariates differs between training and test data. While existing results ensure that the prediction sets from these methods achieve marginal coverage above a nominal level, their coverage rate conditional on the training dataset (referred to as training-conditional coverage) remains unexplored. In this paper, we address this gap by deriving upper bounds on the tail of the training-conditional […]

Ver mais

Like 0

Liked Liked

technocracy

Synapse Compendium Aware Federated Knowledge Exchange for Tool Routed LLMs

digitado ⋅ 3 de February de 2026

arXiv:2602.00911v1 Announce Type: new Abstract: Collaborative learning among LLM-based agents under federated learning faces challenges, including communication costs, heterogeneity in data, and tool-usage, limiting their effectiveness. We introduce Synapse, a framework that trains a shared global knowledge model of tool-usage behavior. Client agents with fixed LLMs learn tool-usage patterns locally, and transmit artifacts for federated aggregation through coordinators. A global tool compendium is updated and redistributed, enabling convergence toward stable tool selection. Synapse uses templated representations, embedding retrieval […]

Ver mais

Like 0

Liked Liked

technocracy

How Does Unfaithful Reasoning Emerge from Autoregressive Training? A Study of Synthetic Experiments

digitado ⋅ 3 de February de 2026

arXiv:2602.01017v1 Announce Type: new Abstract: Chain-of-thought (CoT) reasoning generated by large language models (LLMs) is often unfaithful: intermediate steps can be logically inconsistent or fail to reflect the causal relationship leading to the final answer. Despite extensive empirical observations, a fundamental understanding of CoT is lacking–what constitutes faithful CoT reasoning, and how unfaithfulness emerges from autoregressive training. We study these questions using well-controlled synthetic experiments, training small transformers on noisy data to solve modular arithmetic expressions step by […]

Ver mais

Like 0

Liked Liked

technocracy

VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors

digitado ⋅ 7 de April de 2026

arXiv:2604.02486v1 Announce Type: new Abstract: Vision Language Models (VLMs) achieve impressive performance across a wide range of multimodal tasks. However, on some tasks that demand fine-grained visual perception, they often fail even when the required information is present in their internal representations. In this work, we demonstrate that this gap arises from their narrow training pipeline which focuses on moving visual information to the textual space. Consequently, VLMs can only reason about visual entities that can be mapped […]

Ver mais

Like 0

Liked Liked

technocracy

Temperature Scaling Attack Disrupting Model Confidence in Federated Learning

digitado ⋅ 6 de February de 2026

Predictive confidence serves as a foundational control signal in mission-critical systems, directly governing risk-aware logic such as escalation, abstention, and conservative fallback. While prior federated learning attacks predominantly target accuracy or implant backdoors, we identify confidence calibration as a distinct attack objective. We present the Temperature Scaling Attack (TSA), a training-time attack that degrades calibration while preserving accuracy. By injecting temperature scaling with learning rate-temperature coupling during local training, malicious updates maintain benign-like optimization behavior, evading accuracy-based monitoring […]

Ver mais

Like 0

Liked Liked

technocracy

ACL 2026 first author with weak GPA. How should I approach PhD applications? [D]

digitado ⋅ 17 de June de 2026

Hi everyone, I have a fairly weak undergraduate: a 3.3/5 GPA in Computer Engineering from an average Nigerian university. For my Master’s, I studied Artificial Intelligence at an average European university, where I finished with an 8/10 GPA. A condensed version of my Master’s thesis was recently accepted at ACL 2026, with a meta-review score of 8/10 and a confidence score of 5/5. It’s scheduled for presentation next month. I want to pursue a PhD focused on expanding […]

Ver mais

Like 0

Liked Liked

technocracy

Bias-Aware Conformal Prediction for Metric-Based Imaging Pipelines

digitado ⋅ 27 de January de 2026

arXiv:2410.05263v2 Announce Type: replace Abstract: Reliable confidence measures of metrics derived from medical imaging reconstruction pipelines would improve the standard of decision-making in many clinical workflows. Conformal Prediction (CP) provides a robust framework for producing calibrated prediction intervals, but standard CP formulations face a critical challenge in the imaging pipeline: common mismatches between image reconstruction objectives and downstream metrics can introduce systematic prediction deviations from ground truth values, known as bias. These biases in turn compromise the efficiency […]

Ver mais

Like 0

Liked Liked

technocracy

Google details new 24-hour process to sideload unverified Android apps

digitado ⋅ 19 de March de 2026

Google is planning big changes for Android in 2026 aimed at combating malware across the entire device ecosystem. Starting in September, Google will begin restricting application sideloading with its developer verification program, but not everyone is on board. Android Ecosystem President Sameer Samat tells Ars that the company has been listening to feedback, and the result is the newly unveiled advanced flow, which will allow power users to skip app verification. With its new limits on sideloading, Android […]

Ver mais

Like 0

Liked Liked

technocracy

Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards

digitado ⋅ 5 de March de 2026

Recently, Automatic Speech Recognition (ASR) systems (e.g., Whisper) have achieved remarkable accuracy improvements but remain highly sensitive to real-world unseen data (data with large distribution shifts), including noisy environments and diverse accents. To address this issue, test-time adaptation (TTA) has shown great potential in improving the model adaptability at inference time without ground-truth labels, and existing TTA methods often rely on pseudo-labeling or entropy minimization. However, by treating model confidence as a learning signal, these methods may reinforce […]

Ver mais

Like 0

Liked Liked

technocracy

“The last straw”—RFK Jr.’s anti-vaccine ally angrily quits CDC panel after spat

digitado ⋅ 25 de March de 2026

One of the federal vaccine advisors hand-selected by anti-vaccine Health Secretary Robert F. Kennedy Jr. has angrily resigned from his position, complaining of “drama” amid a spat with a spokesperson. Robert Malone—a former researcher turned outspoken anti-vaccine activist and conspiracy theorist—confirmed he was stepping down Tuesday afternoon to CQ Roll Call, which first reported the news. He told the outlet that his decision to quit came after a “miscommunication” about the fate of the Centers for Disease Control […]

Ver mais

Like 0

Liked Liked