digitado – Page 197

REVEAL: Multimodal Vision-Language Alignment of Retinal Morphometry and Clinical Risks for Incident AD and Dementia Prediction

digitado ⋅ 22 de April de 2026

arXiv:2604.18757v1 Announce Type: new Abstract: The retina provides a unique, noninvasive window into Alzheimer’s disease (AD) and dementia, capturing early structural changes through morphometric features, while systemic and lifestyle risk factors reflect well-established contributors to disease susceptibility long before clinical symptom onset. However, current retinal analysis frameworks typically model imaging and risk factors separately, limiting their ability to capture joint multimodal patterns critical for early risk prediction. Moreover, existing methods rarely incorporate mechanisms to organize or align patients […]

Ver mais

Like 0

Liked Liked

technocracy

Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings

digitado ⋅ 6 de March de 2026

arXiv:2508.11847v3 Announce Type: replace Abstract: We propose a method for evaluating the robustness of widely used LLM ranking systems — variants of a Bradley–Terry model — to dropping a worst-case very small fraction of preference data. Our approach is computationally fast and easy to adopt. When we apply our method to matchups from popular LLM ranking platforms, including Chatbot Arena and derivatives, we find that the rankings of top-performing models can be remarkably sensitive to the removal of […]

Ver mais

Like 0

Liked Liked

technocracy

AI-based Prediction of Biochemical Recurrence from Biopsy and Prostatectomy Samples

digitado ⋅ 30 de January de 2026

arXiv:2601.21022v1 Announce Type: new Abstract: Biochemical recurrence (BCR) after radical prostatectomy (RP) is a surrogate marker for aggressive prostate cancer with adverse outcomes, yet current prognostic tools remain imprecise. We trained an AI-based model on diagnostic prostate biopsy slides from the STHLM3 cohort (n = 676) to predict patient-specific risk of BCR, using foundation models and attention-based multiple instance learning. Generalizability was assessed across three external RP cohorts: LEOPARD (n = 508), CHIMERA (n = 95), and TCGA-PRAD […]

Ver mais

Like 0

Liked Liked

technocracy

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

digitado ⋅ 16 de February de 2026

arXiv:2602.12395v1 Announce Type: new Abstract: Reinforcement learning (RL) with verifiable rewards has become a standard post-training stage for boosting visual reasoning in vision-language models, yet it remains unclear what capabilities RL actually improves compared with supervised fine-tuning as cold-start initialization (IN). End-to-end benchmark gains conflate multiple factors, making it difficult to attribute improvements to specific skills. To bridge the gap, we propose a Frankenstein-style analysis framework including: (i) functional localization via causal probing; (ii) update characterization via parameter […]

Ver mais

Like 0

Liked Liked

technocracy

YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

digitado ⋅ 7 de April de 2026

arXiv:2604.02378v1 Announce Type: new Abstract: Forecasting startup success is notoriously difficult, partly because meaningful outcomes, such as exits, large funding rounds, and sustained revenue growth, are rare and can take years to materialize. As a result, signals are sparse and evaluation cycles are slow. Y Combinator batches offer a unique mitigation: each batch comprises around 200 startups, funded simultaneously, with evaluation at Demo Day only three months later. We introduce YC Bench, a live benchmark for forecasting early […]

Ver mais

Like 0

Liked Liked

technocracy

CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging

digitado ⋅ 3 de March de 2026

arXiv:2603.00573v1 Announce Type: new Abstract: Large language models (LLMs) achieve remarkable performance on diverse downstream and domain-specific tasks via parameter-efficient fine-tuning (PEFT). However, existing PEFT methods, particularly MoE-LoRA architectures, suffer from limited parameter efficiency and coarse-grained adaptation due to the proliferation of LoRA experts and instance-level routing. To address these issues, we propose Core Space Mixture of LoRA (textbf{CoMoL}), a novel MoE-LoRA framework that incorporates expert diversity, parameter efficiency, and fine-grained adaptation. Specifically, CoMoL introduces two key components: […]

Ver mais

Like 0

Liked Liked

technocracy

Q&A: Expanding MIT’s global reach through Universal Learning

digitado ⋅ 12 de May de 2026

MIT’s Universal Learning is a new initiative from MIT Open Learning designed to prepare learners everywhere to tackle complex global challenges through boundary-crossing thinking. Universal Learning offerings combine subject matter expertise from MIT faculty and experts and Open Learning’s more than 25 years of innovation in online education to deliver a learning experience centered on real-world stories, practical exercises, and the needs of global learners. It is delivered on the MIT Learn platform, leveraging the capabilities of the AskTIM […]

Ver mais

Like 0

Liked Liked

technocracy

From PDFs to Proof Pipelines: Building Audit-Grade Traceability in Regulated Deep-Tech

digitado ⋅ 9 de February de 2026

I’ve seen configuration managed on paper, with change notices stacked on drawings like a to-do pile. I’ve seen teams lose the ability to rebuild the same product in another country because they couldn’t produce a complete BOM. The problem wasn’t “conservatism.” The problem was a broken evidence supply chain. Certification-ready digital threads fix that. Here’s what changed when we rebuilt that supply chain: Audit pack assembly: 2 months → 2 weeks Evidence control: Google Drive (no traceability, low access control) […]

Ver mais

Like 0

Liked Liked

technocracy

Computational Arbitrage in AI Model Markets

digitado ⋅ 25 de March de 2026

arXiv:2603.22404v1 Announce Type: new Abstract: Consider a market of competing model providers selling query access to models with varying costs and capabilities. Customers submit problem instances and are willing to pay up to a budget for a verifiable solution. An arbitrageur efficiently allocates inference budget across providers to undercut the market, thus creating a competitive offering with no model-development risk. In this work, we initiate the study of arbitrage in AI model markets, empirically demonstrating the viability of […]

Ver mais

Like 0

Liked Liked

technocracy

Mango: Multi-Agent Web Navigation via Global-View Optimization

digitado ⋅ 22 de April de 2026

arXiv:2604.18779v1 Announce Type: new Abstract: Existing web agents typically initiate exploration from the root URL, which is inefficient for complex websites with deep hierarchical structures. Without a global view of the website’s structure, agents frequently fall into navigation traps, explore irrelevant branches, or fail to reach target information within a limited budget. We propose Mango, a multi-agent web navigation method that leverages the website structure to dynamically determine optimal starting points. We formulate URL selection as a multi-armed […]

Ver mais

Like 0

Liked Liked