January 2026

PatientVLM Meets DocVLM: Pre-Consultation Dialogue Between Vision-Language Models for Efficient Diagnosis

digitado ⋅ 19 de January de 2026

arXiv:2601.10945v1 Announce Type: new Abstract: Traditionally, AI research in medical diagnosis has largely centered on image analysis. While this has led to notable advancements, the absence of patient-reported symptoms continues to hinder diagnostic accuracy. To address this, we propose a Pre-Consultation Dialogue Framework (PCDF) that mimics real-world diagnostic procedures, where doctors iteratively query patients before reaching a conclusion. Specifically, we simulate diagnostic dialogues between two vision-language models (VLMs): a DocVLM, which generates follow-up questions based on the image […]

Ver mais

Like 0

Liked Liked

technocracy

PRISM: Personalized Recommendation via Information Synergy Module

digitado ⋅ 19 de January de 2026

arXiv:2601.10944v1 Announce Type: new Abstract: Multimodal sequential recommendation (MSR) leverages diverse item modalities to improve recommendation accuracy, while achieving effective and adaptive fusion remains challenging. Existing MSR models often overlook synergistic information that emerges only through modality combinations. Moreover, they typically assume a fixed importance for different modality interactions across users. To address these limitations, we propose textbf{P}ersonalized textbf{R}ecommend-ation via textbf{I}nformation textbf{S}ynergy textbf{M}odule (PRISM), a plug-and-play framework for sequential recommendation (SR). PRISM explicitly decomposes multimodal information into unique, […]

Ver mais

Like 0

Liked Liked

technocracy

Change And Cover: Last-Mile, Pull Request-Based Regression Test Augmentation

digitado ⋅ 19 de January de 2026

arXiv:2601.10942v1 Announce Type: new Abstract: Software is in constant evolution, with developers frequently submitting pull requests (PRs) to introduce new features or fix bugs. Testing PRs is critical to maintaining software quality. Yet, even in projects with extensive test suites, some PR-modified lines remain untested, leaving a “last-mile” regression test gap. Existing test generators typically aim to improve overall coverage, but do not specifically target the uncovered lines in PRs. We present Change And Cover (ChaCo), an LLM-based […]

Ver mais

Like 0

Liked Liked

technocracy

HOSL: Hybrid-Order Split Learning for Memory-Constrained Edge Training

digitado ⋅ 19 de January de 2026

arXiv:2601.10940v1 Announce Type: new Abstract: Split learning (SL) enables collaborative training of large language models (LLMs) between resource-constrained edge devices and compute-rich servers by partitioning model computation across the network boundary. However, existing SL systems predominantly rely on first-order (FO) optimization, which requires clients to store intermediate quantities such as activations for backpropagation. This results in substantial memory overhead, largely negating benefits of model partitioning. In contrast, zeroth-order (ZO) optimization eliminates backpropagation and significantly reduces memory usage, but […]

Ver mais

Like 0

Liked Liked

technocracy

Can Instructed Retrieval Models Really Support Exploration?

digitado ⋅ 19 de January de 2026

arXiv:2601.10936v1 Announce Type: new Abstract: Exploratory searches are characterized by under-specified goals and evolving query intents. In such scenarios, retrieval models that can capture user-specified nuances in query intent and adapt results accordingly are desirable — instruction-following retrieval models promise such a capability. In this work, we evaluate instructed retrievers for the prevalent yet under-explored application of aspect-conditional seed-guided exploration using an expert-annotated test collection. We evaluate both recent LLMs fine-tuned for instructed retrieval and general-purpose LLMs prompted […]

Ver mais

Like 0

Liked Liked

technocracy

Tail-Aware Data Augmentation for Long-Tail Sequential Recommendation

digitado ⋅ 19 de January de 2026

arXiv:2601.10933v1 Announce Type: new Abstract: Sequential recommendation (SR) learns user preferences based on their historical interaction sequences and provides personalized suggestions. In real-world scenarios, most users can only interact with a handful of items, while the majority of items are seldom consumed. This pervasive long-tail challenge limits the model’s ability to learn user preferences. Despite previous efforts to enrich tail items/users with knowledge from head parts or improve tail learning through additional contextual information, they still face the […]

Ver mais

Like 0

Liked Liked

technocracy

Sparse Data Tree Canopy Segmentation: Fine-Tuning Leading Pretrained Models on Only 150 Images

digitado ⋅ 19 de January de 2026

arXiv:2601.10931v1 Announce Type: new Abstract: Tree canopy detection from aerial imagery is an important task for environmental monitoring, urban planning, and ecosystem analysis. Simulating real-life data annotation scarcity, the Solafune Tree Canopy Detection competition provides a small and imbalanced dataset of only 150 annotated images, posing significant challenges for training deep models without severe overfitting. In this work, we evaluate five representative architectures, YOLOv11, Mask R-CNN, DeepLabv3, Swin-UNet, and DINOv2, to assess their suitability for canopy segmentation under […]

Ver mais

Like 0

Liked Liked

technocracy

Where to Touch, How to Contact: Hierarchical RL-MPC Framework for Geometry-Aware Long-Horizon Dexterous Manipulation

digitado ⋅ 19 de January de 2026

arXiv:2601.10930v1 Announce Type: new Abstract: A key challenge in contact-rich dexterous manipulation is the need to jointly reason over geometry, kinematic constraints, and intricate, nonsmooth contact dynamics. End-to-end visuomotor policies bypass this structure, but often require large amounts of data, transfer poorly from simulation to reality, and generalize weakly across tasks/embodiments. We address those limitations by leveraging a simple insight: dexterous manipulation is inherently hierarchical – at a high level, a robot decides where to touch (geometry) and […]

Ver mais

Like 0

Liked Liked

technocracy

Secure Data Bridging in Industry 4.0: An OPC UA Aggregation Approach for Including Insecure Legacy Systems

digitado ⋅ 19 de January de 2026

arXiv:2601.10929v1 Announce Type: new Abstract: The increased connectivity of industrial networks has led to a surge in cyberattacks, emphasizing the need for cybersecurity measures tailored to the specific requirements of industrial systems. Modern Industry 4.0 technologies, such as OPC UA, offer enhanced resilience against these threats. However, widespread adoption remains limited due to long installation times, proprietary technology, restricted flexibility, and formal process requirements (e.g. safety certifications). Consequently, many systems do not yet implement these technologies, or only […]

Ver mais

Like 0

Liked Liked

technocracy

Selecting Language Models for Social Science: Start Small, Start Open, and Validate

digitado ⋅ 19 de January de 2026

arXiv:2601.10926v1 Announce Type: new Abstract: Currently, there are thousands of large pretrained language models (LLMs) available to social scientists. How do we select among them? Using validity, reliability, reproducibility, and replicability as guides, we explore the significance of: (1) model openness, (2) model footprint, (3) training data, and (4) model architectures and fine-tuning. While ex-ante tests of validity (i.e., benchmarks) are often privileged in these discussions, we argue that social scientists cannot altogether avoid validating computational measures (ex-post). […]

Ver mais

Like 0

Liked Liked