digitado

About digitado

https://www.digitado.com.br

Posts by :

TelcoAgent-Bench: A Multilingual Benchmark for Telecom AI Agents

digitado ⋅ 9 de April de 2026

arXiv:2604.06209v1 Announce Type: new Abstract: The integration of large language model (LLM) agents into telecom networks introduces new challenges, related to intent recognition, tool execution, and resolution generation, while taking into consideration different operational constraints. In this paper, we introduce TelcoAgent-Bench and TelcoAgent-Metrics, a Telecom-specific benchmarking framework for evaluating multilingual telecom LLM agents. The proposed framework assesses the semantic understanding as well as process-level alignment with structured troubleshooting flows and stability across repeated scenario variations. Our contribution includes […]

Ver mais

Like 0

Liked Liked

technocracy

Extracting Breast Cancer Phenotypes from Clinical Notes: Comparing LLMs with Classical Ontology Methods

digitado ⋅ 9 de April de 2026

arXiv:2604.06208v1 Announce Type: new Abstract: A significant amount of data held in Oncology Electronic Medical Records (EMRs) is contained in unstructured provider notes — including but not limited to the chemotherapy (or cancer treatment) outcome, different biomarkers, the tumor’s location, sizes, and growth patterns of a patient. The clinical studies show that the majority of oncologists are comfortable providing these valuable insights in their notes in a natural language rather than the relevant structured fields of an EMR. […]

Ver mais

Like 0

Liked Liked

technocracy

A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction

digitado ⋅ 9 de April de 2026

arXiv:2604.06207v1 Announce Type: new Abstract: This paper investigates demonstration selection strategies for predicting a user’s next point-of-interest (POI) using large language models (LLMs), aiming to accurately forecast a user’s subsequent location based on historical check-in data. While in-context learning (ICL) with LLMs has recently gained attention as a promising alternative to traditional supervised approaches, the effectiveness of ICL significantly depends on the selected demonstration. Although previous studies have examined methods such as random selection, embedding-based selection, and task-specific […]

Ver mais

Like 0

Liked Liked

technocracy

The Human Condition as Reflected in Contemporary Large Language Models

digitado ⋅ 9 de April de 2026

arXiv:2604.06206v1 Announce Type: new Abstract: This study seeks to uncover evidence of a latent structure in evolved human culture as it is refracted through contemporary large language models (LLMs). Drawing on parallel responses from six leading generative models to a prompt which asks directly what their training corpora reveal about human culture and behavior, we identify a robust cross-model consensus on a limited set of recurring cultural themes. The themes include narrative meaning-making, affect-first cognition, coalition psychology, status […]

Ver mais

Like 0

Liked Liked

technocracy

Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation

digitado ⋅ 9 de April de 2026

arXiv:2604.06205v1 Announce Type: new Abstract: The growth of online platforms and user content requires strong content moderation systems that can handle complex inputs from various media types. While large language models (LLMs) are effective, their high computational cost and latency present significant challenges for scalable deployment. To address this, we introduce Tool-MCoT, a small language model (SLM) fine-tuned for content safety moderation leveraging external framework. By training our model on tool-augmented chain-of-thought data generated by LLM, we demonstrate […]

Ver mais

Like 0

Liked Liked

technocracy

SensorPersona: An LLM-Empowered System for Continual Persona Extraction from Longitudinal Mobile Sensor Streams

digitado ⋅ 9 de April de 2026

arXiv:2604.06204v1 Announce Type: new Abstract: Personalization is essential for Large Language Model (LLM)-based agents to adapt to users’ preferences and improve response quality and task performance. However, most existing approaches infer personas from chat histories, which capture only self-disclosed information rather than users’ everyday behaviors in the physical world, limiting the ability to infer comprehensive user personas. In this work, we introduce SensorPersona, an LLM-empowered system that continuously infers stable user personas from multimodal longitudinal sensor streams unobtrusively […]

Ver mais

Like 0

Liked Liked

technocracy

Front-End Ethics for Sensor-Fused Health Conversational Agents: An Ethical Design Space for Biometrics

digitado ⋅ 9 de April de 2026

arXiv:2604.06203v1 Announce Type: new Abstract: The integration of continuous data from built-in sensors and Large Language Models (LLMs) has fueled a surge of “Sensor-Fused LLM agents” for personal health and well-being support. While recent breakthroughs have demonstrated the technical feasibility of this fusion (e.g., Time-LLM, SensorLLM), research primarily focuses on “Ethical Back-End Design for Generative AI”, concerns such as sensing accuracy, bias mitigation in training data, and multimodal fusion. This leaves a critical gap at the front end, […]

Ver mais

Like 0

Liked Liked

technocracy

Cross-Lingual Transfer and Parameter-Efficient Adaptation in the Turkic Language Family: A Theoretical Framework for Low-Resource Language Models

digitado ⋅ 9 de April de 2026

arXiv:2604.06202v1 Announce Type: new Abstract: Large language models (LLMs) have transformed natural language processing, yet their capabilities remain uneven across languages. Most multilingual models are trained primarily on high-resource languages, leaving many languages with large speaker populations underrepresented in both training data and evaluation benchmarks. This imbalance is particularly visible in the Turkic language family. This paper proposes a theoretical framework for studying cross-lingual transfer and parameter-efficient adaptation of multilingual LLMs within the Turkic language family, focusing on […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models

digitado ⋅ 9 de April de 2026

arXiv:2604.06201v1 Announce Type: new Abstract: While most reading comprehension benchmarks for LLMs focus on factual information that can be answered by localizing specific textual evidence, many real-world tasks require understanding distributional information, such as population-level trends and preferences expressed across collections of text. We introduce Text2DistBench, a reading comprehension benchmark for evaluating LLMs’ ability to infer distributional knowledge from natural language. Built from real-world YouTube comments about movie and music entities, the benchmark provides models with entity metadata […]

Ver mais

Like 0

Liked Liked

technocracy

Thinking in Graphs with CoMAP: A Shared Visual Workspace for Designing Project-Based Learning

digitado ⋅ 9 de April de 2026

arXiv:2604.06200v1 Announce Type: new Abstract: Designing project-based learning (PBL) demands managing highly interdependent components, a task that both traditional linear tools and purely conversational AI struggle with. Traditional tools fail to capture the non-linear nature of creative design, while conversational systems lack the persistent, shared context necessary for reflective collaboration. Grounded in theories of distributed cognition, we introduce CoMAP, a system that embodies a graph-based collaboration paradigm. By providing a shared visual workspace with dual-modality AI support, CoMAP […]

Ver mais

Like 0

Liked Liked