January 2026

Explaining Generalization of AI-Generated Text Detectors Through Linguistic Analysis

digitado ⋅ 14 de January de 2026

arXiv:2601.07974v1 Announce Type: new Abstract: AI-text detectors achieve high accuracy on in-domain benchmarks, but often struggle to generalize across different generation conditions such as unseen prompts, model families, or domains. While prior work has reported these generalization gaps, there are limited insights about the underlying causes. In this work, we present a systematic study aimed at explaining generalization behavior through linguistic analysis. We construct a comprehensive benchmark that spans 6 prompting strategies, 7 large language models (LLMs), and […]

Ver mais

Like 0

Liked Liked

technocracy

Cultural Compass: A Framework for Organizing Societal Norms to Detect Violations in Human-AI Conversations

digitado ⋅ 14 de January de 2026

arXiv:2601.07973v1 Announce Type: new Abstract: Generative AI models ought to be useful and safe across cross-cultural contexts. One critical step toward this goal is understanding how AI models adhere to sociocultural norms. While this challenge has gained attention in NLP, existing work lacks both nuance and coverage in understanding and evaluating models’ norm adherence. We address these gaps by introducing a taxonomy of norms that clarifies their contexts (e.g., distinguishing between human-human norms that models should recognize and […]

Ver mais

Like 0

Liked Liked

technocracy

Knowing But Not Doing: Convergent Morality and Divergent Action in LLMs

digitado ⋅ 14 de January de 2026

arXiv:2601.07972v1 Announce Type: new Abstract: Value alignment is central to the development of safe and socially compatible artificial intelligence. However, how Large Language Models (LLMs) represent and enact human values in real-world decision contexts remains under-explored. We present ValAct-15k, a dataset of 3,000 advice-seeking scenarios derived from Reddit, designed to elicit ten values defined by Schwartz Theory of Basic Human Values. Using both the scenario-based questions and the traditional value questionnaire, we evaluate ten frontier LLMs (five from […]

Ver mais

Like 0

Liked Liked

technocracy

Sesame Plant Segmentation Dataset: A YOLO Formatted Annotated Dataset

digitado ⋅ 14 de January de 2026

arXiv:2601.07970v1 Announce Type: new Abstract: This paper presents the Sesame Plant Segmentation Dataset, an open source annotated image dataset designed to support the development of artificial intelligence models for agricultural applications, with a specific focus on sesame plants. The dataset comprises 206 training images, 43 validation images, and 43 test images in YOLO compatible segmentation format, capturing sesame plants at early growth stages under varying environmental conditions. Data were collected using a high resolution mobile camera from farms […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Synthesis for Two-Dimensional Strand Arrays with Row Constraints

digitado ⋅ 14 de January de 2026

arXiv:2601.07968v1 Announce Type: new Abstract: We study the theoretical problem of synthesizing multiple DNA strands under spatial constraints, motivated by large-scale DNA synthesis technologies. In this setting, strands are arranged in an array and synthesized according to a fixed global synthesis sequence, with the restriction that at most one strand per row may be synthesized in any synthesis cycle. We focus on the basic case of two strands in a single row and analyze the expected completion time […]

Ver mais

Like 0

Liked Liked

technocracy

Scattered Data Histopolation in Averaging Kernel Hilbert Spaces

digitado ⋅ 14 de January de 2026

arXiv:2601.07967v1 Announce Type: new Abstract: Kernel-based methods offer a powerful and flexible mathematical framework for addressing histopolation problems. In histopolation, the available input data does not consist of pointwise function samples but of averages taken over intervals or higher-dimensional regions, and these mean values serve as a basis for reconstructing or approximating the target function. While classical interpolation requires continuity of the underlying function, histopolation can be performed in larger function spaces. In the framework of kernel methods, […]

Ver mais

Like 0

Liked Liked

technocracy

DataScribe: An AI-Native, Policy-Aligned Web Platform for Multi-Objective Materials Design and Discovery

digitado ⋅ 14 de January de 2026

arXiv:2601.07966v1 Announce Type: new Abstract: The acceleration of materials discovery requires digital platforms that go beyond data repositories to embed learning, optimization, and decision-making directly into research workflows. We introduce DataScribe, an AI-native, cloud-based materials discovery platform that unifies heterogeneous experimental and computational data through ontology-backed ingestion and machine-actionable knowledge graphs. The platform integrates FAIR-compliant metadata capture, schema and unit harmonization, uncertainty-aware surrogate modeling, and native multi-objective multi-fidelity Bayesian optimization, enabling closed-loop propose-measure-learn workflows across experimental and computational […]

Ver mais

Like 0

Liked Liked

technocracy

When Models Know When They Do Not Know: Calibration, Cascading, and Cleaning

digitado ⋅ 14 de January de 2026

arXiv:2601.07965v1 Announce Type: new Abstract: When a model knows when it does not know, many possibilities emerge. The first question is how to enable a model to recognize that it does not know. A promising approach is to use confidence, computed from the model’s internal signals, to reflect its ignorance. Prior work in specific domains has shown that calibration can provide reliable confidence estimates. In this work, we propose a simple, effective, and universal training-free method that applies […]

Ver mais

Like 0

Liked Liked

technocracy

Executable Ontologies in Game Development: From Algorithmic Control to Semantic World Modeling

digitado ⋅ 14 de January de 2026

arXiv:2601.07964v1 Announce Type: new Abstract: This paper examines the application of Executable Ontologies (EO), implemented through the boldsea framework, to game development. We argue that EO represents a paradigm shift: a transition from algorithmic behavior programming to semantic world modeling, where agent behavior emerges naturally from declarative domain rules rather than being explicitly coded. Using a survival game scenario (Winter Feast), we demonstrate how EO achieves prioritybased task interruption through dataflow conditions rather than explicit preemption logic. Comparison […]

Ver mais

Like 0

Liked Liked

technocracy

3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing

digitado ⋅ 14 de January de 2026

arXiv:2601.07963v1 Announce Type: new Abstract: The transformative potential of 3D content creation has been progressively unlocked through advancements in generative models. Recently, intuitive drag editing with geometric changes has attracted significant attention in 2D editing yet remains challenging for 3D scenes. In this paper, we introduce 3DGS-Drag — a point-based 3D editing framework that provides efficient, intuitive drag manipulation of real 3D scenes. Our approach bridges the gap between deformation-based and 2D-editing-based 3D editing methods, addressing their limitations […]

Ver mais

Like 0

Liked Liked