technocracy

Selecting Language Models for Social Science: Start Small, Start Open, and Validate

digitado ⋅ 19 de January de 2026

arXiv:2601.10926v1 Announce Type: new Abstract: Currently, there are thousands of large pretrained language models (LLMs) available to social scientists. How do we select among them? Using validity, reliability, reproducibility, and replicability as guides, we explore the significance of: (1) model openness, (2) model footprint, (3) training data, and (4) model architectures and fine-tuning. While ex-ante tests of validity (i.e., benchmarks) are often privileged in these discussions, we argue that social scientists cannot altogether avoid validating computational measures (ex-post). […]

Ver mais

Like 0

Liked Liked

technocracy

A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives

digitado ⋅ 26 de February de 2026

arXiv:2602.21351v1 Announce Type: new Abstract: The rapid accumulation of Earth science data has created a significant scalability challenge; while repositories like PANGAEA host vast collections of datasets, citation metrics indicate that a substantial portion remains underutilized, limiting data reusability. Here we present PANGAEA-GPT, a hierarchical multi-agent framework designed for autonomous data discovery and analysis. Unlike standard Large Language Model (LLM) wrappers, our architecture implements a centralized Supervisor-Worker topology with strict data-type-aware routing, sandboxed deterministic code execution, and self-correction […]

Ver mais

Like 0

Liked Liked

technocracy

CLAD: A Clustered Label-Agnostic Federated Learning Framework for Joint Anomaly Detection and Attack Classification

digitado ⋅ 7 de May de 2026

The rapid expansion of the Internet of Things (IoT) and Industrial IoT (IIoT) has created a massive, heterogeneous attack surface that challenges traditional network security mechanisms. While Federated Learning (FL) offers a privacy-preserving alternative to centralized Intrusion Detection Systems (IDS), standard approaches struggle to generalize across diverse device behaviors and typically fail to utilize the vast amounts of unlabeled data present in realistic edge environments. To bridge these gaps, we propose CLAD, a holistic framework that seamlessly incorporates […]

Ver mais

Like 0

Liked Liked

technocracy

Cleaner energy microgrids under market power and limited regulation in developing countries

digitado ⋅ 19 de March de 2026

arXiv:2603.16893v1 Announce Type: new Abstract: In many low-income countries, neighborhood diesel generators are widely used to compensate for unreliable or unavailable national electricity grids. These diesel-based microgrids are typically characterized by market power, significant pollution, and weak regulatory oversight. In parallel, households increasingly deploy off-grid solar photovoltaic (PV) systems to gain control over electricity supply. However, these systems suffer from curtailed excess generation during peak solar hours and unreliable access at other times. While prior studies have optimized […]

Ver mais

Like 0

Liked Liked

technocracy

A Better Way to Give AI Agents Code Context

digitado ⋅ 19 de March de 2026

Last week, a post about my open-source project CocoIndex Code hit 54K+ views on X after @RoundtableSpace shared it. The tweet was simple: “CocoIndex Code gives your coding agent a brain.” That one line captured exactly what we built and why it matters. The problem is straightforward. Every time your AI coding agent needs context about your codebase, it pulls in entire files. Function signatures, import statements, docstrings, blank lines, comments you wrote at 2am — everything gets […]

Ver mais

Like 0

Liked Liked

technocracy

BXRL: Behavior-Explainable Reinforcement Learning

digitado ⋅ 24 de March de 2026

A major challenge of Reinforcement Learning is that agents often learn undesired behaviors that seem to defy the reward structure they were given. Explainable Reinforcement Learning (XRL) methods can answer queries such as "explain this specific action", "explain this specific trajectory", and "explain the entire policy". However, XRL lacks a formal definition for behavior as a pattern of actions across many episodes. We provide such a definition, and use it to enable a new query: "Explain this behavior". […]

Ver mais

Like 0

Liked Liked

technocracy

I Built a RAG System for Our Analytics Team. It Worked Great Until We Added Real Data.

digitado ⋅ 18 de March de 2026

I’ll tell you the moment I knew our RAG implementation was in trouble. A product manager asked our internal knowledge assistant: “What’s our refund policy for enterprise customers?” The system retrieved three chunks from three different documents. One was from 2022. One was from 2024. One was a draft that never got approved. It combined all three into a confident, well-formatted answer that was wrong in ways that would have cost us money if anyone had acted on […]

Ver mais

Like 0

Liked Liked

technocracy

Comparative Analysis of Neural Retriever-Reranker Pipelines for Retrieval-Augmented Generation over Knowledge Graphs in E-commerce Applications

digitado ⋅ 27 de February de 2026

arXiv:2602.22219v1 Announce Type: new Abstract: Recent advancements in Large Language Models (LLMs) have transformed Natural Language Processing (NLP), enabling complex information retrieval and generation tasks. Retrieval-Augmented Generation (RAG) has emerged as a key innovation, enhancing factual accuracy and contextual grounding by integrating external knowledge sources with generative models. Although RAG demonstrates strong performance on unstructured text, its application to structured knowledge graphs presents challenges: scaling retrieval across connected graphs and preserving contextual relationships during response generation. Cross-encoders refine […]

Ver mais

Like 0

Liked Liked

technocracy

Online Learning for Multi-Layer Hierarchical Inference under Partial and Policy-Dependent Feedback

digitado ⋅ 4 de March de 2026

Hierarchical inference systems route tasks across multiple computational layers, where each node may either finalize a prediction locally or offload the task to a node in the next layer for further processing. Learning optimal routing policies in such systems is challenging: inference loss is defined recursively across layers, while feedback on prediction error is revealed only at a terminal oracle layer. This induces a partial, policy-dependent feedback structure in which observability probabilities decay with depth, causing importance-weighted estimators […]

Ver mais

Like 0

Liked Liked

technocracy

A Three- and a Four- Body Problem

digitado ⋅ 9 de April de 2026

Last week I wrote about the orbit of Artemis II. The orbit of Artemis I was much more interesting. Because Artemis I was unmanned, it could spend a lot more time in orbit. The Artemis I mission took 25 days while Artemis II will take 10 days. Artemis I took an unusual path, orbiting the moon the opposite direction of the moon’s orbit around earth. This video by Primal Space demonstrates the orbit both from the perspective of […]

Ver mais

Like 0

Liked Liked