digitado

When simulations look right but causal effects go wrong: Large language models as behavioral simulators

digitado ⋅ 7 de April de 2026

arXiv:2604.02458v1 Announce Type: new Abstract: Behavioral simulation is increasingly used to anticipate responses to interventions. Large language models (LLMs) enable researchers to specify population characteristics and intervention context in natural language, but it remains unclear to what extent LLMs can use these inputs to infer intervention effects. We evaluated three LLMs on 11 climate-psychology interventions using a dataset of 59,508 participants from 62 countries, and replicated the main analysis in two additional datasets (12 and 27 countries). LLMs […]

Ver mais

Like 0

Liked Liked

technocracy

How to Build an AI Generated Calculator Without Custom JavaScript

digitado ⋅ 24 de January de 2026

How a JSON or JSON-like language enables the next generation of safe human and AI-generated UIs Introduction In The Future of AI-Generated UI, I pointed out that the raw JavaScript generated by AI-generated code is a security problem, and the flexibility of JavaScript without a framework can result in hard-to-manage code. I argued that we need declarative, sandboxed formats like A2UI or a Computational DOM (cDOM) with JSON Pointer Regular Expressions (JPRX if), if we want to trust […]

Ver mais

Like 0

Liked Liked

technocracy

GraphAllocBench: A Flexible Benchmark for Preference-Conditioned Multi-Objective Policy Learning

digitado ⋅ 28 de January de 2026

Preference-Conditioned Policy Learning (PCPL) in Multi-Objective Reinforcement Learning (MORL) aims to approximate diverse Pareto-optimal solutions by conditioning policies on user-specified preferences over objectives. This enables a single model to flexibly adapt to arbitrary trade-offs at run-time by producing a policy on or near the Pareto front. However, existing benchmarks for PCPL are largely restricted to toy tasks and fixed environments, limiting their realism and scalability. To address this gap, we introduce GraphAllocBench, a flexible benchmark built on a […]

Ver mais

Like 0

Liked Liked

technocracy

DsDm: Model-Aware Dataset Selection with Datamodels

digitado ⋅ 24 de January de 2024

Code Paper tl;dr: When training large-scale models, standard practice is to select training data that is intuitively useful. However, it turns out that such data can actually hurt model performance. We instead design a framework that selects by modeling how models learn from data—and thereby greatly improve performance. Suppose we want to train a large-scale ML model, like a language model or a diffusion model. How do we choose which data to train on? Standard methods […]

Ver mais

Like 0

Liked Liked

technocracy

The Three Axes of Success: A Three-Dimensional Framework for Career Decision-Making

digitado ⋅ 27 de January de 2026

arXiv:2601.17023v1 Announce Type: new Abstract: Career decision-making is a socio-technical problem: individuals exercise bounded agency while navigating labor market institutions, organizational incentive structures, and information asymmetries that shape feasible trajectories. Existing frameworks optimize along single dimensions – financial returns, work-life balance, or mission alignment – without explicit models for inter-dimensional tradeoffs or temporal dynamics. We propose The Three Axes of Success, a normative decision framework decomposing career trajectories into Wealth (career capital accumulation and economic optionality), Autonomy (control […]

Ver mais

Like 0

Liked Liked

technocracy

AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access

digitado ⋅ 25 de March de 2026

arXiv:2603.22376v1 Announce Type: new Abstract: Recent advances in AI agents for software engineering and scientific discovery have demonstrated remarkable capabilities, yet their application to developing novel ranking models in commercial search engines remains unexplored. In this paper, we present an AI Co-Scientist framework that automates the full search ranking research pipeline: from idea generation to code implementation and GPU training job scheduling with expert in the loop. Our approach strategically employs single-LLM agents for routine tasks while leveraging […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Matrix Implementation for Rotary Position Embedding

digitado ⋅ 15 de April de 2026

arXiv:2604.09742v1 Announce Type: new Abstract: Rotary Position Embedding (RoPE) has become a core component of modern Transformer architectures across language, vision, and 3D domains. However, existing implementations rely on vector-level split and merge operations that introduce non-negligible computational overhead, often overlooked in attention optimization. The problem is further amplified in multi-dimensional settings (e.g., 2D and 3D RoPE), where additional vector operations and uneven feature partitions degrade hardware utilization. To overcome these limitations, we propose RoME (Rotary Matrix position […]

Ver mais

Like 0

Liked Liked

technocracy

String-Level Ground Fault Localization for TN-Earthed Three-Phase Photovoltaic Systems

digitado ⋅ 16 de February de 2026

arXiv:2602.12289v1 Announce Type: new Abstract: The DC-side ground fault (GF) poses significant risks to three-phase TN-earthed photovoltaic (PV) systems, as the resulting high fault current can directly damage both PV inverters and PV modules. Once a fault occurs, locating the faulty string through manual string-by-string inspection is highly time-consuming and inefficient. This work presents a comprehensive analysis of GF characteristics through fault-current analysis and a simulation-based case study covering multiple fault locations. Building on these insights, we propose […]

Ver mais

Like 0

Liked Liked

technocracy

Clues No One Meant to Reveal

digitado ⋅ 24 de March de 2026

:::info Astounding Stories of Super-Science October 2022, by Astounding Stories is part of HackerNoon’s Book Blog Post series. You can jump to any chapter in this book here. THE MURDER OF ROGER ACKROYD – POIROT PAYS A CALL Astounding Stories of Super-Science October 2022: THE MURDER OF ROGER ACKROYD – POIROT PAYS A CALL By Agatha Christie ::: I was slightly nervous when I rang the bell at Marby Grange the following afternoon. I wondered very much what Poirot expected […]

Ver mais

Like 0

Liked Liked

technocracy

The Claude C Compiler: What It Reveals About the Future of Software

digitado ⋅ 23 de February de 2026

The Claude C Compiler: What It Reveals About the Future of Software On February 5th Anthropic’s Nicholas Carlini wrote about a project to use parallel Claudes to build a C compiler on top of the brand new Opus 4.6 Chris Lattner (Swift, LLVM, Clang, Mojo) knows more about C compilers than most. He just published this review of the code. Some points that stood out to me: Good software depends on judgment, communication, and clear abstraction. AI has […]

Ver mais

Like 0

Liked Liked