digitado – Page 15

From Business Problems to AI Solutions: Where Does Transformation Support Fail

digitado ⋅ 22 de April de 2026

arXiv:2604.18770v1 Announce Type: new Abstract: Translating business problems into well-specified machine learning solutions is a prerequisite for successful AI systems, yet this upstream translation is still one of the least supported steps in existing methodologies. We conduct a structured narrative literature review of 18 approaches spanning requirements engineering (RE), machine learning (ML) project management, and automation. We organize these approaches into a taxonomy of four families and compare them across six input artifact categories, six output artifact categories, […]

Ver mais

Like 0

Liked Liked

technocracy

Haskell in Production: e-bot7

digitado ⋅ 12 de December de 2022

In this edition of our Haskell in Production series, we feature e-bot7 – a low-code conversational AI platform designed for customer service and support. The questions in the article were answered collaboratively by Max Gerer, the co-founder and CTO of e-bot7, and Andreas Scharf, who is the tech lead of e-bot7’s Conversational Engine team. Read further to learn where e-bot7 uses Haskell, why they decided to adopt it, and what was their experience using it. How e-bot7 uses […]

Ver mais

Like 0

Liked Liked

technocracy

WM Arena: Compare world model predictions across 26 Atari games with blind battles and a perception quiz

digitado ⋅ 15 de April de 2026

I built WM Arena (arena.worldflux.ai), an interactive benchmark for visual world models on the Atari 100k suite. Three modes: – Visual Explorer: side-by-side real vs predicted frames across 26 games – Blind Battle: ELO-ranked voting on anonymous model outputs – Real or Predicted? Quiz: a perception test Currently evaluating DIAMOND (NeurIPS ’24 Spotlight), TWISTER (ICLR ’25), IRIS (ICLR ’23), and STORM (NeurIPS ’23). Every model runs its official code at a pinned commit. No re-implementations. Try it: arena.worldflux.ai […]

Ver mais

Like 0

Liked Liked

technocracy

LLMs, MCPs, and Agents in Decision Intelligence

digitado ⋅ 12 de January de 2026

Where language models add leverage — and where they deliberately stop TL;DR: Large Language Models add the most value in Decision Intelligence systems when they improve how decisions are understood, not when they take over how decisions are made. In mature DI setups, LLMs act as an interpretation layer — enabling natural language querying, automated insights, clearer metadata, and better explanations — while decision authority remains with deterministic rules and models. MCPs and agents help scale this safely by controlling access and coordinating tasks […]

Ver mais

Like 0

Liked Liked

technocracy

Guideline2Graph: Profile-Aware Multimodal Parsing for Executable Clinical Decision Graphs

digitado ⋅ 7 de April de 2026

arXiv:2604.02477v1 Announce Type: new Abstract: Clinical practice guidelines are long, multimodal documents whose branching recommendations are difficult to convert into executable clinical decision support (CDS), and one-shot parsing often breaks cross-page continuity. Recent LLM/VLM extractors are mostly local or text-centric, under-specifying section interfaces and failing to consolidate cross-page control flow across full documents into one coherent decision graph. We present a decomposition-first pipeline that converts full-guideline evidence into an executable clinical decision graph through topology-aware chunking, interface-constrained chunk […]

Ver mais

Like 0

Liked Liked

technocracy

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

digitado ⋅ 21 de January de 2026

arXiv:2601.12215v1 Announce Type: new Abstract: Wearable foundation models have the potential to transform digital health by learning transferable representations from large-scale biosignals collected in everyday settings. While recent progress has been made in large-scale pretraining, most approaches overlook the spectral structure of photoplethysmography (PPG) signals, wherein physiological rhythms unfold across multiple frequency bands. Motivated by the insight that many downstream health-related tasks depend on multi-resolution features spanning fine-grained waveform morphology to global rhythmic dynamics, we introduce Masked Multiscale […]

Ver mais

Like 0

Liked Liked

technocracy

In Defense of Capitalism, Even After Its Worst Excesses

digitado ⋅ 20 de January de 2026

We live in what are described as post-capitalist times, where the economic system that promotes the virtues of creating individual wealth has been variously described as broken, defunct, and even failed. It has, according to many, morphed into a system where the oligarchs control the resources required to make immense wealth and leave the rest to fend for themselves and fight over scraps. Some go to the extent of romanticizing the concept of a welfare state, where basic […]

Ver mais

Like 0

Liked Liked

technocracy

Redes sociales: un negocio obsceno que empieza a pasar factura

digitado ⋅ 26 de March de 2026

A veces la realidad tiene un sentido del timing difícil de mejorar. Meta anunciaba esta semana un nuevo paquete de stock options para sus ejecutivos, condicionado a que la compañía alcance una valoración de 9.4 billones de dólares en 2031, desde los aproximadamente 1.5 actuales. Horas después, despedía a cientos de empleados. Y apenas unas horas más tarde, un jurado en Los Angeles declaraba a Meta y a YouTube responsables por negligencia en un caso centrado, precisamente, en […]

Ver mais

Like 0

Liked Liked

technocracy

How Thomson Reuters built an Agentic Platform Engineering Hub with Amazon Bedrock AgentCore

digitado ⋅ 21 de January de 2026

This post was co-written with Naveen Pollamreddi and Seth Krause from Thomson Reuters. Thomson Reuters (TR) is a leading AI and technology company dedicated to delivering trusted content and workflow automation solutions. With over 150 years of expertise, TR provides essential solutions across legal, tax, accounting, risk, trade, and media sectors in a fast-evolving world. AI plays a critical role at TR. It’s embedded in how it helps create, enhance, connect, and deliver trusted information to customers. It powers […]

Ver mais

Like 0

Liked Liked

technocracy

Better Eyes, Better Thoughts: Why Vision Chain-of-Thought Fails in Medicine

digitado ⋅ 10 de March de 2026

arXiv:2603.06665v1 Announce Type: new Abstract: Large vision-language models (VLMs) often benefit from chain-of-thought (CoT) prompting in general domains, yet its efficacy in medical vision-language tasks remains underexplored. We report a counter-intuitive trend: on medical visual question answering, CoT frequently underperforms direct answering (DirA) across general-purpose and medical-specific models. We attribute this to a emph{medical perception bottleneck}: subtle, domain-specific cues can weaken visual grounding, and CoT may compound early perceptual uncertainty rather than correct it. To probe this hypothesis, […]

Ver mais

Like 0

Liked Liked