digitado – Page 96

Microsoft Research’s World-R1 Uses Flow-GRPO and 3D-Aware Rewards to Inject Geometric Consistency Into Wan 2.1 Without Architectural Changes

digitado ⋅ 1 de May de 2026

Video foundation models can paint a beautiful frame. They are still notoriously bad at remembering it. Push the camera through a corridor in Wan 2.1 or CogVideoX and walls warp, objects morph, and details vanish — the giveaway that these models are fitting 2D pixel correlations rather than simulating a coherent 3D scene. A team of researchers from Microsoft Research and Zhejiang University introduced World-R1: a framework that aligns video generation with 3D constraints through reinforcement learning. The […]

Ver mais

Like 0

Liked Liked

technocracy

A Domain-Specific Language for LLM-Driven Trigger Generation in Multimodal Data Collection

digitado ⋅ 16 de April de 2026

arXiv:2604.13046v1 Announce Type: new Abstract: Data-driven systems depend on task-relevant data, yet data collection pipelines remain passive and indiscriminate. Continuous logging of multimodal sensor streams incurs high storage costs and captures irrelevant data. This paper proposes a declarative framework for intent-driven, on-device data collection that enables selective collection of multimodal sensor data based on high-level user requests. The framework combines natural language interaction with a formally specified domain-specific language (DSL). Large language models translate user-defined requirements into verifiable […]

Ver mais

Like 0

Liked Liked

technocracy

Haskell in Production: Chordify

digitado ⋅ 28 de December de 2023

In this edition of our “Haskell in Production” series we interview Jeroen Bransen from Chordify, an online platform, which turns any music or song into chords. Jeroen has been working at Chordify since 2016. Prior to joining it, he obtained his PhD in Software Technology from Utrecht University, where he became acquainted with two of the company’s founders. Initially hired as a Haskell developer, Bransen continues to write Haskell code. Over the years, his responsibilities expanded to include […]

Ver mais

Like 0

Liked Liked

technocracy

AMC26: VSSEA robust position control

digitado ⋅ 7 de January de 2026

arXiv:2601.02557v1 Announce Type: new Abstract: This paper presents robust position control strategies for the novel VSSEA. By employing a constructed state-space model, two control schemes are developed in a unified framework: a state-feedback controller and a sliding mode controller, both integrated with a second-order DOb. The proposed framework achieves high-performance motion control by precisely estimating and compensating for internal and external disturbances, while preserving the nominal dynamic response. Simulation results demonstrate that pole-placement-based controllers are highly sensitive to […]

Ver mais

Like 0

Liked Liked

technocracy

Evaluating Multimodal LLMs for Inpatient Diagnosis: Real-World Performance, Safety, and Cost Across Ten Frontier Models

digitado ⋅ 21 de April de 2026

arXiv:2604.16980v1 Announce Type: new Abstract: Background: Large language models (LLMs) are increasingly proposed for diagnostic support, but few evaluations use real-world multimodal inpatient data, particularly in low and middle-income country (LMIC) public hospitals. Methods: We conducted VALID, a retrospective evaluation of 539 multimodal inpatient cases from a tertiary public hospital in South Africa. Inputs included radiology imaging (CT, MRI, CXR) and reports, laboratory results, clinical notes, and vital signs. Expert panels adjudicated 300 cases (balanced and discordant subsets) […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Beyond Optimization: Stress-Gated Dynamical Regime Regulation in Autonomous Systems

digitado ⋅ 20 de February de 2026

Despite their apparent diversity, modern machine learning methods can be reduced to a remarkably simple core principle: learning is achieved by continuously optimizing parameters to minimize or maximize a scalar objective function. This paradigm has been extraordinarily successful for well-defined tasks where goals are fixed and evaluation criteria are explicit. However, if artificial systems are to move toward true autonomy-operating over long horizons and across evolving contexts-objectives may become ill-defined, shifting, or entirely absent. In such settings, a […]

Ver mais

Like 0

Liked Liked

technocracy

How CMOs Win CFO Buy-In at Scale

digitado ⋅ 16 de February de 2026

How CMOs win CFO buy-in is no longer just a leadership challenge. It is a financial one that directly impacts budgets, capital allocation, and revenue durability. In today’s capital-constrained markets, CFOs evaluate whether marketing is discretionary spend or a disciplined investment that drives measurable economic return. Many marketing teams still report in impressions and engagement, while finance leaders focus on contribution margin, EBITDA, and cash flow timing. Bridging this measurement gap from activity metrics to financial outcomes, including […]

Ver mais

Like 0

Liked Liked

technocracy

TOON: Beyond JSON for LLMs

digitado ⋅ 8 de June de 2026

Author(s): Sourav Ghosh Originally published on Towards AI. Is JSON Finally Getting a Token-Efficient Alternative for LLMs? For years, JSON has been the default language for APIs, integrations, configuration files, event payloads, and all other types of application-to-application communications. It is an easy language to understand, it is very robust and developers can easily exploit it. But when we transition from traditional software systems to Large Language Model applications, we start to see how JSON comes with an […]

Ver mais

Like 0

Liked Liked

technocracy

Teaching Vision-Language Models to Speak Cinema

digitado ⋅ 14 de May de 2026

A year of building a video caption pipeline with 100+ professional creators, and what it taught us about scaling supervision instead of models. By Zhiqiu Lin and Chancharik Mitra. Based on our CVPR 2026 work, Building a Precise Video Language with Human-AI Oversight (Highlight, Top 3%). How close is today’s video generator to a Hollywood cinematographer? Hollywood directors reach for certain shots because they make a scene land. They cue a specific feeling in the viewer that flat […]

Ver mais

Like 0

Liked Liked

technocracy

Sporadic Gradient Tracking over Directed Graphs: A Theoretical Perspective on Decentralized Federated Learning

digitado ⋅ 31 de January de 2026

Decentralized Federated Learning (DFL) enables clients with local data to collaborate in a peer-to-peer manner to train a generalized model. In this paper, we unify two branches of work that have separately solved important challenges in DFL: (i) gradient tracking techniques for mitigating data heterogeneity and (ii) accounting for diverse availability of resources across clients. We propose $textit{Sporadic Gradient Tracking}$ ($texttt{Spod-GT}$), the first DFL algorithm that incorporates these factors over general directed graphs by allowing (i) client-specific gradient […]

Ver mais

Like 0

Liked Liked