March 2026

M-RAG: Making RAG Faster, Stronger, and More Efficient

digitado ⋅ 31 de March de 2026

arXiv:2603.26667v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) has become a widely adopted paradigm for enhancing the reliability of large language models (LLMs). However, RAG systems are sensitive to retrieval strategies that rely on text chunking to construct retrieval units, which often introduce information fragmentation, retrieval noise, and reduced efficiency. Recent work has even questioned the necessity of RAG, arguing that long-context LLMs may eliminate multi-stage retrieval pipelines by directly processing full documents. Nevertheless, expanded context capacity alone […]

Ver mais

Like 0

Liked Liked

technocracy

Quality-Controlled Active Learning via Gaussian Processes for Robust Structure-Property Learning in Autonomous Microscopy

digitado ⋅ 31 de March de 2026

Autonomous experimental systems are increasingly used in materials research to accelerate scientific discovery, but their performance is often limited by low-quality, noisy data. This issue is especially problematic in data-intensive structure-property learning tasks such as Image-to-Spectrum (Im2Spec) and Spectrum-to-Image (Spec2Im) translations, where standard active learning strategies can mistakenly prioritize poor-quality measurements. We introduce a gated active learning framework that combines curiosity-driven sampling with a physics-informed quality control filter based on the Simple Harmonic Oscillator model fits, allowing the […]

Ver mais

Like 0

Liked Liked

technocracy

HCLSM: Hierarchical Causal Latent State Machines for Object-Centric World Modeling

digitado ⋅ 31 de March de 2026

World models that predict future states from video remain limited by flat latent representations that entangle objects, ignore causal structure, and collapse temporal dynamics into a single scale. We present HCLSM, a world model architecture that operates on three interconnected principles: object-centric decomposition via slot attention with spatial broadcast decoding, hierarchical temporal dynamics through a three-level engine combining selective state space models for continuous physics, sparse transformers for discrete events, and compressed transformers for abstract goals, and causal […]

Ver mais

Like 0

Liked Liked

technocracy

datasette-files 0.1a3

digitado ⋅ 31 de March de 2026

Release: datasette-files 0.1a3 I’m working on integrating datasette-files into other plugins, such as datasette-extract. This necessitated a new release of the base plugin. owners_can_edit and owners_can_delete configuration options, plus the files-edit and files-delete actions are now scoped to a new FileResource which is a child of FileSourceResource. #18 The file picker UI is now available as a <datasette-file-picker> Web Component. Thanks, Alex Garcia. #19 New from datasette_files import get_file Python API for other plugins that need to access […]

Ver mais

Like 0

Liked Liked

technocracy

Realistic Market Impact Modeling for Reinforcement Learning Trading Environments

digitado ⋅ 31 de March de 2026

Reinforcement learning (RL) has shown promise for trading, yet most open-source backtesting environments assume negligible or fixed transaction costs, causing agents to learn trading behaviors that fail under realistic execution. We introduce three Gymnasium-compatible trading environments — MACE (Market-Adjusted Cost Execution) stock trading, margin trading, and portfolio optimization — that integrate nonlinear market impact models grounded in the Almgren-Chriss framework and the empirically validated square-root impact law. Each environment provides pluggable cost models, permanent impact tracking with exponential […]

Ver mais

Like 0

Liked Liked

technocracy

Salesforce AI Research Releases VoiceAgentRAG: A Dual-Agent Memory Router that Cuts Voice RAG Retrieval Latency by 316x

digitado ⋅ 31 de March de 2026

In the world of voice AI, the difference between a helpful assistant and an awkward interaction is measured in milliseconds. While text-based Retrieval-Augmented Generation (RAG) systems can afford a few seconds of ‘thinking’ time, voice agents must respond within a 200ms budget to maintain a natural conversational flow. Standard production vector database queries typically add 50-300ms of network latency, effectively consuming the entire budget before an LLM even begins generating a response. Salesforce AI research team has released […]

Ver mais

Like 0

Liked Liked

technocracy

Quantum, chaotic and fractal types of algorithmic convergence

digitado ⋅ 31 de March de 2026

For centuries, mathematicians worked on problems where the convergence is either smooth or does not happen. Now the concept of chaotic convergence is mainstream, popularized by the stochastic gradient descent in deep neural networks, central to LLMs. In my most recent book here, I discuss many cases involving various types of chaotic convergence. In some examples, the chaos exhibits a fractal structure. In others, patterns are strikingly similar to quantum dynamics. This article — a new addition to […]

Ver mais

Like 0

Liked Liked

technocracy

Interesting Problems

digitado ⋅ 31 de March de 2026

In your opinion, what are some of the most interesting/relevant open questions in RL right now? In any topic like inverse RL, imitation learning, model-based RL or more frontier lab focused like model-free, deep-RL, or RLHF-related questions. submitted by /u/sassafrassar [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Water utility announces it’s ditching fluoride—then reveals it did so years ago

digitado ⋅ 31 de March de 2026

Residents of Birmingham, Alabama, were abruptly informed earlier this month that their water utility had decided to stop adding fluoride to city water. Then, days later, they learned that the utility had actually stopped adding fluoride years ago. On March 20, Central Alabama Water (CAW) made an announcement that it had discontinued water fluoridation. The announcement cited “aging equipment” and “increasing maintenance and component replacement” as justifications for the removal of fluoride, which it indicated had already occurred. […]

Ver mais

Like 0

Liked Liked

technocracy

A Latent Risk-Aware Machine Learning Approach for Predicting Operational Success in Clinical Trials based on TrialsBank

digitado ⋅ 31 de March de 2026

Clinical trials are characterized by high costs, extended timelines, and substantial operational risk, yet reliable prospective methods for predicting trial success before initiation remain limited. Existing artificial intelligence approaches often focus on isolated metrics or specific development stages and frequently rely on variables unavailable at the trial design phase, limiting real-world applicability. We present a hierarchical latent risk-aware machine learning framework for prospective prediction of clinical trial operational success using a curated subset of TrialsBank, a proprietary AI-ready […]

Ver mais

Like 0

Liked Liked