digitado – Page 439

Adaptive Stopping for Multi-Turn LLM Reasoning

digitado ⋅ 3 de April de 2026

arXiv:2604.01413v1 Announce Type: new Abstract: Large Language Models (LLMs) increasingly rely on multi-turn reasoning and interaction, such as adaptive retrieval-augmented generation (RAG) and ReAct-style agents, to answer difficult questions. These methods improve accuracy by iteratively retrieving information, reasoning, or acting, but introduce a key challenge: textbf{When should the model stop?} Existing approaches rely on heuristic stopping rules or fixed turn budgets and provide no formal guarantees that the final prediction still contains the correct answer. This limitation is […]

Ver mais

Like 0

Liked Liked

technocracy

CapBench: A Multi-PDK Dataset for Machine-Learning-Based Post-Layout Capacitance Extraction

digitado ⋅ 13 de April de 2026

We present CapBench, a fully reproducible, multi-PDK dataset for capacitance extraction. The dataset is derived from open-source designs, including single-core CPUs, systems-on-chip, and media accelerators. All designs are fully placed and routed using 14 independent OpenROAD flow runs spanning three technology nodes: ASAP7, NanGate45, and Sky130HD. From these layouts, we extract 61,855 3D windows across three size tiers to enable transfer learning and scalability studies. High-fidelity capacitance labels are generated using RWCap, a state-of-the-art random-walk solver, and validated […]

Ver mais

Like 0

Liked Liked

technocracy

QoS-QoE Translation with Large Language Model

digitado ⋅ 13 de April de 2026

arXiv:2604.08703v1 Announce Type: new Abstract: QoS-QoE translation is a fundamental problem in multimedia systems because it characterizes how measurable system and network conditions affect user-perceived experience. Although many prior studies have examined this relationship, their findings are often developed for specific setups and remain scattered across papers, experimental settings, and reporting formats, limiting systematic reuse, cross-scenario generalization, and large-scale analysis. To address this gap, we first introduce QoS-QoE Translation dataset, a source-grounded dataset of structured QoS-QoE relationships from […]

Ver mais

Like 0

Liked Liked

technocracy

DQN agent not moving after performing technique?

digitado ⋅ 10 de March de 2026

the agent learned and performed a difficult technique, but stops moving afterwards, even though there are more points to be had. What could this behavior be explained by? Stable baselines 3 DQN model = DQN( policy=”CnnPolicy”, env=train_env, learning_rate=1e-4, buffer_size=500_000, optimize_memory_usage=True, replay_buffer_kwargs={“handle_timeout_termination”: False}, learning_starts=10_000, # Warm up with random actions first batch_size=32, gamma=0.99, target_update_interval=1_000, train_freq=4, gradient_steps=1, exploration_fraction=0.3, exploration_initial_eps=1.0, exploration_final_eps=0.01, tensorboard_log=TENSORBOARD_DIR, verbose=1, ) submitted by /u/Handy_Cap [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Enabling small language models to solve complex reasoning tasks

digitado ⋅ 12 de December de 2025

As language models (LMs) improve at tasks like image generation, trivia questions, and simple math, you might think that human-like reasoning is around the corner. In reality, they still trail us by a wide margin on complex tasks. Try playing Sudoku with one, for instance, where you fill in numbers one through nine in such a way that each appears only once across the columns, rows, and sections of a nine-by-nine grid. Your AI opponent will either fail […]

Ver mais

Like 0

Liked Liked

technocracy

A guide to APIs, MCPs, and MCP Gateways

digitado ⋅ 30 de April de 2026

APIs and MCPs are often mentioned in the same breath as ways that systems can exchange information, but they are designed differently and have different purposes. This article hopes to explain the differences and how software developers and users should approach interaction with each. An API is mainly found in software applications, while an MCP (Model Context Protocol), is used by large language models. APIs let one application talk to another, and an MCP lets an AI model […]

Ver mais

Like 0

Liked Liked

technocracy

LDDMM stochastic interpolants: an application to domain uncertainty quantification in hemodynamics

digitado ⋅ 31 de March de 2026

arXiv:2603.28324v1 Announce Type: new Abstract: We introduce a novel conditional stochastic interpolant framework for generative modeling of three-dimensional shapes. The method builds on a recent LDDMM-based registration approach to learn the conditional drift between geometries. By leveraging the resulting pull-back and push-forward operators, we extend this formulation beyond standard Cartesian grids to complex shapes and random variables defined on distinct domains. We present an application in the context of cardiovascular simulations, where aortic shapes are generated from an […]

Ver mais

Like 0

Liked Liked

technocracy

Language Model Representations for Efficient Few-Shot Tabular Classification

digitado ⋅ 19 de February de 2026

arXiv:2602.15844v1 Announce Type: new Abstract: The Web is a rich source of structured data in the form of tables, from product catalogs and knowledge bases to scientific datasets. However, the heterogeneity of the structure and semantics of these tables makes it challenging to build a unified method that can effectively leverage the information they contain. Meanwhile, Large language models (LLMs) are becoming an increasingly integral component of web infrastructure for tasks like semantic search. This raises a crucial […]

Ver mais

Like 0

Liked Liked

technocracy

EdgeLDR: Quaternion Low-Displacement Rank Neural Networks for Edge-Efficient Deep Learning

digitado ⋅ 12 de January de 2026

arXiv:2601.05379v1 Announce Type: new Abstract: Deploying deep neural networks on edge devices is often limited by the memory traffic and compute cost of dense linear operators. While quaternion neural networks improve parameter efficiency by coupling multiple channels through Hamilton products, they typically retain unstructured dense weights; conversely, structured matrices enable fast computation but are usually applied in the real domain. This paper introduces EdgeLDR, a practical framework for quaternion block-circulant linear and convolutional layers that combines quaternion channel […]

Ver mais

Like 0

Liked Liked

technocracy

Polynomial Convergence of Riemannian Diffusion Models

digitado ⋅ 7 de January de 2026

arXiv:2601.02499v1 Announce Type: new Abstract: Diffusion models have demonstrated remarkable empirical success in the recent years and are considered one of the state-of-the-art generative models in modern AI. These models consist of a forward process, which gradually diffuses the data distribution to a noise distribution spanning the whole space, and a backward process, which inverts this transformation to recover the data distribution from noise. Most of the existing literature assumes that the underlying space is Euclidean. However, in […]

Ver mais

Like 0

Liked Liked