digitado

AutothinkRAG: Complexity-Aware Control of Retrieval-Augmented Reasoning for Image-Text Interaction

digitado ⋅ 9 de March de 2026

arXiv:2603.05551v1 Announce Type: new Abstract: Information-intensive Document Question Answering (DocQA) is often constrained by long contexts and information overload, which hinders Vision-Language Models (VLMs) from performing precise direct reasoning. Although multimodal GraphRAG has achieved preliminary breakthroughs, existing frameworks still face dual challenges: (1) the necessity of large-scale models for handling queries of diverse complexities and (2) the inherent reasoning bottlenecks of end-to-end VLMs. To address these issues, we propose AutoThinkRAG, a framework that enhances the understanding of complex […]

Ver mais

Like 0

Liked Liked

technocracy

Announcing Moonshine Voice

digitado ⋅ 13 de February de 2026

Today we’re launching Moonshine Voice, a new family of on-device speech to text models designed for live voice applications, and an open source library to run them. They support streaming, doing a lot of the compute while the user is still talking so your app can respond to user speech an order of magnitude faster than alternatives, while continuously supplying partial text updates. Our largest model has only 245 million parameters, but achieves a 6.65% word error rate […]

Ver mais

Like 0

Liked Liked

technocracy

Dataset Distillation as Pushforward Optimal Quantization

digitado ⋅ 9 de February de 2026

arXiv:2501.07681v3 Announce Type: replace-cross Abstract: Dataset distillation aims to find a synthetic training set such that training on the synthetic data achieves similar performance to training on real data, with orders of magnitude less computational requirements. Existing methods can be broadly categorized as either bi-level optimization problems that have neural network training heuristics as the lower level problem, or disentangled methods that bypass the bi-level optimization by matching distributions of data. The latter method has the major advantages […]

Ver mais

Like 0

Liked Liked

technocracy

GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure

digitado ⋅ 18 de March de 2026

arXiv:2603.15643v1 Announce Type: new Abstract: Green Stormwater Infrastructure (GSI) systems, such as permeable pavement, rain gardens, and bioretention facilities, require continuous inspection and maintenance to ensure long-term performance. However, domain knowledge about GSI is often scattered across municipal manuals, regulatory documents, and inspection forms. As a result, non-expert users and maintenance staff may struggle to obtain reliable and actionable guidance from field observations. Although Large Language Models (LLMs) have demonstrated strong general reasoning and language generation capabilities, they […]

Ver mais

Like 0

Liked Liked

technocracy

OUTLINEFORGE: Hierarchical Reinforcement Learning with Explicit States for Scientific Writing

digitado ⋅ 16 de January de 2026

arXiv:2601.09858v1 Announce Type: new Abstract: Scientific paper generation requires document-level planning and factual grounding, but current large language models, despite their strong local fluency, often fail in global structure, input coverage, and citation consistency. We present a reinforcement learning framework that casts scientific outline construction as a long-horizon planning problem over hierarchical document structures. Our approach models edit evolving outlines through structured actions, enabling the system to incrementally build a complete scientific manuscript. To support effective and stabilize […]

Ver mais

Like 0

Liked Liked

technocracy

Update to GPT-5 System Card: GPT-5.2

digitado ⋅ 10 de December de 2025

GPT-5.2 is the latest model family in the GPT-5 series. The comprehensive safety mitigation approach for these models is largely the same as that described in the GPT-5 System Card and GPT-5.1 System Card. Like OpenAI’s other models, the GPT-5.2 models were trained on diverse datasets, including information that is publicly available on the internet, information that we partner with third parties to access, and information that our users or human trainers and researchers provide or generate.

Ver mais

Like 0

Liked Liked

technocracy

Towards OOD Generalization in Dynamic Graphs via Causal Invariant Learning

digitado ⋅ 2 de March de 2026

Although dynamic graph neural networks (DyGNNs) have demonstrated promising capabilities, most existing methods ignore out-of-distribution (OOD) shifts that commonly exist in dynamic graphs. Dynamic graph OOD generalization is non-trivial due to the following challenges: 1) Identifying invariant and variant patterns amid complex graph evolution, 2) Capturing the intrinsic evolution rationale from these patterns, and 3) Ensuring model generalization across diverse OOD shifts despite limited data distribution observations. Although several attempts have been made to tackle these challenges, none […]

Ver mais

Like 0

Liked Liked

technocracy

Semantic Uncertainty Quantification of Hallucinations in LLMs: A Quantum Tensor Network Based Method

digitado ⋅ 29 de January de 2026

arXiv:2601.20026v1 Announce Type: new Abstract: Large language models (LLMs) exhibit strong generative capabilities but remain vulnerable to confabulations, fluent yet unreliable outputs that vary arbitrarily even under identical prompts. Leveraging a quantum tensor network based pipeline, we propose a quantum physics inspired uncertainty quantification framework that accounts for aleatoric uncertainty in token sequence probability for semantic equivalence based clustering of LLM generations. This offers a principled and interpretable scheme for hallucination detection. We further introduce an entropy maximization […]

Ver mais

Like 0

Liked Liked

technocracy

Our approach to advertising and expanding access to ChatGPT

digitado ⋅ 16 de January de 2026

Our approach to advertising and expanding access to ChatGPT OpenAI’s long-rumored introduction of ads to ChatGPT just became a whole lot more concrete: In the coming weeks, we’re also planning to start testing ads in the U.S. for the free and Go tiers, so more people can benefit from our tools with fewer usage limits or without having to pay. Plus, Pro, Business, and Enterprise subscriptions will not include ads. What’s “Go” tier, you might ask? That’s a […]

Ver mais

Like 0

Liked Liked

technocracy

Survey of Various Fuzzy and Uncertain Decision-Making Methods

digitado ⋅ 18 de March de 2026

arXiv:2603.15709v1 Announce Type: new Abstract: Decision-making in real applications is often affected by vagueness, incomplete information, heterogeneous data, and conflicting expert opinions. This survey reviews uncertainty-aware multi-criteria decision-making (MCDM) and organizes the field into a concise, task-oriented taxonomy. We summarize problem-level settings (discrete, group/consensus, dynamic, multi-stage, multi-level, multiagent, and multi-scenario), weight elicitation (subjective and objective schemes under fuzzy/linguistic inputs), and inter-criteria structure and causality modelling. For solution procedures, we contrast compensatory scoring methods, distance-to-reference and compromise approaches, and […]

Ver mais

Like 0

Liked Liked