technocracy

DISPO: Enhancing Training Efficiency and Stability in Reinforcement Learning for Large Language Model Mathematical Reasoning

digitado ⋅ 1 de February de 2026

Reinforcement learning with verifiable rewards has emerged as a promising paradigm for enhancing the reasoning capabilities of large language models particularly in mathematics. Current approaches in this domain present a clear trade-off: PPO-style methods (e.g., GRPO/DAPO) offer training stability but exhibit slow learning trajectories due to their trust-region constraints on policy updates, while REINFORCE-style approaches (e.g., CISPO) demonstrate improved learning efficiency but suffer from performance instability as they clip importance sampling weights while still permitting non-zero gradients outside […]

Ver mais

Like 0

Liked Liked

technocracy

GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure

digitado ⋅ 18 de March de 2026

arXiv:2603.15643v1 Announce Type: new Abstract: Green Stormwater Infrastructure (GSI) systems, such as permeable pavement, rain gardens, and bioretention facilities, require continuous inspection and maintenance to ensure long-term performance. However, domain knowledge about GSI is often scattered across municipal manuals, regulatory documents, and inspection forms. As a result, non-expert users and maintenance staff may struggle to obtain reliable and actionable guidance from field observations. Although Large Language Models (LLMs) have demonstrated strong general reasoning and language generation capabilities, they […]

Ver mais

Like 0

Liked Liked

technocracy

Your Management Model Is the New Bottleneck

digitado ⋅ 12 de February de 2026

Agentic AI has solved the coding constraint. Now your processes, approvals, and org structure are what’s slowing you down. For the last 50 years, project management has evolved alongside software development. New methodologies promised better outcomes: waterfall gave way to Agile, Scrum replaced traditional planning, velocity became the measure of progress. Yet despite these advances, the fundamental constraint remained the same. Projects were limited by human capacity to execute work. That constraint no longer exists. Software development has achieved what […]

Ver mais

Like 0

Liked Liked

technocracy

Building an Internal Coding Agent at Zup: Lessons and Open Questions

digitado ⋅ 15 de April de 2026

arXiv:2604.09805v1 Announce Type: new Abstract: Enterprise teams building internal coding agents face a gap between prototype performance and production readiness. The root cause is that technical model quality alone is insufficient — tool design, safety enforcement, state management, and human trust calibration are equally decisive, yet underreported in the literature. We present CodeGen, an internal coding agent at Zup, and show that targeted tool design (e.g., string-replacement edits over full-file rewrites) and layered safety guardrails improved agent reliability […]

Ver mais

Like 0

Liked Liked

technocracy

A Reduced Order Model approach for First-Principles Molecular Dynamics Computations

digitado ⋅ 27 de February de 2026

arXiv:2602.22390v1 Announce Type: new Abstract: To leverage the redundancy between the electronic structure computed at each step of first-principles molecular dynamics, we present a data-driven modeling framework for Kohn-Sham Density Functional Theory that bypasses the explicit optimization of electronic wavefunctions. We sample a priori representative atomic configurations and construct a low-dimensional basis that efficiently approximates the electronic structure subspace. Subsequently, we employ this reduced basis in a direct solver for the electronic single particle density matrix, thereby enabling […]

Ver mais

Like 0

Liked Liked

technocracy

ZeroS: Zero-Sum Linear Attention for Efficient Transformers

digitado ⋅ 6 de February de 2026

arXiv:2602.05230v1 Announce Type: cross Abstract: Linear attention methods offer Transformers $O(N)$ complexity but typically underperform standard softmax attention. We identify two fundamental limitations affecting these approaches: the restriction to convex combinations that only permits additive information blending, and uniform accumulated weight bias that dilutes attention in long contexts. We propose Zero-Sum Linear Attention (ZeroS), which addresses these limitations by removing the constant zero-order term $1/t$ and reweighting the remaining zero-sum softmax residuals. This modification creates mathematically stable weights, […]

Ver mais

Like 0

Liked Liked

technocracy

Game-Based and Gamified Robotics Education: A Comparative Systematic Review and Design Guidelines

digitado ⋅ 2 de February de 2026

arXiv:2601.22199v1 Announce Type: new Abstract: Robotics education fosters computational thinking, creativity, and problem-solving, but remains challenging due to technical complexity. Game-based learning (GBL) and gamification offer engagement benefits, yet their comparative impact remains unclear. We present the first PRISMA-aligned systematic review and comparative synthesis of GBL and gamification in robotics education, analyzing 95 studies from 12,485 records across four databases (2014-2025). We coded each study’s approach, learning context, skill level, modality, pedagogy, and outcomes (k = .918). Three […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond Reward Suppression: Reshaping Steganographic Communication Protocols in MARL via Dynamic Representational Circuit Breaking

digitado ⋅ 18 de March de 2026

arXiv:2603.15655v1 Announce Type: new Abstract: In decentralized Multi-Agent Reinforcement Learning (MARL), steganographic collusion — where agents develop private protocols to evade monitoring — presents a critical AI safety threat. Existing defenses, limited to behavioral or reward layers, fail to detect coordination in latent communication channels. We introduce the Dynamic Representational Circuit Breaker (DRCB), an architectural defense operating at the optimization substrate. Building on the AI Mother Tongue (AIM) framework, DRCB utilizes a Vector Quantized Variational Autoencoder (VQ-VAE) bottleneck […]

Ver mais

Like 0

Liked Liked

technocracy

Revisiting Continuous-Time Trajectory Estimation via Gaussian Processes and the Magnus Expansion

digitado ⋅ 8 de January de 2026

arXiv:2601.03360v1 Announce Type: new Abstract: Continuous-time state estimation has been shown to be an effective means of (i) handling asynchronous and high-rate measurements, (ii) introducing smoothness to the estimate, (iii) post hoc querying the estimate at times other than those of the measurements, and (iv) addressing certain observability issues related to scanning-while-moving sensors. A popular means of representing the trajectory in continuous time is via a Gaussian process (GP) prior, with the prior’s mean and covariance functions generated […]

Ver mais

Like 0

Liked Liked

technocracy

Architectural Design and Performance Analysis of FPGA based AI Accelerators: A Comprehensive Review

digitado ⋅ 11 de March de 2026

arXiv:2603.08740v1 Announce Type: new Abstract: Deep learning (DL) has emerged as a rapidly developing advanced technology, enabling the performance of complex tasks involving image recognition, natural language processing, and autonomous decision-making with high levels of accuracy. However, as these technologies evolve and strive to meet the growing demands of real-life applications, the complexity of DL models continues to increase. These models require processing of massive volumes of data, demanding substantial computational power and memory bandwidth. This gives rise […]

Ver mais

Like 0

Liked Liked