digitado – Page 85

Transformer-Based Reinforcement Learning for Autonomous Orbital Collision Avoidance in Partially Observable Environments

digitado ⋅ 9 de February de 2026

arXiv:2602.06088v1 Announce Type: new Abstract: We introduce a Transformer-based Reinforcement Learning framework for autonomous orbital collision avoidance that explicitly models the effects of partial observability and imperfect monitoring in space operations. The framework combines a configurable encounter simulator, a distance-dependent observation model, and a sequential state estimator to represent uncertainty in relative motion. A central contribution of this work is the use of transformer-based Partially Observable Markov Decision Process (POMDP) architecture, which leverage long-range temporal attention to interpret […]

Ver mais

Like 0

Liked Liked

technocracy

Interoperability Effects: Extending DeFi Lending Risk Models to Multi-Chain Environments

digitado ⋅ 14 de May de 2026

arXiv:2605.12508v1 Announce Type: new Abstract: On-chain lending has expanded across multiple distributed ledgers as DeFi becomes increasingly multi-chain. This environment introduces novel technical and financial mechanisms, particularly cross-blockchain communication and asset transfer protocols, yet cross-chain elements remain understudied in lending protocol risk management. To address this gap, we applied panel regression fixed effects and OLS models to empirically analyze cross-blockchain interoperability solutions, using TVL and total revenue as performance proxies from October 2022 to January 2025. Our data […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Reason with Curriculum I: Provable Benefits of Autocurriculum

digitado ⋅ 20 de March de 2026

arXiv:2603.18325v1 Announce Type: cross Abstract: Chain-of-thought reasoning, where language models expend additional computation by producing thinking tokens prior to final responses, has driven significant advances in model capabilities. However, training these reasoning models is extremely costly in terms of both data and compute, as it involves collecting long traces of reasoning behavior from humans or synthetic generators and further post-training the model via reinforcement learning. Are these costs fundamental, or can they be reduced through better algorithmic design? […]

Ver mais

Like 0

Liked Liked

technocracy

Why AI in Revenue Operations Fails Without Governed No-Code Architecture

digitado ⋅ 23 de April de 2026

Every enterprise revenue team has seen the demonstration at some point. The AI suggests a price, routes an approval, and flags a risk with apparent confidence, and the room is impressed by the fluency of it. What happens six months after go-live tends to be a different conversation, where the team is managing exceptions, investigating outputs that conflict with internal policy, and reconciling AI-generated recommendations against business rules the system had no reliable way to access in the […]

Ver mais

Like 0

Liked Liked

technocracy

VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

digitado ⋅ 3 de March de 2026

arXiv:2406.16227v2 Announce Type: replace Abstract: Effective clustering of biomedical data is crucial in precision medicine, enabling accurate stratifiction of patients or samples. However, the growth in availability of high-dimensional categorical data, including `omics data, necessitates computationally efficient clustering algorithms. We present VICatMix, a variational Bayesian finite mixture model designed for the clustering of categorical data. The use of variational inference (VI) in its training allows the model to outperform competitors in term of efficiency, while maintaining high accuracy. […]

Ver mais

Like 0

Liked Liked

technocracy

Robust Bayesian Optimisation with Unbounded Corruptions

digitado ⋅ 17 de February de 2026

arXiv:2511.15315v2 Announce Type: replace Abstract: Bayesian Optimization is critically vulnerable to extreme outliers. Existing provably robust methods typically assume a bounded cumulative corruption budget, which makes them defenseless against even a single corruption of sufficient magnitude. To address this, we introduce a new adversary whose budget is only bounded in the frequency of corruptions, not in their magnitude. We then derive RCGP-UCB, an algorithm coupling the famous upper confidence bound (UCB) approach with a Robust Conjugate Gaussian Process […]

Ver mais

Like 0

Liked Liked

technocracy

Why it’s critical to move beyond overly aggregated machine-learning metrics

digitado ⋅ 20 de January de 2026

MIT researchers have identified significant examples of machine-learning model failure when those models are applied to data other than what they were trained on, raising questions about the need to test whenever a model is deployed in a new setting. “We demonstrate that even when you train models on large amounts of data, and choose the best average model, in a new setting this ‘best model’ could be the worst model for 6-75 percent of the new data,” […]

Ver mais

Like 0

Liked Liked

technocracy

AI Is Changing Cyber Risk. Here’s How SMBs Can Respond.

digitado ⋅ 26 de June de 2026

Amid a surge in cyberattacks, security expert Daniel Dobrygowski shares steps every small to midsize business can take to avoid being an easy target.

Ver mais

Like 0

Liked Liked

technocracy

Challenges and Future Directions in Agentic Reverse Engineering Systems

digitado ⋅ 18 de April de 2026

arXiv:2604.14317v1 Announce Type: new Abstract: Agentic systems built on large language models (LLMs) are increasingly being used for complex security tasks, including binary reverse engineering (RE). Despite recent growth in popularity and capability, these systems continue to face limitations in realistic settings. Cutting-edge systems still fail in complex RE scenarios that involve obfuscation, timing, and unique architecture. In this work, we examine how agentic systems perform reverse engineering tasks with static, dynamic, and hybrid agents. Through an analysis […]

Ver mais

Like 0

Liked Liked

technocracy

Weakly Supervised Distillation of Hallucination Signals into Transformer Representations

digitado ⋅ 9 de April de 2026

arXiv:2604.06277v1 Announce Type: new Abstract: Existing hallucination detection methods for large language models (LLMs) rely on external verification at inference time, requiring gold answers, retrieval systems, or auxiliary judge models. We ask whether this external supervision can instead be distilled into the model’s own representations during training, enabling hallucination detection from internal activations alone at inference time. We introduce a weak supervision framework that combines three complementary grounding signals: substring matching, sentence embedding similarity, and an LLM as […]

Ver mais

Like 0

Liked Liked