February 2026

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

digitado ⋅ 16 de February de 2026

arXiv:2602.12444v1 Announce Type: new Abstract: Reinforcement learning (RL) is a powerful framework for optimal decision-making and control but often lacks provable guarantees for safety-critical applications. In this paper, we introduce a novel recovery-based shielding framework that enables safe RL with a provable safety lower bound for unknown and non-linear continuous dynamical systems. The proposed approach integrates a backup policy (shield) with the RL agent, leveraging Gaussian process (GP) based uncertainty quantification to predict potential violations of safety constraints, […]

Ver mais

Like 0

Liked Liked

technocracy

SHAPR: A Solo Human-Centred and AI-Assisted Practice Framework for Research Software Development

digitado ⋅ 16 de February de 2026

arXiv:2602.12443v1 Announce Type: new Abstract: Research software has become a central vehicle for inquiry and learning in many Higher Degree Research (HDR) contexts, where solo researchers increasingly develop software-based artefacts as part of their research methodology. At the same time, generative artificial intelligence is reshaping development practice, offering powerful forms of assistance while introducing new challenges for accountability, reflection, and methodological rigour. Although Action Design Research (ADR) provides a well-established foundation for studying and constructing socio-technical artefacts, it […]

Ver mais

Like 0

Liked Liked

technocracy

Prototype-driven fusion of pathology and spatial transcriptomics for interpretable survival prediction

digitado ⋅ 16 de February de 2026

arXiv:2602.12441v1 Announce Type: new Abstract: Whole slide images (WSIs) enable weakly supervised prognostic modeling via multiple instance learning (MIL). Spatial transcriptomics (ST) preserves in situ gene expression, providing a spatial molecular context that complements morphology. As paired WSI-ST cohorts scale to population level, leveraging their complementary spatial signals for prognosis becomes crucial; however, principled cross-modal fusion strategies remain limited for this paradigm. To this end, we introduce PathoSpatial, an interpretable end-to-end framework integrating co-registered WSIs and ST to […]

Ver mais

Like 0

Liked Liked

technocracy

Interpolation-Inspired Closure Certificates

digitado ⋅ 16 de February de 2026

arXiv:2602.12436v1 Announce Type: new Abstract: Barrier certificates, a form of state invariants, provide an automated approach to the verification of the safety of dynamical systems. Similarly to barrier certificates, recent works explore the notion of closure certificates, a form of transition invariants, to verify dynamical systems against $omega$-regular properties including safety. A closure certificate, defined over state pairs of a dynamical system, is a real-valued function whose zero superlevel set characterizes an inductive transition invariant of the system. […]

Ver mais

Like 0

Liked Liked

technocracy

DRAMatic Speedup: Accelerating HE Operations on a Processing-in-Memory System

digitado ⋅ 16 de February de 2026

arXiv:2602.12433v1 Announce Type: new Abstract: Homomorphic encryption (HE) is a promising technology for confidential cloud computing, as it allows computations on encrypted data. However, HE is computationally expensive and often memory-bound on conventional computer architectures. Processing-in-Memory (PIM) is an alternative hardware architecture that integrates processing units and memory on the same chip or memory module. PIM enables higher memory bandwidth than conventional architectures and could thus be suitable for accelerating HE. In this work, we present DRAMatic, which […]

Ver mais

Like 0

Liked Liked

technocracy

KeySense: LLM-Powered Hands-Down, Ten-Finger Typing on Commodity Touchscreens

digitado ⋅ 16 de February de 2026

arXiv:2602.12432v1 Announce Type: new Abstract: Existing touchscreen software keyboards prevent users from resting their hands, forcing slow and fatiguing index-finger tapping (“chicken typing”) instead of familiar hands-down ten-finger typing. We present KeySense, a purely software solution that preserves physical keyboard motor skills. KeySense isolates intentional taps from resting-finger noise using cognitive-motor timing patterns, and then uses a fine-tuned LLM decoder to convert the resulting noisy letter sequence into the intended word. In controlled component tests, the decoder substantially […]

Ver mais

Like 0

Liked Liked

technocracy

Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

digitado ⋅ 16 de February de 2026

arXiv:2602.12430v1 Announce Type: new Abstract: The transition from monolithic language models to modular, skill-equipped agents marks a defining shift in how large language models (LLMs) are deployed in practice. Rather than encoding all procedural knowledge within model weights, agent skills — composable packages of instructions, code, and resources that agents load on demand — enable dynamic capability extension without retraining. It is formalized in a paradigm of progressive disclosure, portable skill definitions, and integration with the Model Context […]

Ver mais

Like 0

Liked Liked

technocracy

Stabilizing Native Low-Rank LLM Pretraining

digitado ⋅ 16 de February de 2026

arXiv:2602.12429v1 Announce Type: new Abstract: Foundation models have achieved remarkable success, yet their growing parameter counts pose significant computational and memory challenges. Low-rank factorization offers a promising route to reduce training and inference costs, but the community lacks a stable recipe for training models from scratch using exclusively low-rank weights while matching the performance of the dense model. We demonstrate that Large Language Models (LLMs) can be trained from scratch using exclusively low-rank factorized weights for all non-embedding […]

Ver mais

Like 0

Liked Liked

technocracy

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

digitado ⋅ 16 de February de 2026

arXiv:2602.12424v1 Announce Type: new Abstract: Benchmarks establish a standardized evaluation framework to systematically assess the performance of large language models (LLMs), facilitating objective comparisons and driving advancements in the field. However, existing benchmarks fail to differentiate question difficulty, limiting their ability to effectively distinguish models’ capabilities. To address this limitation, we propose RankLLM, a novel framework designed to quantify both question difficulty and model competency. RankLLM introduces difficulty as the primary criterion for differentiation, enabling a more fine-grained […]

Ver mais

Like 0

Liked Liked

technocracy

CacheMind: From Miss Rates to Why — Natural-Language, Trace-Grounded Reasoning for Cache Replacement

digitado ⋅ 16 de February de 2026

arXiv:2602.12422v1 Announce Type: new Abstract: Cache replacement remains a challenging problem in CPU microarchitecture, often addressed using hand-crafted heuristics, limiting cache performance. Cache data analysis requires parsing millions of trace entries with manual filtering, making the process slow and non-interactive. To address this, we introduce CacheMind, a conversational tool that uses Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs) to enable semantic reasoning over cache traces. Architects can now ask natural language questions like, “Why is the memory […]

Ver mais

Like 0

Liked Liked