technocracy

How to save the policy with best performance during training with CleanRL ?

digitado ⋅ 26 de February de 2026

Hi guys, I’m new to the libary CleanRL. I have run some training scripts by using the `uv run python cleanrl/….py` command. I’m not sure if this can save the best policy (e.g. the policy returns best episode rewards) during training. I just went through the documentation of CleanRL and found no information about this. Do you know how can I save the best policy during training and load it after training ? submitted by /u/ZitaLovesCats [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Variational optimization approach for reconstruction of dielectric permittivity and conductivity functions using partial boundary measurements

digitado ⋅ 23 de February de 2026

arXiv:2602.17819v1 Announce Type: new Abstract: We present a variational optimization approach for the solution of a coefficient inverse problem of simultaneous reconstruction of the dielectric permittivity and conductivity functions in time-dependent Maxwell’s system using limited boundary observations of the electric field. The variational optimization approach is based on constructing a weak form of a Lagrangian which allows to use finite element based reconstruction algorithms. The optimality conditions for the Lagrangian and stability estimate for the adjoint problem are […]

Ver mais

Like 0

Liked Liked

technocracy

[P] I wrote a CUDA Locality Sensitive Hashing library with Python bindings

digitado ⋅ 6 de January de 2026

I’ve been working on cuLSH, a GPU-accelerated library for Locality Sensitive Hashing. Main Features: Scikit-Learn Style API: Uses a familiar fit() / query() style API for building and searching the LSH index. CUDA-native: All components (projection generation, hashing, indexing, querying), are performed on the GPU via custom kernels. End-to-End: Not just a hasher; includes bucketed searching and candidate neighbor collection. I know there are plenty of LSH implementations out there, but many focus purely on generating signatures rather […]

Ver mais

Like 0

Liked Liked

technocracy

AI deployment in financial services hits an inflection point as Singapore leads the shift to production

digitado ⋅ 13 de February de 2026

AI deployment in financial services has crossed a critical threshold, with only 2% of institutions globally reporting no AI use whatsoever—a dramatic indicator that the technology has moved decisively from boardroom discussion to operational reality. New research from Finastra surveying 1,509 senior leaders across 11 markets reveals that Singapore financial institutions are leading this transition, with nearly two-thirds already deploying AI in production environments rather than confining it to experimental pilots. The Financial Services State of the Nation […]

Ver mais

Like 0

Liked Liked

technocracy

Robert Wright: Nonzero: The Logic of Human Destiny

digitado ⋅ 1 de November de 2024

Robert Wright traces a grand pattern beneath history: life’s long drift toward cooperation. From single cells to civilizations, progress blooms when we play games where everyone can win. Evolution, technology, and morality, he suggests, are not accidents, but echoes of a deeper logic pulling humanity toward shared destiny.

Ver mais

Like 0

Liked Liked

technocracy

Machine Learning-Driven Crystal System Prediction for Perovskites Using Augmented X-ray Diffraction Data

digitado ⋅ 4 de February de 2026

Prediction of crystal system from X-ray diffraction (XRD) spectra is a critical task in materials science, particularly for perovskite materials which are known for their diverse applications in photovoltaics, optoelectronics, and catalysis. In this study, we present a machine learning (ML)-driven framework that leverages advanced models, including Time Series Forest (TSF), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and a simple feedforward neural network (NN), to […]

Ver mais

Like 0

Liked Liked

technocracy

Model-Free Monte Carlo-like Policy Evaluation

digitado ⋅ 31 de March de 2010

We propose an algorithm for estimating the finite-horizon expected return of a closed loop control policy from an a priori given (off-policy) sample of one-step transitions. It averages cumulated rewards along a set of “broken trajectories” made of one-step transitions selected from the sample on the basis of the control policy. Under some Lipschitz continuity assumptions on the system dynamics, reward function and control policy, we provide bounds on the bias and variance of the estimator that depend […]

Ver mais

Like 0

Liked Liked

technocracy

Sparse Autoencoders are Capable LLM Jailbreak Mitigators

digitado ⋅ 16 de February de 2026

arXiv:2602.12418v1 Announce Type: new Abstract: Jailbreak attacks remain a persistent threat to large language model safety. We propose Context-Conditioned Delta Steering (CC-Delta), an SAE-based defense that identifies jailbreak-relevant sparse features by comparing token-level representations of the same harmful request with and without jailbreak context. Using paired harmful/jailbreak prompts, CC-Delta selects features via statistical testing and applies inference-time mean-shift steering in SAE latent space. Across four aligned instruction-tuned models and twelve jailbreak attacks, CC-Delta achieves comparable or better safety-utility […]

Ver mais

Like 0

Liked Liked

technocracy

Researchers make “neuromorphic” artificial skin for robots

digitado ⋅ 29 de December de 2025

The nervous system does an astonishing job of tracking sensory information, and does so using signals that would drive many computer scientists insane: a noisy stream of activity spikes that may be transmitted to hundreds of additional neurons, where they are integrated with similar spike trains coming from still other neurons. Now, researchers have used spiking circuitry to build an artificial robotic skin, adopting some of the principles of how signals from our sensory neurons are transmitted and […]

Ver mais

Like 0

Liked Liked

technocracy

Degeneracy of Koszul Homological Series on Lie Algebroids. Production of All Affine Structures, Production of all Riemannian Foliations and Production of All Fedosov Structures

digitado ⋅ 14 de January de 2026

The framework of the research whose part of results are published in this work is the category of real vector bundles over finite dimensional differentiable manifolds. The objects of studies are ( textit{gauge structures on these vector bundles} ). We are interested in dynamical properties of the holonomy groups of Koszul connections as well as on their topological properties, i.e. properties that are of homological nature. For the most part the context is the subcategory of Lie algebroids. […]

Ver mais

Like 0

Liked Liked