digitado

About digitado

https://www.digitado.com.br

Posts by :

Reinforcement Learning with LLM-Guided Action Spaces for Synthesizable Lead Optimization

digitado ⋅ 9 de April de 2026

Lead optimization in drug discovery requires improving therapeutic properties while ensuring that proposed molecular modifications correspond to feasible synthetic routes. Existing approaches either prioritize property scores without enforcing synthesizability, or rely on expensive enumeration over large reaction networks, while direct application of Large Language Models (LLMs) frequently produces chemically invalid structures. We introduce MolReAct, a framework that formulates lead optimization as a Markov Decision Process over a synthesis-constrained action space defined by validated reaction templates. A tool-augmented LLM […]

Ver mais

Like 0

Liked Liked

technocracy

Root prime gap

digitado ⋅ 9 de April de 2026

I recently found out about Andrica’s conjecture: the square roots of consecutive primes are less than 1 apart. In symbols, Andrica’s conjecture says that if pn and pn+1 are consecutive prime numbers, then √pn+1 − √pn < 1. This has been empirically verified for primes up to 2 × 1019. If the conjecture is true, it puts an upper bound on how long you’d have to search to find the next prime: pn+1 < 1 + 2√pn + […]

Ver mais

Like 0

Liked Liked

technocracy

An Imperfect Verifier is Good Enough: Learning with Noisy Rewards

digitado ⋅ 9 de April de 2026

Reinforcement Learning with Verifiable Rewards (RLVR) has become a prominent method for post-training Large Language Models (LLMs). However, verifiers are rarely error-free; even deterministic checks can be inaccurate, and the growing dependence on model-based judges exacerbates the issue. The extent to which RLVR is robust to such noise and the verifier accuracy required for effective training remain unresolved questions. We investigate these questions in the domains of code generation and scientific reasoning by introducing noise into RL training. […]

Ver mais

Like 0

Liked Liked

technocracy

A Survey on Progressive Web Applications for Decentralized Systems

digitado ⋅ 9 de April de 2026

Progressive Web Applications (PWAs) have emerged as a transformative paradigm in modern software engineering, combining the reach of the web with the capabilities of native applications. Simulta- neously, decentralized systems—anchored by blockchain technology, distributed ledger frameworks, and peer-to-peer networking protocols—are reshaping trust architectures across industries ranging from finance and healthcare to supply chain and digital identity. Despite the clear synergies between these two technological pillars, the intersection of PWAs and decentralized systems remains relatively underexplored in the academic […]

Ver mais

Like 0

Liked Liked

technocracy

Cognitive-Causal Multi-Task Learning with Psychological State Conditioning for Assistive Driving Perception

digitado ⋅ 9 de April de 2026

Multi-task learning for advanced driver assistance systems requires modeling the complex interplay between driver internal states and external traffic environments. However, existing methods treat recognition tasks as flat and independent objectives, failing to exploit the cognitive causal structure underlying driving behavior. In this paper, we propose CauPsi, a cognitive science-grounded causal multi-task learning framework that explicitly models the hierarchical dependencies among Traffic Context Recognition (TCR), Vehicle Context Recognition (VCR), Driver Emotion Recognition (DER), and Driver Behavior Recognition (DBR). […]

Ver mais

Like 0

Liked Liked

technocracy

How to Deploy Open WebUI with Secure OpenAI API Integration, Public Tunneling, and Browser-Based Chat Access

digitado ⋅ 9 de April de 2026

In this tutorial, we build a complete Open WebUI setup in Colab, in a practical, hands-on way, using Python. We begin by installing the required dependencies, then securely provide our OpenAI API key through terminal-based secret input so that sensitive credentials are not exposed directly in the notebook. From there, we configure the environment variables needed for Open WebUI to communicate with the OpenAI API, define a default model, prepare a data directory for runtime storage, and launch […]

Ver mais

Like 0

Liked Liked

technocracy

Z.AI Introduces GLM-5.1: An Open-Weight 754B Agentic Model That Achieves SOTA on SWE-Bench Pro and Sustains 8-Hour Autonomous Execution

digitado ⋅ 9 de April de 2026

Z.AI, the AI platform developed by the team behind the GLM model family, has released GLM-5.1 — its next-generation flagship model developed specifically for agentic engineering. Unlike models optimized for clean, single-turn benchmarks, GLM-5.1 is built for agentic tasks, with significantly stronger coding capabilities than its predecessor, and achieves state-of-the-art performance on SWE-Bench Pro while leading GLM-5 by a wide margin on NL2Repo (repo generation) and Terminal-Bench 2.0 (real-world terminal tasks). Architecture: DSA, MoE, and Asynchronous RL Before […]

Ver mais

Like 0

Liked Liked

technocracy

Meet OSGym: A New OS Infrastructure Framework That Manages 1,000+ Replicas at $0.23/Day for Computer Use Agent Research

digitado ⋅ 9 de April de 2026

Training AI agents that can actually use a computer — opening apps, clicking buttons, browsing the web, writing code — is one of the hardest infrastructure problems in modern AI. It’s not a data problem. It’s not a model problem. It’s a plumbing problem. You need to spin up hundreds, potentially thousands, of full operating system environments with actual graphical user interfaces. Each one needs to run real software. Each one needs to handle unpredictable crashes. And you […]

Ver mais

Like 0

Liked Liked

technocracy

A Comprehensive Implementation Guide to ModelScope for Model Search, Inference, Fine-Tuning, Evaluation, and Export

digitado ⋅ 9 de April de 2026

In this tutorial, we explore ModelScope through a practical, end-to-end workflow that runs smoothly on Colab. We begin by setting up the environment, verifying dependencies, and confirming GPU availability so we can work with the framework reliably from the start. From there, we interact with the ModelScope Hub to search for models, download snapshots, load datasets, and understand how its ecosystem connects with familiar tools such as Hugging Face Transformers. As we move forward, we apply pretrained pipelines […]

Ver mais

Like 0

Liked Liked

technocracy

A Three- and a Four- Body Problem

digitado ⋅ 9 de April de 2026

Last week I wrote about the orbit of Artemis II. The orbit of Artemis I was much more interesting. Because Artemis I was unmanned, it could spend a lot more time in orbit. The Artemis I mission took 25 days while Artemis II will take 10 days. Artemis I took an unusual path, orbiting the moon the opposite direction of the moon’s orbit around earth. This video by Primal Space demonstrates the orbit both from the perspective of […]

Ver mais

Like 0

Liked Liked