digitado

About digitado

https://www.digitado.com.br

Posts by :

Building a Code Dataset Pipeline from NVIDIA Nemotron-Pretraining-Code-v3 Metadata with Streaming, Pandas, and tiktoken

digitado ⋅ 11 de June de 2026

In this tutorial, we work with NVIDIA’s Nemotron-Pretraining-Code-v3 dataset as a large-scale metadata index for code pretraining research. Instead of downloading the full multi-gigabyte dataset, we stream it, inspect its schema, and build a manageable sample for analysis. We then explore the dataset by studying languages, file extensions, repository frequency, and directory depth, which helps us understand how the index is structured. After that, we reconstruct the raw GitHub URLs from the metadata, attempt to fetch the actual […]

Ver mais

Like 0

Liked Liked

technocracy

Anthropic Releases Claude Fable 5 and Claude Mythos 5: Same Underlying Model, Different Safeguards, New Mythos-Class Tier

digitado ⋅ 11 de June de 2026

Anthropic released two models on June 9, 2026: Claude Fable 5 and Claude Mythos 5. Both belong to a tier called “Mythos-class.” This tier sits above the Opus class in capability. Fable 5 is the version claimed to be made safe for general use. Mythos 5 is the same model with some safeguards lifted, kept in limited release. Claude Fable 5 and Mythos 5 Mythos-class models are a tier of Claude models. They sit above the Opus class […]

Ver mais

Like 0

Liked Liked

technocracy

Top AI Coding Agents and Development Platforms in 2026: Atoms, Devin, Windsurf, Cursor, Warp, and More Compared

digitado ⋅ 11 de June de 2026

Software development has changed. Engineers no longer type most code by hand. They describe intent, and AI agents do the work. Modern tools plan tasks, edit across files, run tests, and open pull requests. Many now ship to production with limited supervision. No single tool fits every need. This guide covers the AI coding agents and platforms shaping development in 2026. Developer Tools Guide Top AI Coding Agents & Platforms — 2026 A practitioner’s field guide to the […]

Ver mais

Like 0

Liked Liked

technocracy

5 Useful Python Scripts to Automate Boring PDF Tasks

digitado ⋅ 11 de June de 2026

PDFs are used everywhere, and these five Python scripts help you automate the most common PDF tasks.

Ver mais

Like 0

Liked Liked

technocracy

Local Agentic Programming on the Cheap: Claude Code + Ollama + Gemma4

digitado ⋅ 11 de June de 2026

This article builds a full local agentic programming stack using Ollama, Gemma 4, and Claude Code.

Ver mais

Like 0

Liked Liked

technocracy

Formally proving a calculation with Claude and Lean

digitado ⋅ 11 de June de 2026

I ran an experiment today to see whether Claude [1] could generate Lean code to prove a calculation at the bottom of this post, six lines of calculus. I started with this prompt This page contains a mathematical proof that a Fourier coefficient, a_n, is given in terms of a Bessel function. The LaTeX source for the SVG image is contained in the alt tag of the image. Generate a formal proof of the result using Lean. and […]

Ver mais

Like 0

Liked Liked

technocracy

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims

digitado ⋅ 11 de June de 2026

A former xAI engineer is suing the company and SpaceX, alleging he was fired for raising AI safety concerns about Grok days before SpaceX’s historic IPO.

Ver mais

Like 0

Liked Liked

technocracy

A Coding Implementation on Microsoft SkillOpt for Instrumented Prompt Optimization, Skill Evolution Analysis, and Baseline Comparison

digitado ⋅ 11 de June de 2026

In this tutorial, we implement an instrumented workflow for Microsoft SkillOpt. We set up the SkillOpt repository, connect it to OpenAI-compatible model access, configure the optimizer and target models, and run the SearchQA optimization pipeline with a controlled sample limit to keep costs manageable. We first evaluate the original seed skill as a baseline, then run a real optimization loop in which SkillOpt improves the skill through rollout, reflection, aggregation, selection, updating, and validation-based gating. Along the way, […]

Ver mais

Like 0

Liked Liked

technocracy

Fresh off bond sale, Amazon borrows $17.5 billion from banks as AI spending continues

digitado ⋅ 10 de June de 2026

As AI spending continues to climb, the e-commerce giant has taken out a fresh $17.5 billion loan from a small coterie of banks.

Ver mais

Like 0

Liked Liked

technocracy

DiffusionGemma

digitado ⋅ 10 de June de 2026

DiffusionGemma Last May Google briefly released an experimental Gemini Diffusion model. I tried the preview at the time and recorded it running at 857 tokens/second. It was an exciting model, but Google made no further announcements about it. That research has returned in the best possible way: as a new open weight (Apache 2 licensed) Gemma model, google/diffusiongemma-26B-A4B-it. NVIDIA are currently hosting the model for free on their NIM cloud API. I used that API to generate this […]

Ver mais

Like 0

Liked Liked