digitado

About digitado

https://www.digitado.com.br

Posts by :

technocracy

Local Agentic Programming on the Cheap: Claude Code + Ollama + Gemma4

digitado ⋅ 11 de June de 2026

This article builds a full local agentic programming stack using Ollama, Gemma 4, and Claude Code.

Ver mais

Like 0

Liked Liked

technocracy

Formally proving a calculation with Claude and Lean

digitado ⋅ 11 de June de 2026

I ran an experiment today to see whether Claude [1] could generate Lean code to prove a calculation at the bottom of this post, six lines of calculus. I started with this prompt This page contains a mathematical proof that a Fourier coefficient, a_n, is given in terms of a Bessel function. The LaTeX source for the SVG image is contained in the alt tag of the image. Generate a formal proof of the result using Lean. and […]

Ver mais

Like 0

Liked Liked

technocracy

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims

digitado ⋅ 11 de June de 2026

A former xAI engineer is suing the company and SpaceX, alleging he was fired for raising AI safety concerns about Grok days before SpaceX’s historic IPO.

Ver mais

Like 0

Liked Liked

technocracy

A Coding Implementation on Microsoft SkillOpt for Instrumented Prompt Optimization, Skill Evolution Analysis, and Baseline Comparison

digitado ⋅ 11 de June de 2026

In this tutorial, we implement an instrumented workflow for Microsoft SkillOpt. We set up the SkillOpt repository, connect it to OpenAI-compatible model access, configure the optimizer and target models, and run the SearchQA optimization pipeline with a controlled sample limit to keep costs manageable. We first evaluate the original seed skill as a baseline, then run a real optimization loop in which SkillOpt improves the skill through rollout, reflection, aggregation, selection, updating, and validation-based gating. Along the way, […]

Ver mais

Like 0

Liked Liked

technocracy

Fresh off bond sale, Amazon borrows $17.5 billion from banks as AI spending continues

digitado ⋅ 10 de June de 2026

As AI spending continues to climb, the e-commerce giant has taken out a fresh $17.5 billion loan from a small coterie of banks.

Ver mais

Like 0

Liked Liked

technocracy

DiffusionGemma

digitado ⋅ 10 de June de 2026

DiffusionGemma Last May Google briefly released an experimental Gemini Diffusion model. I tried the preview at the time and recorded it running at 857 tokens/second. It was an exciting model, but Google made no further announcements about it. That research has returned in the best possible way: as a new open weight (Apache 2 licensed) Gemma model, google/diffusiongemma-26B-A4B-it. NVIDIA are currently hosting the model for free on their NIM cloud API. I used that API to generate this […]

Ver mais

Like 0

Liked Liked

technocracy

Access OpenAI models and Codex through your Oracle cloud commitment

digitado ⋅ 10 de June de 2026

Access OpenAI models and Codex through Oracle Cloud, using existing commitments to build and deploy AI with enterprise security and governance.

Ver mais

Like 0

Liked Liked

technocracy

Logitech’s foldable mouse is for people who refuse to carry a mouse with them

digitado ⋅ 10 de June de 2026

I see it often. Hardworking professionals in cafés, airports, or parks hunched over a laptop while carefully dragging their fingers over their PC’s trackpad to navigate some email, project, or alert that can’t be ignored. They would prefer a mouse to a trackpad, but are reluctant to travel with one. When you’re on the go, carrying a mouse can seem burdensome or unnecessary. But I’d argue that it’s worth the boost in efficiency and comfort when navigating your […]

Ver mais

Like 0

Liked Liked

technocracy

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

digitado ⋅ 10 de June de 2026

Another day, another AI model from Google. This time, Google DeepMind has released a new member of the Gemma 4 open model family, but it’s fundamentally different from the rest of the lineup. DiffusionGemma doesn’t generate outputs linearly like most AI models. Instead, it can produce an entire block of text in parallel. Google says this makes it faster and more efficient when running on local hardware like an Nvidia DGX or a humble gaming GPU. Most AI […]

Ver mais

Like 0

Liked Liked

technocracy

Routing LLMs by task verifiability: a small experiment (n=120, 3 models) inspired by Karpathy’s framework [D]

digitado ⋅ 10 de June de 2026

Full disclosure: this is directional, not a paper. n=120 tasks, one internal evaluator, not peer reviewed. I work at an LLM infrastructure company. This experiment was done on my own time and is not a company claim. Karpathy’s framework classifies tasks by verifiability. Can output be mechanically checked? High verifiability tasks like code compilation and structured JSON extraction are safer because the verifier catches errors. Low verifiability tasks like creative writing are riskier. I wondered if high verifiability […]

Ver mais

Like 0

Liked Liked