May 2026

Google AI Releases Multi-Token Prediction (MTP) Drafters for Gemma 4: Delivering Up to 3x Faster Inference Without Quality Loss

digitado ⋅ 7 de May de 2026

Large language models are getting incredibly powerful, but let’s be honest—their inference speed is still a massive headache for anyone trying to use them in production. Google just launched Multi-Token Prediction (MTP) drafters for the Gemma 4 model family. This specialized speculative decoding architecture can actually triple (3x) your speed at inference time, all without sacrificing a bit of output quality or reasoning accuracy. The release comes just weeks after Gemma 4 surpassed 60 million downloads and directly […]

Ver mais

Like 0

Liked Liked

technocracy

A Groq-Powered Agentic Research Assistant with LangGraph, Tool Calling, Sub-Agents, and Agentic Memory: Lets Built It

digitado ⋅ 7 de May de 2026

In this tutorial, we build a Groq-powered agentic research workflow that runs directly using Groq’s free OpenAI-compatible inference endpoint. We configure LangChain’s ChatOpenAI interface to work with Groq by setting the Groq API key and base URL, allowing us to use fast hosted models such as llama-3.3-70b-versatile for tool-based reasoning. We then connect the model with practical tools for web search, webpage fetching, file handling, Python execution, skill loading, sub-agent delegation, and long-term memory. By the end of […]

Ver mais

Like 0

Liked Liked

technocracy

CopilotKit Introduces Enterprise Intelligence Platform That Gives Agentic Applications Persistent Memory Across Sessions and Devices

digitado ⋅ 6 de May de 2026

Most agentic applications today have a memory problem. Every time a user opens a new session, the agent starts from zero. There is no recollection of what was discussed, what workflows were in progress, or what decisions were already made. The session ends, and everything disappears. For dev teams shipping production agentic applications, the only way around this has been to hand-roll a storage layer from scratch, picking a database, serializing state, managing session IDs, and connecting it […]

Ver mais

Like 0

Liked Liked

technocracy

Triangular analog of the squircle

digitado ⋅ 6 de May de 2026

TimF left a comment on my guitar pick post saying the image was a “squircle-ish analog for an isosceles triangle.” That made me wonder what a more direct analog of the squircle might be for a triangle. A squircle is not exactly a square with rounded corners. The sides are continuously curved, but curved most at the corners. See, for example, this post. Suppose the sides of our triangle are given by L1(x, y) = 1 for i […]

Ver mais

Like 0

Liked Liked

technocracy

Hackable PyTorch RL library with distributional algorithms (D4PG, DSAC, DPPO)

digitado ⋅ 6 de May de 2026

I published a paper on distributional RL for legged locomotion a while back and recently resurfaced and cleaned up the code into a standalone repo: https://github.com/e3ntity/e3rl Here’s a DPPO policy trained with this library running on a real robot: https://sites.google.com/leggedrobotics.com/risk-aware-locomotion The library is based on rsl_rl but contains readable PyTorch implementations of the most popular continuous control algorithms (PPO, SAC, TD3, DDPG), plus their distributional counterparts DPPO, DSAC, D4PG. Runs on CUDA, Apple Silicon, or CPU. pip install […]

Ver mais

Like 0

Liked Liked

technocracy

Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer

digitado ⋅ 6 de May de 2026

Pre-trained transformers are able to learn from examples provided as part of the prompt without any weight updates, a remarkable ability known as in-context learning (ICL). Despite its demonstrated efficacy across various domains, the theoretical understanding of ICL is still developing. Whereas most existing theory has focused on linear models, we study ICL in the nonlinear regression setting. Through the interaction mechanism in attention, we explicitly construct transformer networks to realize nonlinear features, such as polynomial or spline […]

Ver mais

Like 0

Liked Liked

technocracy

Report: SpaceX IPO gives Musk unchecked power and forbids investor lawsuits

digitado ⋅ 6 de May de 2026

SpaceX’s plan to go public will reportedly give CEO Elon Musk “virtually unchecked executive authority” and limit the rights of shareholders to sue the company. The plan, reported by Reuters today, could prevent shareholder lawsuits like the one that held up a lucrative Musk pay package at Tesla. “Excerpts of SpaceX’s IPO registration statement reviewed by Reuters show the company is combining supervoting shares, mandatory arbitration, stricter rules on shareholder proposals and Texas corporate law to give Musk […]

Ver mais

Like 0

Liked Liked

technocracy

What Matters in Practical Learned Image Compression

digitado ⋅ 6 de May de 2026

One of the major differentiators unlocked by learned codecs relative to their hard-coded traditional counterparts is their ability to be optimized directly to appeal to the human visual system. Despite this potential, a perceptual yet practical image codec is yet to be proposed. In this work, we aim to close this gap. We conduct a comprehensive study of the key modeling choices that govern the design of a practical learned image codec, jointly optimized for perceptual quality and […]

Ver mais

Like 0

Liked Liked

technocracy

Human-AI Co-Mentorship in Project-Based Learning: A Case Study in Financial Forecasting

digitado ⋅ 6 de May de 2026

This paper reflects on a AI research project carried out by a team of high-school and early-undergraduate students under the mentorship of graduate researchers and ably assisted by AI tools. We share our experience in not only on the learning experience for the high school students, but also on how AI tools accelerated the process that enabled the high school students to focus on higher order problem formulation and solution. Although the participants entered the project with limited […]

Ver mais

Like 0

Liked Liked

technocracy

Google DeepMind partners with EVE Online for AI model testing

digitado ⋅ 6 de May de 2026

Google’s AI-focused DeepMind division has taken a minority stake in the developer of popular sci-fi simulation EVE Online, saying it will use the game to study “intelligence in complex, dynamic, player-driven systems.” The research partnership comes as the management behind EVE Online developer CCP Games announced that they have spent $120 million to buy themselves out from their former owners at South Korean publisher Pearl Abyss (Crimson Desert). The newly independent entity is being rebranded as Fenris Creations, […]

Ver mais

Like 0

Liked Liked