digitado – Page 129

Omitted Variable Bias in Language Models Under Distribution Shift

digitado ⋅ 20 de February de 2026

arXiv:2602.16784v1 Announce Type: new Abstract: Despite their impressive performance on a wide variety of tasks, modern language models remain susceptible to distribution shifts, exhibiting brittle behavior when evaluated on data that differs in distribution from their training data. In this paper, we describe how distribution shifts in language models can be separated into observable and unobservable components, and we discuss how established approaches for dealing with distribution shift address only the former. Importantly, we identify that the resulting […]

Ver mais

Like 0

Liked Liked

technocracy

Google’s new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

digitado ⋅ 3 de June de 2026

The generative AI boom has driven the cost of memory into the stratosphere, and Google is a key part of that trend. So it’s only fitting that Google should offer some less RAM-hungry local AI models. The company has announced the release of a new Gemma 4 model that fills a gap in the lineup that launched earlier this year. The new model is efficient enough that you may be able to run it on a pretty average […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Deformable Object Manipulation Using Task-Level Iterative Learning Control

digitado ⋅ 26 de February de 2026

arXiv:2602.21302v1 Announce Type: new Abstract: Dynamic manipulation of deformable objects is challenging for humans and robots because they have infinite degrees of freedom and exhibit underactuated dynamics. We introduce a Task-Level Iterative Learning Control method for dynamic manipulation of deformable objects. We demonstrate this method on a non-planar rope manipulation task called the flying knot. Using a single human demonstration and a simplified rope model, the method learns directly on hardware without reliance on large amounts of demonstration […]

Ver mais

Like 0

Liked Liked

technocracy

Limits of n-gram Style Control for LLMs via Logit-Space Injection

digitado ⋅ 26 de January de 2026

arXiv:2601.16224v1 Announce Type: new Abstract: Large language models (LLMs) are typically personalized via prompt engineering or parameter-efficient fine-tuning such as LoRA. However, writing style can be difficult to distill into a single prompt, and LoRA fine-tuning requires computationally intensive training and infrastructure. We investigate a possible lightweight alternative: steering a frozen LLM with n-gram style priors injected in logit space at decoding time. We train an n-gram model on stylistically distinct corpora — including Don Quixote, CNN/DailyMail news […]

Ver mais

Like 0

Liked Liked

technocracy

Self Paced Gaussian Contextual Reinforcement Learning

digitado ⋅ 24 de March de 2026

Curriculum learning improves reinforcement learning (RL) efficiency by sequencing tasks from simple to complex. However, many self-paced curriculum methods rely on computationally expensive inner-loop optimizations, limiting their scalability in high-dimensional context spaces. In this paper, we propose Self-Paced Gaussian Curriculum Learning (SPGL), a novel approach that avoids costly numerical procedures by leveraging a closed-form update rule for Gaussian context distributions. SPGL maintains the sample efficiency and adaptability of traditional self-paced methods while substantially reducing computational overhead. We provide […]

Ver mais

Like 0

Liked Liked

technocracy

How I rolled out an AI automation stack for my Product Team and saved 30% of our working time

digitado ⋅ 2 de June de 2026

Building an AI Automation Stack for Your Product Team: Lessons From a Year of Trying For most of my career as a product manager, I assumed the boring parts of the job were just the cost of doing it. Writing the same ticket templates. Drafting rollout documents in the same format. Producing weekly status updates. All of it necessary, none of it interesting Then I spent a year building an automation stack for my team and learned that […]

Ver mais

Like 0

Liked Liked

technocracy

A Few Bad Neurons: Isolating and Surgically Correcting Sycophancy

digitado ⋅ 28 de January de 2026

arXiv:2601.18939v1 Announce Type: new Abstract: Behavioral alignment in large language models (LLMs) is often achieved through broad fine-tuning, which can result in undesired side effects like distributional shift and low interpretability. We propose a method for alignment that identifies and updates only the neurons most responsible for a given behavior, a targeted approach that allows for fine-tuning with significantly less data. Using sparse autoencoders (SAEs) and linear probes, we isolate the 3% of MLP neurons most predictive of […]

Ver mais

Like 0

Liked Liked

technocracy

Kernel Density Machines

digitado ⋅ 27 de March de 2026

arXiv:2504.21419v3 Announce Type: replace Abstract: We introduce kernel density machines (KDM), an agnostic kernel-based framework for learning the Radon-Nikodym derivative (density) between probability measures under minimal assumptions. KDM applies to general measurable spaces and avoids the structural requirements common in classical nonparametric density estimators. We construct a sample estimator and prove its consistency and a functional central limit theorem. To enable scalability, we develop Nystrom-type low-rank approximations and derive optimal error rates, filling a gap in the literature […]

Ver mais

Like 0

Liked Liked

technocracy

From Textbook to Talkbot: A Case Study of a Greek-Language RAG-Based Chatbot in Higher Education

digitado ⋅ 22 de January de 2026

arXiv:2601.14265v1 Announce Type: new Abstract: The integration of AI chatbots into educational settings has opened new pathways for transforming teaching and learning, offering enhanced support to both educators and learners. This study investigates the design and application of an AI chatbot as an educational tool in higher education. Designed to operate in the Greek language, the chatbot addresses linguistic challenges unique to Greek while delivering accurate, context grounded support aligned with the curriculum. The AI chatbot is built […]

Ver mais

Like 0

Liked Liked

technocracy

The Last Night of the Golden Threads

digitado ⋅ 14 de May de 2026

:::info Astounding Stories of Super-Science April 2004, by Astounding Stories is part of HackerNoon’s Book Blog Post series. You can jump to any chapter in this book here. THE COUNTRY OF THE BLIND – XXXIII. — THE BEAUTIFUL SUIT. Astounding Stories of Super-Science April 2004: THE COUNTRY OF THE BLIND – XXXIII. — THE BEAUTIFUL SUIT. By H. G. Wells ::: There was once a little man whose mother made him a beautiful suit of clothes. It was green […]

Ver mais

Like 0

Liked Liked